Kullback–Leibler Divergence and Mutual Information of Partitions in Product MV Algebras

Markechová, Dagmar; Riečan, Beloslav

doi:10.3390/e19060267

Open AccessArticle

Kullback–Leibler Divergence and Mutual Information of Partitions in Product MV Algebras

by

Dagmar Markechová

^1,* and

Beloslav Riečan

^2,3

¹

Department of Mathematics, Faculty of Natural Sciences, Constantine the Philosopher University in Nitra, A. Hlinku 1, SK-949 01 Nitra, Slovakia

²

Department of Mathematics, Faculty of Natural Sciences, Matej Bel University, Tajovského 40, SK-974 01 Banská Bystrica, Slovakia

³

Mathematical Institute, Slovak Academy of Sciences, Štefánikova 49, SK-814 73 Bratislava, Slovakia

^*

Author to whom correspondence should be addressed.

Entropy 2017, 19(6), 267; https://doi.org/10.3390/e19060267

Submission received: 11 May 2017 / Revised: 5 June 2017 / Accepted: 7 June 2017 / Published: 10 June 2017

(This article belongs to the Section Information Theory, Probability and Statistics)

Download Versions Notes

Abstract

:

The purpose of the paper is to introduce, using the known results concerning the entropy in product MV algebras, the concepts of mutual information and Kullback–Leibler divergence for the case of product MV algebras and examine algebraic properties of the proposed measures. In particular, a convexity of Kullback–Leibler divergence with respect to states in product MV algebras is proved, and chain rules for mutual information and Kullback–Leibler divergence are established. In addition, the data processing inequality for conditionally independent partitions in product MV algebras is proved.

Keywords:

product MV algebra; partition; Shannon’s entropy; Kullback–Leibler divergence; mutual information; conditional mutual information

1. Introduction

The notions of entropy and mutual information are fundamental concepts in information theory [1]; they are used as measures of information obtained from a realization of the considered experiments. The standard approach in information theory is based on the Shannon entropy [2]. Consider a finite measurable partition

A

of probability space

(Ω, S, P)

with probabilities

p_{1}, ..., p_{n}

of the corresponding elements of

A

. We recall that the Shannon entropy of

A

is the number

H (A) = - \sum_{i = 1}^{n} F (p_{i}),

where the function

F : [0, \infty) \to ℜ

is defined by

F (x) = x \log x,

if

x > 0,

and

F (0) = 0

. Perhaps a crucial point in applications of the Shannon entropy in another scientific field presents the discovery of Kolmogorov and Sinai [3] (see also [4,5]). They showed an existence of non-isomorphic Bernoulli shifts describing independent repetition of random spaces with finite numbers of results. If two dynamical systems are isomorphic, they have the same Kolmogorov-Sinai entropy. So Kolmogorov and Sinai constructed two Bernoulli shifts with different entropies, hence non-isomorphic. It is natural that the mentioned modification of entropy has been used in many mathematical structures. In [6], we have generalized the notion of Kolmogorov–Sinai entropy to the case when the considered probability space is a fuzzy probability space

(Ω, M, μ)

defined by Piasecki [7]. This structure can serve as an alternative mathematical model of probability theory for the situations where the observed events are described unclearly, vaguely (so called fuzzy events). Other fuzzy generalizations of Shannon’s and Kolmogorov–Sinai’s entropy can be found e.g., in [8,9,10,11,12,13,14,15,16,17]. It is known that there are many possibilities for defining operations with fuzzy sets; an overview can be found in [18]. It should be noted that while the model presented in [6] was based on the Zadeh connectives [19], in our recently published paper [14], the Lukasiewicz connectives were used to define the fuzzy set operations. In [20], the mutual information of fuzzy partitions of a given fuzzy probability space

(Ω, M, μ)

has been defined. It was shown that the entropy of fuzzy partitions introduced and studied in [6] can be considered as a special case of their mutual information.

In classical information theory the mutual information is a special case of a more general quantity called Kullback–Leibler divergence (K–L divergence for short), which was originally introduced by Kullback and Leibler in 1951 [21] (see also [22]) as the divergence between two probability distributions. It plays an important role, as a mathematical tool, in the stability analysis of master equations [23] and Fokker–Planck equations [24], and in isothermal equilibrium fluctuations and transient nonequilibrium deviations [25] (see also [24,26]). In [27], we have introduced the concept of K–L divergence for the case of fuzzy probability spaces.

A natural generalization of some family of fuzzy sets is the notion of an MV algebra introduced by Chang [28]. An MV algebra is an algebraic structure which models the Lukasiewicz multivalued logic, and the fragment of that calculus which deals with the basic logical connectives “and”, “or”, and “not”, but in a multivalued context. MV algebras play a similar role in the multivalued logic as Boolean algebras in the classical two-valued logic. Recall also that families of fuzzy sets can be embedded to suitable MV algebras. MV algebras have been studied by many authors (see e.g., [29,30,31,32,33]) and, of course, there are also many results about the entropy on this structure (cf. [34,35]). The theory of fuzzy sets is a rapidly and massively developing area of theoretical and applied mathematical research. In addition to MV algebras, generalizations of MV algebras as D-posets (cf. [36,37,38]), effect algebras (cf. [39]), or A-posets (cf. [40,41]) are currently subject of intensive research. Some results about the entropy on these structures can be found e.g., in [42,43,44].

A special class of MV algebras is a class of product MV algebras. They have been introduced independently in [45] from the point of view of probability theory, and in [46] from the point of view of mathematical logic. Product MV algebras have been studied e.g., in [47,48]. A suitable theory of entropy of Kolmogorov type for the case of product MV algebras has been constructed in [35,49,50].

The purpose of this contribution is to define, using the results concerning the entropy in product MV algebras, the concepts of mutual information and Kullback–Leibler divergence for the case of product MV algebras and to study properties of the suggested measures. The main results of the contribution are presented in Section 3 and Section 4. In Section 3 the notions of mutual information and conditional mutual information in product MV algebras are introduced and basic properties of the suggested measures are proved, inter alia, the data processing inequality for conditionally independent partitions. In Section 4 we define the Kullback–Leibler divergence in product MV algebras and its conditional version and examine the algebraic properties of the proposed measures. Our results are summarized in the final section.

2. Basic Definitions, Notations and Facts

In this section, we recall some definitions and basic facts which will be used in the following ones. An MV algebra [30] is a system

(M, \oplus, \otimes, *, 0, 1),

where

M

is a non-empty set,

\oplus

,

\otimes

are binary operations on

M

,

*

is a unary operation on

M

and 0, 1 are fixed elements of

M

, such that the following conditions are satisfied:

(i): $a \oplus b = b \oplus a$ ;
(ii): $a \oplus (b \oplus c) = (a \oplus b) \oplus c;$
(iii): $a \oplus 0 = a;$
(iv): $a \oplus 1 = 1;$
(v): ${(a^{*})}^{*} = a;$
(vi): $0^{*} = 1;$
(vii): $a \oplus a^{*} = 1;$
(viii): ${(a^{*} \oplus b)}^{*} \oplus b = {(a \oplus b^{*})}^{*} \oplus a;$
(ix): $a \otimes b = {(a^{*} \oplus b^{*})}^{*} .$

An example of MV algebra is the real interval

[0, 1]

equipped with the operations

x \oplus y = \min (1, x + y),

x \otimes y = \max (0, x + y - 1)

. It is interesting that any MV algebra has a similar structure. In fact, by the Mundici theorem [33] any MV algebra can be represented by a lattice-ordered Abelian group (shortly Abelian l-group). Recall that an Abelian l-group is an algebraic system

(G, +, \leq)

, where

(G, +)

is an Abelian group,

(G, \leq)

is a partially ordered set being a lattice and

a \leq b

implies

a + c \leq b + c

.

Let

(G, +, \leq)

be an Abelian l-group, 0 be a neutral element of

(G, +)

and

u \in G

,

u > 0

. On the interval

[0, u] = {h \in G; 0 \leq h \leq u}

we define the following operations:

a^{*} = u - a

,

a \oplus b = (a + b) \land u

;

a \otimes b = (a + b - u) \lor 0

. Then the system

M G = ([0, u], \oplus, \otimes, *, 0, u)

becomes an MV algebra. The Mundici theorem states that to any MV algebra

M

there exists an Abelian l-group

G

with a strong unit u (i.e., to every

a \in G

there exists

n \in N

with the property

a \leq n u

) such that

M ≅ M G

.

In this contribution we shall consider MV algebras with a product. We recall that the definition of product MV algebra is based on Mundici’s categorical representation of MV algebra by an Abelian l-group, i.e., the sum in the following definition of product MV algebra, and subsequently in the next text, means the sum in the Abelian l-group associated to the given MV algebra. Similarly, the element u is a strong unit of this group. More details can be found in [45,46].

Definition 1.

A product MV algebra is a couple

(M, \cdot),

where

M

is an MV algebra and

\cdot

is a commutative and associative operation on

M

satisfying the following conditions:

(i): for any $a \in M$ , $u \cdot a = a$ ;
(ii): if $a, b, c \in M$ , $a + b \leq u$ , then $c \cdot a + c \cdot b \leq u,$ and $c \cdot (a + b) = c \cdot a + c \cdot b$ .

In addition, we shall consider a finitely additive state defined on a product MV algebra.

Definition 2

[30]. Let

(M, \cdot)

be a product MV algebra. A map

m : M \to [0, 1]

is said to be a state if the following properties are satisfied:

(i): $m (u) = 1;$
(ii): if $a = \sum_{i = 1}^{n} a_{i},$ then $m (a) = \sum_{i = 1}^{n} m (a_{i})$ .

In product MV algebras a suitable entropy theory has been provided in [35,49,50]. In the following we present the main idea and some results of this theory which will be used in the contribution.

Definition 3.

By a partition in a product MV algebra

(M, \cdot)

we mean a finite collection

A = {a_{1}, ..., a_{n}} \subset M

such that

\sum_{i = 1}^{n} a_{i} = u .

Let m be a state on a product MV algebra

(M, \cdot) .

In the set of all partitions of

(M, \cdot)

the relation

≺

is defined in the following way: Let

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}}

be two partitions of

(M, \cdot) .

We say that

B

is a refinement of

A

(with respect to m), and write

A

≺

B

, if there exists a partition

I (1), I (2), ..., I (n)

of the set

{1, 2, ..., k}

such that

m (a_{i}) = \sum_{j \in I (i)} m (b_{j}),

for every

i = 1, 2, ..., n .

Given two partitions

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}}

of

(M, \cdot),

their join

A \lor B

is defined as the system

A \lor B

= {a_{i} \cdot b_{j}; i = 1, ..., n, j = 1, ..., k},

if

A \neq B,

and

A \lor A = A .

Since

\sum_{i = 1}^{n} \sum_{j = 1}^{k} a_{i} \cdot b_{j}

= \sum_{i = 1}^{n} a_{i} \cdot \sum_{j = 1}^{k} b_{j} = u \cdot (\sum_{j = 1}^{k} b_{j}) = \sum_{j = 1}^{k} b_{j} = u,

the system

A \lor B

is a partition of

(M, \cdot),

too. If

A_{1}, A_{2}, ..., A_{n}

are partitions in a product MV algebra

(M, \cdot),

then we put

\lor_{i = 1}^{n} A_{i} = A_{1} \lor A_{2} \lor ... \lor A_{n}

.

Let

A = {a_{1}, ..., a_{n}}

be a partition in a product MV algebra

(M, \cdot)

and m be a state on

(M, \cdot) .

Then the entropy of

A

with respect to m is defined by Shannon’s formula:

H_{m} (A) = - \sum_{i = 1}^{n} F (m (a_{i})),

(1)

where:

F : [0, \infty) \to ℜ, F (x) = {\begin{cases} x \log x, & if x > 0; \\ 0, & if x = 0 . \end{cases}

If

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}}

are two partitions of

(M, \cdot),

then the conditional entropy of

A

given

B

is defined by:

H_{m} (A / B) = - \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot \log \frac{m (a_{i} \cdot b_{j})}{m (b_{j})} .

In accordance with the classical theory the log is to the base 2 and the entropy is expressed in bits. Note that we use the convention (based on continuity arguments) that

x \log \frac{x}{0} = \infty

if

x > 0,

and

0 \log \frac{0}{x} = 0

if

x \geq 0

.

Example 1.

Consider any product MV algebra

(M, \cdot)

and a state

m

defined on M. Then the set

E = {u}

is a partition of

(M, \cdot)

such that

E ≺ A

for any partition

A

of

(M, \cdot) .

Its entropy is

H_{m} (E) = 0

. Let

a \in M

such that

m (a) = p,

where

p \in (0, 1) .

Evidently,

m (u - a) = 1 - p,

and the set

A = {a, u - a}

is a partition of

(M, \cdot)

. The entropy

H_{m} (A) = - p \log p - (1 - p) \log (1 - p)

. In particular, if

p = \frac{1}{2},

then

H_{m} (A) = \log 2 =

1 bit.

The entropy and the conditional entropy of partitions in a product MV algebra satisfy all properties analogous to properties of Shannon’s entropy of measurable partitions in the classical case; the proofs can be found in [35,49,50]. We present those that will be further exploited. Let

A, B, C

be any partitions of a product MV algebra

(M, \cdot) .

Then the following properties hold: (E1)

H_{m} (A) \geq 0;

(E2)

B ≺ C

implies

H_{m} (A / C) \leq H_{m} (A / B)

; (E3)

H_{m} (A \lor B / C) =

H_{m} (A / C) + H_{m} (B / C \lor A)

; (E4)

H_{m} (A \lor B)

=

H_{m} (A) + H_{m} (B / A)

; (E5)

H_{m} (A \lor B / C) \leq

H_{m} (A / C) + H_{m} (B / C) .

3. Mutual Information of Partitions in Product MV Algebras

In this section the results concerning the entropy in product MV algebras are used in developing information theory for the case of product MV algebras. We define the notions of mutual information and conditional mutual information of partitions in a product MV algebra and prove basic properties of the proposed measures.

Definition 4.

Let

A, B

be partitions in a given product MV algebra

(M, \cdot)

. Then we define the mutual information of

A

and

B

by the formula:

I_{m} (A, B) = H_{m} (A) - H_{m} (A / B) .

(2)

Remark 1.

As a simple consequence of (E4) we get:

I_{m} (A, B) = H_{m} (A) + H_{m} (B) - H_{m} (A \lor B) .

(3)

Subsequently we see that

I_{m} (A, A) =

H_{m} (A),

i.e., the entropy of partitions in product MV algebras can be considered as a special case of their mutual information. Moreover, we see that

I_{m} (A, B) =

I_{m} (B, A),

and hence we can also write:

I_{m} (A, B) = H_{m} (B) - H_{m} (B / A) .

(4)

Example 2.

Consider the measurable space

(Ω, S),

where

Ω

is the unit interval

[0, 1],

and

S

is the

σ

-algebra of all Borel subsets of

[0, 1] .

Let F be the family of all

S

-measurable functions

f : Ω \to [0, 1]

(i.e.,

[α, β] \subset [0, 1] \Rightarrow f^{- 1} ([α, β]) \in S

). F is the so called full tribe of fuzzy sets [30] (see also [14,29]); it is closed also under the natural product of fuzzy sets and represents a special case of product MV algebras. On the product MV algebra F we define a state m by the formula

m (f) = \int_{0}^{1} f (x) d x,

for every

f \in

F. Evidently, the sets

A = {x, 1 - x}

and

B = {x^{2}, 1 - x^{2}}

are two partitions of F with the m-state values

\frac{1}{2}, \frac{1}{2}

and

\frac{1}{3}, \frac{2}{3}

of the corresponding elements of

A

and

B

, respectively. By simple calculations we obtain the entropy

H_{m} (A) = \log 2 =

1 bit, and the entropy

H_{m} (B) = - \frac{1}{3} \cdot \log \frac{1}{3} - \frac{2}{3} \cdot \log \frac{2}{3} = 0.9183

bit. The join of

A

and

B

is the system

A \lor B =

{x^{3}, x^{2} (1 - x), x (1 - x^{2}), (1 - x) (1 - x^{2})}

with the m-state values

\frac{1}{4}, \frac{1}{12}, \frac{1}{4}, \frac{5}{12}

of the corresponding elements. The entropy of

A \lor B

is the number:

H_{m} (A \lor B) = - \frac{1}{4} \cdot \log \frac{1}{4} - \frac{1}{12} \cdot \log \frac{1}{12} - \frac{1}{4} \cdot \log \frac{1}{4} - \frac{5}{12} \cdot \log \frac{5}{12} = 1.8250 b i t .

Since:

H_{m} (A / B) = - m (x^{3}) \cdot \log \frac{m (x^{3})}{m (x^{2})} - m (x (1 - x^{2})) \cdot \log \frac{m (x (1 - x^{2}))}{m (1 - x^{2})} - m ((1 - x) x^{2}) \cdot \log \frac{m ((1 - x) x^{2})}{m (x^{2})} - m ((1 - x) (1 - x^{2})) \cdot \log \frac{m ((1 - x) (1 - x^{2}))}{m (1 - x^{2})} = - \frac{1}{4} \cdot \log \frac{\frac{1}{4}}{\frac{1}{3}} - \frac{1}{4} \cdot \log \frac{\frac{1}{4}}{\frac{2}{3}} - \frac{1}{12} \cdot \log \frac{\frac{1}{12}}{\frac{1}{3}} - \frac{5}{12} \cdot \log \frac{\frac{5}{12}}{\frac{2}{3}} = 0.9067 b i t,

the mutual information of

A

and

B

is the number:

I_{m} (A, B) = H_{m} (A) - H_{m} (A / B) = 1 - 0.9067 = 0.0933 b i t .

We can also see that Equation (3) is fulfilled:

H_{m} (A) + H_{m} (B) - H_{m} (A \lor B) = 1 + 0.9183 - 1.8250 = 0.0933 b i t .

In the following we will use the assertions of Propositions 1 and 2.

Proposition 1.

If

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}}

are two partitions of

(M, \cdot),

then we have:

(i): $m (a_{i}) = \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}),$ for $i = 1, 2, ..., n$ ;
(ii): $m (b_{j}) = \sum_{i = 1}^{n} m (a_{i} \cdot b_{j}),$ for $j = 1, 2, ..., k .$

Proof.

By the assumption

\sum_{j = 1}^{k} b_{j} = u,

therefore, according to Definitions 1 and 2, we get:

m (a_{i}) = m (u \cdot a_{i}) = m ((\sum_{j = 1}^{k} b_{j}) \cdot a_{i}) = m (\sum_{j = 1}^{k} (b_{j} \cdot a_{i})) = \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}), for i = 1, 2, ..., n .

The equality (ii) could be obtained in the same way.

□

From the following proposition it follows that, for every partitions

A, B

of

(M, \cdot),

the set

A \lor B

is a common refinement of

A

and

B

.

Proposition 2.

A ≺ A \lor B,

for every partitions

A, B

of

(M, \cdot) .

Proof.

Assume that

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}} .

Since the set

A \lor B

is indexed by

{(i, j); i = 1, ..., n, j = 1, 2, ..., k},

we put

I (i) = {(i, 1), ..., (i, k)},

i = 1, 2, ..., n .

In view of Proposition 1, we have:

m (a_{i}) = \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) = \sum_{(l, j) \in I (i)} m (a_{l} \cdot b_{j}), for i = 1, 2, ..., n .

However, this indicates that

A ≺ A \lor B .

□

Theorem 1.

For any partitions

A, B

and

C

in a product MV algebra

(M, \cdot),

we have:

I_{m} (A \lor B, C) \geq I_{m} (A, C) .

Proof.

By Equation (2) and the properties (E3) and (E4), we get:

I_{m} (A \lor B, C) = H_{m} (A \lor B) - H_{m} (A \lor B / C) = H_{m} (A) + H_{m} (B / A) - H_{m} (A / C) - H_{m} (B / C \lor A) = I_{m} (A, C) + H_{m} (B / A) - H_{m} (B / C \lor A) .

According to Proposition 2

A ≺ C \lor A

, and therefore by (E2)

H_{m} (B / A) \geq H_{m} (B / C \lor A) .

It follows the inequality:

I_{m} (A \lor B, C) \geq I_{m} (A, C) . □

Proposition 3.

If

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}}

are two partitions of

(M, \cdot),

then:

I_{m} (A, B) = \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot \log \frac{m (a_{i} \cdot b_{j})}{m (a_{i}) \cdot m (b_{j})} .

(5)

Proof.

Since by Proposition 1 it holds:

m (a_{i}) = \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}), for i = 1, 2, ..., n,

we get:

\begin{matrix} I_{m} (A, B) = - \sum_{i = 1}^{n} m (a_{i}) \cdot \log m (a_{i}) + \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot \log \frac{m (a_{i} \cdot b_{j})}{m (b_{j})} \\ = - \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot \log m (a_{i}) + \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot \log \frac{m (a_{i} \cdot b_{j})}{m (b_{j})} \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot [\log \frac{m (a_{i} \cdot b_{j})}{m (b_{j})} - \log m (a_{i})] = \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot \log \frac{m (a_{i} \cdot b_{j})}{m (a_{i}) \cdot m (b_{j})} . □ \end{matrix}

Definition 5.

Two partitions

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}}

of

(M, \cdot)

are called statistically independent, if

m (a_{i} \cdot b_{j}) = m (a_{i}) \cdot m (b_{j}),

for

i = 1, 2, ..., n, j = 1, 2, ..., k .

Theorem 2.

Let

A, B

be partitions in a product MV algebra

(M, \cdot) .

Then

I_{m} (A, B) \geq 0

with the equality if and only if the partitions

A, B

are statistically independent.

Proof.

Assume that

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}} .

Then using the inequality

\log x \leq x - 1,

which is valid for all real numbers

x > 0,

with the equality if and only if

x = 1,

we get:

m (a_{i} \cdot b_{j}) \cdot \log \frac{m (a_{i}) \cdot m (b_{j})}{m (a_{i} \cdot b_{j})} \leq m (a_{i} \cdot b_{j}) \cdot [\frac{m (a_{i}) \cdot m (b_{j})}{m (a_{i} \cdot b_{j})} - 1] = m (a_{i}) \cdot m (b_{j}) - m (a_{i} \cdot b_{j}) .

The equality holds if and only if

\frac{m (a_{i}) \cdot m (b_{j})}{m (a_{i} \cdot b_{j})} = 1,

i.e., when

m (a_{i} \cdot b_{j}) = m (a_{i}) \cdot m (b_{j}) .

Therefore using Equation (5) and Proposition 1 we have:

\begin{matrix} - I_{m} (A, B) = \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) \cdot \log \frac{m (a_{i}) \cdot m (b_{j})}{m (a_{i} \cdot b_{j})} \leq \sum_{i = 1}^{n} \sum_{j = 1}^{k} [m (a_{i}) \cdot m (b_{j}) - m (a_{i} \cdot b_{j})] \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i}) \cdot m (b_{j}) - \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i} \cdot b_{j}) = \sum_{i = 1}^{n} m (a_{i}) \cdot \sum_{j = 1}^{k} m (b_{j}) - \sum_{i = 1}^{n} m (a_{i}) \\ = m (\sum_{i = 1}^{n} a_{i}) \cdot m (\sum_{j = 1}^{k} b_{j}) - m (\sum_{i = 1}^{n} a_{i}) = m (u) \cdot m (u) - m (u) = 1 \cdot 1 - 1 = 0 . \end{matrix}

It follows that

I_{m} (A, B) \geq 0

with the equality if and only if

m (a_{i} \cdot b_{j}) = m (a_{i}) \cdot m (b_{j}),

for

i = 1, 2, ..., n,

j = 1, 2, ..., k,

i.e., when the partitions

A, B

are statistically independent.

□

From Theorem 2 it follows subadditivity and additivity of entropy in a product MV algebra, as shown by the following theorem.

Theorem 3

(Subadditivity and additivity of entropy). For arbitrary partitions

A, B

in a product MV algebra

(M, \cdot),

it holds

H_{m} (A \lor B) \leq

H_{m} (A)

+

H_{m} (B)

with the equality if and only if the partitions

A, B

are statistically independent.

Proof.

It follows by Equation (3) and Theorem 2.

□

Theorem 4.

For arbitrary partitions

A, B

in a product MV algebra

(M, \cdot),

it holds

H_{m} (A / B) \leq H_{m} (A)

with the equality if and only if the partitions

A, B

are statistically independent.

Proof.

The assertion is a simple consequence of Equation (2) and Theorem 2.

□

Definition 6.

Let

A, B

and

C

be partitions in a given product MV algebra

(M, \cdot) .

Then the conditional mutual information of

A

and

B

given

C

is defined by the formula

I_{m} (A, B / C) = H_{m} (A / C) - H_{m} (A / B \lor C) .

(6)

Remark 2.

Notice that the conditional mutual information is nonnegative, because by the property (E2)

H_{m} (A / C) \geq H_{m} (A / B \lor C)

.

Theorem 5.

For any partitions

A, B

and

C

in a product MV algebra

(M, \cdot),

we have:

I_{m} (A, B \lor C) = I_{m} (A, C) + I_{m} (A, B / C) = I_{m} (A, B) + I_{m} (A, C / B) .

Proof.

Let us calculate:

\begin{matrix} I_{m} (A, C) + I_{m} (A, B / C) = H_{m} (A) - H_{m} (A / C) + H_{m} (A / C) - H_{m} (A / B \lor C) \\ = H_{m} (A) - H_{m} (A / B \lor C) = I_{m} (A, B \lor C) . \end{matrix}

In a similar way we obtain also the second equality.

□

Theorem 6

(Chain rules). Let

A_{1}, A_{2}, ..., A_{n}

and

C

be partitions in a product MV algebra

(M, \cdot) .

Then, for

n = 2, 3, ...,

the following equalities hold:

(i): $H_{m} (A_{1} \lor A_{2} \lor ... \lor A_{n}) =$ $H_{m} (A_{1}) +$ $\sum_{i = 2}^{n} H_{m} (A_{i} / \lor_{k = 1}^{i - 1} A_{k});$
(ii): $H_{m} (\lor_{i = 1}^{n} A_{i} / C) =$ $H_{m} (A_{1} / C) +$ $\sum_{i = 2}^{n} H_{m} (A_{i} / {(\lor}_{k = 1}^{i - 1} A_{k}) \lor C);$
(iii): $I_{m} (\lor_{i = 1}^{n} A_{i}, C) =$ $I_{m} (A_{1}, C) +$ $\sum_{i = 2}^{n} I_{m} (A_{i}, C /$ $\lor_{k = 1}^{i - 1} A_{k}) .$

Proof.

(i) By the property (E4) we have:

H_{m} (A_{1} \lor A_{2}) = H_{m} (A_{1}) + H_{m} (A_{2} / A_{1}) .

By (E3) and (E4) we get:

H_{m} (A_{1} \lor A_{2} \lor A_{3}) = H_{m} (A_{1}) + H_{m} (A_{2} \lor A_{3} / A_{1}) = H_{m} (A_{1}) + H_{m} (A_{2} / A_{1}) + H_{m} (A_{3} / A_{2} \lor A_{1}) = H_{m} (A_{1}) + \sum_{i = 2}^{3} H_{m} (A_{i} / \lor_{k = 1}^{i - 1} A_{k}) .

Now let us suppose that the result is true for a given

n \in N .

Then:

H_{m} (A_{1} \lor A_{2} \lor ... \lor A_{n} \lor A_{n + 1}) = H_{m} (A_{1} \lor A_{2} \lor ... \lor A_{n}) + H_{m} (A_{n + 1} / A_{1} \lor A_{2} \lor ... \lor A_{n}) = H_{m} (A_{1}) + \sum_{i = 2}^{n} H_{m} (A_{i} / \lor_{k = 1}^{i - 1} A_{k}) + H_{m} (A_{n + 1} / A_{1} \lor A_{2} \lor ... \lor A_{n}) = H_{m} (A_{1}) + \sum_{i = 2}^{n + 1} H_{m} (A_{i} / \lor_{k = 1}^{i - 1} A_{k}) .

(ii) For

n = 2,

using (E3) we obtain:

H_{m} (A_{1} \lor A_{2} / C) = H_{m} (A_{1} / C) + H_{m} (A_{2} / A_{1} \lor C) .

Suppose that the result is true for a given

n \in N .

Then:

H_{m} (A_{1} \lor A_{2} \lor ... \lor A_{n} \lor A_{n + 1} / C) = H_{m} (\lor_{i = 1}^{n} A_{i} / C) + H_{m} (A_{n + 1} / A_{1} \lor ... \lor A_{n} \lor C) = H_{m} (A_{1} / C) + \sum_{i = 2}^{n} H_{m} (A_{i} / (\lor_{k = 1}^{i - 1} A_{k}) \lor C) + H_{m} (A_{n + 1} / (\lor_{k = 1}^{n} A_{k}) \lor C) = H_{m} (A_{1} / C) + \sum_{i = 2}^{n + 1} H_{m} (A_{i} / (\lor_{k = 1}^{i - 1} A_{k}) \lor C) .

(iii) By Equation (2), the equalities (i) and (ii) of this theorem, and Equation (6), we obtain:

I_{m} (\lor_{i = 1}^{n} A_{i}, C) = H_{m} (\lor_{i = 1}^{n} A_{i}) - H_{m} (\lor_{i = 1}^{n} A_{i} / C) = H_{m} (A_{1}) + \sum_{i = 2}^{n} H_{m} (A_{i} / \lor_{k = 1}^{i - 1} A_{k}) - H_{m} (A_{1} / C) - \sum_{i = 2}^{n} H_{m} (A_{i} / {(\lor}_{k = 1}^{i - 1} A_{k}) \lor C) = I_{m} (A_{1}, C) + \sum_{i = 2}^{n} (H_{m} (A_{i} / \lor_{k = 1}^{i - 1} A_{k}) - H_{m} (A_{i} / (\lor_{k = 1}^{i - 1} A_{k}) \lor C)) = I_{m} (A_{1}, C) + \sum_{i = 2}^{n} I_{m} (A_{i}, C / \lor_{k = 1}^{i - 1} A_{k}) . □

Definition 7.

Let

A, B

and

C

be partitions in a product MV algebra

(M, \cdot) .

We say that

A

is conditionally independent to

C

given

B

(and write

A \to B \to C

) if

I_{m} (A, C / B) = 0 .

Theorem 7.

For partitions

A, B

and

C

in a product MV algebra

(M, \cdot),

A \to B \to C

if and only if

C \to B \to A

.

Proof.

Let

A \to B \to C .

Then

0 = I_{m} (A, C / B) =

H_{m} (A / B) - H_{m} (A / B \lor C) .

Therefore by (E4) we get:

H_{m} (A / B) = H_{m} (A / B \lor C) = H_{m} (A \lor B \lor C) - H_{m} (B \lor C) .

Let us calculate:

I_{m} (C, A / B) = H_{m} (C / B) - H_{m} (C / A \lor B) = H_{m} (C \lor B) - H_{m} (B) - H_{m} (A \lor B \lor C) + H_{m} (A \lor B) = H_{m} (A \lor B) - H_{m} (B) - H_{m} (A / B) = H_{m} (A / B) - H_{m} (A / B) = 0 .

The results means that

C \to B \to A .

The reverse implication is evident.

□

Remark 3.

According to the above theorem, we may say that

A

and

C

are conditionally independent given

B

and write

A \leftrightarrow B \leftrightarrow C

instead of

A \to B \to C

.

Theorem 8.

Let

A, B

and

C

be partitions in a given product MV algebra

(M, \cdot)

such that

A \to B \to C .

Then we have:

(i): $I_{m} (A \lor B, C) =$ $I_{m} (B, C);$
(ii): $I_{m} (B, C) =$ $I_{m} (C, A) +$ $I_{m} (C, B / A);$
(iii): $I_{m} (A, B / C) \leq I_{m} (A, B);$
(iv): $I_{m} (A, B) \geq I_{m} (A, C)$ (data processing inequality).

Proof.

(i) By the assumption we have

I_{m} (A, C / B) = 0

. Hence using the chain rule for the mutual information (Theorem 6 (iii)), we obtain:

I_{m} (A \lor B, C) = I_{m} (B \lor A, C) = I_{m} (B, C) + I_{m} (A, C / B) = I_{m} (B, C) .

(ii) By the equality (i) of this theorem and Theorem 5, we can write:

I_{m} (B, C) = I_{m} (A \lor B, C) = I_{m} (C, B \lor A) = I_{m} (C, A) + I_{m} (C, B / A) .

(iii) From (ii) it follows the inequality

I_{m} (B, C) \geq

I_{m} (C, B / A) .

Interchanging

A

and

C

(we can do it based on Theorem 7) we obtain:

I_{m} (A, B) \geq I_{m} (A, B / C) .

(iv) By the assumption we have

I_{m} (A, C / B) = 0

. Therefore by Theorem 5 we get:

I_{m} (A, B \lor C) = I_{m} (A, B) + I_{m} (A, C / B) = I_{m} (A, B) .

Thus by the same theorem we can write:

I_{m} (A, B) = I_{m} (A, B \lor C) = I_{m} (A, C) + I_{m} (A, B / C) .

Since

I_{m} (A, B / C) \geq 0,

it holds

I_{m} (A, B) \geq I_{m} (A, C) .

□

In the following, a concavity of entropy

H_{m} (A)

and concavity of mutual information

I_{m} (A, B)

as functions of m are studied. We recall, for the convenience of the reader, the definitions of convex and concave function:

A real-valued function

f

is said to be convex over an interval

[a, b]

if for every

x_{1}, x_{2} \in [a, b]

and for any real number

α \in [0, 1]

:

f (α x_{1} + (1 - α) x_{2}) \leq α f (x_{1}) + (1 - α) f (x_{2}) .

A real-valued function

f

is said to be concave over an interval

[a, b]

if for every

x_{1}, x_{2} \in [a, b]

and for any real number

α \in [0, 1]

:

f (α x_{1} + (1 - α) x_{2}) \geq α f (x_{1}) + (1 - α) f (x_{2}) .

In the following, we will use the symbol

F

to denote the family of all states on a given product MV algebra

(M, \cdot) .

It is easy to prove the following proposition:

Proposition 4.

If

m_{1},

m_{2} \in F,

then, for every real number

α \in [0, 1],

α m_{1} + (1 - α) m_{2} \in F .

Theorem 9

(Concavity of entropy). Let

A

be a partition in a given product MV algebra

(M, \cdot) .

Then, for every

m_{1},

m_{2} \in F,

and every real number

α \in [0, 1],

the following inequality holds:

α H_{m_{1}} (A) + (1 - α) H_{m_{2}} (A) \leq H_{α m_{1} + (1 - α) m_{2}} (A) .

Proof.

Assume that

A = {a_{1}, ..., a_{n}} .

Since the function F is convex, we get:

α H_{m_{1}} (A) + (1 - α) H_{m_{2}} (A) = - α \sum_{i = 1}^{n} F (m_{1} (a_{i})) - (1 - α) \sum_{i = 1}^{n} F (m_{2} (a_{i})) = - \sum_{i = 1}^{n} (α F (m_{1} (a_{i})) + (1 - α) F (m_{2} (a_{i}))) \leq - \sum_{i = 1}^{n} F (α m_{1} (a_{i}) + (1 - α) m_{2} (a_{i})) = - \sum_{i = 1}^{n} F ((α m_{1} + (1 - α) m_{2}) (a_{i})) = H_{α m_{1} + (1 - α) m_{2}} (A),

which proves that the entropy

m \mapsto H_{m} (A)

is a concave function on the family

F

.

□

In the proof of concavity of mutual information

I_{m} (A, B)

we will need the assertion of Proposition 5. First, we introduce the following notation. Let

m

be a state on a product MV algebra

(M, \cdot),

a, b \in M .

Then we denote:

\dot{m} (a / b) = {\begin{matrix} \frac{m (a \cdot b)}{m (b)}, & if & m (b) > 0; \\ 0, & if & m (b) = 0 . \end{matrix}

Proposition 5.

If

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}}

are two partitions of

(M, \cdot),

then

H_{m} (B / A) = - \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i}) \cdot F (\dot{m} (b_{j} / a_{i})) .

(7)

Proof.

Let us calculate:

- \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (a_{i}) \cdot F (\dot{m} (b_{j} / a_{i})) = - \sum_{i : m (a_{i}) > 0} \sum_{j = 1}^{k} m (a_{i}) \cdot F (\frac{m (b_{j} \cdot a_{i})}{m (a_{i})}) = - \sum_{i : m (a_{i}) > 0} \sum_{j = 1}^{k} m (a_{i}) \cdot \frac{m (b_{j} \cdot a_{i})}{m (a_{i})} \cdot \log \frac{m (b_{j} \cdot a_{i})}{m (a_{i})} = - \sum_{i : m (a_{i}) > 0} \sum_{j = 1}^{k} m (b_{j} \cdot a_{i}) \cdot \log \frac{m (b_{j} \cdot a_{i})}{m (a_{i})} = - \sum_{i = 1}^{n} \sum_{j = 1}^{k} m (b_{j} \cdot a_{i}) \cdot \log \frac{m (b_{j} \cdot a_{i})}{m (a_{i})} = H_{m} (B / A) .

In the last step, we used the implication

m (a_{i}) = 0 \Rightarrow m (b_{j} \cdot a_{i}) = 0

which follows from the equality

m (a_{i}) = \sum_{j = 1}^{k} m (a_{i} \cdot b_{j})

shown in Proposition 1.

□

Remark 4.

By Proposition 5 there exists

c_{i j} = - F (\dot{m} (b_{j} / a_{i})) \geq 0

such that

H_{m} (B / A) = \sum_{i = 1}^{n} \sum_{j = 1}^{k} c_{i j} \cdot m (a_{i}) .

Definition 8.

Let

A = {a_{1}, ..., a_{n}},

B = {b_{1}, ..., b_{k}}

be two partitions of

(M, \cdot) .

Put

K = {m \in F; H_{m} (B / A) = \sum_{i = 1}^{n} \sum_{j = 1}^{k} c_{i j} \cdot m (a_{i})} .

Theorem 10

(Concavity of mutual information). The mutual information

m \mapsto I_{m} (A, B)

is a concave function on the family

K

.

Proof.

By Equation (4) we can write:

I_{m} (A, B) = H_{m} (B) - H_{m} (B / A) .

In view of Theorem 9 and Remark 4, the function

m \mapsto I_{m} (A, B)

is the sum of two concave functions on the family

K

:

m \mapsto H_{m} (B),

and

m \mapsto - H_{m} (B / A) .

Since the sum of two concave functions is itself concave, we have the statement.

□

4. Kullback–Leibler Divergence in Product MV Algebras

In this section we introduce the concept of Kullback–Leibler divergence in product MV algebras. We prove basic properties of this measure; in particular, Gibb’s inequality. Finally, using the notion of conditional Kullback–Leibler divergence we establish a chain rule for Kullback–Leibler divergence with respect to additive states defined on a given product MV algebra. In the proofs we use the following known log-sum inequality: for non-negative real numbers

x_{1}, x_{2}, ..., x_{n},

y_{1}, y_{2}, ..., y_{n}

, it holds:

\sum_{i = 1}^{n} x_{i} \cdot \log \frac{x_{i}}{y_{i}} \geq (\sum_{i = 1}^{n} x_{i}) \cdot \log \frac{\sum_{i = 1}^{n} x_{i}}{\sum_{i = 1}^{n} y_{i}}

(8)

with the equality if and only if

\frac{x_{i}}{y_{i}}

is constant. Recall that we use the convention that

x \log \frac{x}{0} = \infty

if

x > 0,

and

0 \log \frac{0}{x} = 0

if

x \geq 0

.

Definition 9.

Let

m_{1},

m_{2}

be states defined on a given product MV algebra

(M, \cdot),

and

A = {a_{1}, ..., a_{n}}

be a partition of

(M, \cdot) .

Then we define the Kullback–Leibler divergence

D_{A} (m_{1}

‖

m_{2})

by:

D_{A} (m_{1} ‖ m_{2}) = \sum_{i = 1}^{n} m_{1} (a_{i}) \cdot \log \frac{m_{1} (a_{i})}{m_{2} (a_{i})} .

Remark 5.

It is obvious that

D_{A} (m

‖

m) = 0

. The Kullback–Leibler divergence is not a metric in a true sense since it is not symmetric, i.e., the equality

D_{A} (m_{1}

‖

m_{2})

= D_{A} (m_{2}

‖

m_{1})

is not necessarily true (as shown in the following example), and does not satisfy the triangle inequality.

Example 3.

Consider any product MV algebra

(M, \cdot)

and two states

m_{1},

m_{2}

defined on M. Let

a \in M

such that

m_{1} (a) = p,

and

m_{2} (a) = q,

where

p, q \in (0, 1) .

Evidently,

m_{1} (u - a) = 1 - p,

m_{2} (u - a)

= 1 - q,

and the set

A = {a, u - a}

is a partition of

(M, \cdot) .

Let us calculate:

D_{A} (m_{1} ‖ m_{2}) = p \cdot \log \frac{p}{q} + (1 - p) \cdot \log \frac{1 - p}{1 - q}, and D_{A} (m_{2} ‖ m_{1}) = q \cdot \log \frac{q}{p} + (1 - q) \cdot \log \frac{1 - q}{1 - p} .

If

p = q,

then

D_{A} (m_{1}

‖

m_{2})

= D_{A} (m_{2}

‖

m_{1}) = 0

. If

p = \frac{1}{2},

q = \frac{1}{4},

then we have:

D_{A} (m_{1} ‖ m_{2}) = \frac{1}{2} \cdot \log \frac{\frac{1}{2}}{\frac{1}{4}} + \frac{1}{2} \cdot \log \frac{\frac{1}{2}}{\frac{3}{4}} = \frac{1}{2} \cdot \log 2 + \frac{1}{2} \cdot \log \frac{2}{3} = 0. 207519 b i t,

and:

D_{A} (m_{2} ‖ m_{1}) = \frac{1}{4} \cdot \log \frac{\frac{1}{4}}{\frac{1}{2}} + \frac{3}{4} \cdot \log \frac{\frac{3}{4}}{\frac{1}{2}} = \frac{1}{4} \cdot \log 2 + \frac{3}{4} \cdot \log \frac{3}{2} = 0.188722 b i t .

The result means that

D_{A} (m_{1}

‖

m_{2}) \neq

D_{A} (m_{2}

‖

m_{1}),

in general.

Theorem 11.

Let

m_{1},

m_{2}

be states defined on a product MV algebra

(M, \cdot),

and

A = {a_{1}, ..., a_{n}}

be a partition of

(M, \cdot) .

Then

D_{A} (m_{1}

‖

m_{2}) \geq 0

(Gibb’s inequality) with the equality if and only if

m_{1} (a_{i})

= m_{2} (a_{i}),

for

i = 1, 2, ..., n .

Proof.

If we put

x_{i} = m_{1} (a_{i})

and

y_{i} = m_{2} (a_{i}),

for

i = 1, 2, ..., n,

then

x_{1}, x_{2}, ..., x_{n},

y_{1}, y_{2}, ..., y_{n}

are non-negative real numbers such that

\sum_{i = 1}^{n} x_{i} = 1

and

\sum_{i = 1}^{n} y_{i} = 1

. Indeed,

\sum_{i = 1}^{n} x_{i} = \sum_{i = 1}^{n} m_{1} (a_{i})

= m_{1} (\sum_{i = 1}^{n} a_{i}) = m_{1} (u) = 1;

analogously we obtain

\sum_{i = 1}^{n} y_{i} = 1

. Thus, using the log-sum inequality we can write:

D_{A} (m_{1} ‖ m_{2}) = \sum_{i = 1}^{n} m_{1} (a_{i}) \cdot \log \frac{m_{1} (a_{i})}{m_{2} (a_{i})} = \sum_{i = 1}^{n} x_{i} \cdot \log \frac{x_{i}}{y_{i}} \geq (\sum_{i = 1}^{n} x_{i}) \cdot \log \frac{\sum_{i = 1}^{n} x_{i}}{\sum_{i = 1}^{n} y_{i}} = 1 \cdot \log \frac{1}{1} = 0

with the equality if and only if

\frac{m_{1} (a_{i})}{m_{2} (a_{i})} = α

for

i = 1, 2, ..., n,

where

α

is constant. Taking the sum for all

i = 1, 2, ..., n,

we obtain

\sum_{i = 1}^{n} m_{1} (a_{i}) = α \sum_{i = 1}^{n} m_{2} (a_{i}),

which implies that

α = 1

. This means that

D_{A} (m_{1}

‖

m_{2}) = 0

if and only if

m_{1} (a_{i})

= m_{2} (a_{i}),

for

i = 1, 2, ..., n .

□

Theorem 12.

Let

A

be a partition of

(M, \cdot)

and

ν

be a state on

(M, \cdot)

uniform over

A

. Then, for the entropy of

A

with respect to any state

m

from

F,

we have:

H_{m} (A) = \log c a r d A - D_{A} (m ‖ ν) .

Proof.

Assume that

A = {a_{1}, ..., a_{n}} .

Then

ν (a_{i})

= \frac{1}{n},

for

i = 1, 2, ..., n .

Let us calculate:

D_{A} (m ‖ ν) = \sum_{i = 1}^{n} m (a_{i}) \cdot \log \frac{m (a_{i})}{ν (a_{i})} = \sum_{i = 1}^{n} m (a_{i}) \cdot \log \frac{m (a_{i})}{\frac{1}{n}} = \sum_{i = 1}^{n} m (a_{i}) \cdot (\log m (a_{i}) - \log n^{- 1}) = \sum_{i = 1}^{n} m (a_{i}) \cdot \log m (a_{i}) + \log n = \log c a r d A - H_{m} (A) . □

As a consequence we obtain the following property of entropy of partitions in product MV algebras.

Corollary 1.

For any partition

A

of

(M, \cdot),

it holds

H_{m} (A) \leq \log c a r d A,

with the equality if and only if m is uniform over the partition

A

.

Proof.

Assume that

A = {a_{1}, ..., a_{n}}

and consider a state

ν

on

(M, \cdot)

uniform over

A,

i.e., it holds

ν (a_{i})

= \frac{1}{n},

for

i = 1, 2, ..., n .

Then, by Theorem 12 we get:

D_{A} (m ‖ ν) = \log c a r d A - H_{m} (A) .

Since by Theorem 11

D_{A} (m

‖

ν) \geq 0,

it holds the inequality:

H_{m} (A) \leq \log c a r d A .

Further, by Theorem 11

D_{A} (m

‖

ν) = 0

if and only if

m (a_{i}) =

ν (a_{i}),

for

i = 1, 2, ..., n .

This means that the equality

H_{m} (A) = \log c a r d A

holds if and only if

m (a_{i}) = \frac{1}{n},

for

i = 1, 2, ..., n .

□

Theorem 13

(Convexity of K–L divergence). Let

A

be a partition in a product MV algebra

(M, \cdot) .

The K–L divergence

D_{A} (m_{1}

‖

m_{2})

is convex in the pair

(m_{1}, m_{2}),

i.e., if

(m_{1}^{'}, m_{2}^{'}),

(m_{1}^{″}, m_{2}^{″})

are pairs of states from

F

, then, for any real number

α \in [0, 1],

the following inequality holds:

D_{A} (α m_{1}^{'} + (1 - α) m_{1}^{″} ‖ α m_{2}^{'} + (1 - α) m_{2}^{″}) \leq α D_{A} (m_{1}^{'} ‖ m_{2}^{'}) + (1 - α) D_{A} (m_{1}^{″} ‖ m_{2}^{″}) .

(9)

Proof.

Assume that

A = {a_{1}, ..., a_{n}}

and fix

i \in {1, 2, ..., n} .

Putting

x_{1} = α m_{1}^{'} (a_{i}),

x_{2} = (1 - α) m_{1}^{″} (a_{i}),

y_{1} = α m_{2}^{'} (a_{i}),

y_{2} =

(1 - α) m_{2}^{″} (a_{i})

in the log-sum inequality, we obtain:

(α m_{1}^{'} (a_{i}) + (1 - α) m_{1}^{″} (a_{i})) \cdot \log \frac{α m_{1}^{'} (a_{i}) + (1 - α) m_{1}^{″} (a_{i})}{α m_{2}^{'} (a_{i}) + (1 - α) m_{2}^{″} (a_{i})} \leq α m_{1}^{'} (a_{i}) \cdot \log \frac{α m_{1}^{'} (a_{i})}{α m_{2}^{'} (a_{i})} + (1 - α) m_{1}^{″} (a_{i}) \cdot \log \frac{(1 - α) m_{1}^{″} (a_{i})}{(1 - α) m_{2}^{″} (a_{i})} .

Summing these inequalities over

i = 1, 2, ..., n,

we obtain the inequality (9).

□

The result of Theorem 13 is illustrated in the following example.

Example 4.

Consider the product MV algebra F from Example 2 and the real functions

F_{1}, F_{2},

F_{3}, F_{4}

defined by

F_{1} (x) = x, F_{2} (x) = x^{2}, F_{3} (x) = x^{3}, F_{4} (x) = x^{4},

for every

x \in ℜ .

On the product MV algebra F we define the states

m_{1}, m_{2},

m_{3}, m_{4}

by the following formulas:

m_{1} (f) = \int_{0}^{1} f (x) d F_{1} (x) = \int_{0}^{1} f (x) d x, f \in F;

m_{2} (f) = \int_{0}^{1} f (x) d F_{2} (x) = \int_{0}^{1} f (x) 2 x d x, f \in F;

m_{3} (f) = \int_{0}^{1} f (x) d F_{3} (x) = \int_{0}^{1} f (x) 3 x^{2} d x, f \in F;

m_{4} (f) = \int_{0}^{1} f (x) d F_{4} (x) = \int_{0}^{1} f (x) 4 x^{3} d x, f \in F .

In addition, we will consider the partition

A = {x, 1 - x}

of F. It is easy to calculate that it has the

m_{1}

-state values

\frac{1}{2}, \frac{1}{2};

the

m_{2}

-state values

\frac{2}{3}, \frac{1}{3};

the

m_{3}

-state values

\frac{3}{4}, \frac{1}{4};

and the

m_{4}

-state values

\frac{4}{5}, \frac{1}{5}

of the corresponding elements. In the previous theorem we put

α = 0.2

. We will show that:

D_{A} (0.2 m_{1} + 0.8 m_{3} ‖ 0.2 m_{2} + 0.8 m_{4}) \leq 0.2 D_{A} (m_{1} ‖ m_{2}) + 0.8 D_{A} (m_{3} ‖ m_{4}) .

(10)

Let us calculate:

D_{A} (m_{1} ‖ m_{2}) = \frac{1}{2} \cdot \log \frac{\frac{1}{2}}{\frac{2}{3}} + \frac{1}{2} \cdot \log \frac{\frac{1}{2}}{\frac{1}{3}} = 0.085 b i t;

D_{A} (m_{3} ‖ m_{4}) = \frac{3}{4} \cdot \log \frac{\frac{3}{4}}{\frac{4}{5}} + \frac{1}{4} \cdot \log \frac{\frac{1}{4}}{\frac{1}{5}} = 0.01065 b i t;

D_{A} (0.2 m_{1} + 0.8 m_{3} ‖ 0.2 m_{2} + 0.8 m_{4}) = 0.7 \cdot \log \frac{0.7}{0.7733} + 0.3 \cdot \log \frac{0.3}{0.2267} = 0.020682 b i t .

Since

0.020682 \leq 0.2 \cdot 0.085 + 0.8 \cdot 0.01065 = 0.02552,

the inequality (10) holds.

In the final part, we define the conditional Kullback–Leibler divergence and, using this notion, we establish the chain rule for Kullback–Leibler divergence.

Definition 10.

Let

m_{1},

m_{2}

be states on a given product MV algebra

(M, \cdot)

and

A = {a_{1}, ..., a_{n}},

B = {b_{1}, ..., b_{k}}

be two partitions of

(M, \cdot) .

Then we define the conditional Kullback–Leibler divergence

D_{B / A} (m_{1}

‖

m_{2})

by:

D_{B / A} (m_{1} ‖ m_{2}) = \sum_{i = 1}^{n} m_{1} (a_{i}) \sum_{j = 1}^{k} {\dot{m}}_{1} (b_{j} / a_{i}) \cdot \log \frac{{\dot{m}}_{1} (b_{j} / a_{i})}{{\dot{m}}_{2} (b_{j} / a_{i})} .

Theorem 14

(Chain rule for K–L divergence). Let

m_{1},

m_{2}

be states on a given product MV algebra

(M, \cdot) .

If

A

,

B

are two partitions of

(M, \cdot),

then:

D_{A \lor B} (m_{1} ‖ m_{2}) = D_{A} (m_{1} ‖ m_{2}) + D_{B / A} (m_{1} ‖ m_{2}) .

(11)

Proof.

Assume that

A = {a_{1}, ..., a_{n}}

and

B = {b_{1}, ..., b_{k}} .

We will consider the following two cases: (i) there exists

i_{0} \in {1, ..., n}

such that

m_{2} (a_{i_{0}}) = 0;

(ii)

m_{2} (a_{i}) > 0

for

i = 1, 2, ..., n .

In the first case, both sides of Equation (11) are equal to

\infty,

thus the equality holds. Let us now assume that

m_{2} (a_{i}) > 0,

for

i = 1, 2, ..., n .

We get:

\begin{matrix} D_{A} (m_{1} ‖ m_{2}) + D_{B / A} (m_{1} ‖ m_{2}) \\ = \sum_{i = 1}^{n} m_{1} (a_{i}) \cdot \log \frac{m_{1} (a_{i})}{m_{2} (a_{i})} + \sum_{i = 1}^{n} m_{1} (a_{i}) \sum_{j = 1}^{k} {\dot{m}}_{1} (b_{j} / a_{i}) \cdot \log \frac{{\dot{m}}_{1} (b_{j} / a_{i})}{{\dot{m}}_{2} (b_{j} / a_{i})} \\ = \sum_{i : m_{1} (a_{i}) > 0} \sum_{j = 1}^{k} m_{1} (a_{i} \cdot b_{j}) \cdot \log \frac{m_{1} (a_{i})}{m_{2} (a_{i})} + \sum_{i : m_{1} (a_{i}) > 0} \sum_{j = 1}^{k} m_{1} (a_{i} \cdot b_{j}) \cdot \log \frac{{\dot{m}}_{1} (b_{j} / a_{i})}{{\dot{m}}_{2} (b_{j} / a_{i})} \\ = \sum_{i : m_{1} (a_{i}) > 0} \sum_{j = 1}^{k} m_{1} (a_{i} \cdot b_{j}) \cdot (\log \frac{m_{1} (a_{i})}{m_{2} (a_{i})} + \log \frac{{\dot{m}}_{1} (b_{j} / a_{i})}{{\dot{m}}_{2} (b_{j} / a_{i})}) \\ = \sum_{i : m_{1} (a_{i}) > 0} \sum_{j = 1}^{k} m_{1} (a_{i} \cdot b_{j}) \cdot \log \frac{m_{1} (a_{i}) {\dot{m}}_{1} (b_{j} / a_{i})}{m_{2} (a_{i}) {\dot{m}}_{2} (b_{j} / a_{i})} = \sum_{i : m_{1} (a_{i}) > 0} \sum_{j = 1}^{k} m_{1} (a_{i} \cdot b_{j}) \cdot \log \frac{m_{1} (a_{i} \cdot b_{j})}{m_{2} (a_{i} \cdot b_{j})} \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{k} m_{1} (a_{i} \cdot b_{j}) \cdot \log \frac{m_{1} (a_{i} \cdot b_{j})}{m_{2} (a_{i} \cdot b_{j})} = D_{A \lor B} (m_{1} ‖ m_{2}) . \end{matrix}

In the last step, analogously as in the proof of Proposition 5, we used the implication

m_{1} (a_{i}) = 0 \Rightarrow m_{1} (a_{i} \cdot b_{j}) = 0

which follows from the equality

m_{1} (a_{i}) = \sum_{j = 1}^{k} m_{1} (a_{i} \cdot b_{j})

shown in Proposition 1.

□

In the following example, we illustrate the result of the previous theorem.

Example 5.

Consider the product MV algebra F and the partitions

A = {x, 1 - x},

B = {x^{2}, 1 - x^{2}}

of the product MV algebra F from Example 2. In addition, let

m_{1}, m_{2}

be the states on F, defined in Example 4. Then the partitions

A

and

B

have the

m_{1}

-state values

\frac{1}{2}, \frac{1}{2}

and

\frac{1}{3}, \frac{2}{3}

of the corresponding elements, respectively, and the

m_{2}

-state values

\frac{2}{3}, \frac{1}{3}

and

\frac{1}{2}, \frac{1}{2}

of the corresponding elements, respectively. The join of partitions

A

and

B

is the system

A \lor B =

{x^{3}, x^{2} (1 - x), x (1 - x^{2}), (1 - x) (1 - x^{2})};

it has the

m_{1}

-state values

\frac{1}{4}, \frac{1}{12}, \frac{1}{4}, \frac{5}{12},

and the

m_{2}

-state values

\frac{2}{5}, \frac{1}{10}, \frac{4}{15}, \frac{7}{30}

of the corresponding elements. By simple calculations we obtain:

D_{A} (m_{1} ‖ m_{2}) = 0.085 b i t, D_{A \lor B} (m_{1} ‖ m_{2}) = 0.134 b i t, D_{B / A} (m_{1} ‖ m_{2}) = 0.049 b i t .

It is possible to verify that

D_{A \lor B} (m_{1}

‖

m_{2})

= D_{A} (m_{1}

‖

m_{2})

+ D_{B / A} (m_{1}

‖

m_{2})

.

5. Discussion

In this paper, we have extended the study of entropy in product MV algebras. The main aim of the paper was to introduce, using known results concerning the entropy in product MV algebras, the concepts of mutual information and Kullback–Leibler divergence for the case of product MV algebras and examine algebraic properties of the proposed measures. Our results have been presented in Section 3 and Section 4.

In Section 3 we have introduced the notions of mutual information and conditional mutual information of partitions of product MV algebras and proved some basic properties of the suggested measures. It was shown that the entropy of partitions of product MV algebras can be considered as a special case of their mutual information. Specifically, it was proved that from the properties of mutual information it follows subadditivity and additivity of entropy (Theorem 3). Theorem 6 provides the chain rule for mutual information. In addition, the data processing inequality for conditionally independent partitions in product MV algebras is proved. Moreover, a concavity of mutual information has been studied.

In Section 4 the notion of Kullback–Leibler divergence in product MV algebras was introduced and the basic properties of this measure were shown. In particular, a convexity of Kullback–Leibler divergence with respect to additive states defined on a given product MV algebra is proved. Theorem 11 admits interpretation of Kullback–Leibler divergence as a measure of how different two states on a common product MV algebra (over the same partition) are. The relationship between KL-divergence and entropy is provided in Theorem 12: the more a state

m \in F

diverges from the state

ν \in F

uniform over

A

(over the same partition

A

) the lesser the entropy

H_{m} (A)

is and vice versa. Finally, a conditional version of the Kullback–Leibler divergence in product MV algebras has been defined and the chain rule for Kullback–Leibler divergence with respect to additive states defined on a given product MV algebra has been established.

Notice that in [14] (see also [29,30]) the entropy on a full tribe F of fuzzy sets has been studied. The tribe F is closed also under the natural product of fuzzy sets and it represents a special case of product MV algebras. Accordingly, the theory presented in this contribution can also be applied for the mentioned case of tribes of fuzzy sets.

In [51,52,53,54,55] a more general fuzzy theory—intuitionistic fuzzy sets (IF-sets for short) has been developed. While a fuzzy set is a mapping

μ_{A} : Ω \to [0, 1]

(where the considered fuzzy set is identified with its membership function

μ_{A}

), the Atanassov IF-set is a pair

A = (μ_{A}, ν_{A})

of functions

μ_{A}, ν_{A} : Ω \to [0, 1]

with

μ_{A} + ν_{A} \leq 1

. The function

μ_{A}

is interpreted as a membership function of IF-set

A,

and the function

ν_{A}

as a non-membership function of IF-set

A .

Evidently, any fuzzy set

μ_{A} : Ω \to [0, 1]

can be considered as an IF-set

A = (μ_{A}, 1 - μ_{A}) .

Any result holding for IF-sets is applicable also to fuzzy sets. Of course, the opposite implication is not true; the theory of intuitionistic fuzzy sets presents a non-trivial generalization of the fuzzy set theory. So IF-sets present possibilities for modeling a larger class of real situations. Note that some results about the entropy on IF-sets can be found e.g., in [56,57,58,59]. These results could be used in developing information theory for the case of IF-sets.

To give a possibility to applied MV algebra results also to families of IF-experiments, one can use the Mundici characterization of MV algebras. In the family of IF-sets it is natural to define the partial ordering relation

\leq

in the following way: if

A = (μ_{A}, ν_{A}),

and

B = (μ_{B}, ν_{B})

are two IF-sets, then

A \leq B

if and only if

μ_{A} \leq μ_{B},

and

ν_{A} \geq ν_{B} .

Namely, in the fuzzy case

μ_{A} \leq μ_{B}

implies

ν_{A} = 1 - μ_{A} \geq 1 - μ_{B} = ν_{B} .

Therefore we can consider the Abelian l-group

(ℜ^{2}, +, \leq)

putting

A + B = (μ_{A} + μ_{B}, 1 - (1 - ν_{A} + 1 - ν_{B})) =

(μ_{A} + μ_{B}, ν_{A} + ν_{B} - 1)

with the zero element

0 = (0, 1) .

(In fact,

A + 0 = (μ_{A}, ν_{A}) + (0, 1) = (μ_{A}, ν_{A}) = A

.) The partial ordering

\leq

in the l-group

(ℜ^{2}, +, \leq)

is defined by the prescription

A \leq B

if and only if

μ_{A} \leq μ_{B},

and

ν_{A} \geq ν_{B} .

Then a suitable MV algebra is e.g., the system

M = {(μ_{A}, ν_{A}); (0, 1) \leq (μ_{A}, ν_{A}) \leq (1, 0)}

. Moreover, this MV algebra is a product MV algebra with the product defined by

A \cdot B = (μ_{A} \cdot μ_{B}, 1 - (1 - ν_{A}) \cdot (1 - ν_{B}))

= (μ_{A} \cdot μ_{B}, ν_{A} + ν_{B} - ν_{A} \cdot ν_{B}) .

The presented MV algebra approach gives a possible elegant and practical way for obtaining new results also in the intuitionistic fuzzy case. We note that this approach was used to construct the Kolmogorov-type entropy theory for IF systems in [58], drawing on entropy results for product MV-algebras published in [35,49,50]. In this way it is also possible to develop the theory of information and K–L divergence for IF-sets.

Acknowledgments

The authors thank the editor and the referees for their valuable comments and suggestions. The authors thank Constantine the Philosopher University in Nitra for covering the costs to publish in open access.

Author Contributions

Both authors contributed equally and significantly to the theoretical work as well as to the creation of illustrative examples. Dagmar Markechová wrote the paper. Both authors have read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gray, R.M. Entropy and Information Theory; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Kolmogorov, A.N. New Metric Invariant of Transitive Dynamical Systems and Automorphisms of Lebesgue Spaces. Dokl. Russ. Acad. Sci. 1958, 119, 861–864. [Google Scholar]
Sinai, Y.G. Ergodic Theory with Applications to Dynamical Systems and Statistical Mechanics; Springer: Berlin/Heidelberg, Germany, 1990. [Google Scholar]
Sinai, Y.G. On the Notion of Entropy of a Dynamical System. Dokl. Russ. Acad. Sci. 1959, 124, 768–771. [Google Scholar]
Markechová, D. The entropy of fuzzy dynamical systems and generators. Fuzzy Sets Syst. 1992, 48, 351–363. [Google Scholar] [CrossRef]
Piasecki, K. Probability of fuzzy events defined as denumerable additive measure. Fuzzy Sets Syst. 1985, 17, 271–284. [Google Scholar] [CrossRef]
Mesiar, R. The Bayes principle and the entropy on fuzzy probability spaces. Int. J. Gen. Syst. 1991, 20, 67–72. [Google Scholar] [CrossRef]
Mesiar, R.; Rybárik, J. Entropy of Fuzzy Partitions—A General Model. Fuzzy Sets Syst. 1998, 99, 73–79. [Google Scholar] [CrossRef]
Dumitrescu, D. Entropy of a fuzzy dynamical system. Fuzzy Sets Syst. 1995, 70, 45–57. [Google Scholar] [CrossRef]
Rahimi, M.; Riazi, A. On local entropy of fuzzy partitions. Fuzzy Sets Syst. 2014, 234, 97–108. [Google Scholar] [CrossRef]
Rahimi, M.; Assari, A.; Ramezani, F. A Local Approach to Yager Entropy of Dynamical Systems. Int. J. Fuzzy Syst. 2015, 1, 1–10. [Google Scholar] [CrossRef]
Srivastava, P.; Khare, M.; Srivastava, Y.K. m-Equivalence, entropy and F-dynamical systems. Fuzzy Sets Syst. 2001, 121, 275–283. [Google Scholar] [CrossRef]
Markechová, D.; Riečan, B. Entropy of Fuzzy Partitions and Entropy of Fuzzy Dynamical Systems. Entropy 2016, 18, 19. [Google Scholar] [CrossRef]
Riečan, B. An entropy construction inspired by fuzzy sets. Soft Comput. 2003, 7, 486–488. [Google Scholar]
Riečan, B. On a type of entropy of dynamical systems. Tatra Mt. Math. Publ. 1992, 1, 135–140. [Google Scholar]
Riečan, B. On some modifications of the entropy of dynamical systems. Atti Semin. Mat. Fis. dell’Univ. Modena 1994, 42, 157–166. [Google Scholar]
Dubois, D.; Prade, M. A review of fuzzy set aggregation connectives. Inf. Sci. 1985, 36, 85–121. [Google Scholar] [CrossRef]
Zadeh, L.A. Fuzzy Sets. Inf. Control 1965, 8, 338–358. [Google Scholar] [CrossRef]
Markechová, D. Entropy and mutual information of experiments in the fuzzy case. Neural Netw. World 2013, 23, 339–349. [Google Scholar] [CrossRef]
Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Kullback, S. Information Theory and Statistics; John Wiley & Sons: New York, NY, USA, 1959. [Google Scholar]
Schnakenberg, J. Network theory of microscopic and macroscopic behavior of master equation systems. Rev. Mod. Phys. 1976, 48, 571–585. [Google Scholar] [CrossRef]
Risken, H. The Fokker-Planck Equation, Methods of Solution and Applications; Springer: New York, NY, USA, 1984. [Google Scholar]
Qian, H. Relative Entropy: Free Energy Associated with Equilibrium Fluctuations and Nonequilibrium Deviations. arXiv, 2001; arXiv:math-ph/0007010v2. [Google Scholar]
Ellis, R.S. Entropy, Large Deviations, and Statistical Mechanics; Springer: New York, NY, USA, 1985. [Google Scholar]
Markechová, D. Kullback–Leibler Divergence and Mutual Information of Experiments in the Fuzzy Case. Axioms 2017, 6, 5. [Google Scholar] [CrossRef]
Chang, C.C. Algebraic analysis of many valued logics. Trans. Am. Math. Soc. 1958, 88, 467–490. [Google Scholar] [CrossRef]
Riečan, B.; Mundici, D. Probability on MV-algebras. In Handbook of Measure Theory; Pap, E., Ed.; Elsevier: Amsterdam, The Netherlands, 2002; pp. 869–910. [Google Scholar]
Riečan, B.; Neubrunn, T. Integral, Measure and Ordering; Springer: Dordrecht, The Netherlands, 1997. [Google Scholar]
Dvurečenskij, A.; Pulmannová, S. New Trends in Quantum Structures; Springer: Dordrecht, The Netherlands, 2000. [Google Scholar]
Mundici, D. MV Algebras: A Short Tutorial. 2007. Available online: http://www.matematica.uns.edu.ar/IXCongresoMonteiro/Comunicaciones/Mundici_tutorial.pdf (accessed on 26 May 2007).
Mundici, D. Interpretation of AFC^*-algebras in Lukasiewicz sentential calculus. J. Funct. Anal. 1986, 56, 889–894. [Google Scholar]
Di Nola, A.; Dvurečenskij, A.; Hyčko, M.; Manara, C. Entropy on Effect Algebras with the Riesz Decomposition Property II: MV-Algebras. Kybernetika 2005, 41, 161–176. [Google Scholar]
Riečan, B. Kolmogorov–Sinaj entropy on MV-algebras. Int. J. Theor. Phys. 2005, 44, 1041–1052. [Google Scholar] [CrossRef]
Kôpka, F.; Chovanec, F. D-posets. Math. Slovaca 1994, 44, 21–34. [Google Scholar]
Kôpka, F. Quasiproduct on Boolean D-posets. Int. J. Theor. Phys. 2008, 47, 26–35. [Google Scholar] [CrossRef]
Frič, R. On D-posets of fuzzy sets. Math. Slovaca 2014, 64, 545–554. [Google Scholar] [CrossRef]
Foulis, D.J.; Bennet, M.K. Effect algebras and unsharp quantum logics. Found. Phys. 1994, 24, 1331–1352. [Google Scholar] [CrossRef]
Frič, R.; Papčo, M. Probability domains. Int. J. Theor. Phys. 2010, 49, 3092–3100. [Google Scholar] [CrossRef]
Skřivánek, V.; Frič, R. Generalized random events. Int. J. Theor. Phys. 2015, 54, 4386–4396. [Google Scholar] [CrossRef]
Di Nola, A.; Dvurečenskij, A.; Hyčko, M.; Manara, C. Entropy on Effect Algebras with the Riesz Decomposition Property I: Basic Properties. Kybernetika 2005, 41, 143–160. [Google Scholar]
Giski, Z.E.; Ebrahimi, M. Entropy of Countable Partitions on effect Algebras with the Riesz Decomposition Property and Weak Sequential Effect Algebras. Cankaya Univ. J. Sci. Eng. 2015, 12, 20–39. [Google Scholar]
Ebrahimi, M.; Mosapour, B. The Concept of Entropy on D-posets. Cankaya Univ. J. Sci. Eng. 2013, 10, 137–151. [Google Scholar]
Riečan, B. On the product MV-algebras. Tatra Mt. Math. 1999, 16, 143–149. [Google Scholar]
Montagna, F. An algebraic approach to propositional fuzzy logic. J. Log. Lang. Inf. 2000, 9, 91–124. [Google Scholar] [CrossRef]
Jakubík, J. On product MV algebras. Czech. Math J. 2002, 52, 797–810. [Google Scholar] [CrossRef]
Di Nola, A.; Dvurečenskij, A. Product MV-algebras. Mult. Valued Log. 2001, 6, 193–215. [Google Scholar]
Petrovičová, J. On the entropy of partitions in product MV-algebras. Soft Comput. 2000, 4, 41–44. [Google Scholar] [CrossRef]
Petrovičová, J. On the entropy of dynamical systems in product MV-algebras. Fuzzy Sets Syst. 2001, 121, 347–351. [Google Scholar] [CrossRef]
Atanassov, K. Intuitionistic Fuzzy Sets: Theory and Applications; Physica Verlag: New York, NY, USA, 1999. [Google Scholar]
Atanassov, K. Intuitionistic fuzzy sets. Fuzzy Sets Syst. 1986, 20, 87–96. [Google Scholar] [CrossRef]
Atanassov, K. More on intuitionistic fuzzy sets. Fuzzy Sets Syst. 1989, 33, 37–45. [Google Scholar] [CrossRef]
Atanassov, K.; Riečan, B. On two operations over intuitionistic fuzzy sets. J. Appl. Math. Stat. Inform. 2006, 2, 145–148. [Google Scholar] [CrossRef]
Riečan, B. Probability theory on IF events. In Algebraic and Proof-Theoretic Aspects of Non-Classical Logics; Papers in Honor of Daniele Mundici on the Occasion of his 60th Birthday; Lecture Notes in Computer Science; Springer: New York, NY, USA, 2007; pp. 290–308. [Google Scholar]
Farnoosh, R.; Rahimi, M.; Kumar, P. Removing noise in a digital image using a new entropy method based on intuitionistic fuzzy sets. In Proceedings of the International Conference on Fuzzy Systems, Vancouver, BC, Canada, 24–29 July 2016; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2016; pp. 1328–1332. [Google Scholar]
Burillo, P.; Bustince, H. Entropy on intuitionistic fuzzy sets and on interval-valued fuzzy sets. Fuzzy Sets Syst. 1996, 78, 305–316. [Google Scholar] [CrossRef]
Ďurica, M. Entropy on IF-events. Notes Intuit. Fuzzy Sets 2007, 13, 30–40. [Google Scholar]
Szmidt, E.; Kacprzyk, J. Entropy for intuitionistic fuzzy sets. Fuzzy Sets Syst. 2001, 118, 467–477. [Google Scholar] [CrossRef]

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Markechová, D.; Riečan, B. Kullback–Leibler Divergence and Mutual Information of Partitions in Product MV Algebras. Entropy 2017, 19, 267. https://doi.org/10.3390/e19060267

AMA Style

Markechová D, Riečan B. Kullback–Leibler Divergence and Mutual Information of Partitions in Product MV Algebras. Entropy. 2017; 19(6):267. https://doi.org/10.3390/e19060267

Chicago/Turabian Style

Markechová, Dagmar, and Beloslav Riečan. 2017. "Kullback–Leibler Divergence and Mutual Information of Partitions in Product MV Algebras" Entropy 19, no. 6: 267. https://doi.org/10.3390/e19060267

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Kullback–Leibler Divergence and Mutual Information of Partitions in Product MV Algebras

Abstract

1. Introduction

2. Basic Definitions, Notations and Facts

3. Mutual Information of Partitions in Product MV Algebras

4. Kullback–Leibler Divergence in Product MV Algebras

5. Discussion

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI