Trace distance: Difference between revisions

Content deleted Content added

Inline

Latest revision as of 02:45, 13 February 2023

In quantum mechanics, and especially quantum information and the study of open quantum systems, the trace distance T is a metric on the space of density matrices and gives a measure of the distinguishability between two states. It is the quantum generalization of the Kolmogorov distance for classical probability distributions.

Definition

The trace distance is defined as half of the trace norm of the difference of the matrices: $T(\rho ,\sigma ):={\frac {1}{2}}\|\rho -\sigma \|_{1}={\frac {1}{2}}\mathrm {Tr} \left[{\sqrt {(\rho -\sigma )^{\dagger }(\rho -\sigma )}}\right],$ where $\|A\|_{1}\equiv \operatorname {Tr} [{\sqrt {A^{\dagger }A}}]$ is the trace norm of $A$ , and ${\sqrt {A}}$ is the unique positive semidefinite $B$ such that $B^{2}=A$ (which is always defined for positive semidefinite $A$ ). This can be thought of as the matrix obtained from $A$ taking the algebraic square roots of its eigenvalues. For the trace distance, we more specifically have an expression of the form $|C|\equiv {\sqrt {C^{\dagger }C}}={\sqrt {C^{2}}}$ where $C=\rho -\sigma$ is Hermitian. This quantity equals the sum of the singular values of $C$ , which being $C$ Hermitian, equals the sum of the absolute values of its eigenvalues. More explicitly, $T(\rho ,\sigma )={\frac {1}{2}}\operatorname {Tr} |\rho -\sigma |={\frac {1}{2}}\sum _{i=1}^{r}|\lambda _{i}|,$ where $\lambda _{i}\in \mathbb {R}$ is the $i$ -th eigenvalue of $\rho -\sigma$ , and $r$ is its rank.

The factor of two ensures that the trace distance between normalized density matrices takes values in the range $[0,1]$ .

Connection with the total variation distance

The trace distance can be seen as a direct quantum generalization of the total variation distance between probability distributions. Given a pair of probability distributions $P,Q$ , their total variation distance is $\delta (P,Q)={\frac {1}{2}}\|P-Q\|_{1}={\frac {1}{2}}\sum _{k}|P_{k}-Q_{k}|.$ Attempting to directly apply this definition to quantum states raises the problem that quantum states can result in different probability distributions depending on how they are measured. A natural choice is then to consider the total variation distance between the classical probability distribution obtained measuring the two states, maximized over the possible choices of measurement, which results precisely in the trace distance between the quantum states. More explicitly, this is the quantity $\max _{\Pi }{\frac {1}{2}}\sum _{i}|\operatorname {Tr} (\Pi _{i}\rho )-\operatorname {Tr} (\Pi _{i}\sigma )|,$ with the maximization performed with respect to all possible POVMs $\{\Pi _{i}\}_{i}$ .

To see why this is the case, we start observing that there is a unique decomposition $\rho -\sigma =P-Q$ with $P,Q\geq 0$ positive semidefinite matrices with orthogonal support. With these operators we can write concisely $|\rho -\sigma |=P+Q$ . Furthermore $\operatorname {Tr} (\Pi _{i}P),\operatorname {Tr} (\Pi _{i}Q)\geq 0$ , and thus $|\operatorname {Tr} (\Pi _{i}P)-\operatorname {Tr} (\Pi _{i}Q))|\leq \operatorname {Tr} (\Pi _{i}P)+\operatorname {Tr} (\Pi _{i}Q))$ . We thus have $\sum _{i}|\operatorname {Tr} (\Pi _{i}(\rho -\sigma ))|=\sum _{i}|\operatorname {Tr} (\Pi _{i}(P-Q))|\leq \sum _{i}\operatorname {Tr} (\Pi _{i}(P+Q))=\operatorname {Tr} |\rho -\sigma |.$ This shows that $\max _{\Pi }\delta (P_{\Pi ,\rho },P_{\Pi ,\sigma })\leq T(\rho ,\sigma ),$ where $P_{\Pi ,\rho }$ denotes the classical probability distribution resulting from measuring $\rho$ with the POVM $\Pi$ , $(P_{\Pi ,\rho })_{i}\equiv \operatorname {Tr} (\Pi _{i}\rho )$ , and the maximum is performed over all POVMs $\Pi \equiv \{\Pi _{i}\}_{i}$ .

To conclude that the inequality is saturated by some POVM, we need only consider the projective measurement with elements corresponding to the eigenvectors of $\rho -\sigma$ . With this choice, $\delta (P_{\Pi ,\rho },P_{\Pi ,\sigma })={\frac {1}{2}}\sum _{i}|\operatorname {Tr} (\Pi _{i}(\rho -\sigma ))|={\frac {1}{2}}\sum _{i}|\lambda _{i}|=T(\rho ,\sigma ),$ where $\lambda _{i}$ are the eigenvalues of $\rho -\sigma$ .

Physical interpretation

By using the Hölder duality for Schatten norms, the trace distance can be written in variational form as ^[1]

T(\rho ,\sigma )={\frac {1}{2}}\sup _{-\mathbb {I} \leq U\leq \mathbb {I} }\mathrm {Tr} [U(\rho -\sigma )]=\sup _{0\leq P\leq \mathbb {I} }\mathrm {Tr} [P(\rho -\sigma )].

As for its classical counterpart, the trace distance can be related to the maximum probability of distinguishing between two quantum states:

For example, suppose Alice prepares a system in either the state $\rho$ or $\sigma$ , each with probability ${\frac {1}{2}}$ and sends it to Bob who has to discriminate between the two states using a binary measurement. Let Bob assign the measurement outcome $0$ and a POVM element $P_{0}$ such as the outcome $1$ and a POVM element $P_{1}=1-P_{0}$ to identify the state $\rho$ or $\sigma$ , respectively. His expected probability of correctly identifying the incoming state is then given by

p_{\text{guess}}={\frac {1}{2}}p(0|\rho )+{\frac {1}{2}}p(1|\sigma )={\frac {1}{2}}\mathrm {Tr} (P_{0}\rho )+{\frac {1}{2}}\mathrm {Tr} (P_{1}\sigma )={\frac {1}{2}}\left(1+\mathrm {Tr} \left(P_{0}(\rho -\sigma )\right)\right).

Therefore, when applying an optimal measurement, Bob has the maximal probability

p_{\text{guess}}^{\text{max}}=\sup _{P_{0}}{\frac {1}{2}}\left(1+\mathrm {Tr} \left(P_{0}(\rho -\sigma )\right)\right)={\frac {1}{2}}(1+T(\rho ,\sigma ))

of correctly identifying in which state Alice prepared the system.^[2]

Properties

The trace distance has the following properties^[1]

It is a metric on the space of density matrices, i.e. it is non-negative, symmetric, and satisfies the triangle inequality, and $T(\rho ,\sigma )=0\Leftrightarrow \rho =\sigma$
$0\leq T(\rho ,\sigma )\leq 1$ and $T(\rho ,\sigma )=1$ if and only if $\rho$ and $\sigma$ have orthogonal supports
It is preserved under unitary transformations: $T(U\rho U^{\dagger },U\sigma U^{\dagger })=T(\rho ,\sigma )$
It is contractive under trace-preserving CP maps, i.e. if $\Phi$ is a CPT map, then $T(\Phi (\rho ),\Phi (\sigma ))\leq T(\rho ,\sigma )$
It is convex in each of its inputs. E.g. $T(\sum _{i}p_{i}\rho _{i},\sigma )\leq \sum _{i}p_{i}T(\rho _{i},\sigma )$
On pure states, it can be expressed uniquely in term of the inner product of the states: $T(|\psi \rangle \langle \psi |,|\phi \rangle \langle \phi |)={\sqrt {1-|\langle \psi |\phi \rangle |^{2}}}$ ^[3]

For qubits, the trace distance is equal to half the Euclidean distance in the Bloch representation.

Relationship to other distance measures

Fidelity

The fidelity of two quantum states $F(\rho ,\sigma )$ is related to the trace distance $T(\rho ,\sigma )$ by the inequalities

1-{\sqrt {F(\rho ,\sigma )}}\leq T(\rho ,\sigma )\leq {\sqrt {1-F(\rho ,\sigma )}}\,.

The upper bound inequality becomes an equality when $\rho$ and $\sigma$ are pure states. [Note that the definition for Fidelity used here is the square of that used in Nielsen and Chuang]

Total variation distance

The trace distance is a generalization of the total variation distance, and for two commuting density matrices, has the same value as the total variation distance of the two corresponding probability distributions.

References

^ ^a ^b Nielsen, Michael A.; Chuang, Isaac L. (2010). "9. Distance measures for quantum information". Quantum Computation and Quantum Information (2nd ed.). Cambridge: Cambridge University Press. ISBN 978-1-107-00217-3. OCLC 844974180.
^ S. M. Barnett, "Quantum Information", Oxford University Press, 2009, Chapter 4
^ Wilde, Mark (2017). Quantum Information Theory. arXiv:1106.1445. doi:10.1017/9781316809976. ISBN 9781107176164. S2CID 2515538.

This quantum mechanics-related article is a stub. You can help Wikipedia by expanding it.

[nielsen-1] Nielsen, Michael A.; Chuang, Isaac L. (2010). "9. Distance measures for quantum information". Quantum Computation and Quantum Information (2nd ed.). Cambridge: Cambridge University Press. ISBN 978-1-107-00217-3. OCLC 844974180.

[2] S. M. Barnett, "Quantum Information", Oxford University Press, 2009, Chapter 4

[3] Wilde, Mark (2017). Quantum Information Theory. arXiv:1106.1445. doi:10.1017/9781316809976. ISBN 9781107176164. S2CID 2515538.

[1]

[2]

[3]

@@ Line 2: / Line 2: @@
 == Definition ==
+The trace distance is defined as half of the [[trace norm]] of the difference of the matrices:<math display="block">T(\rho,\sigma) := \frac{1}{2}\|\rho - \sigma\|_{1} = \frac{1}{2} \mathrm{Tr} \left[ \sqrt{(\rho-\sigma)^\dagger (\rho-\sigma)} \right],</math>where <math>\|A\|_1\equiv \operatorname{Tr}[\sqrt{A^\dagger A}]</math> is the trace norm of <math>A</math>, and <math>\sqrt A</math> is the unique positive semidefinite <math>B</math> such that <math>B^2=A</math> (which is always defined for positive semidefinite <math>A</math>). This can be thought of as the matrix obtained from <math>A</math> taking the algebraic square roots of its eigenvalues. For the trace distance, we more specifically have an expression of the form <math>|C|\equiv \sqrt{C^\dagger C}=\sqrt{C^2}</math> where <math>C=\rho-\sigma</math> is Hermitian. This quantity equals the sum of the singular values of <math>C</math>, which being <math>C</math> Hermitian, equals the sum of the absolute values of its eigenvalues. More explicitly,
-The trace distance is just half of the [[trace norm]] of the difference of the matrices:
+<math display="block">T(\rho,\sigma) = \frac12 \operatorname{Tr}|\rho-\sigma| = \frac12\sum_{i=1}^{r}|\lambda_i|,</math>
+where <math>\lambda_i\in\mathbb R</math> is the <math>i</math>-th eigenvalue of <math>\rho-\sigma</math>, and <math>r</math> is its rank.
+The factor of two ensures that the trace distance between normalized density matrices takes values in the range <math>[0,1]</math>.
-:<math>T(\rho,\sigma) := \frac{1}{2}||\rho - \sigma||_{1} = \frac{1}{2} \mathrm{Tr} \left[ \sqrt{(\rho-\sigma)^\dagger (\rho-\sigma)} \right] .</math>
+== Connection with the total variation distance ==
-(The trace norm is the [[Schatten norm]] for ''p''=1.)  The purpose of the factor of two is to restrict the trace distance between two normalized density matrices to the range [0,&nbsp;1] and to simplify formulas in which the trace distance appears.
+The trace distance can be seen as a direct quantum generalization of the [[Total variation distance of probability measures|total variation distance]] between probability distributions. Given a pair of probability distributions <math>P,Q</math>, their total variation distance is<math display="block">\delta(P,Q) = \frac12\|P-Q\|_1 = \frac12 \sum_k |P_k-Q_k|.</math>Attempting to directly apply this definition to quantum states raises the problem that quantum states can result in different probability distributions depending on how they are measured. A natural choice is then to consider the total variation distance between the classical probability distribution obtained measuring the two states, maximized over the possible choices of measurement, which results precisely in the trace distance between the quantum states. More explicitly, this is the quantity<math display="block">\max_\Pi \frac12\sum_i |\operatorname{Tr}(\Pi_i \rho) - \operatorname{Tr}(\Pi_i\sigma)|,</math>with the maximization performed with respect to all possible [[POVM|POVMs]] <math>\{\Pi_i\}_i</math>.
+To see why this is the case, we start observing that there is a unique decomposition <math>\rho-\sigma=P-Q</math> with <math>P,Q \ge 0</math> positive semidefinite matrices with orthogonal support. With these operators we can write concisely <math>|\rho-\sigma|=P+Q</math>. Furthermore <math>\operatorname{Tr}(\Pi_i P),\operatorname{Tr}(\Pi_i Q)\ge0</math>, and thus <math>|\operatorname{Tr}(\Pi_iP)-\operatorname{Tr}(\Pi_i Q))|
-Since density matrices are [[Hermitian matrix|Hermitian]],
+\le \operatorname{Tr}(\Pi_iP)+\operatorname{Tr}(\Pi_i Q))</math>. We thus have<math display="block">\sum_i |\operatorname{Tr}(\Pi_i (\rho-\sigma))|
+=\sum_i |\operatorname{Tr}(\Pi_i (P-Q))|
+\le \sum_i \operatorname{Tr}(\Pi_i(P+Q))
+= \operatorname{Tr}|\rho-\sigma|.</math>This shows that<math display="block">\max_\Pi \delta(P_{\Pi,\rho},P_{\Pi,\sigma}) \le T(\rho,\sigma), </math>where <math>P_{\Pi,\rho}</math> denotes the classical probability distribution resulting from measuring <math>\rho</math> with the POVM <math>\Pi</math>, <math>(P_{\Pi,\rho})_i \equiv \operatorname{Tr}(\Pi_i \rho)</math>, and the maximum is performed over all POVMs <math>\Pi\equiv\{\Pi_i\}_i</math>.
+To conclude that the inequality is saturated by some POVM, we need only consider the projective measurement with elements corresponding to the eigenvectors of <math>\rho-\sigma</math>. With this choice,<math display="block">\delta(P_{\Pi,\rho},P_{\Pi,\sigma}) =
-:<math>T(\rho,\sigma) = \frac{1}{2} \mathrm{Tr} \left[ \sqrt{(\rho-\sigma)^2} \right] = \frac{1}{2} \sum_i | \lambda_i | , </math>
+\frac12\sum_i |\operatorname{Tr}(\Pi_i(\rho-\sigma))|
-where the <math>\lambda_i</math> are eigenvalues of the Hermitian, but not necessarily positive, matrix <math>(\rho-\sigma)</math>.
+= \frac12 \sum_i |\lambda_i| = T(\rho,\sigma), </math>where <math>\lambda_i</math> are the eigenvalues of <math>\rho-\sigma</math>.
 == Physical interpretation ==
-By using the Hölder duality for [[Schatten norm|Schatten norms]], the trace distance can be written in variational form as <ref name="nielsen">{{Cite book|last1=Nielsen|first=Michael A.|authorlink1=Michael Nielsen|last2=Chuang|first2=Isaac L.|authorlink2=Isaac Chuang|title=[[Quantum Computation and Quantum Information (book)|Quantum Computation and Quantum Information]]|publisher=Cambridge University Press|location=Cambridge|year=2010|edition=2nd|oclc=844974180|isbn=978-1-107-00217-3|chapter=9. Distance measures for quantum information}}</ref>
+By using the Hölder duality for [[Schatten norm]]s, the trace distance can be written in variational form as <ref name="nielsen">{{Cite book|last1=Nielsen|first=Michael A.|authorlink1=Michael Nielsen|last2=Chuang|first2=Isaac L.|authorlink2=Isaac Chuang|title=[[Quantum Computation and Quantum Information (book)|Quantum Computation and Quantum Information]]|publisher=Cambridge University Press|location=Cambridge|year=2010|edition=2nd|oclc=844974180|isbn=978-1-107-00217-3|chapter=9. Distance measures for quantum information}}</ref>
 :<math>
 T(\rho,\sigma) = \frac{1}{2}\sup_{-\mathbb{I}\leq U \leq \mathbb{I}} \mathrm{Tr}[U(\rho-\sigma)]
@@ Line 36: / Line 43: @@
 =\frac 12 (1 + T(\rho,\sigma))
 </math>
-of correctly identifying in which state Alice prepared the system.<ref>S. M. Barnett, "Quantum Information", Oxford University Press, 2009, Chapter 4</ref>.
+of correctly identifying in which state Alice prepared the system.<ref>S. M. Barnett, "Quantum Information", Oxford University Press, 2009, Chapter 4</ref>
 == Properties ==
@@ Line 45: / Line 52: @@
 * It is contractive under [[Quantum operation|trace-preserving CP maps]], i.e. if <math>\Phi</math> is a CPT map, then <math>T(\Phi(\rho),\Phi(\sigma))\leq T(\rho,\sigma)</math>
 * It is convex in each of its inputs. E.g. <math>T(\sum_i p_i \rho_i,\sigma) \leq \sum_i p_i T(\rho_i,\sigma)</math>
+* On pure states, it can be expressed uniquely in term of the inner product of the states: <math>T(|\psi\rangle\langle\psi|,|\phi\rangle\langle\phi|) = \sqrt{1-|\langle\psi | \phi\rangle|^2} </math> <ref>{{cite book|last1=Wilde |first1=Mark |title=Quantum Information Theory |date=2017 |doi=10.1017/9781316809976 |arxiv=1106.1445|isbn=9781107176164 |s2cid=2515538 }}</ref>
 For [[qubits]], the trace distance is equal to half the [[Euclidean distance]] in the [[Bloch sphere|Bloch representation]].
@@ Line 53: / Line 61: @@
 :<math>
--F(\rho,\sigma) \le T(\rho,\sigma) \le\sqrt{1-F(\rho,\sigma)} \, .
+-\sqrt{F(\rho,\sigma)} \le T(\rho,\sigma) \le\sqrt{1-F(\rho,\sigma)} \, .
 </math>
-The upper bound inequality becomes an equality when <math>\rho</math> and <math>\sigma</math> are [[quantum state#pure states|pure states]].
+The upper bound inequality becomes an equality when <math>\rho</math> and <math>\sigma</math> are [[quantum state#pure states|pure states]]. [Note that the definition for Fidelity used here is the square of that used in Nielsen and Chuang]
 ==== Total variation distance ====
@@ Line 66: / Line 74: @@
 [[Category:Quantum information science]]
+{{quantum-stub}}
-[[Category:Quantum mechanics]]