Bounding quantum uncommon information with quantum neural estimators
Abstract
In classical information theory, uncommon information refers to the amount of information that is not shared between two messages, and it admits an operational interpretation as the minimum communication cost required to exchange the messages. Extending this notion to the quantum setting, quantum uncommon information is defined as the amount of quantum information necessary to exchange two quantum states. While the value of uncommon information can be computed exactly in the classical case, no direct method is currently known for calculating its quantum analogue. Prior work has primarily focused on deriving upper and lower bounds for quantum uncommon information. In this work, we propose a new approach for estimating these bounds by utilizing the quantum Donsker–Varadhan representation and implementing a gradient-based optimization method. Our results suggest a pathway toward efficient approximation of quantum uncommon information using variational techniques grounded in quantum neural architectures.
Contents
I Introduction
In classical information theory, a message can be modeled as a sequence of values independently drawn from a discrete probability source . The entropy of the source, denoted by , quantifies the inherent randomness in , and admits an operational interpretation as the minimal amount of information required to faithfully encode outcomes sampled from .
More generally, in a base- encoding scheme, the entropy is defined as , a quantity known as the Shannon entropy [Sha1948EN]. Operationally, corresponds to the expected number of -ary digits needed to represent an outcome from . In the special case , we adopt the shorthand and .
Consider now a communication scenario in which a sender, Alice, wishes to transmit the source to a receiver, Bob, through a noisy communication channel . The channel transforms the input into an output source , inducing a joint distribution over and . This statistical dependence enables Bob to infer partial information about from his observation of , given knowledge of the channel. The amount of information shared between and is captured by the mutual information, defined as
| (I.1) |
where denotes the Kullback–Leibler divergence [KL1997], and is the product of the marginals of and . This can also be expressed in terms of Shannon entropy as , which corresponds to the sum of the individual entropies minus the joint entropy. Intuitively, mutual information quantifies the extent to which the joint distribution deviates from independence. A natural question then arises:
“How much additional information must Bob acquire
in order to fully reconstruct the message ?”
The amount of information not shared with is captured by the partial information of relative to , given by . In classical information theory, this quantity coincides with the conditional entropy [Sle1971EN]. The operational meaning of this quantity as the minimum information required for reconstruction is formally established by the Slepian-Wolf theorem. It states that in the asymptotic limit, if Bob already possesses the correlated message Y, Alice need only send her message X at a rate of bits per symbol for Bob to reconstruct the message with a probability of error that approaches zero.
Now consider a bidirectional communication setting in which Alice and Bob respectively possess correlated random variables and , and aim to exchange their messages. Due to the correlation, it suffices to exchange only the parts of the messages that are not mutually shared. The total amount of information required for this task is given by , which is referred to as the uncommon information between and . In classical settings, since this quantity is expressible solely in terms of conditional and mutual entropies, it is often subsumed under the broader framework of mutual information. Refer to Fig. 1 for expressions related to classical information content.
In quantum information theory, classical notions are extended to the quantum regime by replacing classical probability distributions with quantum states. The entropy of a quantum state is defined as , known as the von Neumann entropy [von1955QI]. This quantity characterizes the asymptotic number of qubits required to encode independently prepared copies of , which is approximately qubits [Sch1995QI]. When a quantum system is described by the reduced state , its entropy is denoted by .
To extend the notion of mutual information to the quantum setting, it is crucial not to define it as the information about one system obtainable by measuring the other, since quantum measurements generally disturb the state and irreversibly alter the system. Instead, for a bipartite quantum system described by , where and represent subsystems held by two parties, the quantum mutual information is defined as
| (I.2) |
where denotes the Umegaki relative entropy. This quantity measures the total correlations between and by quantifying the deviation of the joint state from a product state. Similarly, this quantity can be written using the von Neumann entropy as .
To formulate a quantum analogue of partial information, one may consider the quantum state merging protocol [Hor2005QSM]. Let be a purification of the mixed state , where is a reference system purifying . The protocol addresses the task of transferring system from one party to the other, so that the receiving party ultimately holds the entire pure state . The quantum communication cost of this task is given by the quantum conditional entropy .
Unlike its classical counterpart, this quantity can be negative. In such cases, the task can be completed without any quantum communication, and furthermore, entanglement is generated in the process. This reflects the fact that when and are highly entangled, transferring provides more than just its marginal information. While other positive-definite definitions of quantum relative entropy exist, they do not correspond to the operational cost of this fundamental state merging task, which provides a conceptual foundation for more elaborate protocols like state exchange.
The concept of uncommon information also admits a quantum generalization. Extending the state merging protocol, one may consider the quantum state exchange protocol [Jon2008QUI, Lee2024QUI], where the goal is to exchange the respective parts of the two parties so that the final global state becomes . The minimum quantum communication cost of achieving this transformation defines the quantum uncommon information.
In contrast to the classical case, the quantum uncommon information cannot be expressed as . Since this sum can be negative, iterating the protocol would imply the possibility of generating unbounded entanglement, which contradicts the no-cloning theorem. Therefore, no straightforward quantum analogue of the classical expression exists, and computing quantum uncommon information requires fundamentally different techniques [Hor1998QI]. Refer to Fig. 2 for expressions related to quantum information content.
The closed-form expression for the quantum uncommon information is not yet known. Although several studies have investigated specific instances [Lee2019QSE, Lee2019QSE2, Lee2021QSE], for the general case, only upper and lower bounds have been established [Jon2008QUI, Lee2024QUI].
Quantum uncommon information is not merely a theoretical construct. It serves as a key metric for quantifying communication costs in various quantum network protocols, such as distributed quantum computing [App1999DQC, App2004DQC, App2013DQC, App2016DQC], quantum key agreement [App2017QKD, App2020QKD, App2023QKD], and quantum secret sharing [App1999QSS, App2008QSS].
In this work, we address this gap by proposing a method for predicting bounds on the quantum uncommon information of a given quantum state. Our approach leverages quantum machine learning techniques based on the quantum Donsker–Varadhan representation [Shin2024QMINE, Ziv2024QMINE], which we employ to estimate the von Neumann entropy. These entropy estimates then enable the derivation of corresponding bounds on the quantum uncommon information.
II Quantum uncommon information
II.1 Quantum state exchange protocol
The quantum uncommon information is defined operationally via the quantum state exchange protocol [Jon2008QUI, Lee2024QUI]. This protocol is framed within the paradigm of entanglement as a resource, where any task beyond the scope of local operations and classical communication (LOCC) must be paid for by consuming pre-shared entanglement.
In this scenario, two parties, Alice and Bob, hold quantum systems and , which are part of a larger pure state on a system , where is a reference system that purifies such that and . The goal is to exchange their respective systems, and , using only LOCC assisted by an initial amount of shared pure entanglement. The cost of the protocol is then the net amount of entanglement, measured in units of maximally entangled states (ebits), consumed to achieve this exchange. This resource-centric view motivates the formal definition of the protocol that follows. Refer to Fig. 3 for the description of the quantum state exchange protocol.
Definition II.1 (Quantum state exchange protocol).
Let be a pure state on , and let and be maximally entangled states on auxiliary systems and , respectively. Let denote the target state obtained from by swapping systems and . A quantum state exchange protocol with error is a quantum channel
| (II.1) |
satisfying
| (II.2) |
where is the permissible error.
Now consider the asymptotic setting, where each party holds copies of the initial state . If the error vanishes as , the protocol achieves asymptotically faithful exchange. The quantum uncommon information is then defined as the asymptotic entanglement cost of this task.
Definition II.2 (Quantum uncommon information).
Let be a pure state on , and let be the target state with systems and exchanged. Consider a sequence of quantum channels
| (II.3) |
where and are maximally entangled states on and , respectively. Suppose the protocol satisfies
| (II.4) |
with as . Let and denote the Schmidt ranks of and . Then, the quantum uncommon information is defined as
| (II.5) |
where the infimum is taken over all such sequences of protocols achieving vanishing error.
II.2 Bounds on quantum uncommon information
There is no known closed-form expression for the quantum uncommon information . However, several studies have investigated upper and lower bounds using various strategies [Jon2008QUI, Lee2024QUI].
II.2.1 Upper bounds
An upper bound on can be obtained by constructing an explicit quantum state exchange protocol and calculating the corresponding entanglement rate. As a baseline, consider the protocol in which Alice first transmits her state to Bob, and then Bob sends his state to Alice. This approach effectively applies the quantum state merging protocol twice. The total entanglement consumption in this case is , and the strategy is referred to as the merge-and-send protocol.
This basic protocol can be improved by identifying and removing redundant components that do not require exchange. The common subspace refers to a part of the quantum state that remains invariant under the exchange and hence need not be transmitted, reducing the total entanglement cost.
Definition II.3 (Common subspace).
Let be a pure state on the tripartite system . A subspace of both and is called a common subspace if there exist unitary operators on and on such that
| (II.6) | ||||
| (II.7) | ||||
| (II.8) | ||||
| (II.9) |
where denotes the projection operator onto subspace , and denotes the state with systems and exchanged.
Let and be ancillary systems with the same dimensions as and , respectively. For fixed pure states and , define the unitary operator
| (II.10) |
acting on systems and . Then the following pure state is called the stretched state:
| (II.11) |
By converting the original state into the stretched state and applying a subspace exchange protocol that swaps only and , the resulting entanglement cost is given by the conditional entropy .
II.2.2 Lower bounds
A lower bound on can be derived by observing that the sum of the initial entanglement and the entanglement used in the protocol must be at least the final entanglement.
From Alice’s perspective, the initial entanglement of system is , while after the exchange, the entanglement becomes . Therefore,
| (II.12) |
A symmetric argument from Bob’s perspective yields the lower bound
| (II.13) |
This idea can be refined by considering copies of the state and analyzing the entanglement structure across them via a decomposition protocol.
Definition II.4 (Decomposed states).
Let be a pure state on , and let , , and denote systems corresponding to Alice, Bob, and a reference, respectively. Suppose that for the -fold product state , there exists a reversible transformation such that, for some error ,
| (II.14) |
where are non-negative rational numbers. If such a transformation exists, the four disjoint pure states are referred to as the decomposed states, and the corresponding entanglement cost is given by .
The best known bounds can now be summarized as follows.
Proposition II.1 (Bounds on quantum uncommon information).
Let denote the quantum uncommon information between quantum systems and . Then the following inequalities hold:
| (II.15) | ||||
| (II.16) |
where the upper bound is defined by , and the lower bound is defined by .
Remark II.1.
In general, it is computationally challenging to evaluate the infimum over all possible common subspaces or the supremum over all valid decompositions . In this work, we focus on scenarios where the parties possess additional structural knowledge about the state, enabling efficient identification of the common subspace. For decomposed states, we restrict our attention to special cases in which such a decomposition is guaranteed. Further details are provided in Section IV.
III Quantum Donsker–Varadhan representation
The Donsker–Varadhan representation provides a lower bound on the divergence between two probability distributions [von1932DVR]. It has been widely applied to mutual information estimation by optimizing over a neural network. The quantum Donsker–Varadhan representation is the quantum analogue of this formulation, enabling entropy estimation via the Gibbs variational principle [Shin2024QMINE, Ziv2024QMINE].
Proposition III.1 (Quantum Donsker–Varadhan representation).
Let be a -dimensional density matrix with rank , and define the function by
| (III.1) |
where is a Hermitian operator and is a positive constant. Then, for any , we have
| (III.2) |
provided that is an -rank density matrix and .
According to this formulation, one can estimate the von Neumann entropy by minimizing the function over rank- Hermitian operators . The optimal value approximates up to arbitrary precision .
III.1 Neural estimation of quantum entropy
Suppose is a density matrix of rank . Let be a parameterized unitary operator with parameters . We define a parameterized ansatz for the Hermitian operator as
| (III.3) |
where , , and .
Then the function can be rewritten as
| (III.4) |
The first term can be computed on a quantum computer using a parameterized quantum circuit, while the second term is evaluated classically. Accordingly, we used the function from Section III.1 directly as the loss function. We then obtained an estimate of the von Neumann entropy by updating the parameters to minimize the value of this loss function.
The gradient of with respect to can be computed classically, and the gradient with respect to can be evaluated using the parameter-shift rule [Mit2018QML]. This allows the use of hybrid quantum–classical optimization techniques, such as gradient descent, to minimize and thereby estimate the von Neumann entropy of [Shin2024QMINE, Ziv2024QMINE].
In our numerical simulations, we employed the Adam (Adaptive Moment Estimation) optimizer to update the variational parameters. The initial learning rate was set to 0.1, and a StepLR learning rate scheduler was also applied, which decays the learning rate by a factor of 0.9 every 10 optimization steps to enhance the stability of the training process.
It has been shown that the number of copies of required for this learning process scales as , where is the rank of the state [Shin2024QMINE]. This implies that the quantum Donsker–Varadhan representation is particularly effective when applied to low-rank quantum states.
IV Estimation of bounds
Based on the above, the quantum Donsker–Varadhan representation can be utilized to estimate the von Neumann entropy of a given density matrix. Since the bounds of quantum uncommon information are expressed in terms of von Neumann entropy, applying the same approach allows quantum Donsker–Varadhan representation to estimate the bounds of quantum uncommon information between given quantum states. Notably, as the process of expressing the bounds reduces the size of the states whose von Neumann entropy needs to be estimated, the rank of these states also decreases. Therefore, the method proves to be effective in such cases.
Consider the case of estimating the bounds of quantum uncommon information for quantum systems and . In general, let us consider a given pure state , where is a purifying system of .
IV.1 The loose bounds
For the loose upper bound and lower bound , the von Neumann entropy of each state can be directly estimated to obtain these values. Consequently, for loose bounds, it is relatively straightforward to approximate their values. However, for tight bounds, direct estimation becomes more challenging.
Therefore, we impose several constraints to determine the form of the state, which will then allow us to estimate the bound value. When the tight bound can be measured, the significance of the wide bound value becomes irrelevant. However, for the purpose of comparison, we can easily estimate the bound using a state of the form by applying the aforementioned method. In the following, we will explore methods for estimating tight upper and lower bounds, respectively.
IV.2 The tight bounds
IV.2.1 Common subspaces
Before estimating the tight upper bound, we first examine the properties of the common subspace.
Suppose the common subspace with respect to a specific basis is given by . If and satisfy the conditions for a common subspace without the need for additional unitary transformations, then when expressed in matrix form with respect to the same basis, the following must hold:
| (IV.1) |
where is an matrix, and , are matrices. Moreover, the spectrum of must correspond to the spectrum of and . Therefore, the common subspace is contained in the subspace corresponding to the components where and have the same eigenvalues.
Now suppose the density matrices are given by
| (IV.2) |
We are allowed to apply arbitrary unitary operations to systems and , respectively. Since a unitary operator acts as a basis change, we can align the states to a desired basis to identify the common subspace. Thus, the following holds.
Proposition IV.1 (Partial spectral alignment and unitary diagonalization).
Let and be density matrices of the same dimension , with eigenvalues ordered as and . Suppose there exists a permutation such that for some , the condition holds for . Then there exist unitary operators and such that
Let be the basis that maximizes the overlap of eigenvalues. Then, from the above, we obtain:
Theorem IV.1 (Unitary mapping of a subspace to a fixed basis segment).
Let be a common subspace. Then there exists a unitary such that
Now consider a decomposition of into subspaces and :
where is an orthonormal basis of . In general, we have:
Proposition IV.2 (Characterization of common subspaces via a nonzero-structure relation).
Define a relation on by if or for some . Then the equivalence classes define all possible common subspaces of the form
To identify , prepare two copies of , construct a swapped version by exchanging , and run a swap test. Let be the union of subspaces that pass this test. Then, by Theorem IV.1, this gives the infimum for .
IV.2.2 Decomposed states
For the lower bound, we classify the structure of entanglement. Consider a decomposition into EPR and GHZ states [Vidal2000, Lee2024QUI]. Suppose:
| (IV.3) |
Then we have:
Proposition IV.3 (Reversible entanglement decomposition of tensor powers of tripartite states).
For , there exists a reversible map such that
| (IV.4) |
where .
It is known that not every tripartite state admits such decomposition [Acin2003], and there is no general method to construct it. Thus, we restrict estimation to cases where such decompositions are explicitly given.
V Numerical simulations
To verify whether the proposed method accurately estimates the bounds, we conducted numerical simulations. First, we implemented a quantum machine learning algorithm to estimate the von Neumann entropy of a given state. In this setup, the parameterized unitary was constructed using rotation gates along each axis, together with CNOT gates. The specific structure of the ansatz is shown in Fig. 4.
Subsequently, we describe the estimation procedures for each of the bounds. To verify the performance and stability of our proposed method, we conducted extensive numerical simulations. Each simulation reported in this section was performed for 10 independent runs. For each run, the variational parameters of the quantum circuit were initialized with different random values.
The figures presented in this section depict the aggregated results of these multiple runs. The solid line represents the mean of the entropy estimates at each iteration, while the shaded area corresponds to the standard deviation. This shaded region thus represents the statistical variability of our algorithm’s convergence behavior, originating from the different random initial starting points in the optimization landscape.
V.1 Estimation of the loose bounds
For the loose upper bound and the loose lower bound , we directly estimated each von Neumann entropy of the given . To simulate this, we first generated random quantum states and partitioned the system equally, assigning half to and the other half to , thereby constructing the target state for entropy estimation. Then, using the entropy estimation algorithm described in the previous section, we estimated the corresponding entropies. This simulation was conducted for both 4, 6, and 8 qubit systems.
The simulation results for the upper bound are shown in Fig. 5. The estimation of stabilized after approximately 100 optimization steps for the 4-qubit system and around 200 steps for the 6-qubit and 8-qubit system.
For the lower bound , since only half of the system is used to estimate and , the actual subsystems involved correspond to 2-qubit, 3-qubit, and 4-qubit reduced states, respectively. The results are presented in Fig. 6.
The estimation of stabilized after roughly 50 optimization steps. Theoretically, since the error in the quantum Donsker–Varadhan representation-based method is proportional to the rank of the density matrix, larger systems require more optimization steps to achieve stable estimates.
V.2 Estimation of the tight upper bound
To estimate the tight upper bound , we first identify the common subspace of systems and from the given state , then construct the stretched state , and perform entropy estimation on the full system . In practice, when the system size is small, it is often the case that no nontrivial common subspace exists. Therefore, in this simulation, we fixed the common subspace in advance and proceeded with the estimation.
Specifically, let the dimensions of systems and be , and fix an integer . Denote the computational basis of and by , and define the common subspace as . In other words, for the tripartite system , we ensured that the components supported on were symmetric under the exchange of systems and , while the components supported on were randomly generated.
Simulations were conducted for total system sizes of 4 and 8 qubits for . For the 4-qubit system, we set ; for the 8-qubit system, we set . The results are shown in Fig. 7.
The estimation of stabilized after approximately 200 optimization steps. Since the ranks of the reduced states on and , which are used in the entropy estimation process, depend on the value of , the number of steps required for convergence increases with the size of the common subspace.
Finally, Fig. 8 compares the estimation results for the loose and tight upper bounds of the quantum uncommon information of the given state. The difference between the tight and loose bounds leads to distinct convergence behaviors in their corresponding estimated values.
V.3 Estimation of the tight lower bound
For the tight lower bound , the simulation was conducted as described earlier, assuming that the state is given by Section IV.2.2.
According to Proposition IV.3, the state can be decomposed into EPR and GHZ states, which allows us to estimate the value of the lower bound. Specifically, the coefficients are randomly chosen such that remains a pure state. Based on these coefficients, the values of can be computed. Since are EPR states and is a GHZ state, we estimate the corresponding entropies to obtain the value of . Fig. 9 presents the simulation results. The estimation of stabilized after roughly 50 optimization steps.
VI Concluding remarks
In this paper, we proposed a quantum machine learning approach to estimate the bounds of the quantum uncommon information, which represents the minimum amount of entanglement required to exchange the given state between subsystems and . Since quantum Donsker–Varadhan representation allows the von Neumann entropy of a state to be expressed as the infimum of a specific function, we designed a machine learning algorithm that utilizes this as a cost function to estimate entropy. As the bounds of quantum uncommon information are expressed in terms of von Neumann entropy, the same algorithm can be used to estimate their values. Notably, the number of copies required for training with quantum Donsker–Varadhan representation scales as with respect to the rank , providing an advantage in estimating bounds that involve entropies of subsystems of a given state. In addition, the method proposed in this paper for identifying the common subspace of a given state can be extended to other estimation techniques.
Our estimation method requires different types of information from Alice and Bob depending on the bound to be estimated. Fundamentally, the use of quantum Donsker–Varadhan representation requires knowledge of the rank of each state. In scenarios where the ranks are not known, one may employ techniques such as quantum rank estimation [Ryan2019VQD].
To determine the common subspace, Alice and Bob must additionally know the marginal states they individually possess and how the entire system can be decomposed. In cases where the marginal states are not known, one can employ a variational quantum circuit to search for unitaries and . However, this approach still requires access to information about the global system.
Moreover, a decomposed state requires prior knowledge of whether the given state can be expressed in a specific canonical form. Therefore, depending on the context in which quantum uncommon information is applied, one must estimate different bounds accordingly.
Quantum uncommon information can be broadly applied in scenarios where the parties are required to exchange their quantum states completely, including entanglement, and thus provides a natural measure of communication cost in such contexts. These situations frequently arise during entanglement distribution protocols in quantum networks [AppQN1997, AppQN2016, App1999DQC, App2004DQC, App2013DQC, App2016DQC, App2017QKD, App2020QKD, App2023QKD, App1999QSS, App2008QSS].
In many of these cases, Alice and Bob have prior knowledge about their respective states, particularly when the state preparation process is known. Furthermore, in these cases, the states often consist of EPR or GHZ states. Under these conditions, the requirements for identifying the common subspace and the decomposed state are often satisfied, and the estimation method for quantum uncommon information can be utilized effectively.
To allow for more general application, certain aspects need to be improved. First, if the process of identifying the common subspace can be refined to work without prior knowledge of the structure of the original state, then it may be possible to determine the common subspace based solely on information from the partial systems held by Alice and Bob. Furthermore, our approach remains applicable even when different forms of decomposed states are considered.
Although this work assumes an ideal quantum system, the presence of noise in realistic quantum devices could affect the performance of the proposed methodology. For instance, errors in state preparation or gate operations within the variational circuit could degrade the accuracy of the loss function calculation, potentially introducing a bias to the entropy estimate or hindering convergence.
A crucial direction for future research is therefore to integrate quantum error mitigation techniques into our algorithm to test its robustness in noisy environments. Such an extension would significantly enhance the practical applicability of the proposed method.
Data availability statement
The data and software that support the findings of this study can be found in the following repository: https://github.com/donghwa722/QUINE
Acknowledgements
J.L. acknowledges helpful discussions with Ju-Young Ryu. This work was supported by the National Research Foundation of Korea (NRF) through a grant funded by the Ministry of Science and ICT (Grant No. RS-2025-00515537). This work was also supported by the Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korean government (MSIP) (Grant Nos. RS-2019-II190003 and RS-2025-02304540), the National Research Council of Science & Technology (NST) (Grant No. GTL25011-401), and the Korea Institute of Science and Technology Information (KISTI) (Grant No. P25026). I.K.S. acknowledges support by Quantum Computing based on Quantum Advantage challenge research through the National Research Foundation of Korea (NRF) funded by the Korean government (MSIT) (Grant No. RS-2023-00256221).