The Theory of Observations
The Theory of Observations
Scientists have been debating the meaning of quantum mechanics for over a
century. This book for graduate students and researchers gets to the root of
the problem: the contextual nature of empirical truth, the laws of observation,
and how these impact on our understanding of quantum physics. Bridging the
gap between nonrelativistic quantum mechanics and quantum field theory, this
novel approach to quantum mechanics extends the standard formalism to cover
the observer and their apparatus. The author demystifies some of the aspects
of quantum mechanics that have traditionally been regarded as extraordinary,
such as wave–particle duality and quantum superposition, by emphasizing the
scientific principles. Including key experiments and worked examples throughout
to encourage the reader to focus on empirically sound concepts, this book avoids
metaphysical speculation and alerts the reader to the use of computer algebra to
explore quantum experiments of virtually limitless complexity.
G E O RG E J A RO S Z K I E W I C Z
University Printing House, Cambridge CB2 8BS, United Kingdom
One Liberty Plaza, 20th Floor, New York, NY 10006, USA
477 Williamstown Road, Port Melbourne, VIC 3207, Australia
314–321, 3rd Floor, Plot 3, Splendor Forum, Jasola District Centre, New Delhi – 110025, India
79 Anson Road, #06–04/06, Singapore 079906
www.cambridge.org
Information on this title: www.cambridge.org/9781107136236
DOI: 10.1017/9781316477182
© George Jaroszkiewicz 2017
This publication is in copyright. Subject to statutory exception
and to the provisions of relevant collective licensing agreements,
no reproduction of any part may take place without the written
permission of Cambridge University Press.
First published 2017
Printed in the United Kingdom by Clays, St Ives plc
A catalogue record for this publication is available from the British Library.
Library of Congress Cataloging-in-Publication Data
Names: Jaroszkiewicz, George, author.
Title: Quantized detector networks : the theory of observation / George
Jaroszkiewicz (University of Nottingham (retired)).
Other titles: Cambridge monographs on mathematical physics.
Description: Cambridge, United Kingdom ; New York, NY : Cambridge University
Press, 2017. | Series: Cambridge monographs on mathematical physics
Identifiers: LCCN 2017028906 | ISBN 9781107136236 (hardback) |
ISBN 1107136237 (hardback)
Subjects: LCSH: Quantum theory.
Classification: LCC QC174.13 .J38 2017 | DDC 530.12–dc23 LC record
available at https://lccn.loc.gov/2017028906
ISBN 978-1-107-13623-6 Hardback
Cambridge University Press has no responsibility for the persistence or accuracy of
URLs for external or third-party internet websites referred to in this publication
and does not guarantee that any content on such websites is, or will remain,
accurate or appropriate.
Contents
Preface page xv
Acronyms xviii
1 Introduction 1
1.1 Motivation 1
1.2 Physics, Not Metaphysics 3
1.3 A Brief History of Quantum Interpretation 6
1.4 Plan of This Book 9
1.5 Guidelines for Reading This Book 10
1.6 Terminology and Conventions 11
3 Classical Bits 28
3.1 Binary Questions 28
3.2 Question Cardinality 28
3.3 Classical Binary Questions 29
3.4 Classical Bits 29
3.5 Signal Bits 30
3.6 Nodes 31
3.7 Dual Bits 32
3.8 The Interpretation of Bits and Their Duals 32
viii Contents
4 Quantum Bits 46
4.1 Quantum Bits 46
4.2 Preferred Bases 46
4.3 Qubit Properties 47
4.4 Qubit Operators 48
4.5 Signal Bit Operators 50
4.6 The Standard Born Interpretation 50
4.7 The Born Interpretation in QDN 53
4.8 Classical and Quantum Ensembles 53
4.9 Basis Transformations 54
4.10 The Preferred Basis Problem 55
4.11 Rank-One Qubit Evolution 57
4.12 Mixed Qubit States 58
4.13 Density Operators 58
11 Modules 148
11.1 Modules 148
11.2 The Vacuum 149
11.3 The Wollaston Prism 150
11.4 The Newtonian Prism 152
11.5 Nonpolarizing Beam Splitters 157
11.6 Mirrors 160
11.7 Phase Changers 160
11.8 Polarization Rotators 160
11.9 Null Modules 161
13 Interferometers 172
13.1 Introduction 172
13.2 The Mach–Zehnder Interferometer 172
13.3 Brandt’s Network 174
13.4 The Two-Photon Interferometer 176
Contents xi
16 Nonlocality 217
16.1 Introduction 217
16.2 Active and Passive Transformations 221
16.3 Local Operations 225
16.4 Primary and Secondary Observers 226
16.5 Subregister Bases 228
16.6 Local and Remote Evolution 229
16.7 The No-Communication Theorem 229
24 Oscillators 309
24.1 Introduction 309
24.2 The Classical Oscillator Register 310
24.3 Quantization 314
24.4 Bosonic Operators 315
24.5 Quantum Register Oscillator Operators 317
Contents xiii
26 Conclusions 343
Appendix 345
A.1 QDN Notation 345
A.2 Lab Time and Frame Fields 346
A.3 Lab Time and Stages 346
A.4 Ensembles 348
A.5 Vector Spaces 350
References 357
Index 365
Preface
1
I don’t imply there are other forms of reality. I can say nothing about that.
xvi Preface
CA computer algebra
CBR computational basis representation
CM classical mechanics
CPT charge, parity, and time
CST causal set theory
CTC closed timelike curve
DS double-slit
EPR Einstein, Podolsky, and Rosen
eQDN extended quantized detector networks
FLS Feynman, Leighton, and Sands
GP generalized proposition
GPC general proposition classification
GR general relativity
HV hidden variables
LG Leggett–Garg
LSZ Lehmann–Symanzik–Zimmermann
MDS monitored double-slit
MS Misra and Sudarshan
NEO null evolution operator
ONB orthonormal basis
POVM positive operator-valued measure
PVM projection-valued measure
QDN quantized detector network
QED quantum electrodynamics
QFT quantum field theory
QM quantum mechanics
REC relative external context
RIC relative internal context
RQFT relativistic quantum field theory
SBR signal basis representation
SG Stern–Gerlach
SR special relativity
SUO system under observation
ToE Theory of Everything
1
Introduction
1.1 Motivation
The aim of this book is to introduce, develop, and apply quantized detector
networks (QDN), an information-based, observer-centric approach to quantum
mechanics (QM). Six reasons motivating our development of QDN are the
following.
Strength in Diversity
There is no evidence for the view that there is, or even should be, a single, best
way to do QM. For instance, while we believe that quantum electrodynamics
(QED) can explain almost everything about the hydrogen atom, the Schrödinger
wave equation does a good enough job most of the time and far more econom-
ically. QDN is but another way to describe and formulate quantum physics; it
will have advantages in some contexts and disadvantages in others.
Logical Development
QDN is not intended to replace QM. On the contrary, QM works brilliantly and
it would be foolish to think we could do better. QDN is intended to enhance
QM in areas where little or no attention has been paid hitherto: the observer
and their apparatus. QDN seems to us to be a logical application of successful
principles that sooner or later would be developed further along the lines taken
in this book.
1
Not in every case, unfortunately.
1.2 Physics, Not Metaphysics 3
Human Conditioning
There is now a great deal of empirical evidence for the proposition that humans
see the world not as it is but how they have been conditioned to see it (Halligan
and Oakley, 2000; Kahneman, 2011). This conditioning occurs because of various
forces: evolutionary, genetic, cultural, linguistic, familial, political, and so on. CM
is based on ordinary, everyday-life human experience. QM was developed recently,
only just over a hundred years ago, and only once sufficient technology had been
developed that could reveal the deficiencies in our classical conditioning. We have
not caught up with QM in our modes of thinking, and it is not obvious that we
ever will. It remains a natural tendency for even the most experienced quantum
theorist to explain things in familiar terms, such as space, time, and particles.
QDN can be seen as an attempt to steer the theorist away from those conditioned
thoughts whenever it is wise to do so.
2
This may well change with advances in the neurosciences. Observers are fundamental to
physics, so how observers operate and fit into the grand scheme of things seems a
reasonable subject for investigation.
4 Introduction
is the only way that physics is done. Significant examples are the concepts of
SUO, observer, particle, atom, molecule, wave function, space, time, apparatus,
and so on. These are all mental constructs and essentially metaphysical in nature.
There is a paradox of sorts here involving the use of mathematics in physics. A
conventional, indeed perhaps ubiquitous, view about empirical physics is that the
empirically discovered laws of physics are independent of observers. Yet observers
are indispensable. Observers are needed first to formulate theories (which do not
exist except in the minds of theorists) and then to test those theories empirically.
It is a metaphysical proposition to assert or to believe the proposition that the
properties of objects “exist” independently of observers or observation. How
could we prove that proposition? We shall call this idea the realist interpretation
of physics, otherwise known as realism.
The paradox is that the realist interpretation has been remarkably successful in
classical physics, that is, the interpretation of phenomena according to the laws
of CM based on the Newtonian space-time paradigm or the Minkowski/general
relativistic spacetime paradigm. That success has led many physicists to believe
that this interpretation should apply to the whole of physics.
The paradox we refer to above has an additional twist: the classical realist
position simply does not work when applied to real physics experiments that go
beyond a certain basic level of sophistication. It just does not. The empirical
evidence against the classical realist position as the be-all and end-all is now
virtually unbreakable, although there will always be theorists who do not accept
that statement. We have in mind the Bell inequality-type experiments that are
discussed in Chapter 17. These have now established beyond much reasonable
doubt that classical intuition is deficient when it comes to quantum physics (by
which term we mean those experiments where quantum phases are significant).
Therefore, something has to give.
This book is an explanation of our interpretation of what quantum mechanics
really means, which is that it is a theory of observation. This involves the observer
as well as the observed.
The resolution of the above-mentioned paradox that led us to QDN was the
painful acceptance of the fact that physics is done by humans. This fact is painful
in that the current tendency in physics is to avoid reference to humans specifically,
and to marginalize observers as much as possible. There is a standard reason
given for this, often referred to as the principle of general covariance. This is the
assertion that the laws of physics are intrinsic and independent of observers.
The principle of general covariance is, when examined properly, a vacuous
proposition, but the paradox is that it works in CM, particularly in general
relativity (GR). Some theorists even employ it in their formulations of quantum
theory. Such efforts invariably end up as branches of mathematical metaphysics,
because QM is all about real world physics and experimentation. We think that
perhaps science has reached the point where observers should be factored much
more into the equations somehow. That agenda is not easy, and is what this book
1.2 Physics, Not Metaphysics 5
is all about. What helps greatly here is the hard fact that, logically, there can be
no empirical evidence for the realist position or for the universality of the principle
of general covariance, because the very notion of empirical evidence necessarily
involves human observers. You cannot have your cake and eat it. You cannot use
observation to prove that the laws of physics are independent of observation.
If there is one thing that QM teaches us, then, it is the uncomfortable lesson
that not all intuition stands up. Sometimes, particularly when it comes to quan-
tum processes, the natural human interpretation of what is believed to be going
on is inconsistent, or just meaningless.
Our considered view, then, is that QM is not a theory describing objects per se
but a theory of entitlement. QM provides us with a set of rules that tells us what
we as observers are entitled to say in any particular context, and no more. The
good news is that this theory of entitlement is self-consistent and has definite
mathematical rules that have never yet been found to fail.
We illustrate the sort of scientific philosophy that guides us in this book with
the following, scenario A. Suppose a hundred, a thousand, a million observers
had independently measured some quantifiable property about some SUO S and
had come up with the same numerical value to within experimental error. Then
it would be natural to assert that S “has” that property. That is something we
do all the time. We say that our house “has” four bedrooms, that our car “is”
white, that John “is” tall, and so on. The logic seems inescapable. The world
around us “has” properties that we discover.
But when it comes to quantum physics, there is now enough evidence that
tells us that we cannot always rely on the above assertion. We cannot exclude
the possibility that the very processes used to “see” those presumed properties
of SUOs create, in some way, those very properties. If we take this as a warning,
then the only thing that we would be entitled to say about scenario A with real
confidence is only what we could be sure of: that a hundred, a thousand, a million
observers had independently performed some procedure and obtained the same
number to within experimental error.
This may sound limited, cautious, lacking in vision, and so on. But the history
of empirical quantum science over the last several decades has shown unam-
biguously that if we made the realist assertion that SUOs “have” measurable
properties, then sooner or later we would come to an empirically observed break-
down of some prediction based on that assertion. The theorist Wheeler went so
far as to formulate this principle of entitlement in the form of what he called the
participatory principle:
Stronger than the anthropic principle is what I might call the participatory
principle. According to it we could not even imagine a universe that did not
somewhere and for some stretch of time contain observers because the very
building materials of the universe are these acts of observer-participancy.
6 Introduction
You wouldn’t have the stuff out of which to build the universe otherwise.
This participatory principle takes for its foundation the absolutely central
point of the quantum: No elementary phenomenon is a phenomenon until
it is an observed (or registered) phenomenon.
(Wheeler, 1979)
environment, and it is these models that were used by theorists such as Newton,
Lagrange, and Hamilton as the basis on which to construct CM. Four of these
critical facts are the following.
Daylight Is Bright
To the human eye, daylight is relatively bright. The human brain interprets
objects that it sees visually not in photonic, that is, discrete, terms but rather
as a continuous process. The net effect is to create the illusion that well-defined
objects or SUOs exist over extended time scales in continuous space and time.
These facts and others, such as the way that the human brain processes infor-
mation, all conspire to give the impression that objects under observation “have”
physical properties independent of any observer. Moreover, these properties can
be measured nondestructively and understood by the rules of classical mechanics.
For example, in Newton’s laws of motion, there is no mention of any apparatus
or observer, apart from the implication that these laws are stated relative to
inertial frames of reference.
After 1900
Up to 1900, virtually all physicists thought about space, time, matter, and the
way physics was to be done in the above classical terms. The advent of special
relativity (SR) in 1905 in no way changed this perspective: special relativity is
8 Introduction
all about the differences between observers and how they see things classically,
and not about those things themselves. Indeed, theorists such as FitzGerald
(FitzGerald, 1889), Larmor (Larmor, 1897), and Lorentz (Lorentz, 1899) had
been working out many of the details of SR well before 1900, basing their work
on the Newtonian space-time paradigm. Their work and that of Einstein in his
landmark 1905 paper on SR (Einstein, 1905b) had nothing to do with what
Planck started in 1900. So little, in fact, that bridging the gap between SR and
QM remains perhaps the greatest challenge in mathematical physics.
Up to 1900, Planck had been trying to understand the empirical data from
experiments on black body cavities, using Newtonian physical principles and
Maxwell’s theory of electromagnetism. That approach led to the prediction that
the intensity of light emitted from a black body would grow with frequency,
but the data indicated otherwise. In 1900, in order to understand the data, he
approached the problem in a new way, departing from conventional principles.
To understand properly what Planck did not say in 1900, we need to under-
stand what Einstein did say in 1905. In that year, Einstein explained the pho-
toelectric effect in terms of particles of light (Einstein, 1905a). These particles
subsequently became known as photons (Lewis, 1926). What seemed paradoxical,
and still does to this day, is that photons are regarded as particles associated with
Maxwell’s theory of electromagnetism, a classical theory that predicts that light
is a wave process and not a particulate process.
Planck did not assert that light itself was quantized. What he said was that the
black body data could be explained if the assumed atomic oscillators lining the
walls of a black body cavity could absorb and/or emit electromagnetic radiation
only in well-defined amounts, known as quanta:
E + E + E + · · · = E0
of the energy over the separate resonators of each group, first of all the
distribution of the energy E over the N resonators of frequency ν. If E
considered to be continuously divisible quantity, this distribution is possible
in infinitely many ways. We consider, however – this is the most essential
point of the whole calculation – E to be composed of a definite number of
equal parts and use thereto the constant of nature h = 6.55 × 10−27 erg·sec.
This constant multiplied by the common frequency ν of the resonators gives
us the energy element ε in erg, and dividing E by ε we get the number P of
energy elements which must be divided over the N resonators. If the ratio
is not an integer, we take for P an integer in the neighbourhood.
(Planck, 1900a)
Retractions
If f is a function that maps a set A into a set B, we denote the set of image
points in B by f (A). In this book we encounter situations where f (A) is a proper
subset of B, which means that there are elements in B that are not images of
any element in A. If f is an injection, which means that there is a one-to-one
correspondence between elements in A and elements in f (A), then the retraction
f of f is the well-defined function with domain f (A) that takes elements from
f (A) back into A, given by f (f (a)) = a for any element a of A. We shall use
retractions a lot in this book.
A f
B
a b
f(a)
f(A)
f
Figure 1.1. If f : A → B is an injection and f (A) is a proper subset of B,
then there are elements such as b in B for which the retraction f of f is not
defined.
12 Introduction
spaces. The first Hilbert space H will contain the system states, the imagined
states of the SUO, and for those we shall use the conventional and familiar Dirac
bra-ket notation |Ψ, with inner product Φ|Ψ, and so on. The second Hilbert
space, Q, will be a quantum register containing the labstates, the quantum states
of the apparatus. For these labstates we shall use bold notation j and their duals
will be denoted j. Labstate inner products are denoted ij, and so on. Total states
are elements of the total Hilbert space and denoted by |Ψ, i) ≡ |Ψ ⊗ i, noting
the round bracket on the left-hand side of this definition. Inner products of total
states are given by
The use of bold font for labstates and overlined bold font for their duals is
consistent with the question and answer formalism described in the next chapter.
2
Questions and Answers
Matters are never as simple as that. We will clarify presently what we mean
by observers, SUOs, and so on. At this point, we observe that the above answer
QI begs a number of questions that should have been asked and answered first.
Specifically, we should also ask the questions
A≡ Who is asking this question?
B≡ Why is this question being asked?
C≡ Who or what is being asked?
D≡ What will be done with the answer?
and perhaps more. It is the failure to ask questions A, B, and so on, that seems
to us to be the root cause of many if not all of the conceptual problems people
have in the interpretation of quantum mechanics (QM).
We could go further. The context, or circumstances, under which we ask our
questions is extremely important. For instance, physicists use experiments to
answer their empirical questions. The apparatus that they use is part of that
context: if the apparatus cannot be constructed, then the questions cannot even
be asked. When we look carefully at what we are doing when we do physics,
it soon becomes obvious that we have to take great care in how we ask our
questions.
We should say at this point that we do have a specific view about how questions
should be asked in physics, particularly quantum physics. This view is based
on a concept that we call contextual completeness, discussed in Section 2.12.
Contextual completeness is a measure, based on a simple algorithm, for deciding
whether any given assertion or proposition is a reasonable scientific proposition
or not. Specifically, we can establish whether it could be investigated according
to proper scientific protocols, or whether it is a vacuous proposition, meaning
that it cannot be empirically tested for a truth value. One of the surprising
features of our analysis is that it shows that there are several degrees of contextual
completeness, ranging from the completely vacuous (characteristic of metaphysics
and philosophy) to the fully contextually complete proposition required in QM.
Manifold Time
Manifold Time is the view that time is a “thing,” an objectivized parameter
modeled as a real geometrical dimension. An important development in Manifold
Time occurred in 1908, when Minkowski proposed that time could be mod-
eled geometrically, as one of the dimensions of spacetime, the four-dimensional
Lorentz-signature manifold of special relativity (SR), in which all past, present,
and future events are contained (Minkowski, 1908; Petkov, 2012). This model of
the universe is often referred to as the Block Universe (Price, 1997).
There are several significant factors commonly used by physicists to support
Manifold Time and the Block Universe concepts. Some of these are the following.
Relativity
Manifestly Block Universe theories such as SR and general relativity (GR) are
based on spacetime manifolds with Lorentzian signature metrics. These are dis-
cussed in the Appendix, Section A.2.
Reversibility
The laws of Newtonian classical mechanics excluding friction are generally time-
reversible. Lines in geometry do not carry any intrinsic direction. Therefore, if
time is reversible, it may be possible to model it geometrically.
Unitary Evolution
In QM, states of SUOs evolving without observational intervention are described
by Schrödinger unitary evolution, and this is regarded as reversible.
CPT Theorem
In high energy particle physics, the well-known charge, parity, time (CPT) rever-
sal theorem involves symmetry operations in spacetime that are consistent with
all experiments to date, particularly particle scattering amplitudes, which have
empirically confirmed crossing symmetries (Eden et al., 1966).
Differential Equations
Many differential equations in mathematical physics, such as Maxwell’s equations
for electrodynamics and Einstein’s field equations in GR, contain no dissipative
terms, leading to both advanced and retarded solutions to the field equations
with no natural (to the theory) reason to eliminate the classically unacceptable
advanced solutions.
Process Time
In contrast to Manifold Time, Process Time (Encyclopaedia Britannica, 2000)
is based on an acknowledgment that there are irreversible processes going on in
the Universe and that humans experience a feeling that there is a “moment of
16 Questions and Answers
the now.” Time is no longer viewed as a thing but as a process. This implies
the existence (in this context) of the systems or objects that are processing in
time. According to QDN, the observers of those processes are fundamental to
this discussion as well. Factors that we take here as supporting the Process Time
paradigm are the following.
Irreversibility
Thermodynamics, one of the fundamental branches of physics, has irreversibility
built into it.
Observers
Observers have memory and intention, which are asymmetric regarding the direc-
tion of time. Observers operate contextually in an irreversible way, acquiring
information as time progresses.
Hubble Expansion
The Universe appears to be expanding in an irreversible way from a definite
point in the past. The Block Universe model has no empirical justification for
postulating events earlier than that point. Indeed, the Steady State paradigm
(Bondi and Gold, 1948; Hoyle, 1948) that postulated an indefinite past has been
generally abandoned by cosmologists.
Contextual Incompleteness
The Block Universe paradigm is contextually incomplete with a generalized
propositional classification of zero (contextual completeness and generalized
propositional classification are discussed further on in this chapter). It is a
vacuous, metaphysical concept.
Emergence, on the other hand, is an acceptance that there are situations that
transcend reductionism. For example, life is a process, not a collection of atoms;
there is no mention of life in the standard model of high energy physics.
Situations in physics that fall out of the scope of reductionism and require
emergent thinking generally involve observers and large-scale processes typically
involving numbers of particles on the scale of Avogadro’s number and more, that
is, of the order 1026 . Examples in the physical sciences that require emergent
concepts are thermodynamics, gravitation, cosmology, and quantum physics. We
take the view in this book that it is a fundamental category error to believe that
reductionism alone can explain any of those concepts.
The question remains as to the status of reductionism in QDN. Does the
relatively successful reductionist Standard Model of particle physics have a place
in QDN?
Our answer is emphatically yes. Reductionism has a specific role in those
aspects of observation that do not involve apparatus explicitly, and that is in the
information void , the conceptual region between state preparation and outcome
detection. We shall be greatly preoccupied with the information void, for it is in
that region where the idealized Lagrangians of particle physics can be explored
and used to work out the quantum signal transition amplitudes that QDN needs.
Indeed, QDN has no way by itself of generating those amplitudes. In this respect,
QDN is analogous to scattering matrix (S-Matrix) theory1 in hadronic physics
(Eden et al., 1966), which deals with the architecture of scattering processes
but cannot account for the explicit details of amplitudes without introducing
Lagrangians.
An important and useful point here is that, by definition, we cannot “know”
what the information void is. We can only make models for the transmission of
information from preparation device to outcome detector. This is useful because it
means we are free in our choice of the models we use to calculate the transmission
amplitudes from source to detector across information voids. These models need
not be of the standard Lorentzian signature, spacetime manifold type that are
used in relativistic quantum field theory. They could be more complicated, such
as Snyder’s quantized spacetime algebra (Snyder, 1947a,b), or even manifesta-
tions of superstrings. At the end of the day, such models simply provide “black
box” recipes for the calculation of amplitudes that will have various degrees of
empirical correctness but should never be regarded as reflecting absolute truth
in any way. Of course, some models will be much better than others.
1
S stands for scattering.
18 Questions and Answers
2
By this we mean whether we have decided to use classical mechanics (CM) or QM in our
discussion of the experiment.
2.6 Answer Set Collapse 19
Null Tests
The question Q1 could be completely nondestructive, giving an answer yet leaving
the original state unaltered. This scenario is assumed throughout CM, classical
questions generally being assumed completely nondestructive. In standard QM
this scenario is called a null test (Peres, 1995), being described by a state that
is an eigenstate of the observable concerned. In his famous book on QM, Dirac
discussed this scenario in terms of polarized light passing through two or more
polarizing crystals (Dirac, 1958). If an unpolarized monochromatic beam of light
is passed through a polarizing crystal, two polarized beams of light are observed
to emerge. If either of these beams is then passed through an identical crystal
with the same orientation as the first, that beam then goes through the second
crystal without further splitting.
A spectacular example of a null test was discussed by Newton in his astonishing
book Opticks published in 1704 (Newton, 1704). Newton showed that if a beam of
sunlight were passed through a prism P 1 , then it would be split into a set of sub-
beams of different colors, defining a spectrum. If that spectrum was then carefully
focussed onto a second prism P 2 , then P 2 would undo the action of P 1 , resulting
in a reconstituted beam emerging from P 2 that had the same spectral properties
as the initial incident beam. In essence, the combination P 1 followed by P 2 acts
as a null test on the original incident beam. This experiment is discussed in detail
in Section 11.4.
Partial Destruction
It could be the case that Q1 is partially destructive, also known as a measurement
of the first kind (Paris, 2012). After the answer Q1 Ψ has been obtained, the SUO
is then in some altered state Ψ different from the original state Ψ. Any attempt
now by the observer to ask question Q2 would then be extracting not the answer
Q2 Ψ but the answer Q2 Ψ .
Total Destruction
It could be the case that Q1 is totally destructive, also known as a measurement of
the second kind , or demolitive measurement (Paris, 2012). The process of asking
Q1 totally destroys the original state Ψ, so that Q2 could not even be asked.
In the real world, human observers get a lot of information about objects
around them optically, which is by and large a weakly destructive process that
gives the impression of being totally nondestructive. This has led to the classical
conditioning that regards the ordering of questions as not significant. In contrast,
QM is based on the empirical evidence that all observations that extract genuine
information are never nondestructive, Planck’s constant being a manifestation of
that fact. The temporal ordering of questions is significant in QM and is generally
2.7 Incompatible Questions and Category Errors 21
empirically in the case of two-path interferometry (Afshar, 2005), but the general
consensus at this time is that they have not succeeded (Jacques et al., 2008).3
Suppose a detector does exist in a laboratory. It could be faulty, so that any
attempt to detect a signal would be doomed to failure. Another possibility is that
a detector exists and has registered a signal that the observer knows about (such
as in a teleportation experiment), and then that detector has been deliberately
decommissioned in some way by the observer so as to prevent further signals
being generated. Any further attempts to detect a new signal automatically then
involve a incompatible question.
Finally, even if a detector exists and is working properly, its compatibility will
be contextual on the states being asked. For instance, a photon detector will not
detect neutrinos.
There is a class of questions that appear to be nominally physically based,
but which have an empty answer set under all known empirical circumstances.
This means that there is currently no way known of answering such questions
empirically. For example, the question Is time travel possible? is one such
question. We just don’t know the answer. We may refer to such questions as
speculative. There is a place for them in science, in that they can stimulate
reasonable discussion.
A more disturbing class of question is associated with vacuous theories, which
are theories such that none of their critical propositions can be tested by any
known practical means.4 Such theories are often designated as not even wrong
(Woit, 2006). Examples currently are the Multiverse paradigm, quantum gravity,
and string theory. What is disturbing is not so much that they cannot be tested,
but that some of their advocates claim that they represent real physics simply
because their apparent mathematical beauty (a subjective opinion) is regarded
by their authors as more important than a total lack of critical empirical evidence
at this time (2017) (Woit, 2006). Vacuous theories may well be consistent with
all currently known empirically established facts, but that is not enough to
validate them. A common characteristic is that although they have a generalized
proposition classification (GPC, discussed in Section 2.12) of zero or at best one,
many of their supporters claim that they represent valid quantum physics (which
requires a GPC of three). We disagree. They are speculations and we classify such
theories as mathematical metaphysics.
2.8 Propositions
In this book, a proposition is defined as any string of symbols that may be of
interest to an observer. For example, P 1 ≡ Today is Tuesday, P 2 ≡ XY Z,
3
We are not at all critical of Afshar’s attempts per se. Our view is that such attempts should
always be encouraged. That is what science is all about. There should always be someone
trying to upset the standard paradigms.
4
A critical proposition would be a specific claim made by that theory alone and no other,
that would, if confirmed, establish the scientific worth of that theory.
2.9 Negation, Context, Validation 23
5
The explanation that humans do physics in order to understand the universe does not
explain why they find themselves needing to do this. No other species seems to have this
ambition.
24 Questions and Answers
yes and a probability of no (Jaynes, 2003). Another example is QM, where the
answers are complex-valued amplitudes.
This last point is critical in QM. The rules of logic in QM have to be carefully
respected because they are not quite the same as those of CM. The reason has
to do with contextuality, which is not factored into classical logic in general. In
particular, we should not make the following error of physics. Suppose that P is
a physical proposition and we have constructed an apparatus A that allows us
to conclude that P is true. Then we may write
V(P , A) = 1. (2.3)
In other words, we have factored into our notation the context (apparatus A)
that is needed to empirically confirm the truth of P . In words, we would express
(2.3) as the statement that proposition P is true relative to context A.
because the apparatus A that allows us to test for the truth of P might be com-
pletely useless to test for the truth of ¬P . For instance, suppose P is the propo-
sition Supersymmetric particles exist. To date (2017), experiments at the
Large Hadron Collider have found no sign of such things. That does not imply the
truth of the negation of P , which is ¬P ≡ Supersymmetric particles do not
exist. In order to prove empirically that such particles did not exist, we would
have to find apparatus A such that V(¬P , A ) = 1. It is most unlikely that such
apparatus could ever be constructed, because proof of a negation is very often
impossible (we cannot prove that fairies do not exist, for instance).
2.12 Generalized Propositions and Their Classification 25
We conclude that the rule (2.2) that holds in classical two-valued logical cannot
always be replaced in physics by the rule
observer and the wider environment/universe in which they are performing their
experiments.
A GP addresses the issues we raised in Section 2.1: ideally, any GP gives
information about who is making the proposition, what that proposition is,
and how the truth value of that proposition can be determined. This raises the
following two points.
Identification
We should always ask the question:
For whom is this GP relevant?
The answer is: for the observer involved and not necessarily for anyone or any-
thing else. It is, after all, an essential requirement of observation that an observer
understands their own relationship to their environment. Even in CM, frames of
reference have to be identified. No experiment makes sense if any aspect of the
GP concerned is unknown to the observer concerned.
Missing Context
What happens if some or all of the contextual information in a GP is missing?
We address this fundamental point now.
Definition 2.4 Given a GP P ≡ (P, Cint |O, Cext ), its generalized proposition
classification (GPC) #P is given by
#P = α + 2β, (2.8)
where
0, Cint = ∅ 0, Cext = ∅
α= β= (2.9)
1, Cint = ∅, 1, Cext = ∅,
where the empty set symbol ∅ denotes missing context.
Metaphysical Propositions
These are completely contextually incomplete GPs of the form
(P , ∅|∅, ∅), (2.10)
2.12 Generalized Propositions and Their Classification 27
Mathematical Propositions
These are partially contextually complete GPs of the form
Classical Propositions
These are partially contextually complete GPs of the form
Complete Propositions
These are contextually complete GPs of the form
of the atoms had landed. It was the observers’ decision to interpret these spots
as of empirical significance. Stern and Gerlach did not know that their observed
spots were a marker of electron spin. In 1922, electron spin was unknown; Stern
and Gerlach were trying to validate Bohr’s Old Quantum Mechanics theory of
the atom. The electron spin interpretation came into focus a few years later.
Classical bits are sets, not vector spaces, because there is no obvious math-
ematical or physical meaning to the multiplication of a bit state by a real or
complex number, or to the addition of two bit states. This is no longer the case
when we generalize bits to their probabilistic counterparts, where they are known
as stochastic bits (or s-bits) and to their quantum counterparts, where they are
known as quantum bits (or qubits). Nevertheless, it is useful and convenient
to represent bits via two-dimensional complex vector spaces. This allows us to
represent bit states as vectors and bit operators as matrices, but it should be
kept in mind that classical bit states cannot be added in principle. An exception
is in Boolean algebra, where rules such as 1 + 1 = 0 have a contextual meaning.
3.6 Nodes
QDN analyzes experiments in the simplest form possible, which is in terms of
binary questions and answers. We will show in later chapters how any given
apparatus is represented in QDN as a collection of binary questions and answers
forming a stage network , a collection of nodes connected by links across which
quantum information is transmitted.
Nodes come in two forms, external and internal .
External Nodes
External nodes correspond to physically existing equipment and come in two
varieties: sources (preparation devices) and detectors. Observers input contextual
information into stage networks via sources and extract signal information via
detectors.
Internal Nodes
These occur in the information void , the region of space and time where no
information is extracted. Internal nodes can be thought of as virtual detectors,
as they usually correspond to places in a network where a real detector could
have been placed, if the observer had so chosen.
Example 3.2 In the double-slit experiment, the two slits are identified with
internal nodes, while the source of the incoming beam and the detecting screen
are identified with external nodes.
While it will always be easy to identify external nodes in any stage diagram,
that will not always be the case for the internal nodes. The rule for assigning
internal nodes is that any such node represents a potential opportunity for
information extraction, if the observer so chooses. So for instance in the double-
slit experiment example considered above, not only could the observer place
detectors at any of the two slits, but the observer could conceivably fill the space
between source and screen with a vast number of internal nodes, if that was
needed. In the limit of extremely large numbers, the QDN description would begin
to look more like quantum field theory. The art in QDN is to find the simplest
possible description of an experiment in terms of a limited number of nodes.
form ij, and then the answer is given by δ ij . If δ ij = 0 (the number zero), then
that means that the answer is no, whereas if δ ij = 1 (the number one), then that
means that the answer is yes. We shall refer to each side of expressions such as
(3.1) as a classical answer .
where = denotes “is represented by.” For the dual bits, we have the row matrix
R
representation
0 = 1 0 , 1= 0 1 . (3.4)
R R
These four matrices form a basis for the four-dimensional vector space of complex
2 × 2 matrices, so we can use them to construct other useful matrices. There are
four bit operators, labelled I, F , D, and U here, that are occasionally useful.
They are defined and represented as follows.
The four operators I, F , U , and D will be used in bit state dynamics, and
then it will be convenient to define O 1 ≡ I, O 2 ≡ F , O 3 ≡ D, and O 4 ≡ U .
3.11 Labstates
Our objective in this book is to interpret quantum mechanics via signal states
of apparatus instead of states of systems under observation (SUOs). In order to
keep this in mind, we shall use the term labstate whenever we refer to the former,
reserving the term system state for the latter.
3.12 Time and the Stages Concept 35
In this book, we shall deal with three kinds of labstate associated with a
single detector: classical labstates, stochastic labstates, and quantum labstates.
Classical and stochastic labstates are discussed in this chapter, quantum labstates
are discussed in the next chapter. Each form of labstate has its own dynamical
evolution rules, which we shall discuss separately in some detail. Because only
one detector is involved in these preliminary discussions, such dynamics will
be referred to as rank one. If two detectors were involved, then we would be
discussing rank two dynamics, and so on.
Given a single detector, the observer would find it either in its signal ground
labstate 0 or in its signal labstate 1, assuming the detector existed, was not
faulty, and that the observer actually looked.
Quantum state preparation occurs at the source nodes and generally takes
place over some contextually “small” or negligible interval of labtime (time as
measured in the laboratory). States then evolve undisturbed over temporal links
and are detected by the observer at the detectors, again over contextually “small”
or negligible intervals of labtime.
The power of QM in general is that detailed modeling of what actually goes
on at the nodes seems to be less significant than the detailed modeling of the
transition amplitudes evolving over temporal links. For example, Feynman dia-
grams are used in relativistic quantum field theory to calculate those amplitudes,
with virtually no modeling of the detection equipment that would be needed in
practice.
This view of quantum processes was taken to an extreme with the development
of the Multiverse paradigm (Deutsch, 1997). In that paradigm, only evolution
of the wave function for the Multiverse in the information void is asserted to be
significant. But because real, empirical information is extracted in the laboratory
at nodes only, it should not come as a surprise that the Multiverse concept turns
out to be empirically vacuous. So where does QDN stand in relation to these
nodes and links?
QDN is an attempt to investigate the nodal aspect of quantum physics more
than has been hitherto the case. As with scattering matrix (S-Matrix) theory
(Eden et al., 1966), QDN provides a framework for discussing the spatiotemporal
architecture of observation but does not provide the dynamical details of the
amplitudes involved.
Our aim in this book is to discuss a multi-detector approach to quantum
physics. By this we mean to discuss real experiments that involve perhaps many
preparation channels; large numbers of modules such as beam splitters, mirrors,
and suchlike; and batteries of outcome detectors. In such circumstances, we
naturally find ourselves encountering the dictates of special and general rela-
tivity (GR). To date there has been no empirical evidence that the principles
of relativity and of quantum mechanics are incompatible. There is in practice
“peaceful co-existence” between GR and QM, and that has to be respected in
QDN.
The grouping of nodes into sets called stages reflects classical causality, the
notion that an event can influence some events dynamically but not others. In
relativity, events outside each other’s light cones cannot be causally related. The
analogous concept in QDN is that nodes in the same stage cannot transmit or
receive quantum information from each other.
3.12 Time and the Stages Concept 37
There is a subtlety here, however, to do with shielding. This needs some expla-
nation. Consider two detectors, A and B, that are, in relativistic parlance, time-
like separated. This means that one of the detectors, say B, is inside the forward
light cone of A, viewed from the conventional relativistic perspective. Therefore,
by standard causal physics, B could in principle be affected dynamically by
whatever was done at A. But suppose B was shielded in some way from any
dynamical effects from A (such as being placed inside a Faraday cage, in the
case of electromagnetic interactions). Then for all practical purposes, we could
regard A and B as if they were dynamically independent. It would not then
be inconsistent to assign them to the same QDN stage, even though they were
not on any hypersurface of simultaneity in physical spacetime. On this basis the
QDN definition of simultaneity is contextual.
S W B
Figure 3.1. A typical stage diagram. Dotted lines represent individual stages,
labeled by subscripts, the nth stage being denoted Σn . Circles represent nodes,
where information either enters or leaves the network, or passes on to other
nodes. Shaded circles represent actual outcome detectors. Boxes represent
modules such as the source (S), a Wollaston prism (W ), a phase-changer (φ),
a mirror (M ), and a beam splitter (B). Solid lines represent transmission in
the information void between nodes and modules.
Intervention
An intervention is a single act of information extraction from a set of detectors
at a single stage.
Run
A run is the complete process involved in a given sequence of actions, which
starts with initial labstate preparation and ends with final labstate detection.
3.15 Transtemporal Identity 39
Experiment
An experiment consists of a given number of runs, each following an identical
protocol, or experimental procedure.
Runtime
Runtime is time required to perform a given run, as measured by the observer’s
laboratory clocks.
Measurement
A measurement is the statistical result of data accumulated over one or more
runs of a given experiment.
justify giving them their particular identities. This is related to the phenomenon
of persistence, which depends on the time scales over which objects can be taken
reliably to have significance.
Such an approach is taken in conventional classical mechanics (CM), where
SUOs are represented by points moving about phase space, and in QM, where
state vectors move about in Hilbert space.
The alternative to this is to imagine, like Heraclitus, that everything changes,
nothing persists. According to this view, objects such as SUOs and apparatus
only appear to persist because sufficient patterns of mass and energy are repeated
sufficiently unchanged over certain time scales as judged by some observer, and it
is this that gives that observer the impression that “objects” exist in the universe.
We shall adopt the Heraclitian point of view, because we need to consider the
possibility that observers and apparatus can be created and destroyed. This will
mean changing, over time, the dimensions of the mathematical spaces used to
model processes of observation.
Therefore, when we discuss a rank-one labstate evolving from stage ΣM to
stage ΣN , we shall think of it as a succession of detectors, denoted, say, by
ΔM , ΔM +1 , . . . , ΔN . Each detector Δn is associated with a classical bit Bn and
its dual B n , and that detector exists only at stage Σn . But if persistence is
assumed, then the whole set {Δi : i = M, M + 1, . . . , N } of detectors may be
thought of as a single detector with an enduring, transtemporal identity existing
over the time interval [tM , tN ].
It is important not to mix different theoretical spaces associated with different
times. While we can ask the question in j n for M ≤ n ≤ N , we are not allowed
to ask the question in j m for n = m. The reason is obvious when stated in words:
we cannot observe today what does not yet exist or what used to exist. All actual
observations are done in process time. Past and future are inferred from the data
so acquired: archaeologists do not dig up the past – they dig up traces of the
past embedded in the present. Improper questions such as in j m , n = m, are not
defined mathematically either, in the same way that dual vectors (one-forms) are
defined only by their action on their individual, associated vector space.
1
1
O kMM+1,M ≡ aM +1 OkabM bM (3.14)
a=0 b=0
and OkabM are the ab components of the corresponding classical bit matrix. This
operator takes us from the two-dimensional vector space QM in which we have
embedded 0M and 1M into the two-dimensional vector space QM +1 in which we
have embedded 0M +1 and 1M +1 .
This process can be continued. Specifically, for any time n such that M ≤ n <
N we may write
in → in+1 ≡ O kn+1,n
n
in , (3.15)
from which we find
iM → iN ≡ O N,M iM , (3.16)
where the complete evolution operator O N,M is given by
O N,M ≡ O N,N −1 O N −1,N −2 . . . O M +1,M
1 1
(3.17)
= aN O kN −1 O kN −2 . . . O kM ab bM .
a=0 b=0
If now at time N, the observer stepped in and looked at the detector, they would
normally ask the question 1N ≡ Is there a signal here? The answer is given
by 1N iN , where a value one represents an answer yes while a value zero represents
an answer no.
Each set of integers {kM , kM +1 , . . . , kN −1 } in (3.17) represents a specific “oper-
ator chain” of labstate dynamical changes. Each of the integers kn can be 1, 2, 3,
or 4, so there is in total 4N −M different operator chains. However, because the
multiplication of the operator matrices is closed, the net result is that there are
only four possible overall complete evolution operators, given by
1
1
O kN,M ≡ k
aN Oab bM , k = 1, 2, 3, 4. (3.18)
a=0 b=0
but not the initial labstate, they could not say for sure what that initial state
was. Retrodiction is therefore impossible if a D or U transition occurs even
once.
The mathematical reason why the occurrence of just a single D or U transition
generates irreversibility is that each of these maps is two-to-one, and so their
matrix representations are singular.
Now suppose that the observer wanted to know if there was a signal in the
detector. They need to ask the question
1 ≡ Is the detector in its signal state? (3.20)
In our formalism, the answer is given by
p
1Ψ = 0 1 = q, (3.21)
R q
where we interpret the right-hand side as a number. In this case, the answer is
not zero or one as in the classical case but interpreted as the probability of there
44 Classical Bits
where S ij
n are the components of the left-stochastic matrix Sn , where
an bn
Sn ≡ . (3.26)
1 − an 1 − bn
Then Ψn+1 is also a stochastic labstate. The determinant |Sn | of the stochastic
matrix Sn in (3.26) is given by |Sn | = an − bn , which means that stochastic
evolution is irreversible for an = bn .
3.21 Stochastic Questions 45
Exercise 3.5 Prove that the product of any two left-stochastic matrices
is also a left-stochastic matrix. Prove also that the inverse of a nonsingular
left-stochastic matrix is also a left-stochastic matrix.
ij = δ ij , i, j = 0, 1, (4.1)
4.3 Qubit Properties 47
where 0 and 1 are the duals of 0 and 1, respectively, and δ ij is the Kronecker
delta. Then the relationship between the bit B ≡ {0, 1} and its quantum coun-
terpart Q is this: the elements of B are identified as the natural orthonormal
basis for Q, known as the preferred basis.
The existence of the preferred basis is fundamental to our approach to physics
and is in no way controversial. We do not claim that there is a unique or absolute
preferred frame in the Universe. Any preferred frame is preferred only by virtue
of the relative context associated with a given observer. By definition of what
is meant by “observer,” each observer is assumed always to know the empirical
context associated with each signal qubit that they use, being nothing other than
a mathematical representation of some detector in the observer’s laboratory. We
can be confident that there is always going to be such a local preferred basis,
because the idea that an observer could extract real information in an experiment
with no knowledge about their apparatus makes no sense whatsoever.
By definition, each detector has only two possible outcome states, known as
ground state and signal state. This contextually defines the preferred basis: 0
represents the ground (no signal) state of the apparatus and 1 represents the
signal state of the apparatus.
We emphasize the following point. There runs throughout QM a strand of
thinking, conditioned by experience with CM, that states of SUO have some sort
of existence of their own. According to this logic, such states do not need any
preferred bases for their mathematical representation. This line of thinking then
leads to the notion that observers are not needed either.
It is along such realist lines of thinking that Hidden Variables theory (Bohm,
1952), decoherence (in its original form) (Joos, 2012), and the Multiverse (Many
Worlds) (Deutsch, 1997) are based. The problem with those interpretations of
QM is that they are each contextually incomplete with a generalized proposition
classification1 of zero, meaning that those theories are empirically vacuous.
1
Generalized propositions and their classification is discussed in Section 2.12.
48 Quantum Bits
of a stochastic bit state such as (3.19) are nonnegative real numbers representing
conditional probabilities. Because the components of a quantum bit state are
not even real, let alone nonnegative, we encounter here the first of several issues
in the interpretation of QM. This particular issue is generally regarded as being
resolved by Born’s interpretation of the above complex components as probability
amplitudes (Born, 1926), discussed in Section 4.6.
If we choose to use the matrix representation of bits (3.3), then we may write
α
Ψ= . (4.3)
R β
Ψ ≡ α0 + β1 = α∗ 0 + β ∗ 1, (4.4)
where α∗ and β ∗ are the complex conjugates of α and β, respectively. Then the
“inner product” ΨΨ is given, using linearity and (4.1), by
A normalized signal qubit state is one for which ΨΨ = 1, that is, |α|2 + |β|2 = 1.
We shall deal extensively with normalized signal qubit states, as these are asso-
ciated with probability conservation in QM.
Normalized signal qubit states have been defined in terms of their components
relative to the preferred basis. However, we can discuss them more generally as
qubit states, that is, drop the observational context and think of them as just
elements of some qubit. A qubit is a Hilbert space, a concept that is independent
of basis and therefore does not require a preferred basis. We can discuss qubits
in this context in a more abstract way as follows.
Given any element Φ of a Hilbert space, then by definition it has a norm Φ
or length given by
Φ ≡ (Φ, Φ), ≥ 0, (4.6)
where (Φ, Φ) is the inner product of Φ with itself. This length is basis
independent.
A normalized qubit state therefore is an element of a complex two-dimensional
Hilbert space and has unit norm.
We define a qubit map to be any map from a qubit into itself. Specifically,
given such a map M , then for any element Ψ of qubit Q, the object M (Ψ) is
some element of Q. For example, the identity map I satisfies the rule I(Ψ) = Ψ
for any element Ψ of Q.
This definition of qubit map makes no reference to linearity, so a qubit map
need not be linear, as in the following example.
Example 4.1 Given a qubit Q with preferred basis {0, 1}, define the qubit
map M by M (Ψ) = 0 for every element Ψ of Q. Then M is a nonlinear map,
as, for example, we have M (0 + 1) = 0 but M (0) + M (1) = 0 + 0 = 20.
The map in the above example is not unphysical. It has the interpretation of
a resetting or preparation process that prepares a detector to be in its ground
state, ready to receive a signal, regardless of the state it is currently in. We shall
use such a process in later chapters.
Because we are concerned in this book with quantum processes, almost all
of the qubit maps we shall deal with will be linear. Linear qubit maps will be
called qubit operators. It is conventional in the case of linear operators to drop
the round brackets of the argument; that is, we shall write OΨ to mean O(Ψ)
whenever O is a linear operator. Then for linear operator O, for any elements
Ψ, Φ of Q, and for any complex numbers α, β, we have the rule
O{αΨ + βΦ} = αOΨ + βOΦ. (4.7)
In QM, linear operators are often associated with dynamical variables and so
additional mathematical structure is introduced. Given two qubit operators O 1
and O 2 over a qubit Q, we define the linear combination αO 1 + βO 2 of these
two operators to satisfy the rule
(αO 1 + βO 2 )Ψ ≡ (αO 1 Ψ) + (βO 2 Ψ), (4.8)
for arbitrary complex numbers α, β, and arbitrary elements Ψ of Q.2 Then we
state without proof that the set L(Q) of all qubit operators over a qubit Q has
all the properties of a four-dimensional complex Hilbert space (Paris, 2012).
The structure of the vector space L(Q) is relatively easily explored. We saw
in the previous chapter that given the preferred basis {0, 1} for Q, the four
elementary transition operators (ETOs) T ij ≡ ij : i, j = 0, 1, form a convenient
basis for L(Q), that is, any qubit operator O can be written in the form
1
1
O= Oij T ij , (4.9)
i=0 j=0
2
Note that the + symbol on the left-hand side of (4.8) denotes operator addition, while the
+ symbol on the right-hand side denotes vector addition.
50 Quantum Bits
, A, and A
The four operators P , P are by inspection just the four ETOs T ij
introduced earlier. The advantage in this new designation is mainly psychological:
the projection operators play one role in our formalism while the signal operators
play another, and it is very helpful to distinguish between them.
In this new notation, the multiplication rule (4.10) is best expressed in the
form of a table, Table 4.1, where the entries are the products LR, operator L
coming from the left-most column and R coming from the top-most row.
L\R P
P A
A
P P Z A Z
P Z
P Z
A
A Z A Z P
A
A Z
P Z
far as physics is concerned. The components of an s-bit are real and interpreted
as conditional probabilities, whereas the components of a qubit relative to its
preferred basis are complex amplitudes and so cannot be probabilities. It was Max
Born who gave the empirically correct interpretation of such complex amplitudes
(Born, 1926), as follows.
Symmetry
There is an inherent symmetry in the relationship between the two states in rule
(4.14), referred to as the microscopic reversibility of quantum processes (Bohm,
1952). This is because of the mathematical equality |(Ψ, Φ)| = |(Φ, Ψ)|, which
immediately leads to the physical prediction P r(Ψ|Φ) = P r(Φ|Ψ). This relation
has been confirmed empirically countless times.
52 Quantum Bits
Complex Amplitudes
The inner product (Ψ, Φ) is complex, which means it does not have the classical
interpretation of a probability. Physicists get around this by referring to such
expressions as “the amplitude for Φ to go to Ψ,” or similar terminology, thereby
endowing it with a touch of familiarity. The fact is, however, no one understands
precisely why complex amplitudes occur in QM and it remains one of the enduring
mysteries of the subject. Schwinger suggested that the appearance of complex
numbers in QM is associated with the existence of antiparticles (Schwinger,
1958). Other physicists have investigated the theoretical and empirical possibility
of replacing complex amplitudes in QM with hypercomplex (or quaternionic)
amplitudes (Adler, 1995; Procopio et al., 2016; Adler, 2016), but there is at this
time no empirical evidence that such a step is necessary.
where IABC represents some novel third-order interference term not predicted
by the above Born rule (Sorkin, 1994). A recent experiment has virtually ruled
out such a term (Sinha et al., 2010).
Sorkin’s motivation in this discussion is directly opposite to ours in this book:
he explicitly states that he does not want to base the interpretation of his
generalized probabilities with “some undefined concept of ‘measurement made by
human observers,”’ and takes “the attitude that the ontology of QM is identical
to that of classical realism” (Sorkin, 1994).
Dynamics
The Born rule is usually applied to states of SUOs evolving in time. In such
a case, care has to be taken with the rules for the conservation of probability.
These rules are as contextual as anything else in an experiment. For instance,
if a state Φi is prepared at initial time ti , allowed to evolve undisturbed until
4.8 Classical and Quantum Ensembles 53
time tf , then the amplitude to find the system under observation in state Ψf at
final time is given by (Ψf , Uf i Φi ), where Uf i is the unitary evolution operator
taking states from initial time to final time. In this scenario, total probability is
conserved. On the other hand, particle decay experiments may appear to involve
a loss of total probability, if the mathematical modeling is done in too basic a
fashion. For example, the Schrödinger wavefunction for a decaying particle SUO
is frequently asserted to be given by a function of the form
Ψ(t, x) e−i(E−iΓ)t/ Φ(x), (4.17)
where Γ is a real constant related to the so-called half-life of the particle. We can
discuss such a scenario in QDN; the QDN approach to particle decay experiments
is covered in Chapter 15.
We return now to the fact that the definition of a Hilbert space is basis
independent. Let us explore this further. Ignoring physical context and looking
at a qubit strictly as a mathematical vector space, we are entitled to change our
basis from the preferred basis. Consider therefore replacing each element i in our
original preferred basis with some new vector i , as follows. First, we note that our
preferred basis B is orthonormal, i.e., ij = δ ij , i, j = 0, 1. Anticipating physical
applications in later chapters, we shall preserve this relationship. Therefore,
we shall require i j = δ ij , i, j = 0, 1. For a complex Hilbert space, such
transformations are called unitary.
Given our initial basis B ≡ {0, 1}, consider a unitary transformation B →
B ≡ {0 , 1 } such that i j = δij , for 0 ≤ i, j ≤ 1. For such a transformation we
may write
1
i → i ≡ U i = j U ji , (4.21)
j=0
3
Everett does explicitly postulate an absolute wave function for the Universe, something
QDN cannot accept, because such a postulate has a GPC of zero.
4.11 Rank-One Qubit Evolution 57
basis, relative to A, B and C are systems under observation and not observers.
QDN is fully compatible with that idea, but regards the Multiverse concept as
anathema.
This operator is one that not only preserves the norm but also preserves inner
products under the transformation from one Hilbert space to the next. Such
a transformation will be called a semi-unitary transformation, rather than a
unitary transformation, for the good reason that in a more general context, the
dimensions of the Hilbert spaces need not be the same. When the dimensions
of the Hilbert spaces are different, then we need to consider the retraction of
an evolution operator, rather than its inverse. We shall discuss this important
aspect of our dynamics in more detail later.
In standard QM, a dynamics that preserves magnitudes of inner products
(as opposed to inner products themselves) is usually realized via either unitary
transformations or anti-unitary transformations. Anti-unitary transformations
are subtle, having a great deal to do with the concept of time reversal in standard
QM. We shall not consider anti-unitary transformations further.
An important issue that impacts on our approach is that one way of discussing
time reversal in QM is to switch bra and ket vectors. Conventionally, this does
not seem to matter but it amounts to switching questions and answers in our
approach, which requires great care in the interpretation.
We may readily generalize our dynamical rule (4.26) to describe evolution from
initial stage SM to final stage SN , where N > M : the product of two semi-unitary
transformations is also a semi-unitary transformation. We find the rule
ΨM → ΨN = U N,M ΨM , (4.28)
58 Quantum Bits
where ω a is the probability that the initial state is actually Ψa . The expectation
value O of an observable O is then given by the standard rule
4.13 Density Operators 59
= ω 1 Ψ1 Ψ1 + ω 2 Ψ2 Ψ2
= p{α0 + β1}{α∗ 0 + β ∗ 1} + (1 − p){γ0 + δ1}{γ ∗ 0 + δ ∗ 1}
(4.33)
= {p|α|2 + (1 − p)|γ|2 }00 + {pαβ ∗ + (1 − p)γδ ∗ }01
+{pβα∗ + (1 − p)δγ ∗ }10 + {p|β|2 + (1 − p)|δ|2 }11.
5.1 Introduction
In this chapter we extend the discussion of the previous chapter, from one
detector to an apparatus with an arbitrary number of nodes at any given stage.
Our labeling convention is followed throughout this book: subscripts always
label stages whereas superscripts always label nodes and modules (discussed
in Chapter 11). The number of nodes at stage Σn will be called the rank of the
apparatus at that stage and denoted rn . The ith node at stage Σn will be denoted
by in , which should not be confused with labstate vectors such as in , which are
always denoted in bold font.
Whenever we are using classical mechanics (CM), the collection of classical
nodes at stage Σn will be called a classical binary register , denoted Rn . In that
case, in is represented by classical bit Bni , so Rn is the Cartesian product of all
the bits at that stage, that is,
Rn ≡ Bn1 × Bn2 × · · · × Bnrn . (5.1)
The cardinality (number of elements) of the rank r classical binary register Rn
[r ]
is dn ≡ 2rn . If we wish to indicate the rank of Rn , we write Rn n .
On the other hand, whenever we are using quantum mechanics (QM), the
collection of nodes at stage Σn will be called a quantum register and denoted
[r ]
Qn or Qn n . In the quantum case, in is represented by qubit Qin . The quantum
register Qn is the tensor product of all the qubits at that stage, that is,
Qn ≡ Q1n ⊗ Q2n ⊗ · · · ⊗ Qrnn . (5.2)
Such a tensor product space is a Hilbert space of dimension dn ≡ 2 . rn
This rule is interpreted as follows. If the right-hand side of (5.5) is zero, then the
elements i1 i2 i3 . . . ir and j 1 j 2 j 3 . . . j r of the Hilbert space Q[r] are orthogonal;
otherwise, they are the same element and it has length (norm) of one.
Example 5.4 According to our notation, indices keep track of factor qubit
spaces in a quantum register. Hence for a rank-four register, for example, we
would find
of the other detectors. Such a question that does not involve all of the detectors
will be called a partial question. Partial questions are discussed in Chapter 8.
A maximal question is one that asks a binary question of every detector in
a register. From (5.5), we see that the elements of the dual basis B [r] can be
interpreted as maximal questions, and there are 2r of them. It is important to
note that in the absence of any theory that quantizes observers and/or their
questions, arbitrary vectors in Q[r] are not in general physically meaningful
maximal questions; only the elements of the preferred dual basis B [r] are maximal
questions in the formulation of QDN being discussed at this point.
It is not impossible, however, to imagine a scenario where questions being asked
correspond to elements of Q[r] that are nontrivial linear combinations of preferred
basis elements. For instance, a given observer may be conducting an experiment
where there are two possible quantum outcomes, such that each outcome would
trigger a different maximal question of some state at a later stage. That would
correspond to quantization of apparatus and/or observers. This sort of scenario
is essentially the focus of Chapter 21.
The maximal questions are all the elements of the dual basis
5.5 Signality
The signality of a basis state corresponds to the total “particle number” being
detected by the apparatus concerned, if any positive signal in any of the detectors
is interpreted as registering a particle in the conventional sense.
which is zero in general except for the particular case when ia = j a for each
possible value of a, and then the answer value is one. This result greatly simplifies
calculations in QDN.
This result contributes to the inherent economy of the particle description in
CM, as the following example illustrates. Suppose we wanted to describe a single
point particle in three-dimensional space. The conventional approach would be
to define a Cartesian coordinate system with coordinates x, y, and z along three
mutually orthogonal axes, relative to some chosen origin of coordinates. Then the
position of a particle P would be fully specified by giving only three numbers,
(xP , yP , zP ), referred to as the position coordinates of the particle. The economy
5.7 Quantum Registers 65
of this approach is literally infinite: saying where the particle is immediately says
where the particle is not, which is every point in space apart from the one with
coordinates (xP , yP , zP ).
This economy of information comes about because of context: we have been
told that there is only one particle.
But suppose now we are presented with a situation where we were not told
beforehand how many particles there were in our laboratory. Then we would
have to look at every point in space if we wanted to know the total particle
number. This is not a trivial point but a hard fact in astrophysics and cosmology:
astronomers have to scan space thoroughly in order to make estimates of matter
and dark matter densities on cosmological scales.
In the quantum context, we may use signality to tell us how many signals
there are in our detector array. A labstate of signality one corresponds closely to
what can be thought of as a single particle state. This means that, if we knew
beforehand that we were dealing with a signality-one state, then it would suffice
to find a positive signal in just one detector in an array to be sure (by context)
that there were no positive signals anywhere else in that array.
It is an intriguing thought that when we specify the classical Cartesian coordi-
nates x, y, z of a point particle, we are not only answering the question where is
the particle? but also saying that it is not at any of the other points in space,
of which there is an uncountable number. Clearly, that is an impressive form of
economy, but one occurring only because of our contextual information.
Consider an arbitrary state Ψ in Q[2] . Given the above preferred basis, we can
always write
Ψ = z 1 01 02 + z 2 11 02 + z 3 01 12 + z 4 11 12 , (5.10)
where the z a are labeled complex numbers, not powers of z. The dual state Ψ is
given by
Ψ ≡ z 1 01 02 + z 2 11 02 + z 3 01 12 + z 4 11 12
= z 1∗ 01 02 + z 2∗ 11 02 + z 3∗ 01 12 + z 4∗ 11 12 , (5.11)
where z a∗ is the complex conjugate of z a . If the state is normalized, we have the
condition
ΨΨ = |z 1 |2 + |z 2 |2 + |z 3 |2 + |z 4 |2 = 1. (5.12)
If now we write z a = xa + iy a , where xa and y a are real, then the normalization
condition (5.12) is equivalent to saying that possible states rank-two quantum
register can be identified one-to-one with points on S 7 , the unit sphere in eight-
dimensional Euclidean space E8 .
The above analysis is for just two detectors. If we had in mind, say, a screen
consisting of a million detectors, or a neural network in a brain, the numbers
shoot up beyond imagination.1 Clearly, qubit register states have much more
structure than their classical counterparts. The exploration of this structure is
in a real sense in its infancy at this time. For example, there is still a great deal
to be understood about the physics of quantum entangled states in low rank
quantum registers. We discuss some aspects of this in Chapter 22.
basis B [r] is useful in some respects but less so in others. A frequently more
useful but equivalent notation for the elements of the preferred basis will be
called the computational basis representation (CBR) and is obtained as follows.
For each element i1 i2 i3 · · · ir of the SBR, there is precisely one element i of the
CBR, where i is an integer in the range [0, 2r − 1], given by the computational
basis map
i = i1 + 2i2 + 4i3 + · · · + 2r−1 ir , (5.13)
3
where i means the third element in the SBR and not i to the power of three.
The computational basis map can be inverted. Given an element p of the CBR,
where p is an integer in the range [0, 2r − 1], we can write
1 a [a]
r
p = p[1] + 2p[2] + 4p[3] + · · · + 2r−1 p[p] = 2 p , (5.14)
2 a=1
1
The average human brain has about a hundred billion neurons.
5.9 Register Operators 67
where the symbol p[a] denotes the signal status (zero or one) of the ath detector
when the apparatus is in the labstate p. The coefficients p[a] will be referred to
as the binary components of the integer p and are discussed in further detail later
on in this chapter. They play an essential role in the formalism of QDN and in
many calculations.
Example 5.8 For the rank-three quantum register Q1 Q2 Q3 , the SBR of the
preferred basis B [3] is
B [3] = {01 02 03 , 11 02 03 , 01 12 03 , 11 12 03 , 01 02 13 , 11 02 13 , 01 12 13 , 11 12 13 }.
(5.15)
The CBR of B [3] is the set
The CBR generally has the advantage over the SBR of being more compact
and on that account is better suited in many but not all calculations. The dual
preferred basis B [r] can also be expressed in CBR terms, and then the inner
product relations (5.5) take the form
ij = δ ij , 0 i, j < 2r , (5.17)
which is very useful.
A disadvantage of the CBR is that it masks the signal properties of a given
state. For example, the CBR element 3 could represent the SBR element 11 12 for
a rank-two apparatus or the SBR element 11 12 03 04 05 of a rank-five apparatus
and so on. However, context will generally make it clear what is meant by a given
CBR expression.
The CBR is useful for representing linear operators over the register. These
will generally be denoted in blackboard font in QDN. For example, the classical
or quantum register identity operator I[r] is expressed in the CBR by
r
2 −1
I[r] = kk. (5.18)
k=0
labeling convention used for states and questions will be used for linear operators:
upper indices refer to qubits and, while lower labels indicate stages.
Suppose On,m is a linear register operator mapping states from an initial stage
Σm rank-rm classical or quantum register into a final stage Σn rank-rn classical
or quantum register. Suppose further that we are given its action on each element
im of the CBR, that is,
n −1
2r
On,m im = ji
On,m jn, i = 0, 1, . . . , 2rm − 1, (5.19)
j=0
ji
where the On,m are complex coefficients.
Now “multiply” each side from the right by im and sum over i:
m −1
2r m −1 2rn −1
2r
On,m im im = ji
On,m j n im (5.20)
i=0 i=0 j=0
n −1
2r
Cn,m im = ji
Cn,m jn, i = 0, 1, 2, . . . , 2rm − 1, (5.24)
j=0
ji
where for a given i, every element Cn,m is zero for every integer j in the interval
[0, 2 − 1] except for one, which has value one. Using the completeness property
rn
where I a is the bit identity operator for the ath bit or qubit and P i , P i are
the bit projection operators for the ith bit or qubit, as discussed in the previous
chapter.
Pa a
P Aa a
A
Pa Pa 0 Aa 0
a
P 0 a
P 0 a
A
Aa 0 Aa 0 Pa
a
A a
A 0 a
P 0
Then an extraordinarily useful fact is that any two elements from different signal
sets commute.
For a given signal set S a , we have the ath signal set algebra, as shown in
Table 5.1. For i = 1, 2, . . . , r we have
Ai Ai = A iA i = 0, i = Pi + P
Ai , A i = I[r] , (5.28)
where square brackets denote a commutator. Note that in the above, the symbol
0 represents the zero operator for the register concerned.
The signal set algebra gives QDN a particular flavor; parts of it are reminiscent
of a theory with fermions (anticommuting objects), while other parts have a
bosonic (commuting) character. At the signal level, however, we are dealing with
neither concept specifically; the signal algebra is determined by the physics of
observation as it relates to the apparatus and has its own logic that is distinct
from that of conventional particle physics.
The same approach can be used to discuss maximal questions. Recall that these
are represented by the dual preferred basis elements of the form i, 0 ≤ i < 2r .
Then we can write
k
0Ai1 Ai2 . . . Aik ≡ 2ij −1 . (5.31)
j=1
Ai 0 = 0, i = 0,
0A 1 ≤ i ≤ r. (5.32)
ij = δ ij , 0 ≤ i, j, < 2r . (5.33)
Example 5.12 For a rank-11 quantum register, the preferred basis state
given by 851 in the CBR is given by 11 12 03 04 15 06 17 08 19 110 011 in the SBR,
from which we can read off its signality as 6. Equivalently, we can see that
this state is given by A 2A
1A 5A
7A
9A
10 0, which clearly has signality 6.
Such examples soon show the enormous advantage of using the CBR for large
rank registers. The CBR is particularly suited for computer algebra calculations.
By inspection, it is easy to see that the preferred basis B [r] separates into r + 1
disjoint signality classes:
signality zero B [r,0] ≡ {0},
signality one B [r,1] ≡ {A a 0 : a = 1, 2, . . . , r},
signality two B [r,2]
≡ {A aA b 0 : 1 a < b r},
(5.35)
.. ..
. .
signality r B [r,r]
≡ {A 1A2 . . . A
r 0}.
Then we may write the preferred basis as the union of all of these signality classes,
that is,
r
B [r] = B [r,a] . (5.36)
a=0
72 Classical and Quantum Registers
We define the signality count σ [r,a] as the number of elements in B [r,a] . Then
r!
σ [r,a] = = ar (5.37)
a!(r − a)!
Hence the total signality σ [r] is given by
r
r
r
σ [r] ≡ σ [r,a] = a = 2a , (5.38)
i=0 a=0
as expected.
A significant feature of QDN is that the addition of register states of different
signality is permitted, unless the dynamics specifically rules it out. This will
happen, for example, whenever the signals are interpreted as electrically charged
particles. Under such circumstances, we expect conservation of total signality.
where each of the binary coefficients k [a] is either zero or one. Table 5.2 shows
the binary decomposition up to k = 9.
From the table, we read off for example that
6 = 0 × 21−1 + 1 × 22−1 + 1 × 23−1 + 0 × 24−1 + 0 × · · · = 2 + 4 = 6.
For each k, the maximum a for which k [a] = 1 is the minimum rank of k,
denoted μ(k) and indicated by a box in Table 5.2. The minimum rank of k is the
rank of the “smallest” quantum register that contains the basis state k.
a → 1 2 3 4 ...
2a−1 → 1 2 4 8 ...
k↓ 0 0 0 0 0 ...
1 1 0 0 0 ...
2 0 1 0 0 ...
3 1 1 0 0 ...
4 0 0 1 0 ...
5 1 0 1 0 ...
6 0 1 1 0 ...
7 1 1 1 0 ...
8 0 0 0 1 ...
9 1 0 0 1 ...
5.15 Computational Basis Representation for Signal Operators 73
μ(1) = 1,
μ(2) = μ(3) = 2,
μ(4) = μ(5) = μ(6) = μ(7) = 3. (5.40)
where k [a] = 0 or 1 is the ath binary component of the integer k; i.e., we have
k = k [1] 21−1 + k [2] 22−1 + k [3] 23−1 + · · · + 2μ(k)−1 . (5.43)
Hence we may equate the CBR and SBR elements in the basis B [μk ]
k[1] k[2]
1
k = k[1] k[2] . . . 1μk = A 2
A ... A μk 0. (5.44)
Now use the results
ak = 0
A if k [a] = 1,
(5.45)
= k + 2a−1 if k [a] = 0.
Defining
k [a] ≡ 1 − k [a] , (5.46)
then we can readily show
ak =
A k [a] k + 2a−1 , a = k [a] k − 2a−1
kA
Aa k = k [a] k − 2a−1 , kAa = k [a] k + 2a−1 . (5.47)
Hence we must have
r
2 −1 r
2 −1
a
A = [a]
k k+2 a−1
k= k [a] kk − 2a−1 , (5.48)
k=0 k=0
r
2 −1 r
2 −1
Aa = k [a] k − 2a−1 k =
k [a] kk + 2a−1 , (5.49)
k=0 k=0
which leads to
r
2 −1 r
2 −1
P =
a
k [a] kk, a =
P k [a] kk. (5.50)
k=0 k=0
74 Classical and Quantum Registers
In general, the CBR for any signal operator in a rank-r register consists of a
sum of 2r−1 transition operators, all of which annihilate each other, including
themselves. Likewise, a product A j of two different signal operators can be
iA
r−2
expressed as a sum of 2 transition operators that mutually annihilate, and
so on. This process of representation can be continued until we arrive at the
saturation operator A 2 . . . A
1A r , which when applied to the signal ground state
creates the antithesis of the ground state, the fully saturated signal state 2r − 1 ≡
11 12 . . . 1r .
A particularly useful expression for the signal creation operators is obtained
by writing any of them in the form
i = 2i−1 0 + Xi ,
A (5.55)
2 −1 [i]
r
where the operator Xi ≡ k=1 k k + 2
i−1
k annihilates the signal ground
state. This expression can be used to greatly simplify calculations for those
experiments involving signality-one labstates, as in single-photon quantum optics
experiments.
6
Classical Register Mechanics
6.1 Introduction
Classical mechanics (CM) is conventionally formulated according to partially
contextually complete Block Universe principles: an exophysical observer sitting
in some frame of reference looks in on a system under observation (SUO) and
assigns truth values to propositions about states of that SUO, but the details of
how these truth values are obtained are not given.
There is a curious aspect to this scenario: the contextual incompleteness of CM
is itself contextual. By this we mean that there are two scenarios, with different
general propositional classifications (GPCs).
6.3 Architecture
In any such discussion it is important to establish the relevant spatiotempo-
ral architecture. There are two aspects of architecture relevant to the present
discussion. One has to do with the unspecified exophysical observer and their
1
It seems invalid to argue that in Newtonian mechanics, we have the laws of motion for
relative internal context and the observer’s frame for relative external context, so that
would appear to give a GPC of 3. That would be mixing theory and experiment to an
unacceptable degree. We have made the point before that mathematics is not physics.
6.3 Architecture 77
apparatus, and the other is to do with the dynamics of the system under obser-
vation. We shall refer to these aspects as the external and internal architectures,
respectively.
External architecture involves time and persistence.
Time
In classical register mechanics, time is modeled as a succession of discrete stages,
ΣM , ΣM +1 , . . . , ΣN , representing the actions or lack of actions by an observer
in a physical laboratory. At stage Σn , the state of the SUO will be denoted Ψn ,
an element of a classical register Rn of rank rn .
Persistence
This refers to the question of identity of a given detector over a succession of
stages. In quantized detector networks (QDN), each detector is identified with
a single stage only. Therefore, there is no concept of “existential persistence” in
QDN. This applies as much to the observer and their apparatus as it does to the
SUOs that they observe. The first law of time (the dictum of Heraclitus), that
everything changes, applies here.
That does not rule out effective persistence, meaning that as the observer’s
time runs from stage to stage, their description of the apparatus seems to be
constant, in terms of the rank of the registers concerned and in terms of the
labeling of the detectors in those registers.
Effective persistence is the rule in ordinary experience, to the extent that
humans are strongly conditioned to believe that they are moving around in
time over a fixed spatial background. This belief carries over into conventional
descriptions of experiments, in which apparatus is assumed to persist while states
of SUOs evolve. What is really going on is that the first law of time always
applies, but the time scales for significant change in apparatus are generally so
great relative to those associated with the SUOs that the former time scales may
be taken to be infinite compared with the latter. This is a common assumption
made in high-energy particle physics scattering experiments, for instance.
Autonomy or Not
An SUO could be autonomous, meaning that it has been effectively isolated
from its environment. Alternatively, the exophysical observer could arrange for
external agencies, such as electric and magnetic forces, to influence the dynamics
of an SUO.
Signality Conservation
Signality is taken in this chapter as a marker of particle number: a signality-
p state of a classical register at a given stage will be interpreted as a state
with p particles. Newtonian mechanics does not readily countenance changes of
particle number. If that has to be considered, then that can be readily modeled
by signality nonconserving register dynamics. The reason for this is that register
mechanics is more like a field theory than a particle theory.
For the rest of this chapter we shall restrict our attention to autonomous SUOs
moving over a succession of classical registers, each of which has the same rank r,
with stage-independent laws of dynamics, as these are generally of most interest.
In principle, there would be no problem in dealing with other forms of dynamics,
including those where the rank of the physical register changes with time. Indeed,
that is a common scenario in the quantum register dynamics we shall discuss in
the next chapter.
We could also deal with classical stochastic mechanics, which would incorpo-
rate Bayesian principles in a natural way, but for the rest of the chapter we shall
deal with deterministic laws of mechanics. In the following, the set of integers
{0, 1, 2, 4, . . . , 2r − 1} is denoted Z [r] .
We shall use the computational basis representation (CBR) Bn ≡ {kn : k ∈
Z [r] } to represent the 2r labstates in a rank r classical register Rn at stage Σn .
Consider the temporal evolution of a system from labstate Ψn in register Rn at
stage Σn to labstate Ψn+1 in register Rn+1 at stage Σn+1 . The dynamical rules
will be encoded into the expression
Ψn → Ψn+1 ≡ Cn+1,n Ψn , (6.3)
where Cn+1,n is a classical register operator. Such an operator maps any one of
the 2r labstates in Bn into precisely one of the 2r labstates in Bn+1 .
The reason for this constraint on any classical register operator comes from
the basic principles of CM: states in CM are well-defined, single-valued elements
of phase space at any given time. This translates in the present context to the
requirement that at any stage Σn , the classical state of an SUO is precisely
one of the 2r possible states of the register Rn . In consequence, any admissible
classical register operator acts on single elements of Rn and maps them into single
elements of Rn+1 . Note that this does not rule out nonsurjective or noninjective
operators; that is, there may be elements in Bn+1 that are not mapped into
(nonsurjective), and it could be the case that more than one element in Bn may
be mapped into the same element in Bn+1 (noninjective).
6.4 Permutation Flows 79
Taking any initial state, there are 2r possible final states. Since there is a totalr
2
of 2r possible initial states, we immediately deduce that there is a total of (2r )
different possible classical register operators mapping from Rn into Rn+1 .
Even for low-rank registers, the number of possible operators can be impres-
sive. For a rank-one system, we deduce that there should be 22 = 4 different
possible such operators. Indeed, that is exactly what we saw in Chapter 3, where
we identified the four classical bit operators I, F , U , and D. The number of
possibilities grows rapidly with r. For example, there are 256 different operators
that can map a rank-two register into another rank-two register, and a total of
88 = 16,777,216 different operators mapping a rank-three register into another
rank-three register.
Given a potentially vast number of possible evolution operators, some criteria
need to be imposed in order to reduce the discussion to manageable and realistic
proportions. What comes to our assistance here is that most of the possible
evolution operators over a classical register will not be useful. Many of them
will correspond to irreversible and/or unphysical dynamical evolution and only
a small subset will be of interest. We need to find some principles to guide us in
our choice of evolution operator.
We turn to standard classical mechanics as discussed in Hamiltonian mechan-
ics. The first thing to note is that classical phase spaces are generally constant
in time. This corresponds to taking the rank of successive classical registers to
be constant, i.e., rn = rn+1 ≡ r for some integer r and for all n in the temporal
interval under discussion. We shall make this assumption from this point on.
Next, we recall that in standard CM of the Hamiltonian variety, Hamilton’s
equations of motion lead to Liouville’s theorem. This tells us that as we track a
small volume element along a classical trajectory,2 this volume remains constant
in magnitude though not necessarily constant in shape or orientation. This leads
to the idea that a system of many noninteracting particles moving along classical
trajectories in phase space behaves as an incompressible fluid, such a phenomenon
being referred to as a Hamiltonian flow.
An important characteristic of Hamiltonian flows is that flow lines never cross.
We shall encode this idea into our development of classical register mechanics.
There are two versions of this mechanics, one of which does not necessarily
conserve signality while the other does. We consider the first one now.
2
A classical trajectory is a phase space trajectory that satisfies Hamilton’s equations of
motion.
80 Classical Register Mechanics
0 → 3 → 4 → 6 → 0, 1 → 1, 2 → 5 → 2, 7 → 7. (6.4)
This permutation flow does not conserve signality. The simplest proof of this
is to note that the signality zero state 0 evolves into a state with nonzero
signality.
i0 → p(i)1 : i = 0, 1, 2, . . . , 2r − 1. (6.5)
i0 → i1 , i = 0, 1, 2, . . . , 2r − 1. (6.6)
the final states, then (6.6) would indeed be trivial. If we had such contextual
information, however, then the relabeling p(i)1 → i1 would have no physical
significance, and the real dynamics would be understood by the observer.
Reversibility
Permutations form a group, which means that given a permutation P , then its
inverse P −1 always exists. Hence permutation flows are reversible.
Orbits
Any SUO evolving under stage-independent permutation dynamics will demon-
strate patterns known as orbits or cycles. Given a permutation of 2r objects,
there will exist subsets known as cycles of the objects being permuted such that
only elements within a given cycle replace each other under the permutation. This
is relevant here because we have chosen to discuss time-independent autonomous
systems, the evolution of which is given by repeated applications of the same
permutation. Therefore, the structure of the cycles does not change and so each
cycle is stable, consisting of the same p elements with a dynamical period p. For
example, the identity permutation gives a trivial form of mechanics where nothing
changes. It has 2r cycles each of period 1. At the other end of the spectrum, the
permutation denoted by (0 → 1 → 2 → · · · → 2r − 1 → 0) has no cycles except
itself and has period 2r .
Any physical register evolving under time-independent, autonomous permu-
tation mechanics must return to its initial labstate no later than after 2r time
steps. This is the analogue of the Poincaré recurrence theorem (Poincaré, 1890).
where k0 is any one of the 2r0 elements of the initial physical register R0 and
the ω k are probabilities summing to unity. The formalism outlined above then
gives the expectation values of operators.
7
Quantum Register Dynamics
7.1 Introduction
In this chapter we move beyond the classical register scenario discussed in the
previous chapter, extending the discussion to experiments described by time-
dependent quantum registers of varying rank. We apply our previous discussion
of the signal basis representation (SBR), the computational basis representation
(CBR), signal operators, signal classes, and the CBR of signal operators to the
quantum case. Our discussion of dynamics covers persistence, that is, the stability
of apparatus, observers, and laboratory time, and the Born probability rule. We
state the principles of quantized detector network (QDN) dynamics and show
how they apply to the description of quantum experiments. We discuss the signal
theorem and path summations.
7.2 Persistence
Our first step is to clarify what QDN assumes about the evolution of appara-
tus in time, because this affects the modeling. QDN is designed to reflect the
behavior of apparatus in the real world and so it is not assumed in general that
a given observer’s apparatus is constant in time, even during a given run of an
experiment.
Although many experiments appear to be carried out with apparatus that
persists over any given run of the experiment, and indeed, perhaps over all the
runs of that experiment, that is just an incidental factor that reflects no more
than an economy in construction. In practice it is invariably easier and more
economical to use the same equipment over and over again rather than use it
once, throw it away after each run, and build a new version ready for the next
run. Lest this be thought of as a trivial point, it is nevertheless an integral
and costly feature of many experiments, involving maintenance and upgrading.
Indeed, in laboratories such as the Large Hadron Collider, actual run time is a
7.3 Quantized Detector Networks 85
small fraction of total project time. A related, significant issue has to do with
the concept of ensemble, discussed in the Appendix.
Persistence has everything to do with time scale, specifically, the relative
laboratory time over which any piece of equipment can be meaningfully discussed
as such. The degree of persistence of apparatus is as contextual as anything else
in physics. If a run is relatively brief, say, over in very small fractions of a second,
as in high-energy particle scattering experiments, then in such an experiment,
the apparatus will behave as if it persists forever. On the other hand, in some
experiments, individual runs can involve enormous intervals of time between
state preparation and observation, as always happens in the case of astrophysical
observations of stars and galaxies. In such cases, light from a distant star may
be received by astronomers long after that star had ceased to exist as a star.
A necessary criterion for an experiment to be describable by QDN is that any
detector exists at least during a given stage. Indeed, we can take the definition
of a stage to be that interval of laboratory time over which a given detector can
be assumed to persist, that is, have an identity that has physical relevance in the
context of the experiment concerned.
In this respect, a stage is to be identified not with a moment or point in time
but with the interval of time over which information could be extracted by the
observer. Such an interval is of contextual temporal length in the laboratory,
being as short or as long as the observer requires to acquire a bit or “quantum”
of information. The temporal divide between two stages will be called a quantum
tick , or q-tick . It is necessarily a heuristic concept, because any attempt to
empirically measure a q-tick and assign a numerical value to a chronon, or
assumed fundamental unit of time, will be self-referential in some way or other.
We note that the concept of Planck time, commonly regarded as the shortest
meaningful interval of time in physics, is a theoretical construct that could
never be observed directly and has been criticized on such grounds (Meschini,
2007).
now it is the observables of the theory that have unitary temporal evolution over
and above any intrinsic, explicit evolution.
In standard QM, these two related architectures were eventually recognized as
too limited. One reason is that in the PVM approach, the number of mutually
orthogonal possible outcome states associated with a given observable is the
number of its eigenstates, which is fixed and cannot exceed the dimension of
the Hilbert space concerned. The positive operator-valued measure (POVM)
formalism on the other hand was developed to deal with the possibility that
the number of observational outcome channels set up in a laboratory may be
different from the dimension of the Hilbert space used (Ludwig, 1983a,b; Kraus,
1983; Peres, 1995) and, indeed, may exceed that number, in the case of finite
dimensions.
In contrast, the QDN formalism assumes from the outset that the Hilbert space
representing outcome possibilities is always different from one stage to the next,
even if the dimensionality remains constant, and even if the detectors involved
in the experiment appear to persist over several or all stages of an experiment.
To be specific, let us denote the observer’s apparatus at any given stage Σn by
An . Some parts of An will correspond to actual, real detectors, while other parts
will be virtual, or potential, detectors. Whatever the case, An will be always be
modeled by a countable number rn of vertices at any stage of the relevant stage
diagram for that experiment. In this book, we denote the ith vertex at stage Σn
by in , where the superscript i runs from 1 to rn and labels all of the distinct
vertices in that stage. Note that in (a vertex) should not be confused with in
(bold font), our notation for a CBR element of the preferred basis at stage Σn .
Note also that detector vertices should not be confused with modules, which are
nondetecting parts of apparatus that sit in the information void between stages.
In the description of real experiments, rn will always be finite, in contrast to
the situation in QM, where the Hilbert space may be infinite dimensional. The
harmonic oscillator in one spatial dimension is an example where QM assumes
either that the position observable has a continuous spectrum or, equivalently,
that the Hamiltonian has a countable infinity of energy eigenstates. The harmonic
oscillator is discussed in Chapter 24. In the world of empirical reality, there are
no such harmonic oscillators, just SUOs for which such a description may be a
reasonable approximation.
The question as to whether rn is finite or not is central to many if not all of
the technical difficulties encountered in the refinement of standard QM known as
quantum field theory. The harmonic oscillator appears to be intimately involved
in many of these problems in one way or another. Although the mathematical
properties of the quantized harmonic oscillator play an essential role in account-
ing for the particle concept in free quantum field theory, those same properties
generate fundamental problems in interacting field theories. For instance, the
ultraviolet divergences encountered in most Feynman loop integrals are linked
to the unbounded energy spectrum of the standard QM oscillator, while infrared
7.3 Quantized Detector Networks 87
divergences are linked to the assumed continuity of spacetime and the zero-point
energy of the quantized oscillator.
The issue of zero-point energy of the quantized oscillator was explicitly
addressed in quantum optics by Glauber in his landmark papers on photon
correlations (Glauber, 1963a,b,c). He pointed out that the irreversibility of
quantum detection processes involves the non-Hermitian positive energy electric
field operator E (+) (t, x) rather than the Hermitian electric field operator
E(t, x) ≡ E (+) (t, x) + E (−) (t, x). It is the vacuum expectation values (VEVs)
involving the E field operators that are affected by zero-point energy, whereas
VEVs involving the E (+) alone are not. Glauber’s conclusion was the following:
The electric field in the vacuum undergoes zero-point oscillations which, in the
correctly formulated theory, have nothing to do with the detection of photons.
(Glauber, 1963c)
1
Charge, space (parity), and time.
88 Quantum Register Dynamics
of apparatus (Jaroszkiewicz, 2004), the only effect being to increase the number
of qubits used in the formalism.
During their operational lifetimes, observers quantify their time in terms of real
numbers, usually read from clocks. Most clocks give only a crude estimate of the
passage of time, and as a result, the ordinary human perception of time as a one-
dimensional continuum is just a convenient approximation. The classical view
of time is that it is a continuum at all scales and for all phenomena. Certainly,
things appear consistent with that view in the ordinary world.
In QM, however, the situation is quite different. What matters in a quantum
experiment is information acquisition from the observer’s apparatus and this can
only ever be done in a discrete way, regardless of any theoretical assumption to
the contrary (Misra and Sudarshan, 1977). While an observer’s effective sense
of time can be modeled accurately as continuous, it is certainly the case that
an observer can look at a detector and determine its status in a discrete way
only. There are no truly continuous-in-time observations. It is important here to
distinguish between what experimentalists actually do in experiments and what
theorists imagine they do.
The discreteness of the information extraction process forms the basis of the
time concept in QDN. In general, a given observer will represent the state of
their apparatus (the labstate) at a finite sequence of their own (observer) times,
denoted by the integer n. These times will be referred to as stages. In QDN, a
pure labstate at stage Σn will be denoted by Ψn .
In QDN, stage Σn+1 is always regarded as definitely later than stage Σn . There
is no scope in QDN for the concept of closed time-like curve (CTC) found in some
GR spacetimes, such as the Gödel model (Gödel, 1949). Invariably, discussions
that do involve CTCs cannot accommodate quantum processes properly, at least
not those that involve probabilities.2
A final point on this topic: there is no need to assume that the laboratory time
interval between stages has a definite value or that a succession of stages involves
equal intervals of labtime.
2
This is for the good reason that the probability concept requires an observer accumulating
information with an irreversible sense of time, something that a genuine CTC is
incompatible with.
7.6 The Born Probability Rule 91
Remark 7.2 This subject is a minefield of issues that lurk unseen until
we stumble across them. Reference was made to “identical state preparation”
in the previous paragraph. That is clearly a vacuous concept. By definition,
different runs of a given experiment cannot have absolutely identical state
preparation: the Universe will have aged for sure during any two runs,
and there will be vast changes in the local environment of any apparatus
on atomic scales. Yet clearly, most of those changes will not influence the
outcome probabilities. As pointed out by Kraus, what appears significant
are equivalence classes of state preparation processes (Kraus, 1983). Exactly
where the line between influence and noninfluence of external factors in state
preparation should be drawn is one of those deep questions that no one knows
anything much about. The Heisenberg cut is a hypothetical line demarcating
the classical world of the observer and the quantum world of the SUO. We
propose the term Kraus cut to denote the hypothetical line between those
factors that have no influence on state preparation or outcome and those
that do. All we can do is to observe that some things are critical in state
preparation, and everything else seems unimportant. Much the same point is
emphasized in Peres (1995).
In QDN, labstates are usually normalized to unity for convenience. Because the
preferred basis states form a complete orthonormal basis set, we may immediately
92 Quantum Register Dynamics
read off from (7.2) the various CBR conditional probabilities Pr(in |Ψn ), which
are given by the rule
Pr(in |Ψn ) is the conditional (Bayesian) probability for the observer to find the
apparatus in signal state in at stage Σn , if the observer looked at their apparatus
at that time. These probabilities are conditional on the observer being sure, just
before they looked, that the labstate at stage Σn was Ψn .
There is no natural restriction in QDN to labstates that are eigenstates of
signality; i.e., superpositions of basis states from different signality classes are
permitted in principle. QDN is analogous in this respect to the extension of
Schrödinger wave mechanics to Fock space and to quantum field theory.
the CBR, this state can be written in the form (7.2) and normalized according
to (7.3).
A given run of an experiment will be described by the observer in terms of
a sequence {Ψn : 0 n N } of normalized labstates, each element of which is
associated with a particular quantum register Qn , ending with state outcome
observation at the final stage ΣN .3 The question now is how successive labstates
relate to each other between times M and N .
Provided each run is prepared in the same way, and provided the apparatus
during eah run is controlled in the same way, we can discuss a typical labstate Ψn
as a representative for an ensemble of runs. Indeed, that is the only interpretation
that makes sense if an objectivized or hidden-variables interpretation of the wave
function is to be avoided. QDN follows the traditional QM view as emphasized
by Peres in this respect (Peres, 1995).
The dynamical transition from labstate Ψn to labstate Ψn+1 involves a map-
ping from quantum register Qn to quantum register Qn+1 that satisfies two
criteria fundamental to quantum mechanics: linearity and norm preservation.
This leads us to give a number of definitions and theorems that have proved
central to QDN.
Born maps are used in QDN in order to preserve total probabilities (hence
the terminology), but unfortunately, their properties are insufficient to model all
quantum processes. Born maps are not necessarily linear, as can be seen from
the elementary example B(Ψ) = |Ψ|Φ for all Ψ in H, where Φ is a fixed element
of H normalized to unity and |Ψ| is the norm of Ψ in H. To go further, it is
necessary to impose linearity.
Henceforth, we adopt the rule that if U is a linear map, then we may drop the
parentheses and write U Ψ instead of U (Ψ).
3
QDN allows partial observations to be made at intermediate stages.
94 Quantum Register Dynamics
Bn+1,n Is Nonlinear
By Theorem 7.5, nonlinearity is unavoidable if the rank rn of Qn is greater than
the rank rn+1 of Qn+1 , but can arise even if this is not the case. Nonlinearity
here is interpreted as a marker of classical intervention by the observer. For
example, switching off any apparatus at stage Σn+1 would be modeled by the
Born map Bn+1,n (Ψn ) = 0n+1 for any state Ψn in Qn , where 0n+1 is the signal
ground state of the apparatus at stage Σn+1 . Another example is state reduction
due to observation; that is, if at stage Σn+1 the observer actually looks at the
apparatus and determines its signal status, then this would be modeled by the
nonlinear Born map Bn+1,n (Ψn ) = kn+1 , where now kn+1 is some element
of the CBR of the preferred basis Bn+1 , chosen randomly with a probability
weighting given by the Born rule. In this particular case, however, there are
actually two labstates associated with stage Σn+1 : Ψn+1 representing the state of
the apparatus immediately prior to state reduction and k representing the actual
observed outcome immediately after. None of this represents anything more than
mathematical modeling of the observer’s actual or potential knowledge about the
signal status of their apparatus. QDN makes no comment on whether anything
deeper than changes in apparatus signal status has occurred. In particular, any
speculation of superluminal information flow concerns correlations, which is a
phenomenon that occurs in all forms of mechanics and has everything to do with
what the observer did in the past.
rM rM +1 · · · rn · · · rN , (7.6)
96 Quantum Register Dynamics
where ΣM is the initial stage and ΣN is the final stage. From this, we can
appreciate that unless experimentalists are careful, their quantum registers will
grow irreversibly in rank. On the other hand, the particle decay experiments
discussed in Chapter 15 specifically require the rank to increase at each time step.
The use of Born maps means total probability is always conserved, even if lin-
earity is absent. In principle, therefore, QDN allows for a discussion of nonlinear
QM, still based on most of the familiar Hilbert space concepts used in QM. As
we have mentioned in the case of nonlinear Born maps, necessarily nonlinear pro-
cesses such as state preparation, state reduction, the switching on and off of appa-
ratus, and so on, which are outside the scope of unitary (Schrödinger) evolution
in standard QM, can all be discussed in QDN in terms of nonlinear Born maps.
Our interest will generally be in experiments based on linear quantum pro-
cesses, so (7.6) will be assumed. For such an experiment running from stage ΣM
to stage ΣN , for N > M , and knowing rn rn+1 , then the labstate Ψn will
change according to the rule
Ψn → Ψn+1 ≡ Un+1,n Ψn , M n < N, (7.7)
where Un+1,n is a semi-unitary operator (this terminology will be used from now
on even in the case where rn = rn+1 ).
The CBR at stages Σn and Σn+1 can be used to represent Un+1,n . Specifically,
we find
2rn+1
−1 2
rn
−1
j,i
Un+1,n = j n+1 Un+1,n in . (7.8)
j=0 i=0
A useful way of thinking about the semi-unitarity condition (7.10) and the
rules of semi-unitary operators is in terms of complex vectors as follows. Let
V be a finite-dimensional complex vector space with inner product a† · b. An
orthonormal d-subset of V is any set of elements {ai : i = 1, 2, . . . , d} of V
that are normalized to unity and mutually orthogonal; that is, we have the rule
ai† · aj = δ ij .
Now let V be another finite-dimensional complex vector space with inner
product a† · b . The question of semi-unitarity reduces to the possibility of
finding an injection from any given orthonormal d-subset of V into at least one
7.10 The Signal Theorem 97
Here k is some integer in the semi-open interval [0, 2rn+1 ), α and β are non-
zero complex numbers, and φn+1 and ψ n+1 are elements in Qn+1 sharing no
signal states in common either with each other or with kn+1 in their CBR
basis expansions, which means
The signal theorem leads to the following important result for conventional
physics. Suppose an observer constructs an apparatus that, if prepared at stage
Σn to be in its signal ground state 0n , would evolve into 0n+1 . If the dynamics
is semi-unitary, then we may write
98 Quantum Register Dynamics
that is, a labstate with no signal ground component (note that the summation
(7.17) in runs from 1, not 0). Then for calibrated apparatus with semi-unitary
labstate evolution, the signal theorem tells us that there can be no signal ground
component in the labstate Ψn+1 at time n + 1, and so we may write
2rn+1
−1
Ψn → Un+1,n Ψn = Φj j n+1 , (7.18)
j=1
where
n −1
2r
j ji
Φ = Un+1,n Ψi . (7.19)
i=1
0n → Un+1,n 0n = 0n+1 ,
1n → Un+1,n 1n = 1n+1 , (7.20)
where |a|2 + |b|2 + |c|2 = 1. Moreover, since the evolution is given as semi-
unitary, then inner products are preserved. Hence we deduce
α∗ a + β ∗ b = γ ∗ a + δ ∗ b = 0. (7.24)
where |α|2 + |β|2 = 1. Suppose further that the dynamics is semi-unitary and
that the apparatus at stage Σn+1 is of rank three. Then by the signal theorem,
semi-unitary evolution such that
100 Quantum Register Dynamics
1 0n → Un+1,n A
A 1 0n = (aA
1 + bA
2 )0n+1 , |a|2 + |b|2 = 1,
n n n+1 n+1
0n → Un+1,n A
A 2 2 0n = (cA
2 + dA
3 )0n+1 , |c|2 + |d|2 = 1, (7.28)
n n n+1 n+1
is not possible.
This result tells us that a double-slit type of experiment where each slit has
only one quantum outcome site in common with the other cannot be physically
constructed. Experiments where two or more quantum outcome sites are in
common are possible, and then inevitable quantum interference terms will occur
in final state amplitudes. For example, in a standard double-slit experiment, every
site on the detector screen is affected by the presence of each of the two slits.
This result reinforces an important rule in QM: we cannot simply add pieces
of apparatus together and expect the result to conform to an addition of classical
expectations. A double-slit experiment where both slits are open is not equivalent
to two single-slit experiments run coincidentally and simultaneously.
Persistence
To understand the action of a null evolution operator, consider the idealized
scenario of an initial labstate Ψn in a rank r quantum register Qn evolving into
labstate Ψn+1 in a quantum register Qn+1 of the same rank r. In the following,
we shall use the CBR at all stages.
Suppose we are given that
r
2 −1
Ψn = Ψin in . (7.29)
i=0
This is a labstate at stage Σn+1 with exactly the same coefficient profile, that is,
set of coefficients {Ψin }, as the initial labstate. It is reasonable in this context to
refer to this phenomenon as an example of persistence and refer to Ψn+1 as a
persistent image of Ψn .
Such an evolution operator will be referred to as a null evolution operator
(NEO). An NEO Nn+1,n makes sense only under particular circumstances: the
rank of the quantum register at stage Σn+1 must be the same as that at stage Σn
and there has to be a one-to-one identification of the elements of the preferred
bases.
This form of evolution demonstrates the two faces of time: on the one hand,
labtime (the time of the observer) goes on as normal, being collated with rel-
ative external context such as the expansion of the Universe. The labtime of a
given observer cannot be reversed relative to other observers, according to all
known current physics, although it can be slowed relative to the labtime of other
observers. On the other hand, some objects such as labstate profiles of persistent
labstates may appear to be indifferent to labtime.
Note that the upper limit on the summation over the index i is 2rn − 1, not
n+2 . Under
2rn+1 − 1. This operator effectively maps states in Un+1,n Qn into Q
evolution generated by Vn+2,n+1 we find
n −1
2r
Ψn+2 ≡ Vn+2,n+1 Ψn+1 = Ψin in+2 , (7.36)
i=0
Inserting a resolution of each evolution operator of the form (7.8), the final state
can be expressed in the form
N −1 2rN −1 −1
2r r0
−1
2
j ,j j ,j j1 ,j0 j0
N N −1 N −1 N −2
Ψn = ... j N UN,N −1 UN −1,N −2 . . . U1,0 Ψ0 . (7.38)
jN =0 jN −1 =0 j0 =0
We may immediately read off from this expression the coefficient of the signal
basis vector iN . This gives the QDN analogue of the quantum mechanics Feyn-
man amplitude Φifinal |Ψinitial for the initial state |Ψinitial to go to a particular
final outcome state |Φifinal . In our case, what we are actually reading off is
A(iN |Ψ0 ), the amplitude for the labstate to propagate from its initial state Ψ0
and then be found in signal basis state i at stage ΣN . We find
2rN
−1 −1 2rN −2 −1
r0
2 −1
i,j j ,j j1 ,j0 j0
A(iN |Ψ0 ) = ... N −1
UN,N N −1 N −2
−1 UN −1,N −2 . . . U1,0 Ψ0 . (7.39)
jN −1 =0 jN −1 =0 j0 =0
The required conditional probabilities are obtained from the Born rule as
discussed above, and so we conclude
2rN −1 −1 rN −2 −1 2r0 −1 2
2 i,j
jN −1 ,jN −2 j1 ,j0 j0
P r(iN |Ψ0 ) = ... N −1
UN,N −1 UN −1,N −2 . . . U1,0 Ψ0 .
jN −1 =0 jN −2 =0 j0 =0
(7.40)
By writing the amplitude (7.39) in the form
2rN
−1 −1
i,j
A(iN |Ψ0 ) = UN,N −1 A(j N −1 |Ψ0 ), (7.41)
jN −1 =0
provides the best way to discuss the quantization of certain classical theories for
which other approaches prove inadequate. In QDN, time is discrete and in that
sense we follow Feynman’s lead while avoiding the pitfalls associated with the
continuum limit, which we do not take in QDN.
This completes our introduction to the QDN formalism. An obvious extension
is to include mixed labstates. These are discussed in Chapter 9.
8
Partial Observations
8.1 Introduction
In this chapter we discuss the quantized detector network (QDN) approach to
partial observations, or the extraction, during an extended-in-time quantum pro-
cess, of only some of the quantum information embedded in a detector amplitude.
In order to deal with partial questions, we need first to discuss how QDN deals
with maximal questions, which means looking at all the detectors at a given stage.
8.2 Observables
Our preoccupation with detectors rather than systems under observation (SUOs)
is nothing new in quantum mechanics (QM). Indeed, the primary significance of
what could be observed was a guiding principle when Heisenberg formulated his
matrix mechanics approach to QM (Heisenberg, 1925). The conventional position
in QM is that the only important operators in the theory are the observables.
These correspond closely to the variables used in classical mechanics (CM) to
describe measurable quantities, such as energy, momentum, electric charge, and
so on.
In standard QM, observables are generally assumed to be self-adjoint operators
(Hermitian operators in the case of finite-dimensional Hilbert spaces) so that they
have real eigenvalues, and these are the QM analogues of CM variables. Apart
from that proviso, operators representing observables have few restrictions.
This generality raises significant questions. A particularly critical question
is whether a given “observable” makes empirical sense. There is no theorem
that proves that every self-adjoint operator corresponds to something that
can be actually observed in the laboratory. For example, in one-dimensional
wave mechanics, the operator pppx
ppp is self-adjoint and its classical analogue,
pppxppp, is a perfectly regular function over phase space. But we know of no
experiment that can measure such a quantity directly, whereas experiments to
measure p or x directly could be devised.
106 Partial Observations
The problem as we see it in this book is that the observable concept in standard
QM puts the cart before the horse: it is first implicitly assumed in QM that states
of SUOs can be created, and only then is the question of what can be done to and
on those states raised. QDN does things the other way around: the apparatus that
creates the states and detects the signals has to come first. Indeed, that is all that
is needed. “State preparation” defines the states contextually and the outcome
detectors define what information can be obtained. In QDN, “observables” are
nothing but the signals in final stage detectors plus the context that informs the
observer as to the meaning of those signals.
1
Once again, we have to resort to realist language to convey our meaning, though it is an
oversimplification to do so.
8.3 Maximal Questions 107
described scenario gives the game away: how could we even say that there was a
galaxy acting as a source of light without having observed sufficient light from it
to establish that fact? That degree of observation thereby provides the context
that allows us to think of that remote galaxy and our detectors as part of the
same experiment.
In fact, such context is generally built up over relatively long periods of time
and depends on technology. Thousands of years ago, astronomers could only see
via their naked eyes a strange blob of diffuse light where M31, the Andromeda
Galaxy, is in the night sky. Then as telescopes were developed, followed by long-
exposure photography, sufficient observations were made to establish the context
that we know today. Only that relatively recent context allows us to say whether
a signal from Andromeda is red-shifted or blue-shifted light.2
2
In fact, Andromeda is moving toward our galaxy, so light from ordinary sources in
Andromeda should show a small blue shift.
108 Partial Observations
ground state is |Ψ1 |2 , the probability of detector 1 being in its ground state and
detector 2 being in its signal state is |Ψ2 |2 , and the probability of both detectors
being in their signal states is |Ψ3 |2 . Note that |Ψ1 |2 is not the probability that
there is a signal in detector 1. That probability is given by |Ψ1 |2 + |Ψ3 |2 .
P r(11 02 03 14 |Φ) = ΦP 4 Φ.
1 P2 P3 P (8.4)
where each of the operators on the right-hand side is a register operator. This
then gives the required result (8.4).
It will be clear from the above that the set of all partial questions involves
expectation values of all possible products of the register projection operators.
This leads to the following theorem.
Theorem 8.4 For a rank-r classical or quantum register Q[r] , the number
of possible partial questions is 3r .
Proof To prove the theorem, we determine the number of partial questions
of each rank and then add up all those numbers.
The observer could go to each of the r detectors one by one and ask one of
two possible questions of it: the two questions that can be asked at detector i
i . These are rank-one partial questions,
are given by the register projectors Pi , P
so we conclude that there is a total of 2r rank-one partial questions.
Assuming r > 1, the observer could now ask rank-two partial questions,
involving only two distinct detectors
in the register. Given a rank-r register,
there is a total of r(r − 1) = 2r distinct pairs, and for each pair of choices, 22
alternative questions can be asked. For example, for the choice i < j, we can
i j i j i j i j
four questions P P , P P , P P , and P P . Therefore, there is a total
ask the
2 r
of 2 2 such partial questions.
We can continue this argument until we reach the maximal questions, which
are rank-r partial questions. There is only one way of choosing r objects from
r objects, and 2r possible maximal questions to be asked of that choice. Hence
we find the grand total TQ of distinct partial question operators to be given by
TQ = 1 + 2 1r + 22 2r + · · · + 2r rr = (1 + 2)r = 3r , (8.11)
as asserted.
P1 22 = 22, 4 22 = 0,
P1 P (8.12)
but the respective eigenvalues cannot now be directly read off. Since the CBR is
generally useful, we need to quantify the action of the partial questions on the
CBR. We do this as follows. The set {I[r] , P1 , . . . , P 2 . . . P
1 P r } of partial questions
8.6 Identity Classes 111
0 1 02 = 0 11 02 = 1 01 1 2 = 2 1 1 12 = 3
r
contains 3r elements. We define Q1 ≡ I[r] ,Q2 ≡ P1 , . . . , Q3 ≡ P 2 . . . P
1 P r . This
choice of labeling is arbitrary, there being no obvious way to order a complete
set of partial questions. Then for any partial question QP , P = 1, 2, . . . , 3r , and
any CBR element i, i = 0, 1, 2, . . . , 2r − 1, we write
QP i = Y P,i i, (8.13)
where the question eigenvalue Y P,i is either zero or unity.
In Table 8.1, the four identity classes are separated by horizontal lines. A given
identity class consists of partial questions of the same rank, so we shall refer to
each class by its rank. In the above example, C 1,1 and C 1,2 are rank-zero identity
classes; C 2,1 , C 2,2 , and C 3,2 are rank-one identity classes; and C 4,2 is a rank-two
identity class.
Identity classes are related to probability conservation. We shall find that if
we want to conserve probability in any calculation, we need to restrict partial
questions to the same identity class. It may then be necessary to relabel the
partial questions and their question eigenvalues with an extra label identifying
individual identity classes.
needle in a haystack.3 At the end of the day, we note that the result of the search
was a single number, that is, one, which means yes, the Higgs particle exists
(according to our contextual interpretation of the data).
This line of thinking also encourages experimentalists not to give up but
to continue searches that appear to be unsuccessful. For instance, there is at
this time (2017) no direct empirical evidence for supersymmetric partners of
various particles such as electrons and photons. Because the number of zero-
value question eigenvalues is potentially vast, a lack of confirmation so far does
not mean that a yes answer does not exist.
3
Which can be done quickly with the right apparatus, such as a metal detector.
9
Mixed States and POVMs
9.1 Introduction
As the decades of the twentieth century rolled on, quantum mechanics (QM)
became more and more sophisticated and mathematical. Understanding what
this theory means intuitively was confounded not only by nonclassical concepts
such as wave–particle duality and quantum interference, but also by issues to do
with observation. The Newtonian classical mechanics (CM) paradigm of reality,
wherein reductionist laws of physics describe observer-independent dynamics of
systems under observation (SUOs) with observer-independent properties, was
found to be inadequate. Quantum theorists were confronted with the measure-
ment problem, which attempts to understand, explain, and rationalize the laws
of QM that underpin the processes of observation that go on in the laboratory.
They are not the same as those of CM in several puzzling respects.
Historically, the first sign of the measurement problem was Planck’s quantiza-
tion of energy (Planck, 1900b) and the second was Bohr’s veto on radiation damp-
ing in hydrogen (Bohr, 1913). These occurred in the first quarter of the twentieth
century, a period in physics often referred to as Old Quantum Mechanics. Another
indicator that intuition was inadequate was Born’s interpretation of the squared
modulus of the Schrödinger wave function as a probability density (Born, 1926).1
That interpretation has everything to do with observers and observation, because
probability without an observer is a vacuous concept.
Eventually, the projection-valued measure (PVM) formalism emerged, cham-
pioned by von Neumann in an influential book on the mathematical formulation
of QM (von Neumann, 1955). Subsequently, pioneers such as Ludwig (Ludwig,
1983a,b) and Kraus (Kraus, 1974, 1983) refined the theory into the general pos-
itive operator-valued measure (POVM) formalism that we shall discuss and use.
1
Born appears to have at first taken the magnitude of the wave function as the probability
density, but corrected himself in time.
9.2 Sets and Measures 115
Definition 9.2 The extended reals R∗ is the set of real numbers R and two
extra elements, +∞ and −∞. These latter two elements are referred to as plus
infinity and minus infinity, respectively, and are interpreted accordingly.
where the xn are real and the eigenvalues of X, and the projectors P n are given
by P n ≡ |xn xn |. Here the normalized eigenvectors |xn satisfy the eigenvalue
equation
X|xn = xn |xn (9.6)
and form an ONB for H.2 Orthonormality then leads to the product rule
P n P m = δ nm P n (9.7)
and the completeness sum
d
P n = I H. (9.8)
n=1
9.5 Trace
If {|ψ n : n = 1, 2, . . . , d} is an ONB for H, then the trace Tr{O} of an operator
O is defined by
d
Tr{O} ≡ ψ n |O|ψ n . (9.9)
n=1
2
This can always be arranged, even if some of the eigenvalues are degenerate.
118 Mixed States and POVMs
where U is the unitary evolution operator taking the initial state |Ψ to its final
state U |Ψ at the time of measurement and is the initial density operator ,
defined by
≡ |ΨΨ| (9.12)
in this instance.
The expectation value XΨ of the observable X conditional on Ψ is given by
d d
XΨ ≡ xPr(xn |Ψ) = xn Tr{P n U U † }
n=1 d n=1
= Tr xn P n U U † = Tr{XU U † }. (9.13)
n=1
Here the possible initial states are normalized but not necessarily mutually
orthogonal. The number K of possibilities is arbitrary and can exceed the dimen-
sion d of HS . The probabilities ω κ are epistemic in character, that is, classical
probabilities. Such a random initial state is referred to as a mixed state when K >
1. When K = 1, the observer’s ignorance about the initial state is zero and the
corresponding unique element of the Hilbert space is referred to as a pure state.
The expectation value X of the observable X conditional on a mixed state
is now
K
X = ω κ ψ κ |U † XU |ψ κ
κ=1
K
= Tr{XU ω κ |ψ κ ψ κ | U † } = Tr{XU U † }, (9.15)
κ=1
where
K
≡ ω κ |ψ κ ψ κ | (9.16)
κ=1
It is easy to show that a density operator is a positive operator and has unit
trace. This means that the eigenvalues of a density operator are nonnegative and
sum to unity.
d
X= xn P n . (9.18)
n=1
3
In standard QM, ancillas are often treated as auxiliary, almost incidental aspects. In QDN,
they are essential and as important as SUOs.
120 Mixed States and POVMs
There are two partial traces available. TrS {V ⊗ W } is a partial trace over the
SUO degrees of freedom, giving
d
d
TrS {V ⊗ W } = sm |V ⊗ W |sm = sm |V |sm W, (9.21)
m=1 m=1
Given a density operator SA over HS ⊗ HA , then we define the partial traces
S ≡ TrA {SA }, A ≡ TrS {SA }. (9.23)
Then S is a density operator over HS and A is a density operator over HA ,
with
TrS {S } = TrA {A } = 1. (9.24)
Circularity
For a single Hilbert space H, the trace of operators A, B, . . . , Z satisfies the
circularity property
TrH {ABC . . . Z} = TrH {BC . . . ZA}. (9.25)
This property holds also for tensor products of Hilbert spaces. If R1 ,R2 , . . . , RN
are operators over HS ⊗ HA , then
TrSA {R1 R2 . . . RN } = TrSA {R2 R3 . . . RN R1 }. (9.26)
Circularity does not hold for partial traces.
9.9 Purification
Suppose HS is a Hilbert space of dimension d, representing states of some SUO.
Consider a density operator S over HS . From previous sections, we can find
a set {|ψ m : m = 1 . . . d} of normalized eigenvectors of S with nonnegative
eigenvalues, that is,
S |ψ m = λm |ψ m , m = 1, 2, . . . , d, λm 0. (9.27)
Then we can write
d
S
= λm |ψ m ψ m |. (9.28)
m=1
d √
|ψ SA ) ≡ λn |ψ n ⊗ |θn . (9.29)
n=1
The density operator SA associated with this pure state is then given by
d √
SA ≡ |ψ SA )(ψ SA | = λm λn |ψ m ψ n | ⊗ |θm θn |. (9.30)
m,n=1
that is,
TrA {SA } = S . (9.32)
In words, we can represent the density operator for a mixed state in one Hilbert
space by the density operator for a pure state in a larger Hilbert space. This fun-
damental result opens the door to Naimark’s theorem (below). The state |ψ SA )
is called a purification of S . There are infinitely many possible purifications of
a given density operator.
9.11 POVMs
The Born rule for mixed states can be rewritten in a form that can be gener-
alized, leading to the POVM formalism, a more general approach to quantum
measurement than the PVM formalism.
In standard QM, given a density operator S on a d-dimensional SUO Hilbert
d
space HS and observable X ≡ m=1 xm P m , then the probability Pr(xm |S ) of
outcome xn is given by
where we use P n P n = P n .
Now suppose we can find a set {M n : n = 1, 2, . . . , N } of operators called
Kraus (or detection) operators such that
N
M n† M n = I S . (9.36)
n=1
E n ≡ M n† M n . (9.38)
E n = E n . (9.40)
Then there exists a Hilbert space HA of dimension at least N , and a pure state
|ω A in HA such that
1. The density operator A ≡ |ω A ω A | ∈ L(HA ), the Hilbert space of linear
operator over HA .
2. There is some unitary evolution operator (in QDN a semi-unitary operator)
U ∈ L(HS ⊗ HA ) such that
U † U = U U † = I SA . (9.42)
3. There is a set {P n } of projectors over HA ,
such that
E n = T rA {I S ⊗ P n U I S ⊗ A U † }. (9.43)
Note that the U operator can encode any dynamical evolution in the
combined system.
Part 2
In this part, we derive an expression for the detection operators {M n }.
Suppose an initial mixed state is prepared, such that in HS ⊗ HA it is
described by density operator
0 SA ≡ 0 S ⊗ |ω0 A ω0 A | (9.44)
and then allowed to evolve under unitary evolution U , giving the final density
operator
SA = U 0 AB U † . (9.45)
Then a projective measurement to test for outcome |xn of observable X is
made via the apparatus. The probability P r(xn |0 SA ) is given by the equivalent
of the Born rule, in this case
124 Mixed States and POVMs
i.e.
where E n ≡ M n† M n .
Notation
In the following, Dirac’s bracket notation |ψn is used for system states; our bold
notation in denotes computational basis representation (CBR) of labstates; and
modified Dirac brackets |Ψn ) = |ψn , in ) ≡ |ψn ⊗ in describe total states. We
make an exception to our rule suppressing tensor product symbols in the case of
total states, in order to separate out system states and labstates.
By definition, system states are unobservable directly and there is no nat-
ural preferred basis for HnS .4 Therefore, we are free to choose any convenient
4
The principle of “general covariance,” or independence of frame of reference, is meaningful
as far as system states are involved, but vacuous as far as labstates are concerned.
9.13 QDN and POVM Theory 125
dn 2 −1
rn
[n]
→ , (9.49)
α=1 i=1 αi
[0]
[0]
|Ψ0 ) ≡ ψ0αi |α0 ⊗ i0 , (Ψ0 | = ψ0αi∗ α0 | ⊗ i0 , (9.50)
i αi
where ψ0αi∗ is the complex conjugate of ψ0αi . If the initial total state is normalized
to unity, then we have
[0]
(Ψ0 |Ψ0 ) = |ψ0αi |2 = 1. (9.51)
αi
where ΣN is the final stage and the contextual evolution operator UN,0 is given by
[N ] [0]
αi,βj
UN,0 ≡ UN,0 |αN β0 | ⊗ iN j 0 . (9.53)
αi βj
[N ] [0]
αi,βj∗
U N,0 = UN,0 |β0 αN | ⊗ j 0 iN . (9.54)
αi βj
The semi-unitary condition U N,0 UN,0 = I0 , where I0 is the identity operator for
the initial total space H0 , requires the conditions
[N ]
αi,βj∗ αi,γk
UN,0 UN,0 = δ βγ δ jk . (9.55)
αi
126 Mixed States and POVMs
i
[N ] [0]
αi,βj∗
M N,0 ≡ U N,0 iN = UN,0 |β0 αN | ⊗ j 0 , (9.56)
α βj
Interpretation
We may readily understand the QDN POVM formalism if we consider the pos-
sible questions that the observer could ask. Those questions can be only asked
about the signal status of the apparatus. If the final stage normalized total state
is |ΨN ) and the final stage apparatus has rank rN , then there is a grand total
of 2rN maximal questions, each of them equivalent to a projector of the form
INS
⊗ iN iN , for i = 0, 1, 2, . . . , 2rN − 1. The probability P (iN |Ψ0 ) that the final
labstate is iN is given by
P (iN |Ψ0 ) ≡ (ΨN |iN iN |ΨN ) = (Ψ0 |U N,0 iN iN UN,0 |Ψ0 )
i
= (Ψ0 |U N,0 iN iN UN,0 |Ψ0 ) = (Ψ0 |M N,0 MN,0
i
|Ψ0 ) (9.59)
i Mi i
M N,0 EN,0
i
N,0
= Tr EN,0 0 , (9.60)
where 0 ≡ |Ψ0 )(Ψ0 | is the initial stage density operator.
We saw in Chapter 8 that we can also ask partial questions, involving only a
subset of all the final state detectors. We can answer these questions as well,
because any partial question is equivalent to some combination of maximal
questions.
Suppose we wanted to ask the partial question P 2 , that is, look only at
detector 2 and ask whether it is in its signal state. Then we use the rule that
i = I, the register identity, to write
for i = 1, 2, ...,d, Pi + P
2 = IP
P 2 I = (P1 + P1 )P
2 (P3 + P3 )
2 P3 + P
= P1 P 2 P3 + P1 P
1 P 2 P
3 + P 2 P
1 P 3
= Q2 + Q3 + Q6 + Q7 . (9.61)
We conclude from this that knowledge of all the maximal questions and their
answers allows us to answer all partial questions.
where the ω are probabilities summing to unity and |Ψκ0 ) is the normalized total
κ
SUO states and treat HS as stage independent.5 That is certainly not the case
for the quantum registers Qn .
The third aspect concerns the notion of purification that we discussed above for
standard QM. There it was pointed out that the dimension of the ancilla Hilbert
space HA had to be at least as great as the number of independent states in
the prepared mixture that was being purified, and that there was always an
infinite number of ancillas that could be employed. In the case of QDN, neither
of these comments applies. The analogue of the ancilla concept is the quantum
register modeling the apparatus, and the rank of that register is determined
by the number of detectors that the observer has constructed. That number is
independent of the dimensionality of HS . Moreover, the concept of preferred
basis does not apply to the standard ancilla concept, whereas it is relevant in
QDN. The moral here is that standard QM treats states of SUO as the primary
objects of interest, whereas QDN relegates them to auxiliary devices and elevates
the apparatus to be the only thing that matters. This is exactly in line with
Wheeler’s participatory principle, mentioned previously.
We start our analysis therefore with a statement as to what is known at initial
stage Σ0 . We will assume that a mixed SUO state has been prepared at stage
Σ0 ,6 such that the initial density matrix 0 SA is an element of L(HS ⊗ Q0 )
given by
1 ,
0 SA ≡ S ⊗ 10 10 = S ⊗ P (9.63)
0
5
In our computer algebra program MAIN, discussed in Chapter 12, we take the system space
to change from stage to stage.
6
States of SUOs are treated in QDN as convenient repositories of context. By their actions
in the state preparation process, an observer will in general be entitled to model some of
that context in terms of a density operator representing a mixed state of an SUO.
9.13 QDN and POVM Theory 129
d
dn+1 dn
Un+1,n = |B, j n+1 )U BA,ji
n+1,n (A, in |. (9.65)
A,B=1 j=0 i=0
d
dn+1 dn
U n+1,n = |A, in )U BA,ji∗
n+1,n (B, j n+1 |. (9.66)
A,B=1 j=0 i=0
By definition, we have
where I S is the identity over HS and I1 A is the identity over Q1 . From this, we
arrive at the semi-unitary conditions
d d
n+1
U CA,ja∗ CB,jb
n+1,n U n+1,n = δ
AB ab
δ . (9.68)
C=1 j=0
evolving from state Σ0 to stage Σ1 , giving total state |Ψ1 ) ≡ U1,0 |Ψ0 ).
Continuing this process, we find the state at stage ΣN is given by |ΨN ) =
UN,0 |Ψ0 ), where UN,0 ≡ UN,N −1 UN −1,N −2 . . . U1,0 .
Suppose now the observer asks a partial question QθN of the state of the
apparatus at stage ΣN . The expectation value E[QθN |Ψ0 ] of the answer is given by
d
E[QθN |Ψ0 ] ≡ (ΨN |I S ⊗ QθN |ΨN ) = 0 Ψ0 A|E N |B,
ΨA∗ B θ
(9.70)
A,B=1
where
If now the initial state is a mixed state, such that the initial SUO state |Ψα
0 is
given with probability ω α , then the expectation value is given by
where
0 ≡
A ω α |Ψα
0 Ψ0 |.
α
(9.73)
α
We note that, provided we sum θ over all elements of an identity class Θ only,7
then
E θN = 10 U N,0 I S ⊗ QθN UN,0 10 = 10 U N,0 I S ⊗ IN UN,0 10 = I S . (9.74)
θ in Θ θ in Θ
Hence
E[QN |0 ] = TrS
θ
0 A
EN θ
= TrS {0 A I S } = 1. (9.75)
θ in Θ θ in Θ
7
Identity classes are discussed in the previous chapter.
10
Double-Slit Experiments
10.1 Introduction
In this chapter we show how the quantized detector network (QDN) formal-
ism describes the double-slit (DS) experiment. This is arguably the simplest
experiment that demonstrates quantum affects such as wave–particle duality and
quantum interference. It continues to be the focus of much debate and experiment
(Mardari, 2005), because theoretical modeling of what is going on reflects current
understanding of quantum physics and hence physical reality. We will apply QDN
to two variants: the original DS experiment and the monitored DS experiment,
where an attempt is made to determine the imagined path of the particle.
The DS experiment is widely acknowledged by physicists to be of importance
to the understanding of quantum mechanics (QM). So much so that in 2002,
the single electron version, first performed by Merli, Missiroli, and Pozzi (Merli
et al., 1976), was voted by readers of Physics World to be “the most beautiful
experiment in physics” (Rosa, 2012).
The DS experiment can be discussed in terms of three stages, shown in
Figure 10.1. By the end of the preparation stage, Σ0 , a monochromatic beam of
light or particles has been prepared by a source P , such as a laser. The beam
emerges from point O and then passes through an information void V1 to the
first stage, Σ1 , which consists of a wall or barrier W . This wall has two openings
denoted A and B that allow parts of the beam to pass through into another
information void V2 and onto the second and final stage Σ2 , which consists of a
detecting screen S.
The screen S is in general some material that can absorb and record particle
impacts. In reality, any screen will consist of a finite number of signal detectors,
such as photosensitive molecules, but the typical QM modeling is done as if
there were a continuum of sites on the screen, such as C, that could register
particles.
132 Double-Slit Experiments
V
D
A
C z
O
P
B
S
E
W
Runs
Any DS experiment will consist of a large, possibly enormous, number of runs,
or repetitions of a basic protocol.
Statistics
The conclusions of the experiment are based on a statistical analysis of the data
averaged over all valid runs.
Preparation Stage
Each run consists of the observer establishing the start of that run, indicated by
stage Σ0 in Figure 10.1. This means that the observer will have reliable contextual
information that some previously agreed-on procedure has been carried out at
device P . If anything occurs to prevent confidence in that information, then that
run is discarded.
Outcome Stage
At the end of each run, corresponding to stage Σ2 in Figure 10.1, the observer
looks at every accessible point/detector on the screen S and records whether
each detector has a signal or not. The data from each run are entirely classical
at this point, being in the form of a vast number of bits of yes/no values, each
value coming from a given detector on the screen S.
Information Void
It is a critical feature of any quantum experiment, apart from the quantum
Zeno-type experiments discussed in Chapter 15, that between stages Σ0 and
10.3 Baseball in the Dark 133
Reset
At the end of each run, the detectors in screen S are reset to their ground states,
ready for the next run, which takes place as if no other run had ever occurred.
Line-of-Sight Violation
With reference to Figure 10.1, suppose opening B is blocked off but otherwise the
experiment runs as described above. According to CM, particles passing through
opening A should pass more or less undisturbed onto position D on the screen
S. Of course, the opening at A would deflect some of the particles, since the
wall W consists of atoms, so it is to be expected that at the edges of opening
A, forces will act on any beam particles passing nearby. However, the fraction of
all the particles passing through either opening that comes close to the edges of
the slits is expected to be relatively small, so we expect to find a relatively large
distribution of particles around position D on the screen S with relatively few
to either side.
10.4 Observed Phenomena 135
What is observed is quite different. There is indeed a broad peak found centered
on D, but it extends further over the screen than expected from naive CM
expectations. Let us call the probability profile of this broad peak P A (z), where
z is the position coordinate of points along the screen as shown in Figure 10.1.
If now instead of B we blocked off opening A, then we would get a similar broad
distribution P B (z), this time centered on E, on the line of sight from O to B.
Interference
What is astonishing is that if now we ran the experiment with both A and B open,
then we would not find a simple distribution P A (z) + P B (z) as expected from
CM principles. Instead, we would find a distribution P A (z) + P B (z) + I AB (z),
where I AB is known as an interference term. It is this interference term that
causes all the fuss.
Self-interference
It was originally believed that a theoretical particle–wave conflict could be
avoided if the explanation of the interference term I AB was that there were
interactions of the particles coming from opening A that were somehow
interacting with particles coming from opening B, thereby disrupting the basic
addition of P A to P B on the detecting screen.
But the mystery was only deepened when DS experiments were done such that
the rate of particles falling on the screen was extremely low (Taylor, 1909). So
low in fact that there would be one (or even fewer than one)1 on average landing
on the screen S per run. It was found that the interference term occurred in the
statistical analysis when a large number of runs was performed, even when only
one particle could possibly have passed through at a time. The point is that the
aforementioned picture of clouds of particles interfering with each other cannot
be valid here when only one particle passes through at a time. The paradox is
that it is then hard if not impossible to understand why the interference term
should occur in the analysis of many separate, uncorrelated runs. When QM
is interpreted in terms of particle–waves interfering with each other, the low-
intensity interference pattern is generally referred to as self-interference, a term
commonly attributed to Dirac in his famous book on QM (Dirac, 1958).
Not all theorists subscribe to this view (Mardari, 2005). From the QDN per-
spective, the term self-interference is a dangerous one to take seriously, as it
invokes a confusing picture of a classical particle that behaves unlike a classical
particle. We do not have to believe in particles: the only things that we can be
sure of are signals in detectors.
1
This is deep. An average of fewer than one particle per run would mean, of course, that
during some runs, no particles were recorded as having landed on the screen. An average of
fewer than one particle per run vindicates the view that what matters is what the observer
does, not what we think they do. Observers push buttons and look at screens. They do not
know for sure what is happening beyond that description.
136 Double-Slit Experiments
Path Indeterminacy
The interference term IAB requires both openings A and B to be present. If any
attempt is made to block off one or another opening, then I AB disappears. Now
according to CM, every time a detector registers a signal at S, that is evidence
for a particle having traveled from source P to that detector. According to CM
principles, that particle had to have traveled along a continuous path from O
through either A or B. If true, it should be possible to establish which of the two
openings it was. But any attempt to do this appears to destroy I AB .
It is as if the observer has two mutually exclusive choices: either have no
knowledge about which path was taken, and then P A , P B , and the interference
term occur, or know which path was taken, such as through A, but then the inter-
ference term and P B disappear and only P A is observed. In QM, the principle
that there is this exclusive choice is generally referred to as complementarity.
It may be possible to trade off information, with the observer having only
a probability estimate for each of these alternatives. In that case, the overall
distribution on the screen would depend on that probability estimate in some
way. However, predicting that distribution would undoubtedly require the most
careful analysis of context. Experiments along such lines have been attempted,
such as that of Afshar (Afshar, 2005). The results of such experiments remain
controversial (Kastner, 2005).
Wave–Particle Duality
We shall see in the next section that physicists can get a good theoretical handle
on the experiment by applying the particle–wave concept inherent to Schrödinger
wave mechanics. The DS experiment touches on particle-like attributes because
the detectors in the screen S respond in a discrete yes/no way characteristic of
particle impacts, while on the other hand the distribution P A +P B +I AB is char-
acteristic of wave dynamical processes. The mystery only arises when theorists
suppose that there are objects with particle and wave properties simultaneously
and fail to recognize empirical context as a critical factor in the experiment. In
actuality, each aspect (particle or wave) is significant within its own particular
empirical context, and there are no real paradoxes in the laboratory. The so-
called wave–particle paradox occurs only because of the way humans generally
choose to interpret their experiments.
d
i |Ψ, t = H|Ψ, t, (10.1)
dt
is the free particle Hamiltonian given by
where H
H = p · p . (10.2)
2m
Relative to a standard improperly normalized particle position basis
{|x, x ∈ E3 }, the state vector |Ψ, t is given by
%
|Ψ, t = d3 xΨ(x, t)|x. (10.3)
The wave-function part Ψ(x, t) of the solution to Eq. (10.1) is readily found by
standard methods to be given by
%
Ψ(x, t) = d3 yF (x, t; y, t0 )Ψ(y, t0 ), t > t0 , (10.4)
&
where the propagator F (x, t; y, t0 ) ≡ x|e−iH(t−t0 )/ |y is given by (Feynman
and Hibbs, 1965)
3/2 '
−im im(x − y)2
F (x, t; y, t0 ) = exp , t > t0 . (10.5)
2π(t − t0 ) 2(t − t0 )
This propagator will be valid for events (x, t) and (y, t0 ) in the same information
void region. We shall consider what happens in region V1 and then in region V2 .
In the following, we take standard Cartesian coordinates x ≡ (x, y, z) with
origin at O in Figure 10.1, x-axis along the beam direction, y-axis transverse
to the beam, and z-axis in the direction from B to A. For simplicity, we shall
suppress any transverse effects and assume that openings A and B are almost
point-like, with coordinates xA = (d, 0, a) and y B = (d, 0, −a), respectively,
where d is the distance of the wall W from the opening O and a is positive.
We shall consider what happens at a point C on the detecting screen S, with
coordinates xC = (d + D, 0, c), where D is the distance of the screen S from the
wall W .
For the rest of this section we use the notation x1 ≡ (x1 , t1 ), y2 ≡ (y 2 , t2 ),
and so on.
Region V1
We imagine that at the first stage Σ0 , a normalized pulse is emitted from O,
characterized by wave function Φ(x0 ). At a given event with coordinates x1 on
the wall W , the wave function Ψ(x1 ) after propagation through V1 is given by
%
Ψ(x1 ) = d3 x0 F (x1 ; x0 )Φ(x0 ), t 1 > t0 . (10.6)
Region V2
Equation (10.6) gives the wave function impacting on the wall W on the side
facing the first information void V1 . We need an expression for the wave function
on the other side of W that acts as an initial wave function propagating into
information void V2 and hitting the screen S. Stage Σ1 can be thought of in this
respect as a preparation stage.
To this end we introduce shape functions GA (x1 ) and GB (x1 ) that characterize
the openings A and B. Feynman and Hibbs (1965) discuss this calculation where
a Gaussian shape function is assumed.
A The prepared wave function on the V2
B
side of the wall is then given by G (x1 ) + G (x1 ) Ψ(x1 ).
Propagation through V2 follows the same pattern as through V1 . The final
stage wave function Ψ(x1 ) for an event on S at stage Σ2 is given by
%
Ψ(x2 ) = d3 x1 F (x2 ; x1 ) GA (x1 ) + GB (x1 ) Ψ(x1 ). (10.9)
What matters here is the squared modulus |Ψ(x2 )|2 , which according to the
Born interpretation (Born, 1926) is the probability density relevant to outcome
detection on the screen S. From (10.9) we find
|Ψ(x2 )|2 = P A (x2 ) + P B (x2 ) + I AB (x2 ), (10.10)
where
%
A
P (x2 ) = d3 x1 d3 y 1 F (x2 ; x1 )F ∗ (x2 ; y1 )GA (x1 )G∗A (y1 )Ψ(x1 )Ψ∗ (y1 ),
%
P B (x2 ) =d3 x1 d3 y 1 F (x2 ; x1 )F ∗ (x2 ; y1 )GB (x1 )G∗B (y1 )Ψ(x1 )Ψ∗ (y1 ),
% A '
∗ G (x1 )G∗B (y1 )+
AB 3 3
I (x2 ) = d x1 d y 1 F (x2 ; x1 )F (x2 ; y1 ) Ψ(x1 )Ψ∗ (y1 ).
GB (x1 )G∗A (y1 )
(10.11)
In these integrals, we may use (10.6) to work out the outcome probabilities
from a knowledge of the detectors in the screen S and the characteristics of
the preparation device P ; that is, we should be able to specify the initial wave
function Φ(x0 ) reasonably well.
Blocking off opening B corresponds to setting GB to zero, and then we see
from (10.11) that the interference term and P B vanish. A similar remark applies
the blocking off of opening A.
12
22
A 1
1 32
P 10 B 42 S
21 52
K2
S0 S1 S2
Stage Σ0
The initial normalized labstate Ψ0 prepared by apparatus P by stage Σ0 is
denoted in QDN by
Ψ0 = 10 , (10.12)
Stage Σ1
The QDN description of Ψ1 , the labstate at stage Σ1 , has a dual role. On
the one hand it represents what the observer could find (statistically) if they
looked at either or both openings A and B at that stage. Actually, doing this
would be part of the complex calibration processes involved in setting up the
experiment in the first place. On the other hand, Ψ1 should be regarded as the
initial labstate for propagation to the next stage, Σ2 , landing on the screen S.
If this latter alternative is chosen, then the observer should make no attempt to
extract information from Ψ1 .
The rules of QM lead us to assert that
1 + α2 A
Ψ1 = U1,0 Ψ0 = (α1 A 2 )01 , (10.13)
1 1
140 Double-Slit Experiments
Stage Σ2
The transition from stage Σ1 to stage Σ2 is handled as follows. We assert that
a 01 , a = 1, 2, on
there is a semi-unitary operator U2,1 such that for each term A 1
the right-hand side of equation (10.13),
K
a 01 =
U2,1 A j,a j
U2,1 A2 02 , a = 1, 2, (10.14)
1
j=1
assuming there are K detector sites on the detecting screen S. This process
U2,1presently. Conserva-
preserves signality. We shall comment on the nature of
j,a
tion of probability requires the complex coefficients U2,1 to satisfy the semi-
unitarity rule
K
U j,b∗ j,a ab
2,1 U2,1 = δ , (10.15)
j=1
It can be readily checked, using the normalization condition and the above semi-
unitarity rules that Ψ2 Ψ2 = 1.
We can now readily calculate all outcome probabilities, by choosing any of
the 3N maximal or partial questions. For example, the conditional probability
Pr(k2 |Ψ0 ) that the kth detector at stage Σ2 would be in its signal state is given
by Pr(k2 |Ψ0 ) = Ψ2 Pk Ψ2 , which readily evaluates to the value
2
10.7 Contextual Subspaces 141
2
Pr(k2 |Ψ0 ) = αa∗ αb U k,a∗ k,b
2,1 U2,1 , k = 1, 2, . . . , K. (10.17)
a,b=1
Exercise 10.1 Prove (10.17) and show that total probability is conserved,
that is,
K
Pr(k2 |Ψ0 ) = 1. (10.18)
k=1
In the case of the DS experiment, the contextual bases B1c , B2c are given by all
the respective signality-one states, so we have
B1c ≡ A 1 01 , A
2 01 , B c ≡ A i 02 : i = 1, 2, . . . , K . (10.22)
1 1 2 2
These define the contextual subspaces Qc1 and Qc2 . From (10.14), we can construct
the contextual evolution operator Uc2,1 and its retraction Uc2,1 . These are given
in the CBR by
2
K
j,a j−1 a−1
2
K
Uc2,1 ≡ U2,1 22 21 , Uc2,1 ≡ U j,a∗ a−1 j−1
2,1 22 21 (10.23)
a=1 j=1 a=1 j=1
and satisfy the required relation Uc2,1 Uc2,1 = Ic1 , where Ic1 is the stage-Σ1 contextual
identity
Ic1 = 11 11 + 21 21 . (10.24)
Remark 10.4 The real world of experience and the world of the theorist’s
imagination are each far too complex to understand fully. Experiments
attempt to limit the complexity of the former by focussing on a limited
number of detectors. Contextual subspaces implement that strategy as closely
as possible, limiting the amount of complexity that the theorist needs to face.
Usually it will be clear by context when we are dealing with contextual sub-
spaces, so we shall usually drop the superscript c in our notation.
12
A 22
11
32
P 10 42 S
B 52
21
K2
S0 S1 S2
Figure 10.3. The QDN explanation of the Sillitto-Wykes experiment result
is that stage Σ1 need not coincide with a hyperplane of simultaneity in the
laboratory, or indeed, in any other frame of reference.
think of that as having happened in any way; physicists are not in the business
of believing in unverified propositions.
It is that lack of entitlement that really upsets people, because it destroys the
comfortable belief that real things are going on, even when we are not observing
them. Physicists, being people, felt compelled to find out whether the path of
electron could be determined when an interference pattern on screen S was found.
The monitored DS experiment was devised in order to investigate this issue.
V V
DA
A C
z
P L
B
DB S
W
Path A
The electron passes through slit A and interacts with the light, with an amplitude
at C written as aφ1 , where a is a factor representing the electron–electromagnetic
field interaction.
Path B
The electron passes through slit B and interacts with the light, with an amplitude
at C written as bφ2 , where b is a factor representing the electron–electromagnetic
field interaction.
A2
B2
A 1 12
1
P 10 22 S
B
21 32
K2
S0 S1 S2
Figure 10.5. The QDN stage diagram for the monitored DS experiment.
K
a 01 =
U2,1 A j + V j,a A
(U j,a A j A
A j,a j B j A
A2 A2 + X j,a A A B
2 2 +W 2 2 A2 )02 . (10.27)
1
j=1
2
(a) (b) (c) (d)
The four terms on the right-hand side represent the following alternatives:
(a) U j,a is the signality-one amplitude that an electron has passed through
opening a and has landed on the screen at site j and no signal is detected in
either DA or DB .
(b) V j,a is the signality-two amplitude that an electron has passed through
opening a and has landed on the screen at site j and a signal is detected in
DA and not in DB .
(c) W j,a is the signality-two amplitude that an electron has passed through
opening a and has landed on the screen at site j and a signal is detected in
DB and not in DA .
(d ) X j,a is the signality-three amplitude that an electron has passed through
opening a and has landed on the screen at site j and a signal is detected in
DA and a signal is detected in DB . This possibility is not considered in the
calculation of FLS.
The labstate Ψ2 is given by
2
K
j + V j,a A
αa (U j,a A j A
A j,a j B j A
A2 A2 + X j,a A A B
Ψ2 = 2 2 2 +W 2 2 A2 )02 . (10.28)
a=1 j=1
From this we can readily determine all the various probabilities of interest by
asking the appropriate questions. For example, if we ask for the probability
that the electron has landed on the kth detector site, and there is a signal in
DA , and there is no signal in DB , we need to calculate the expectation value
k P
Ψ2 P A B
2 2 P2 Ψ2 . To evaluate this, we first note that the signal algebra gives
2
P A B
k P kAA
2 2 P2 Ψ2 = αa V k,a A 2 2 02 , (10.29)
a=1
10.9 The Monitored Double-Slit Experiment 147
k PA P
Similarly, the probability Ψ2 P B
2 2 2 Ψ2 , that the electron landed on the kth
screen site and there was no signal in DA and there was a signal in DB , is
given by
k PA P
Ψ2 P B
2 2 2 Ψ2 = |α W + α2 W k,2 |2 .
1 k,1
(10.31)
The QDN calculation allows greater flexibility in the architecture of the exper-
iment than that assumed in the FLS calculation, but the price is that more
assumptions have to be made about the various amplitudes.
11
Modules
11.1 Modules
In this chapter, we discuss important modules such as Wollaston prisms, Newto-
nian prisms, beam splitters, phase shifters, mirrors, and null modules. Modules
are pieces of apparatus placed at various strategic positions in the informa-
tion void and are designed to influence labstate amplitudes at signal detectors.
Modules are therefore a critical component in the quantized detector network
(QDN) approach to quantum mechanics (QM).
Modules are not classified as either real or virtual detectors, because no infor-
mation is extracted from them. Their function is solely to influence quantum
amplitude propagation between stages in specific ways. Even empty space (the
vacuum) can be regarded as a module, for the propagation of signals through
nominally “empty” space is a fundamental subject in its own right. A particularly
important example illustrating how nontrivial that can be is the Hubble–Doppler
red shift of light from distant galaxies.
In the QDN description of the double-slit (DS) experiment, shown in Figure
10.2, the stage Σ1 detectors labeled 11 and 21 may be considered as on the V2
side of the wall W , where V2 is the information void between stages Σ1 and Σ2 .
Viewed in this way, the wall may be interpreted as a part of the appara-
tus that is positioned in the information void region V1 so as to influence the
signals obtained at detectors 11 and 21 . Virtually all experiments have similar
components of apparatus in the information void. We shall call any such piece of
apparatus a module. Modules are necessary to the architecture of the experiment,
are situated in the information void, and therefore are not detectors.
Examples of modules are Stern–Gerlach (SG) inhomogeneous field magnets,
Wollaston prisms, mirrors, phase changers, and so on. Even empty space, other-
wise known as the vacuum, should be regarded as a module.
Each module has its own physical properties. These include the number of
detectors feeding amplitudes into that module and the number of detectors that
11.2 The Vacuum 149
amplitudes are fed out from. We shall discuss a number of modules relevant to
the experiments discussed in this book, starting with the vacuum.
0
1
The question of what the best model of the “internal” vacuum of the RIC
could be is a crucial one, because there are many contexts where the answer will
influence the calculation of the quantum signal amplitudes of relevance to QDN.
For example, we may well need to take spacetime curvature into account if we are
sending signals across vast distances. Such a scenario is found in all observations
in astronomy and cosmology.
We have indicated previously that QDN has nothing to say about the infor-
mation void or the vacuum, precisely because QDN models what is happening
at the signal detectors. Therefore, we will need to bring in standard physics
theory, such as relativistic quantum field theory, to help us write down signal
amplitudes between those detectors. Should that appear to make QDN rather
limited, we point out that standard theories do have their limitations as well,
such as the appearance of infinite renormalization constants in quantum field
theory. We interpret that as the same problem seen from the opposite direction.
It will probably be only by a judicious combination of QDN (or whatever should
replace it) on the apparatus side of the physics coin and of standard relativistic
quantum field theory on the reductionist side that we will find a better approach
to empirical physics than with just either alone.
In QDN notation, signal amplitude propagation across the information void is
represented by featureless lines, as in Figure 11.1.
1
As stated in a previous chapter, we drop the superscript “c” denoting “contextual.”
152 Modules
In this particular case, these POVMs satisfy the relations E i1,0 E j1,0 = δ ij E i1,0 (no
sum over i) and
E 11,0 + E 21,0 = |s10 s10 | + |s20 s20 | ⊗ 10 10 = IH ⊗ Ic0 , (11.9)
the identity operator for the initial contextual Hilbert space H ⊗ Qc0 . From these
operators we find the conditional outcome rates
Pr(11 |Ψ0 ) ≡ T r E 11,0 0 = |α|2 , Pr(21 |Ψ0 ) ≡ T r E 21,0 0 = |β|2 , (11.10)
2
Note that this expression is based on the ancient and misleading paradigm that observers
look “at” objects.
11.4 The Newtonian Prism 153
S P i P i
P P
S K K
Figure 11.3. (a) Newtonian prism P 1 splits an incoming beam of white light
from source S (the Sun) into subbeams associated with primary colors. Prism
P 2 does not split any one of those subbeams further. (b) The QDN schematic
of the same process.
Hilbert space containing free photon states3 and Qn is the apparatus quantum
register at that stage. Relevant states in HnEM are constructed in terms of a
suitable orthonormal set {|in : i = 1, 2, . . . K}, with orthonormality condition
in |jn = δ ij .
With reference to Figure 11.3(b), the stage-Σ0 normalized incident state |Ψ0 )
is defined as
K
K
|Ψ0 ) ≡ ψ0i |i0 ⊗ 10 , |ψ0i |2 = 1. (11.11)
i=1 i=1
which models the splitting up of the original beam into its primary colour
constituents.
Index Labels
The electromagnetic state indices i0 and i1 have a temporal subscript that labels
stages only and does not affect the “value” of the index. So, for example, we take
i1 = i0 ≡ i.
Discreteness
Contrary to what we might anticipate from experience with RQFT, the elec-
tromagnetic index in in the above is discrete, not continuous, because we can
3
Any reference to photons is for convenience. The “basis” states here form a discrete set and
therefore are not plane wave solutions but approximate them suitably.
154 Modules
only ever have a discrete set of detectors. This overturns the usual approach
to RQFT, which presupposes a spectrum of photon states with a continuous
frequency range. We do not need to deal with any supposed continuity in HnEM ,
because the only parts of it that we can ever deal with are found at the finite,
discrete detector end. Everything done here is contextual: it is the presupposition
that an unobserved (and unobservable) continuum of states should play a role
in, for example, Feynman graph integrals that contributes to the divergence of
renormalization constants in RQFT. In the laboratory, all signals are finite.
The effect of the first prism P 1 on the incident beam is modeled by the action
of a contextual semi-unitary evolution operator U1,0 , given by
K
U1,0 ≡ i 01 10 ,
|i1 i0 | ⊗ A (11.13)
1
i=1
With reference to Figure 11.3(a), we see that only one chosen subbeam, labeled
i in Figure 11.3(b), enters the second prism P 2 , all the other subbeams being
blocked off. Then we may write
i 01 ≡ |i2 ⊗ A
U2,1 |i1 ⊗ A i 02 . (11.14)
1 2
Hence we deduce
K
U2,1 = j 02 01 Aj + |i2 i1 | ⊗ A
|j2 j1 | ⊗ A i 02 01 Ai . (11.16)
2
1
2
1
j=1,
j=i null tests through P2
Although the two terms on the right-hand side of this expression look similar,
they are very different contextually. A practical difference would be that |j2 is
an exact copy (in H2EM ) of |j1 (in H1EM ), for j = i, whereas |i2 would not be
an exact copy of |i1 because of, say, attenuation effects as that subbeam passed
through P 2 .
The point here is that context underpins the significance of everything, and
that context is determined by the observer. If they decide to block off all sub-
beams except the one labeled i, and then actually do that, then that action
(not the decision alone) automatically brings in empirically significant context
that involves separate dynamics and apparatus. It is the essence of Wheeler’s
participatory principle quoted on Chapter 1 that action be actually carried out
when discussing quantum processes.
We see from this that the freedom that the observer has to choose how to
rearrange apparatus creates a contextuality. This is emphasized in the next part
of our description of the Newtonian prism, the recombination of a split spectrum
back into the original beam.
S P LP
K
S L
P P
Figure 11.4. (a) Newton used lens L and prism P 2 to recombine the spectrum
from prism P 1 back into a beam of white light. (b) In the QDN schematic of
the same process, the lens and second prism constitute module LP .
Stage Σ0
The initial labstate is
K
K
|Ψ0 ) ≡ ψ0i |i0 ⊗ 10 , |ψ0i |2 = 1. (11.17)
i=1 i=1
Stage Σ0 → Σ1
The evolution operator U1,0 that models the action of prism P 1 is given by
K
c
U1,0 = i 01 10 .
|i1 i0 | ⊗ A (11.18)
1
i=1
Stage Σ1 → Σ2
The evolution operator U2,1 that models the undoing action of the module LP
consisting of the lens L and the prism P 2 is given, by inspection, by
K
U2,1 = 1 02 01 Ai .
|i2 i1 | ⊗ A (11.20)
2 1
i=1
K
|Ψ2 ) ≡ U2,1 |Ψ1 ) = ψ0i |i2 ⊗ 12 . (11.21)
i=1
Comparing this with the original labstate (11.17), we see that the final total
state |Ψ2 ) is a persistent image of |Ψ0 ). Therefore, the combination of prism P 1
followed by prism P 2 is equivalent to a null test of the original beam.
It might be believed that we have encountered an example of semi-unitary
evolution where the initial quantum register had a larger rank than the final
quantum register. In fact, that is not the case. The action of the lens between
the prisms is designed to focus on a one-dimensional subspace of the stage-Σ1
register, a subspace containing the image of the original beam. It is that subspace
alone that is then mapped by the lens and second prism into the one-dimensional
subspace representing the outgoing recombined beam.
When Newton’s recombination experiment is examined in fine detail, only then
can it be appreciated how extraordinarily difficult it is to model what happens
with some degree of correctness. The reader is invited to model this experiment
using only quantum electrodynamics (QED) in its standard formulation, and
they will then see what we mean.
In stage diagrams such as Figure 11.5, where the square representing the beam
splitter is rotated as shown to face the two input ports 10 and 20 , we shall
adopt the convention that 21 and 11 are the transmitted and reflected beams,
respectively, associated with input port 10 , while they are the reflected and
transmitted beams, respectively, associated with input port 20 . This convention
is used with Form 2 above throughout this book in the encoding of our computer
algebra program MAIN, discussed in the next chapter.
In many experiments discussed in the√literature, Form 2 is assumed, with
t and r taken equal, that is, t = r = 1/ 2. In our computer algebra program
MAIN discussed in the next chapter, we do not do this. We use Form 2, but it is
generally most instructive to leave the transmission and reflection coefficients
arbitrary up to the required constraints. For instance, ti and ri will be the
transmission and reflection coefficients for beam splitter B i and will be arbitrary,
apart from the condition that (ti )2 + (ri )2 = 1.
Signality-Two Input
On occasion, the possibility arises that a signality-two labstate or its equivalent
is incident on a beam splitter. We shall refer to this as beam splitter saturation.
The dynamics for such a scenario has to be decided contextually. One possibility
is that the two photons (speaking loosely) do not emerge and are absorbed by
the device. In such a case, the output channels of the beam splitter remain in
their ground states. On the other hand, we could decide to model what happens
as a case of transparency, such that the output labstate is a signality-two state
with both output detectors registering a signal.
11.6 Mirrors
Mirrors are important modules in many experiments. The basic action of a mirror
is to deflect electromagnetic radiation, that is, change path direction.
Depending on the physics, mirrors can also change the phase of electromagnetic
waves. According to Fresnel’s laws of optics, light incident from air on a mirror
undergoes a phase shift of π, if reflecting from the front surface of a mirror, while
there is no phase shift on rear surface reflection.
Naturally occurring mirrors can have a severe influence on signal detection
amplitudes. For example, television signals received at an aerial are built up
from the superposition of all the signals that have followed separate paths from
the transmitting station to that aerial. If a transmitted signal has one path
that goes in line of sight from transmitter to detector and another path that
goes from transmitter, is reflected from the surface of the sea, and then arrives
at the detector, then destructive interference can occur, thereby degrading the
overall detected signal (Laven et al., 1970). Moreover, if the sea level changes
due to tidal action, then the interference varies during the day and cannot be
eliminated easily.
In stage diagrams, mirrors are labeled M .
12.1 Introduction
The quantized detector network (QDN) formalism was designed from the outset
to deal with detector networks of arbitrary complexity and rank. However, there
is a price to be paid. A qubit register of rank r is a Hilbert space of dimension 2r ,
so the dimensionality of quantized detector networks grows exponentially with
rank. For example, a relatively small system of, say, 10 detectors involves a
Hilbert space of dimension 210 = 1, 024.
Even such a relatively small system cannot be dealt with easily by manual cal-
culation, because quantum mechanics (QM) involves entangled states. Labstates
in QDN are far more complex (in the sense of having far more mathematical
structure) than the corresponding states in a classical register of the same rank.
Some of the mathematical entanglement and separability properties of quantum
labstates are discussed in Chapters 22 and 23.
The quantum entanglement structure of labstates poses a ubiquitous and
serious problem, for both theorists and experimentalists. At this time, there
is significant interest in quantum entanglement, particularly regarding its use
in quantum computing, but theoretical understanding of entanglement is still
surprisingly limited.
On the experimental side, quantum computers with 2000 qubits are currently
being developed (D-Wave Systems, 2016). There is scope here for the application
of QDN to networks of rank going into the many thousands. For those sorts of
systems under observation (SUOs), quantum entanglement makes calculations
by hand far too laborious to be of any practical use.
Fortunately, three factors come to our aid here, making the application of QDN
to large-rank networks a potentially viable proposition.
12.2 Program MAIN 163
Modularization
We saw in previous chapters that QDN deals with discrete aspects of observa-
tion, despite the fact that standard QM and relativistic quantum field theory
(RQFT) deal with continuous degrees of freedom. This discreteness comes in
three forms, referred to us as stages, nodes, and modules, and all of that is due
to three inescapable physical facts. First, all real observations take time and
involve discrete signals in finite numbers of detectors.1 Second, real apparatus is
constructed from atoms, not continua. Third, these atoms form finite numbers
of well-characterized modules, as discussed in the previous chapter.
Contextuality
A critically helpful fact here is that in any particular experiment, the contextual
subspaces that the observer needs to deal with will usually be of significantly
lower dimensions than that of the full quantum registers involved. For example,
while the register ground state 0n at stage Σn is an indispensable component
of the QDN formalism, there are few situations that will involve the completely
saturated labstate 2r − 1n ≡ A 2
1 A r
n n . . . An 0n . This will certainly be the case
when the rank r is large, say, of the order of hundreds or even possibly thousands.
Computerization
While QDN calculations can often be done by hand for small rank calculations
involving a small number of stages, the use of computer algebra (CA) software,
such as Maple, Mathlab, and Mathematica, allows problems with much greater
rank registers running over many more stages to be dealt with. All but the
simplest network calculations in this book were done using a computer algebra
program developed by us called Program MAIN, for example.
1
Technically, the signals are not discrete or continuous. It is the observer who interprets
whatever they find at a detector as a discrete bit of information.
164 Computerization and Computer Algebra
M B
S W Rq
Figure 12.1. The Wollaston interferometer. A monochromatic, unpolarized
beam of light from source S is passed through Wollaston prism W . Emerging
polarized beam 11 is deflected by mirror M onto beam splitter B, while
emerging polarized beam 21 has its polarization rotated by angle θ through
module Rθ , before hitting B. Signal detection is at detectors 13 and 23 .
into the other input channel of B. The two beam splitter outcome channels are
monitored by final stage detectors 13 and 23 . The angle θ is chosen by the observer
before the preparation switch 10 is thrown.
If the transmission t and reflection r coefficients of the beam splitter B are
known, then the observed outcome frequencies at the final stage detectors will
depend on θ, and this gives information about the relative magnitudes of the two
polarizations in the initial beam.
For applying QDN to the WI and all other experiments we have a single CA
program, referred to here as MAIN . This program consists of two sections, A
and B. In Section A, we input the details of the particular experiment we want
to calculate, such as how many stages the experiment involves, the rank of each
stage quantum register, and so on.
In practice, MAIN carries full details of all the experiments we have investi-
gated, with a path parameter that allows us to select any experiment of current
interest. For example, the path parameter for WI is 44.
Given the path parameter, MAIN jumps to Section A, initializing the variables
from the relevant data there. Then MAIN jumps to Section B, automatically
calculating everything of interest, such as the outcome probabilities at each final
labstate detector.
The power of this approach is that all the work is done in encoding Section A.
In general, this does not take long and is straightforward, given a stage diagram
for the process and the module rules discussed in the previous chapters. Section B
is universal, that is, is the same for all experiments. This gives a very economical
and powerful approach indeed, allowing a great range of questions to be posed
and answered rapidly. The only real theoretical problem is in deciding the stage-
to-stage evolution operators Un+1,n in the encoding of Section A. This part of
12.3 The Wollaston Interferometer 165
the process is currently where the computer does not help and the theorist has to
decide what goes in. However, it should be possible to automate this part as well,
if stage diagrams could be encoded first. We comment on this further, below.
where |san
is an “internal” SUO state, carrying information such as polarization,
and ian is a computational basis representation (CBR) labstate element carrying
information about the signal state of the detectors.
Encoding
Our QDN notation is transcribed into program MAIN as follows. Although the
observer’s focus is the signality of the signal detector amplitudes, the internal
calculations of MAIN are written in terms of the CBR. The reason is that
i ,
labstates of signality greater than one are nonlinear in the signal operators An
whereas the CBR is transparent to signality.
Our encoding takes the following form:
Here the symbols a, A, s, and S were arbitrarily chosen, with the rule that
lowercase letters represent quantum “ket” states and capital letters represent
their duals.
The power of the CA approach comes in at this point in that we can use
differentiation to implement “inner product” rules. For example,
∂a[j, n] ∂s[q, n]
in j n → = δ ij , spn |sqn → = δ pq , (12.3)
∂a[i, n] ∂s[p, n]
which is readily encoded in CA.2
In the following, the asterisk symbol ∗ denotes ordinary commutative multipli-
cation in the CA program. It is an important and helpful fact that our encoding
does not need any noncommutative product operation, although CA is capable
of handling those when required.
2
Differentiation is a good example of the dichotomy between the continuous and the discrete.
Mathematicians frequently deal with continuous variables, but mathematicians themselves
operate on discrete principles: all of mathematics is done line by line, symbol by symbol, in
a discrete way, very much like the way experimentalists have to operate in the real world.
166 Computerization and Computer Algebra
If rn is the rank of the stage-Σn qubit register and dn is the dimension of the
internal SUO Hilbert space, also at that stage, then total states will be of the
form
n −1 d
2r n
|Ψn ) ≡ n |sn ⊗ in
cp,i p
i=0 p=1
n −1 d
(12.4)
2r n
Transcription Phase
Given the stage diagram, the next step is to read from it the following information
and encode it into MAIN:
1. N , the number of stages, not counting the initial stage, which is labeled
Σ0 . There is no limit to the size of N apart from the capacity limits of the
12.3 The Wollaston Interferometer 167
Stage Σ0 → Σ1
1 00 = |s1 ⊗ A
U1,0 |s10 ⊗ A 1 01 ,
0 1 1
1 00 = |s2 ⊗ A
U1,0 |s20 ⊗ A 2 01 . (12.9)
0 1 1
Stage Σ1 → Σ2
1 01 = |s1 ⊗ A
U2,1 |s11 ⊗ A 1 02 , (12.10)
1 2 2
2 01 = (cos θ|s2 + sin θ|s1 ) ⊗ A
U2,1 |s21 ⊗ A 2 02 . (12.11)
1 2 2 2
Stage Σ2 → Σ3
1 02 = |s1 ⊗ tA
U3,2 |s12 ⊗ A 2 03 + irA
1 03 ,
2 2 3 3
2 02 = |s1 ⊗ tA
U3,2 |s12 ⊗ A 1 03 + irA
2 03 ,
2 2 3 3
2 02 = |s2 ⊗ tA
U3,2 |s22 ⊗ A 1 03 + irA
2 03 . (12.12)
2 2 3 3
That is all the theoretical input needed. The next step is to transcribe it
into Section A code.
6. To transcribe the above information, we first use contextual completeness to
write down U1,0 , U2,1 , and so on, and then use the rules (12.2). For instance,
from (12.9) this gives
1 01 s1 | ⊗ 01 A1 + |s2 ⊗ A
U1,0 = |s11 ⊗ A 2 01 s2 | ⊗ 01 A1
1 0 0 1 1 0 0
= |s11 s10 | ⊗ 11 10 + |s21 s20 | ⊗ 21 ⊗ 10
→ U [1, 0] ≡ s[1, 1] ∗ S[1, 0] ∗ a[1, 1] ∗ A[1, 0]+
s[2, 1] ∗ S[2, 0] ∗ a[2, 1] ∗ A[1, 0], (12.13)
Evaluation Phase
Once Section A has been initialized, Section B goes into action, employing several
key procedures (subroutines):
1. Given U [n + 1, n] and U [n + 2, n + 1], then the operator Un+2,n ≡ Un+1,n+1
Un+1,n is given by the rule Un+2,n → U [n + 2, n], where
−1 d[n+1]
2r[n+1] ∂ 2 U [n + 2, n + 1] ∂ 2 U [n + 1, n]
U [n + 2, n] ≡ ∗ .
i=0 p=1
∂S[p, n + 1]∂A[i, n + 1] ∂s[p, n + 1]∂a[i, n + 1]
(12.14)
The overall contextual evolution operator U [N, 0] is evaluated by iterating
this process.
2. The retraction operator U N,0 is calculated from U [N, 0] by complex conjuga-
tion of coefficients and the interchange s ↔ S, a ↔ A. Complex conjugation
is readily handled in CA.
3. The detector POVM operators EN,0 i
→ E[i, N ] discussed in Chapter 9 are
calculated by the rule
d[N ]
∂ 2 U [N, 0] ∂ 2 U [N, 0]
E[i, N ] ≡ ∗ . (12.15)
p=1
∂S[p, N ]∂A[i, N ] ∂s[p, N ]∂a[i, N ]
−1
2r[N −1
]
d[0] 2r[0]
E[i, N ] = s[p, N ] ∗ S[p, N ] a[i, 0] ∗ A[i, 0], (12.16)
i=0 p=1 i=0
Answers
Answers can be either in numerical form or in the far more useful algebraic form.
For example, the output listing for the Wollaston interferometer is
1 03 |Ψ0 ) = |c1 |2 r2 + |c2 |2 t2 + i(c2∗ c1 − c1∗ c2 )rt sin θ,
P (A 3
2 03 |Ψ0 ) = |c1 |2 t2 + |c2 |2 r2 − i(c2∗ c1 − c1∗ c2 )rt sin θ.
P (A (12.17)
3
These sum to unity, as expected. The implication here is that MAIN has found
1A
the other two potential probabilities P (03 |Ψ0 ) and P (A 2
3 3 03 |Ψ0 ) to be zero.
Signal Decomposition
The CBR is excellent for CA but not so helpful with understanding signality.
To assist in the interpretation, MAIN converts CBR probabilities into a listing
of total signal probability for individual final-stage detectors, excluding any
that have probabilities of zero. This gives valuable insight into the patterns
of information flow. With some experience, it becomes clear why the photon
concept has become so popular: signality is conserved in many quantum optics
experiments unless some specific module such as a nonlinear crystal that creates
photons is used, as in parametric down-conversion.
Partial Questions
MAIN answers the full set of maximal questions. We have stated before that this
set of answers allows all partial questions to be answered. Therefore, CA gives
us the possibility of answering all possible questions in the context of QDN.
This becomes important in several complex experiments, such as the two-photon
interferometer of Horne, Shimony, and Zeilinger (Horne et al., 1989) and the
double-slit quantum eraser of Walborn et al. (Walborn et al., 2002), experiments
covered in detail in Chapter 14.
Rank r Tr (seconds)
10 0.137
12 0.515
14 2.609
16 11.73
18 40.92
20 192.3
22 1284
Analysis of this table suggests that the time Tr for MAIN to complete a DS
simulation with a rank-r screen register is approximately of the form Tr = A +
B2r , where A and B are constants. It is clear that even a rank-100 simulation
could not be completed on this basis.
12.5 Prospects 171
12.5 Prospects
Our current CA program, MAIN, requires the user to transcribe “by hand” the
information contained in a stage diagram into CA form. This process could be,
and perhaps should be, fully automated, opening the door to potentially vast
applications. Ideally, this would utilize the currently accessible technology of
touch screens, whereby the user used electronic graphics pens to sketch stage
diagrams in freehand. These would then be automatically transcribed into the
information currently fed in by hand in Section A of MAIN.
We envisage the development of user-friendly graphical software that would
allow the user to select all the necessary modules that were of interest, using
palettes of symbols representing real and virtual detectors, modules of all sorts,
and lines representing transmission through various media. With the right infor-
mation, it should be possible to encode different material properties, so that
quantized detector networks representing crystals, glasses, and other complex
systems such as neural networks and retinas could be dealt with in a systematic
and coherent way.
There is little doubt in our mind that such an approach to complex prob-
lems involving quantum mechanical effects in engineering, materials science, and
medical science will one day become commonplace.
13
Interferometers
13.1 Introduction
The discussion of modules and computerization in previous chapters gives us the
means to consider empirically useful quantized detector networks (QDN). In this
chapter we shall focus on a particular class of network known as interferometers.
Interferometers play a crucial role in quantum mechanics (QM) because they
demonstrate the “paradox” of wave–particle duality in a direct way. On the one
hand, discrete signals are detected by the observer and those signals are usually
interpreted as particles or quanta. On the other hand, the observed frequencies
built up over many runs show effects interpreted as due to the interference
of waves.
In the previous chapter, we used computer algebra (CA) to discuss the Wol-
laston interferometer (WI), a relatively simple interferometer. We shall use the
same approach in the following discussion of the Mach–Zehnder interferometer
(MZI), apparatus that allows the observer to investigate optical transmission
through various materials.
M f B
S B M
Figure 13.1. The Mach–Zehnder interferometer: S is a source of monochro-
matic, unpolarized light, B 1 and B 2 are beam splitters, M 1 and M 2 are
mirrors, and φ contains a medium that changes the phase of the light by φ.
Parameters
For this experiment, the number of stages N is N = 3. From Figure 13.1, the
rank at each stage is seen to be r0 = 1, r1 = r2 = r3 = 2. Since polarization is not
a factor in this version of the experiment, we take the dimensions of the internal
(photonic) space at the four stages to be given by d0 = d1 = d2 = d3 = 1.
Stage Σ1 to Stage Σ2
1 01 = eiφ |s2 ⊗ A
U2,1 |s1 ⊗ A 1 02 ≡ eiφ |s2 ⊗ 12 ,
1 2
2 01 = |s2 ⊗ A
U2,1 |s1 ⊗ A 2 02 ≡ |s2 ⊗ 22 . (13.2)
1 2
Stage Σ2 to Stage Σ3
1 02 = t2 |s3 ⊗ A
U3,2 |s2 ⊗ A 2 03 + ir2 |s3 ⊗ A
1 00 ≡ |s3 ⊗ (t2 23 + ir2 13 ),
2 3 3
2 02 = ir2 |s3 ⊗ A
U3,2 |s2 ⊗ A 2 03 + t|s3 ⊗ A 1 00 ≡ |s3 ⊗ (ir2 23 + t13 ).
2 3 3
(13.3)
Here we have ignored any phase changes at the mirrors and characterized each
beam splitter separately. In the above, all superscripts are labels, not powers.
174 Interferometers
This is all the input information required for our CA program MAIN, as
discussed in the previous chapter. The nonzero conditional outcome probabilities
are found to be
1 03 |Ψ0 ) = −2r1 r2 t1 t2 cos(φ) + (r1 r2 )2 + (t1 t2 )2 ,
Pr(A 3
2 03 |Ψ0 ) = +2r1 r2 t1 t2 cos(φ) + (r1 t2 )2 + (t1 r2 )2 .
Pr(A (13.4)
3
B B
S W R
Figure 13.2. Brandt’s network: source S prepares an unpolarized, monochro-
matic beam of light that passes through Wollaston prism W . This splits the
beam into two orthogonally polarized components. These are passed through
beam splitters B 1 and B 2 as shown. Module R rotates the polarization of one of
the polarized beams into that of the other, prior to it being passed through B 2 .
13.3 Brandt’s Network 175
Stage Σ0 to Stage Σ1
1 00 ≡ |s1 ⊗ A
U1,0 |s10 ⊗ A 1 01 = |s1 ⊗ 11 ,
0 1 1 1
U1,0 |s0 ⊗ A
2 00 ≡ |s ⊗ A
1 2 01 = |s2 ⊗ 21 .
2
(13.6)
0 1 1 1
Stage Σ1 to Stage Σ2
1 01 ≡ t1 |s1 ⊗ A
U2,1 |s11 ⊗ A 1 01 + ir1 |s1 ⊗ A
2 02 = |s1 ⊗ t1 11 + ir1 22 ,
1 2 2 2 2 2
2 01 ≡ −|s1 ⊗ A
U2,1 |s21 ⊗ A 3 02 = −|s1 ⊗ 42 , (13.7)
1 2 2 2
Stage Σ2 to Stage Σ3
2 02 ≡ |s1 ⊗ A
U3,2 |s12 ⊗ A 2 03 = |s1 ⊗ 13 ,
2 3 3 3
2 02 ≡ t2 |s1 ⊗ A
U3,2 |s12 ⊗ A 3 03 + ir2 |s1 ⊗ A
2 03
2 3 3 3 3
= t |s3 ⊗ 43 + ir |s3 ⊗ 23 ,
2 1 2 1
U3,2 |s12 ⊗ 3 02
A 03 + t2 |s1 ⊗ A
≡ ir |s3 ⊗ A
2 1 3 2 03
2 3 3 3
assuming perfect efficiency and wave-train overlap. When the reflection and
transmission coefficients are chosen as by Brandt (Brandt, 1999), these rates
agree with his precisely.
176 Interferometers
f
S C B B
Parameters
We take N = 3, d[0] = d[1] = d[2] = d[3] = 1, r[0] = 1, r[1] = r[2] = r[3] = 4.
Initial State
The preparation switch is given by
1 00 .
|Ψ, 0) = |s10 ⊗ A (13.10)
0
Evolution
Stage Σ0 to Σ1
1 00 ≡ √1 |s1 ⊗ A
U1,0 |s10 ⊗ A 3 01 + A
1A 4 01 = √1 |s1 ⊗ {51 + 10 } .
2A
0 1 1 1 1 1 1 1
2 2
(13.11)
13.4 The Two-Photon Interferometer 177
Stage Σ1 to Σ2
U2,1 |s11 ⊗ A 3
1A iφ1 1 3
1A iφ1 1
1 1 01 ≡ e |s2 ⊗ A 2 2 02 = e |s2 ⊗ 52 , (13.12)
U2,1 |s11 ⊗ A 4
2A 2
4
2A 2
1 1 01 ≡ e |s12 ⊗ A |s12 ⊗ 102 .
iφ iφ
2 2 02 = e (13.13)
Stage Σ2 to Σ3
U3,2 |s12 ⊗ A 3
1A 1 4 1 3 1 + ir2 A
2
2 2 02 = |s3 ⊗ t A3 + ir A3 t2 A
1
3 3
= |s13 ⊗ t1 t2 93 + ir1 t2 53 + ir2 t1 103 − r1 r2 63 ,
U3,2 |s12 ⊗ A 4
2A 2 2 2 1 3 + ir1 A
4
2 2 02 = |s3 ⊗ t A3 + ir A3 t1 A
1
3 3
= |s13 ⊗ t1 t2 63 + ir2 t1 53 + ir1 t2 103 − r1 r2 93 .
(13.14)
With this information, program MAIN gives the following nonzero correlation
probabilities:
1A
3 2 4 1 1 2 2 1 2 1 2
Pr(A 3 3 03 |Ψ0 ) = Pr(A3 A3 03 |Ψ0 ) = (r t ) + (r t ) + r r t t cos(φ − φ ),
1 2 1 2 1 2
2 2
3
2A 1 4 1 1 2 2 1 2 1 2
Pr(A 3 3 03 |Ψ0 ) = Pr(A3 A3 03 |Ψ0 ) = (r r ) + (t t ) − r r t t cos(φ − φ ).
1 2 1 2 1 2
2 2
(13.15)
Partial Questions
The above signal pair correlations show dependency on the phases φ1 , φ2 , whereas
as pointed out by HSZ, the answers to the partial questions involving single
detectors only have no such dependence. For instance, the probability Pr(13 |Ψ0 )
that stage Σ3 detector 13 has fired is given by
3
1A 1 4 1
Pr(13 |Ψ0 ) = Pr(A 3 3 03 |Ψ0 ) + Pr(A3 A3 03 |Ψ0 ) = , (13.16)
2
which is independent of the phases. The same result holds for the other single
detectors.
Nonlocality
The QDN formalism shows that the phase change φ1 applied to component 11
could be applied to component 31 instead with no change in physical predictions;
this is evident from the fact that the phase factor exp(iφ1 ) multiplies the product
1A
A 3 2
2 2 in Eq. (13.12). A similar remark applies to the other phase change φ . Since
the component beams are spatially separated when this phase change is applied,
this means that quantum states are inherently nonlocal in character.
178 Interferometers
14.1 Introduction
In recent years, numerous quantum optics experiments have demonstrated novel
quantum effects based on quantum amplitude superposition. Some of these
empirical observations raise disturbing questions about the nature of reality,
particularly concerning quantum nonlocality. In this chapter we discuss some
experiments that suggest (to us, at least) that the information void, that
uncharted regime between signal preparation and outcome detection, is not well
modeled by classical spacetime. Something strange seems to be lurking there.
For the purposes of exposition, the sequence of experiments we discuss in this
chapter does not follow the historical sequence of those experiments, We discuss
first the delayed-choice quantum eraser experiment of Kim et al. (Kim et al.,
2000), referred to here as KIM.1 This leads us to define a heuristic measure
of which-path information that is central to the theme of this chapter. This
leads to a discussion of Wheeler’s thought experiments on the conundrum of
delayed choice in physics, particularly evident in the phenomenon of galactic
lensing (Wheeler, 1983), and its empirical implementation, the delayed-choice
interferometer experiment of Jacques et al., referred to as JACQUES. Our final
experiment in this sequence is the double-slit quantum eraser of Walborn et al.
(Walborn et al., 2002), referred to as WALBORN. That experiment gives a
significant challenge for quantized detector networks (QDN) to reproduce its
empirical results.
Such experiments have led to suggestions that interference patterns formed
by particles impacting on a screen may be influenced in some way by decisions
made long after those particles had landed on that screen. Our objective in
this chapter is to show by a detailed QDN analysis of those experiments that
1
With profound apologies to co-authors, we will employ the convention that several
experiments are referred to by the capitalized surname of the first named author.
180 Quantum Eraser Experiments
2
The specific English translation of what Einstein wrote here has been disputed: mysterious
rather than spooky has been suggested by some authorities as closer to what Einstein
intended to say.
14.2 Delayed-Choice Quantum Eraser 181
B S
K K
Stage Σ0 to Stage Σ1
The jump to stage Σ1 is defined by the production of two correlated photon
pairs. Photon polarization is ignored here, but the formalism can readily deal
with any situation where this is not the case. Therefore, MAIN assumes a one-
dimensional internal Hilbert space in this experiment with normalized basis |sn
for the internal degrees of freedom at any given stage Σn .
The semi-unitary transformation from Σ0 → Σ1 is given by
U1,0 |s0 ⊗ A 1 00 = |s1 ⊗ αA 2
1A 3 4
0 1 1 + β A1 A1 01 , (14.1)
where |α|2 + |β|2 = 1. As we stated above we assume that photon spin is not
relevant to the issues being explored here.
Stage Σ1 to Stage Σ2
For this jump we have
K
U2,1 |s1 ⊗ A 2 01 = |s2 ⊗
1A i {t1 A
C iA 1 + ir1 A
2 }02 ,
1 1 2 2 2
i=5
K
U2,1 3 4
|s1 ⊗ A1 A1 01 = |s2 ⊗ i {t2 A
Di A 4 + ir2 A
3 }02 . (14.2)
2 2 2
i=5
Here, one component beam from each pair is focused onto the detecting screen
DS, while the other component is channeled onto either beam-splitter B 1 or B 2 ,
14.2 Delayed-Choice Quantum Eraser 183
as shown. The {C i } coefficients represent the amplitudes for landing on the screen
DS from 11 , while the {Di } are from 31 . The coefficients ti , ri are characteristic
transmission and reflection parameters associated with beam splitter B i , i =
1, 2, 3, and satisfy the rule (ti )2√
+(ri )2 = 1. It is useful not to set these parameters
to the conventional value 1/ 2 but to keep them open and available to be
changed. It is in these parameters that we encode the observer’s freedom of
choice in this particular experiment.
Stage Σ2 to Stage Σ3
The final transition from stage Σ2 to Σ3 involves four terms, each of which
involves null tests in one way or another. Most significantly for this discussion, it
is supposed that the DS screen detectors have registered their signals irreversibly
by stage Σ2 . Therefore, this information is carried to stage Σ3 by null tests, shown
by the horizontal lines on the right-hand side of Figure 14.1.
We write, for i = 5, 6, . . . , K:
U3,2 |s2 ⊗ A i A 1 02 = |s3 ⊗ A 1
i A
2 2 3 3 03 ,
U3,2 |s2 ⊗ A i A2 i 3 3 3 2
2 2 02 = |s3 ⊗ A3 {t A3 + ir A3 }03
U3,2 |s2 ⊗ A i A3 i 3 2 3 3
2 2 02 = |s3 ⊗ A3 {t A3 + ir A3 }03 (14.3)
U3,2 |s2 ⊗ A i A4 i 4
2 2 02 = |s3 ⊗ A3 A3 03 .
This is all the information needed for our CA program MAIN to answer all
possible maximal questions. MAIN gives four sets of nonzero maximal question
answers, equivalent to two-site correlations:
1A
Pr(A i
3 3 03 |Ψ0 ) = |t C α| ,
1 i 2
2A
Pr(A i
3 3 03 |Ψ0 ) = |r r C α − ir t D β| ,
1 3 i 2 3 i 2
3A
Pr(A i
3 3 03 |Ψ0 ) = |r t C α + ir r D β| ,
1 3 i 2 3 i 2
A
Pr(A 4 03 |Ψ0 ) = |t D β| ,
i 2 i 2
i = 5, 6, . . . , K. (14.4)
3 3
have been registered on the screen DS during any given run of the experiment.
In other words, choices can be made at B 1 , B 2 , and B 3 that affect various
incidence rates. The question is, does any change made by the experimentalist
at any beam splitter affect anything that has been measured before that
change was made? In particular, can any change in B 3 affect what has already
happened on the screen?
By inspection of (14.3), we see that no change in t3 or r3 , subject to (t3 )2 +
(r ) = 1, has any effect whatsoever on Pr(A i
1A 4 i
3 3 03 |Ψ0 ) or Pr(A3 A3 03 |Ψ0 ).
3 2
3
In ordinary usage, the term post-selection means selection occurring after some given
process. In probability theory, given an event A, then the probability of some event B
conditional on that given is a conditional probability, denoted P (B|A). Either way, the
concern is with what happens after something has been done or occurred.
14.3 Which-Path Measure 185
determining with certainty full path information from a single detected “indi-
cator” photon, that is, from a single signal in a specific set of detectors. We
define
For example, in the case of the double-slit experiment discussed in Chapter 10,
the detecting screen contains the indicator detectors. Then with both slits open,
we find ΦDS = 0. When one of the slits is open, then ΦDS = 1. In the case of
the delayed choice quantum eraser discussed above, the indicator detectors are
13 and 43 , so we define
WHEELER-1
Figure 13.1 is relevant to WHEELER-1. Specifically, WHEELER-1 is concerned
with stage-Σ1 nodes 11 and 21 and their relationship to stage-Σ2 nodes 12 and 22 .
Suppose the laboratory time between stage Σ1 and stage Σ2 is significantly long
enough for anything done to one of the beams, at say module φ, to be long after
the action of beam splitter B 1 in any reasonable sense of the word. Then there
appears to be a clash of particle and wave concepts.
To be specific, consider a pulse of light 10 coming from a distant galaxy billions
of years ago at stage Σ0 being split by gravitational lensing (equivalent to B 1 )
14.4 Wheeler’s Double-Slit Delayed Choice Experiment 187
into components 11 and 21 . These two components follow very different paths
across the Universe until an observer here on Earth observes stage-Σ3 signals at
detectors 13 or 23 . Now two kinds of process could affect outcome probabilities at
those detectors: (1) there could be some galactic cloud affecting, say, the optical
path from 11 to 12 , equivalent to module φ, and (2) the observer could choose to
do something significant at B 2 so that either an interference pattern is built up
or no such pattern is built up.
The conventional picture here is of a particle known as a photon traveling
through space. Intuitively, we would like to think of such objects. But it seems
incredible that the presence of some disruptive process φ on the potential path
11 → 12 could affect a photon traveling from 21 to 22 . Yet that is just what the
photon paradigm requires us to contemplate, particularly if the observer decides
to arrange the apparatus at B 2 to detect an interference pattern at the detectors
13 and 23 , or not.
Of this scenario, Wheeler wrote:
The problem as we see it lies not with the quantum physics but with mental
imagery associated with the particle-wave concept. There is, for instance, a clas-
sically conditioned vacuous assertion glaringly obvious in Wheeler’s statement
above: that “the photon has already passed that galaxy billions of years before
we made our decision.” Proof? There can be none, but of course, we are all
strongly conditioned to believe this counterfactual, vacuous statement and many
like it. According to quantum physics principles, as we see them in this book, we
have no direct empirical entitlement to make that assertion. In quantum physics,
188 Quantum Eraser Experiments
WHEELER-2
Wheeler’s delayed choice experiment can be discussed as a double-slit experiment
with a modified screen. Some of the detectors on the screen can receive quantum
signals from both slits, while the others can receive a signal from only one of
the slits. The interest in this arrangement comes from the possibility that the
observer can decide in principle which detectors receive which signal(s) long after
light has left the two slits.
An idealized version of WHEELER-2 is shown in Figure 14.2. The details are
much the same as the double-slit experiment studied in Chapter 10, but with the
difference that now there are three groups of detectors on the screen. Detectors
12 to R2 can receive a quantum amplitude from 11 only, (R + 1)2 to (R + S)2
can receive quantum amplitudes from 11 and from 21 , and (R + S + 1)2 to K2
can receive an amplitude from 21 only. Here, K = R + S + T .
Following Wheeler, we imagine that the experimentalist can shuffle the values
of R, S, and T during any given run, after they are sure that light has left the two
slits 11 and 22 and before any impact on the final detecting screen at stage Σ2 .
Any actual experiment would require analysis of the final data, post-selecting
signals corresponding to equivalent values of R, S, and T .
S
R S
R S
Stage Σ0 to Stage Σ1
The evolution of the preparation switch is given by
1 00 = |s1 ⊗ αA
U1,0 |s0 ⊗ A 1 + βA
2 01 , (14.9)
0 1 1
Stage Σ1 to Stage Σ2
The evolution of the two separate components at stage Σ1 is given by
R
R+S
01 = |s2 ⊗
U2,1 |s1 ⊗ A1
RA
i 02 +
i
S A
i,1 02 ,
i
1 2 2
i=1 i=R+1
Region R
Region S
R+S K
2 01 = |s2 ⊗
U2,1 |s1 ⊗ A i 02
S i,2 A + i 02
T iA . (14.10)
1 2 2
i=R+1 i=R+S+1
Region S Region T
Here the coefficients {Ri }, {S i,1 , S i,2 }, and {T i } satisfy the semi-unitary condi-
tions
R
R+S
K
R+S
R+S
|Ri |2 + |S i,1 |2 = |T i |2 + |S i,2 |2 = 1, S i,1∗ S i,2 = 0.
i=1 i=R+1 i=R+S+1 i=R+1 i=R+1
(14.11)
The outcome probabilities are found to be
Pr(Ai2 02 |Ψ0 ) = |αRi |2 , 1iR
= |αS i,1 + βS i,2 |2 , R<iR+S (14.12)
= |βT i |2 R + S < i K ≡ R + S + T.
These probabilities sum to unity as required.
From this we find the which-path parameter ΦW 2 for W 2 to be the sum of the
probabilities over regions R and T , that is,
R
K
ΦW 2 = |α|2 |Ri |2 + |β|2 |T i |2 . (14.13)
i=1 i=R+S+1
This reduces to unity when S = 0 as expected and to zero when both R and T
are zero.
relevant stage diagram being the same as Figure 13.1. In this experiment, referred
to here as JACQUES, the final beam-splitter B 2 could be removed while the light
was on its way from the first beam-splitter B 1 .
Although this experiment is of type WHEELER-1, it is also equivalent to the
above WHEELER-2 scenario with K = 2, because a Mach–Zehnder experiment
represented by stage diagram Figure 13.1 is equivalent to a double-slit experiment
with just two detectors in the final stage-detecting screen. In JACQUES, the
configuration with the second beam splitter B 2 removed corresponds to taking
R = T = 1, S = 0 in WHEELER-2, while that with B 2 in operation corresponds
to R = T = 0, S = 2.
Stage Σ0 to Stage Σ1
Evolution from Σ0 → Σ1 is given by
1 00 = |s1 ⊗ A
U1,0 |s0 ⊗ A 2
1A
0 1 1 01 . (14.14)
Stage Σ1 to Stage Σ2
Evolution from Σ1 → Σ2 is given by
U2,1 |s1 ⊗ A 2 01 = |s2 ⊗ αA
1A 2 + βA
1A 3 02 ,
1A (14.15)
1 1 2 2 2 2
Stage Σ2 to Stage Σ3
The final stage transition Σ2 → Σ3 is given by
K
U3,2 |s2 ⊗ A 2 02 = |s3 ⊗
1A 1A
C iA i
2 2 3 3 03 ,
i=2
K
U3,2 |s2 ⊗ A 3
1A 1A
i
2 2 02 = |s3 ⊗ Di A 3 3 03 , (14.16)
i=2
where the screen consists of K −1 detectors and the {C i }, {Di } coefficients satisfy
the usual semi-unitarity conditions.
The coincidence rates Pr(A i
1A
3 3 03 |Ψ0 ) are given by
1A
Pr(A i
3 3 03 |Ψ0 ) = |αC + βD | ,
i i 2
i = 2, 3, . . . , K, (14.17)
and these are the same as the single site detection rates Pr(A i 03 |Ψ0 ). These
3
results demonstrate double-slit interference because the detection of the 13 signal
provides no which-way information, so that this version of the experiment is
equivalent to a standard double-slit experiment.
S P
Spin Bases
The first orthonormal photon spin basis B HV ≡ {|H, |V } consists of a hori-
zontal polarization state |H and a vertical polarization state |V ; the second
photon spin basis B LR ≡ {|R, |L} consists of a right circularly polarized state
|R and a left circularly polarized state |L; and the third photon spin basis is
B W P ≡ {|P , |N }, a conceptual “which-path” basis defined by Walborn et al.
(Walborn et al., 2002). For clarity, our |P and |N states are the same as |+
and |− states, respectively, used in Walborn et al. (2002).
These bases are related as follows:
1 1−i
|H = √ {|P + |N } , |R = {|P + i|N } ,
2 2
(14.18)
1 1−i
|V = √ {|P − |N } , |L = {i|P + |N } .
2 2
Double-Slit Polarizers
The two modules P 2 and P 3 have the following active4 actions on their respective
input beams:
⎧ 2 s
⎨ P |H2 →2 |Ls3 = √12 {|H3s + i|V3s } ,
P
beam 22 (14.20)
⎩ P 2 |V2s → i|R3s = √1 {i|H3s + |V3s } ,
2 P 2
⎧ 3 s
⎨ P |H2 →3 |R3s = √12 {|H3s − i|V3s } ,
P
beam 32 (14.21)
⎩ P 3 |V2s → −i|Ls3 = √1 {−i|H3s + |V3s } .
3P 2
Stage Σ0 to Stage Σ1
⎧ ⎫
⎪
⎨ ⎪
⎬
1 1A 2 01
U1,0 |Ψ0 ) = √ |H1p |V1s + |V1p |H1s ⊗ A . (14.22)
⎩
⎪
2⎪ ⎭ 1
1
s[2,1] s[3,1] a[20 +21 ,1]≡a[3,1]
We show in (14.22) how the various terms are transcribed into the notation used
in program MAIN.
Stage Σ1 to Stage Σ2
U2,1 |H1p |V1s ⊗ A 2 01 = √1 |H p |V s ⊗ A
1A 2 02 + A
1A 3 02 ,
1A
1 1 2 2 2 2 2 2
2
1
2
1A 1 2 1 3
U2,1 |V1p |H1s ⊗ A 1 1 01 = √ |V2 |H2 ⊗ A2 A2 02 + A2 A2 02 .
p s
(14.23)
2
Stage Σ2 to Stage Σ3
Here the modules P 2 and P 3 take effect:
U3,2 |H2p |V2s ⊗ A 2 02 = √1 {i|H p |H s + |H p |V s } A
1A 2
1A
2 2 3 3 3 3 3 3 03 ,
2
1
2
1A 1 2
U3,2 |V2p |H2s ⊗ A 2 2 02 = √ {|V3 |H3 + i|V3 ⊗ |V3 } A3 A3 03 ,
p s p s
2
1
3
1A 1 3
U3,2 |H2p |V2s ⊗ A 2 2 02 = √ {−i|H3 |H3 + |H3 ⊗ |V3 } A3 A3 03 ,
p s p s
2
U3,2 |V2p |H2s ⊗ A 3 02 = √1 {|V p |H s − i|V p ⊗ |V s } A
1A 1A
3
2 2 3 3 3 3 3 3 03 . (14.24)
2
The point here, as stressed by Walborn et al., is that all four photon polariza-
tion states on the right-hand side of (14.24) are mutually orthogonal. Therefore,
4
Here “active” means that the changes are physically observable.
194 Quantum Eraser Experiments
Stage Σ3 to Stage Σ4
No polarizations are affected on passage through the double slit, so we have
K
U4,3 |si3 ⊗ A 2 03 =
1A 1A
C j |si4 ⊗ A j
3 3 4 4 04 ,
j=2
K
U4,3 |si3 ⊗ A 3
1A 1A
Dj |si4 ⊗ A j
3 3 03 = 4 4 04 , (14.25)
j=2
The above information was encoded into program MAIN, with the following
results for the nonzero maximal question answers:
1 1 j2
Pr A j
1A
4 4 04 |Ψ0 = |C | + |D | ,
j 2
j = 2, 3, . . . , K. (14.26)
2 2
This confirms that QDN reproduces the empirical results of Walborn et al.
(Walborn et al., 2002). Essentially, placing P 2 and P 3 in front of their respective
slits destroys the lack of which-way information observed in the unpolarized
double-slit experiment WALBORN-1 discussed
above.
MAIN also confirms that Pr A j 04 |Ψ0 = Pr A j
1A
4 4 4 04 |Ψ0 , j = 2, 3, . . . , K,
that is, that the single signal pattern on the screen shows no interference.
S P
Although this looks like a passive basis change, module X has the property of
splitting the beam into two separately observable components and this is critical
to the experiment. Module X is really an active transformation, with stage Σ3
output beams denoted 13 and 23 .
Stage Σ2 to Stage Σ3
Here the pattern of information flow is intricate, requiring great care in the
programming. The difficulty of correctly encoding what the experimentalists had
done turned out to generate the most significant test of the QDN formalism: a
single sign error would easily invalidate the whole calculation. This underlines
what we said at the end of Chapter 12, that there is an obvious need for a more
sophisticated process of transcribing stage diagrams into CA code. We find the
following transition rules:
196 Quantum Eraser Experiments
1A
{|P3p |P3s + i|P3p |N3s } A 3
2
1A (1+i) 3 3 03 +
U3,2 |H2p |V2s ⊗ A 2 2 02 = √ ,
2 2
{|N p |P p + i|N p |N s } A 3 03
2A
3 3 3 3 3 3
(14.30)
1A
{i|P3p |P3s + |P3p |N3s } A 4
p 1 3 (1+i) 3 3 03 +
U3,2 |H2 |V2 ⊗ A2 A2 02 = − 2√2
s
,
{i|N3p |P3p + |N3p |N3s } A 2A4
3 3 03
(14.31)
{i|P p
|P s
+ |P p
|N s
} 1A
A 3 03 −
p 1A
U3,2 |V2 |H2s ⊗ A 2 (1−i)
√
3 3 3 3 3 3
2 2 02 = 2 2 ,
{i|N p |P p + |N p |N s } A 2A 3 03
3 3 3 3 3 3
(14.32)
1A
{|P3p |P3s + i|P3p |N3s } A 4
3 02 =
1A (1−i) 3 3 03 −
U3,2 |V2p |H2s ⊗ A √ .
2 2 2 2
{|N3p |P3p + i|N3p |N3s } A 4
2A
3 3 03
(14.33)
Stage Σ3 to Stage Σ4
The evolution is given by (14.25), taking into account that detector 24 is not
associated now with the double-slit detecting screen but is part of the X module
output channel detectors.
The above information when fed into program MAIN gives the results:
1 1 j 2 i j j∗
Pr A j
1A
4 4 04 = |C | + |D | + C D − C j∗ Dj ,
j 2
4 4 4
1 1 i j j∗
Pr A j
2A
4 4 04 = |C | + |D | − C D − C j∗ Dj ,
j 2 j 2
j = 3, 4, . . . , K,
4 4 4
(14.34)
15.1 Introduction
In this chapter we discuss experiments where the run architecture is significantly
different from that of standard in-out experiments such as particle scattering.
We apply the quantized detector network (QDN) formalism to particle decays,
the ammonium molecular system, Kaon-type regeneration decay experiments,
and quantum Zeno experiments. In all of these experiments, the problem is the
modeling of time, which conventionally is taken to be continuous. In QDN, time
is treated in terms of stages, which are discrete. We show how the QDN formalism
deals with such experiments.
In standard quantum mechanics (QM), time is assumed to be continuous. That
is a legacy from classical mechanics (CM), which does not concern itself in general
with the processes of observation. CM assumes systems under observation (SUOs)
“have” physical properties that are independent of how they are observed. In
contrast, QM cannot be considered without a discussion of the processes of
observation. On close inspection of any process of observation, as it is actually
carried out in the laboratory and not how it is modeled theoretically, the continuity
of time does not look quite so obvious.
The problem is that there are two mutually exclusive views about the nature
of observation in physics. These were discussed in detail by Misra and Sudarshan
(MS) in an influential paper on the quantum Zeno effect (Misra and Sudarshan,
1977). On the one hand, no known principle forbids the continuity of time, so
the axioms of QM are stated implicitly in terms of continuous time. When the
Schrödinger equation is postulated to be one of them (Peres, 1995), temporal
continuity is assumed explicitly. On the other hand, it is an empirical fact that
no experiment can actually monitor any SUO in a truly continuous way. All
references to continuous time measurements are invariably based on statistical
modeling of complex processes, with the continuity of time having much the same
status as that of temperature. Such effective parameters are extremely useful in
15.1 Introduction 199
P (t|Ψ)
This question asks for the probability that an unstable system prepared at time
zero in state Ψ has decayed sometime during the interval [0, t].
Q(t|Ψ)
This question asks for the probability that the prepared state has not decayed
during this interval.
R(t1 , t|Ψ)
This question asks for the probability that the state has not decayed during the
interval [0, t1 ], where 0 < t1 < t, and has decayed during the interval [t1 , t].
Here we come across an example where mathematics and logic cannot be used
to explore physics. We pointed out in Section 2.10 that the validation1 of the
negation ¬P of a physical proposition P cannot always be undertaken by the
same apparatus that is used to validate P . The point is that suppose we had used
apparatus AP to answer MS’s question P (t|Ψ) by looking for decay products of
an unstable SUO and had found no such decay products over any given interval
of time. We could not conclude that the SUO was absolutely stable; there could
be decay products that our apparatus could not detect. At best we could only
say that the SUO was stable relative to AP .
In physics, therefore, we cannot simply assert Q(t|Ψ) = 1 − P (t|Ψ) as an
empirical fact, because as we stated in Chapter 2, what are important in physics
are generalized propositions, and these require full specification of apparatus. In
order to answer Q(t|Ψ), we would have to use apparatus AQ , which could be
very different from AP .
Likewise, in order to answer MS’s question R(t1 , t|Ψ), we would have to use
apparatus AR .
The point made by MS is not quite the same as what we have just made. Our
concern is about apparatus, theirs was about time. MS stressed that P (t|Ψ),
Q(t|Ψ), and R(t1 , t|Ψ) are not what quantum mechanics normally calculates,
1
Our convention is that validation means the attempt to establish the truth of a proposition.
200 Particle Decays
which is the probability distribution of the time at which decay occurs, denoted
by T . The difference as they saw it is that the P , Q, and R questions involve
a continuous set of observations (according to the standard paradigm), or the
nearest practical equivalent of it, during each run of the experiment, whereas
T involves a set of repeated runs, each with a one-off observation at a different
time to determine whether the particle has decayed or not by that time. Because
P , Q, and R involve an experimental architecture different from T , it should be
expected that empirical differences might be observed.
Note that the observations referred to by MS to can have negative outcomes;
i.e., a failure to detect an expected particle decay in an experiments counts as an
observation. The correct statement of such an observation is not that the particle
is stable, but that that particular experiment has failed to detect any decay.
MS emphasized the limitations of QM, stressing that although it works
excellently in many situations, QM does not readily give a complete picture of
experiments probing questions such as P , Q, and R. They concluded that “there
is no standard and detailed theory for the actual coupling between quantum
systems and the classical measuring apparatus” (Misra and Sudarshan, 1977).
We fully agree. QDN is a relatively simplified attempt to move toward such a
theory.
Our first task in this chapter is to apply QDN to the simplest idealized decay
process, a particle decaying via a single channel. The quantum Zeno effect is
then discussed. That effect demonstrates that the answer as to whether a system
decays while it is being monitored or whether it remains in its initial state
depends on the experimental context, i.e., the details of the apparatus and
the measurement protocol involved. We follow this by applying QDN to more
complex phenomena such as the ammonium molecule and neutral Kaon decay.
We show how QDN can readily provide the empirical architecture to describe
the spectacular phenomenon of Kaon decay regeneration, originally discussed by
Gell-Mann and Pais in standard QM (Gell-Mann and Pais, 1955).
It will be shown that for all of these phenomena, QDN incorporates probability
conservation at all levels of the discussion and therefore does not require the
introduction of any ad hoc imaginary terms in any energies or the use of non-
Hermitian Hamiltonians.
only. Should such aspects be required, the formalism can readily deal with them
by introducing an “internal” Hilbert space associated with the SUO states.
The run architecture follows the pattern used throughout this book, with
labstate preparation for each run being completed by an initial stage denoted as
Σ0 , and referred to as stage zero. All subsequent stages in that run are counted
from stage zero, so Σ1 is the first stage (after stage zero).
By stage Σ0 of any given run, the observer will have contextual evidence that
they have prepared an X-particle state, in the language of standard QM. This
is represented in QDN by the normalized labstate Ψ0 ≡ A X 00 , which we have
0
previously designated a preparation switch.
Now consider the first stage, Σ1 , at which the observer has the means to
detect any decay. Suppose by that stage, the labstate is now represented by Ψ1
and given by
X + βA
Ψ1 ≡ U1,0 Ψ0 = (αA Y1 )01 , |α|2 + |β|2 = 1. (15.1)
1 1
Here the first term on the right-hand side (RHS) represents the possibility that
the particle has not decayed, whereas the second term, involving Y , represents
the possibility that a decay has occurred.
It is part of the underlying philosophy of QDN that the term in Y in Eq. (15.1)
does not model specific details of the Y state. It models a yes/no possibility that
something has happened. It is an example of a virtual detector rather than a real
detector. A virtual detector is informational in character, not necessarily directly
identifiable with a specific, real detector in the laboratory, although such things
are necessary to establish the context for the labstate Y .
To clarify this point further, suppose that the multiparticle state Y consisted
of N identifiable particles. We have not modeled here the stage Σ1 labstate
Y 1AY 1AY 1 . . . A
Y 1 01 , where for example A Y 1
1 2 3 N i
by a term in (15.1) such as β A 1 1 1 1 1
would create a signal in a specific detector for decay component particle Y i at
stage Σ1 . We could do that, if we wanted to, however. That would undoubtedly
add to the complexity of a problem that already has some degree of complexity
in its architecture, so that is a scenario where a computer algebra approach to
QDN would be most suitable.
The modeling of the labstate Ψ1 given by (15.1) does however include some
desirable features that we have put in “by hand.” We exclude from the RHS of
(15.1) the possibility that we find no signals whatsoever at stage Σ1 ; that is, we
exclude the signal ground state 01 . This means that the apparatus is what we
have referred to before as calibrated .
In the same spirit, we exclude the signality-two state A XA Y 1 01 , on the grounds
1 1
that any run with a labstate consisting of the original particle and its decay prod-
uct would be discounted by the observer as contaminated by external influences
(as happens in real experiments).
From (15.1), the amplitude A(X1 |X0 ) for the particle not to have decayed by
stage Σ1 is given by
202 Particle Decays
A(X1 |X0 ) ≡ 01 AX
1 Ψ1 = α, (15.2)
while the amplitude A(Y1 |X0 ) for the particle to have made the transition to
state Y by stage Σ1 is given by
A(Y1 |X0 ) ≡ 01 AY1 1 Ψ1 = β. (15.3)
X , AY ] = 0, and so on.
Here we have used the fact that [A n n
From the above, we see that
|A(X1 |X0 )|2 + |A(Y1 |X0 )|2 = 1, (15.4)
so total probability is conserved.
The above probabilities can also be calculated directly as expectation values
of partial questions. For the probability Pr(X1 |X0 ) of no decay by stage Σ1 , we
find Pr(X1 |X0 ) ≡ Ψ1 P X Ψ1 = |α|2 , while the probability Pr(Y1 |X0 ) of decay
1
into Y by stage Σ1 is given by Pr(Y1 |X0 ) ≡ Ψ1 P Y1 Ψ1 = |β|2 . Note that these
1
are contextual probabilities: we noted at the previous section that these are
statements valid only relative to the detectors used, which are assumed suitable
for what they are supposed to detect.
On the RHS of (15.3), the label is Y1 ; that is, the decay state label is itself
labeled by a temporal subscript, in this case the number 1, which is the stage
Σ1 at which the amplitude is calculated for. This label of a label is significant.
It registers the fact that when a detector is triggered, it does so irreversibly. The
stage at which this happens is a crucial feature of the analysis, being directly
related to the measurement issues discussed by MS (Misra and Sudarshan, 1977).
Our architecture is based on monitoring the state of the SUO as much as possible,
that is, attempting to perform as good an approximation to continuous-in-time
monitoring as our equipment allows.
The above process conserves signality one, so the dynamics can be discussed
economically in terms of the evolution of the signal operators rather than the
labstates. For instance, evolution from stage Σ0 to stage Σ1 can be given in the
form
A X U1,0 = αA
X → U1,0 A X + βA
Y1 , (15.5)
0 0 1 1
where U1,0 is a semi-unitary operator satisfying the rule U1,0 U1,0 = I0 , with
I0 being the identity for the initial lab register Q0 ≡ QX 0 and U1,0 being the
retraction of U1,0 .
Process (15.5) involves a change in rank, since Q1 ≡ QX Y1
1 Q1 , but not in
signality. Because dim Q1 > dim Q0 , the evolution operator is properly semi-
unitary, that is, satisfies the condition U1,0 U1,0 = I1 , which is a statement of
irreversibility relative to the observer. This is a critical feature of the experiments
discussed in this chapter, apart for the ammonium molecule, and is the reason
for the apparent loss of probability in conventional Schrödinger wave mechanics
descriptions of unstable particles. In those descriptions, a common strategy is
to consider only the Hilbert space of the original SUO and add an imaginary
15.2 One Species Decays 203
term −iΓ to energies, thereby forcing wave functions to fall off with time, with
the interpretation that this represents particle decay. Where this probability loss
goes is left unstated.
In relativistic quantum field theory, such as quantum electrodynamics (QED),
the architecture is usually different. There, the Hilbert space is big enough to
accommodate particles (such as muons) and their decay products. Decays are
treated as scattering problems, with initial undecayed particles coming in at
remote negative (infinite past) time and decay products going out at remote
positive (infinite future) time. The extraction of decay lifetimes then is usually
done in a heuristic manner, usually involving manipulation with symbols that are
nominally divergent, such as dividing an amplitude by the four-volume measure
of Minkowski spacetime in order to determine a flux decay rate. Our ambition
in QDN is to avoid such manipulations while retaining probability conservation.
The QDN description of the next stage of the process, from stage Σ1 to stage
Σ2 , is more subtle and involves a null test. Considering the labstate of the above
decay process at stage Σ1 , there are now two terms to consider.
No Decay
The first term on the RHS in (15.1), αA X 01 , corresponds to a no decay outcome
1
by stage Σ1 . This potential outcome can now be regarded as a preparation, at
stage Σ1 , of an initial X state that could subsequently decay into a Y state
or not, with the same dynamical characteristics as for the first temporal link
of the run, held between stages Σ0 and Σ1 . If the measured laboratory time
interval τ10 ≡ t1 − t0 between stages Σ0 and Σ1 is the same, within experimental
uncertainty, as the measured laboratory time interval τ21 ≡ t2 − t1 between
stages Σ1 and Σ2 , and so on for subsequent links, then spatial and temporal
homogeneity may be assumed, if the apparatus has been set up in the laboratory
carefully enough. This will be a physically reasonable assumption in the absence
of gravitational fields and in the presence of suitable apparatus.
Decay
The second term, β A Y1 01 , in (15.1) corresponds to decay having occurred during
1
the first time interval. Such an outcome is irreversible in this example, but this
is not an inevitable assumption in general. Situations where the Y state could
revert back to the X state are more complicated but of empirical interest, such
as in the ammonium maser and Kaon and B meson decay. These scenarios are
discussed later sections in this chapter.
state Y at that time. Now there is nothing that requires this information to be
extracted precisely at that stage. The experimentalist could choose, or indeed
be required, to delay information extraction until some later stage, effectively
placing the decay product observation “on hold.” The inherent irreversibility of
signal detectors means that, as a rule, such signals on hold are not lost.
As stated above, this may be represented in QM by passing a state through
a null test, which does not alter it. In QDN this is represented by Eq. (15.7).
Essentially, quantum information about a decay is isolated from the rest of the
experiment and passed forward in time until it is physically extracted.
The lab register Q2 at stage Σ2 has rank three, being the tensor product
Q2 ≡ Q X Y1 Y2
2 Q2 Q2 . Semi-unitary evolution from stage Σ0 to stage Σ2 is still of
signality one and is given by
X → U2,1 U1,0 A
A X U1,0 U2,1 = α2 A
X + αβ A
Y2 + β A
Y1 , (15.8)
0 0 2 2 2
with the various probabilities being read off as the squared moduli of the corre-
sponding terms.
The temporal architecture of this process is given in Figure 15.1. It will be
apparent from a close inspection of (15.8) that what appears to look like a space-
time description with a specific arrow of time is being built up, with a memory
of the change of rank of the lab register at stage Σ1 being propagated forward in
time to stage Σ2 . This is represented by the contribution involving A Y1 , which
2
is interpreted as a potential decay process that may have occurred by stage Σ1
and contributing to the overall labstate amplitude at stage Σ2 .
X Y
X Y
Y Y
where Un,0 ≡ Un,n−1 Un−1,n−2 . . . U1,0 is semi-unitary and satisfies the constraint
Un,0 Un,0 = I0 .
The amplitude A(X, n|X, 0) that the original state has not decayed by stage
Σn can be immediately read off the RHS of (15.9) and is given by
A(X, n|X, 0) = 0n AX X
n Un,0 A0 00 = α .
n
(15.10)
Commentary
Figure 15.1 does not reveal the full complexity of what is going on. That will be
appreciated by the observation that the labstate has signality one at every stage.
This means that at any stage, either the original state has not decayed or it has
decayed once, either at the stage being examined or prior to that stage.
This stage diagram reinforces the message that the Block Universe picture
of reality is too simplistic, because that picture is a classical record of what
was actually observed and cannot include the future of whatever “now” is being
discussed (which is stage Σ0 here), unless the vacuous assumption is made that
the future is single valued and predetermined. That would not be compatible
with quantum principles as we know them, however. Figure 15.1 refers to the
future of stage Σ0 and the probability outcomes predicted for the observer by
QM, for that stage only; it is not a valid stage diagram for any process time stage
after stage Σ0 . The contextuality of stage diagrams underlines the message that
physics is contextual, never absolute.
In the following, we will assume that the parameter α in the one-particle decay
discussed above satisfies |α| < 1, because the case |α| = 1 corresponds to a
stable particle, which is of no interest here.
Consider the physics of particle decay. Calculated probabilities should be
functions of labtime, the clock time used by the observer in the laboratory.
Labtime is not assumed here to be a continuous variable on the microscopic
level. Instead, it is linked to the scale of time associated with successive stages,
and this is determined by the apparatus used. The temporal subscript n in our
concept of stages will, when it is so arranged, correspond to a physical interval of
time τ , where τ is some reasonably well-defined time scale characteristic of the
apparatus.
Certainly, stages need not be strictly regulated in terms of being equally spaced
out in time. But in the sort of experiments relevant to this chapter, there will
be such an interval τ , and it will be typically a minute fraction of a second, but
certainly nowhere near the Planck time scale of 10−44 second. Indeed, the conjec-
ture that there is such a Planck time scale remains conjectural and has received
some meaningful criticism (Meschini, 2007). The smallest interval currently that
has been measured empirically is of the order 10−23 second, which is on the
shortest hadronic resonance scale, comparable with the time light takes to cross
a proton diameter. More realistic measurement scales that could be involved in
our discussion directly would probably be electromagnetic in origin, in the range
of 10−9 to 10−18 second. For instance, the shortest controllable time is about 10
attoseconds, that is, about 10−17 second (Koke et al., 2010). Experimentalists
would generally have a good understanding of what their relevant τ is.
Suppose first that we have some reason to believe that we can relate the
transition amplitude α to the characteristic time τ by the rule |α|2 ≡ e−Γτ ,
where Γ is a characteristic inverse time introduced to satisfy this relation. Then
the survival probability P (tn ) is given by P (t) ≡ Pr(X, n|X, 0) = e−Γt , which
is the usual exponential decay formula. No imaginary term proportional to Γ
in any supposed Hamiltonian or energy has been introduced in order to obtain
exponential decay.
A subtlety may arise here, however. Exponential decay implies that |α|2 is an
analytic function of τ with a Taylor expansion of the form
|α|2 = 1 − Γτ + O τ 2 , (15.12)
i.e., one with a nonzero linear term. Under such circumstances, the standard
result limn→∞ (1 − x/n)n = e−x leads to the exponential decay law. The possibil-
ity remains, however, that the dynamics of the apparatus is such that the linear
term in (15.12) is zero, so that the actual expansion is of the form
|α|2 = 1 − γτ 2 + O τ 3 , (15.13)
where γ is a positive constant (Itano et al., 1990). Then in the limit n → ∞,
where nτ ≡ t is held fixed, the result is given by
15.4 Matrix Analysis 207
n
lim 1 − γτ 2 + O τ 3 = 1, (15.14)
n→∞, nτ =t fixed
which gives rise to the quantum Zeno effect scenario. An expansion of the ampli-
tude of the form a = 1+iμτ +ντ 2 +O(τ 3 ) is consistent with (15.13), for example,
if μ is real and μ2 + ν + ν ∗ < 0.
To understand properly what is going on, it is necessary to appreciate that
there are two competing limits being considered: one where an SUO is being
repeatedly observed over an increasingly large macroscopic laboratory time scale
t ≡ nτ , and another one where more and more observations are being taken
in succession, each separated on a time scale τ that is being brought as close
to zero as possible by the experimentalist. In each case, the limit cannot be
achieved in the laboratory. The result is that in such experiments, the specific
properties of the apparatus and the experimental protocol may play a decisive
role in determining the results. If the apparatus is such that (15.12) holds, then
exponential decay will be observed, whereas if the apparatus behaves according
to the rule (15.13), or any reasonable variant of it, then approximations to the
quantum Zeno effect should be observed.
From the QDN perspective, the quantum Zeno effect can be understood from
the architecture of decay observation as follows. Looking at Figure 15.1, we see
that there is one channel, denoted by circles with an X, that runs across all
stages. That channel is the “no decay” channel. If during a run involving a great
number of stages the net probability of any of the other outcomes being detected
is sufficiently low, then it would appear that the original system was stable.
However, given enough stages with a fixed duration τ between each, the decay
outcomes would eventually win out. The quantum Zeno effect therefore relies on
having as brief a duration τ as possible and finding the critical time scale over
which the apparent effect could be observed.
Another way of understanding the quantum Zeno effect is in terms of envir-
onment. For instance, a free neutron will decay with a mean lifetime of about
880 seconds, whereas inside a nucleus, neutrons are generally stable. We can
understand the quantum Zeno effect as the effect of the detection environment
on an otherwise unstable system.
The squared modulus of the first component of this column vector gives the
same survival probability |α|2n as before. It is also easy to read off all the other
transition amplitudes and from them determine discrete time versions of the
P , Q, and R functions discussed by MS (Misra and Sudarshan, 1977).
Although the QDN analysis gives results that look formally like the standard
decay result, the scenario involved is equivalent to that discussed by MS; namely,
there is a constant questioning (or its discrete equivalent) by the apparatus as
to whether decay has taken place or not. In this case the results are simple. For
Kaon and B meson decays, the results are more complicated.
15.5 The Ammonium System 209
Then relative to this representation, the Hamiltonian for the system is given by
the Hermitian matrix
e f
H= , (15.20)
R f∗ e
where e is real and f can be complex. When f is zero, the two states are
degenerate energy eigenstates and so are stable. This possibility is of no in interest
here, so we shall assume that f = |f |eiφ , where |f | is nonzero and φ is a constant
phase.
The two eigenvalues of H are E ± ≡ e ± |f | with corresponding normalized
energy eigenstates
1 eiφ
|± = √ , (15.21)
R 2 ±1
210 Particle Decays
where we have set the arbitrary phases to zero for convenience. Hence an arbitrary
normalized solution to the Schrödinger equation
d
i |Ψ, t = H|Ψ, t (15.22)
dt
has matrix representation
−
|Ψ, t = ae−iE |+ + be−iE
+
t/ t/
|−, (15.23)
R
Pr(X, t|Ψ, 0) = 1
2 + |ab| cos(β − α + 2|f |t/), (15.24)
Pr(Y, t|Ψ, 0) = 1
2 − |ab| cos(β − α + 2|f |t/). (15.25)
X X X X
Y Y Y Y
Figure 15.2. The temporal architecture of the ammonium molecule, with the
two orthogonal states denoted X and Y .
15.5 The Ammonium System 211
X 0n , A
for the contextual subspace Qcn with orthonormal basis {A Y 0n }.
n n
The form (15.27) is particularly suitable for finding the evolution operator
UN,0 taking states of the SUO from stage Σ0 to some final stage ΣN . We find
UN,0 = ΦTN U N Φ0 , N = 0, 1, 2, . . . (15.32)
N
The problem therefore reduces to finding U , which we do as follows.
As discussed in Section 11.5, a unitary matrix U such as (15.29) can always
be put in standard form, that is,
a −b∗
U = eiη , (15.33)
b a∗
where η is real and a and b satisfy the condition |a|2 + |b|2 = 1.
We now state without proof that matrix U can be written in the form
U = eiη V ΛV † , (15.34)
where matrix V is a unitary matrix given by
u −v ∗
V ≡ , (15.35)
v u∗
212 Particle Decays
X
X
X
X Y
Y
Y Z
Y Z
Z Z
Z
Z
S S S S
flow from the X and Y states and so these eventually disappear. Before that
occurs, however, there will be back-and-forth transitions between the X and Y
states that give rise to the phenomenon of regeneration.
In actual Kaon decay experiments, pure K 0 states can be prepared via the
0
strong interaction process π − +p → K 0 +Λ, while pure K states can be prepared
0
via the process π + + p → K + + K + p. In our notation, these preparations
X 0n and A
correspond to initial labstates A Y 00 , respectively. In practice, super-
0 0
0
positions of K 0 and K states may be difficult to prepare directly, but the analysis
of Gell-Mann and Pais shows that such states could in principle be prepared
indirectly (Gell-Mann and Pais, 1955). Therefore, labstates corresponding to X
and Y superpositions are physically meaningful and will be used in the following
analysis.
Consider an initial labstate of the form Ψ0 ≡ x0 A X + y0 A
Y 00 , where
0 0
|x0 |2 + |y0 |2 = 1. Matrix methods are appropriate here. The dynamics of the
system will be discussed in terms of the initial column vector Ψ0 ≡ [ x0 y0 ]T ,
equivalent to the statement that each run of the experiment starts with the
rank-two lab register Q0 ≡ QX Y
0 Q0 . The dynamical rules (15.42)–(15.44) map
X Y Z1
labstates in Q0 into Q1 ≡ Q1 Q1 Q1 , so there is a change of rank from two to
three. The transition is represented by the semi-unitary matrix
⎡ ⎤
α u
U1,0 ≡ ⎣β v ⎦ , (15.46)
γ w
which subsequently generalizes to
⎡ ⎤
α u 0Tn
⎢ ⎥
⎢β v 0Tn ⎥
Un+1,n ≡⎢
⎢
⎥,
⎥ n > 0, (15.47)
⎣γ w 0Tn ⎦
0n 0n In
where In is the n×n identity matrix and 0n is a column of n zeros. The observer’s
detector array increases rank by one over each time step. The state at stage
Σn is represented by a column vector Ψn with n + 2 components, given by
Ψn = Un,n−1 Un−1,n−2 . . . U2,1 U1,0 Ψ0 . Overall probability is conserved, because
of the semi-unitarity of the transition operators.
As before, the key to unraveling the dynamics is linearity, which is guaranteed
by the use of semi-unitary evolution operators. Suppose the state Ψn at time n
is represented by
components, xn and yn , behave as if they were. The dynamics gives the relations
n+1
xn+1 = αxn + uyn = λxn , yn+1 = βxn + vyn = λyn , and zn+1 = γxn + wyn .
Experimentalists will be interested principally in survival probabilities for the
X and Y states, so the dynamics of Z states will be ignored here; i.e., the behavior
of the components znk for k < n will not be discussed. Clearly, however, the QDN
formalism is capable of giving much more specific details about the process than
just the X and Y survival probabilities.
It will be seen from the above that λ is an eigenvalue of the matrix
α u
,
β v
which means that in principle there are two
solutions, λ+ and λ− , for the
eigenmode values, given by λ± = 12 {α + v ± (α − v)2 + 4βu}. It is expected
that these will not be mutual complex conjugates in actual experiments, because
if they were, the analysis could not explain observed Kaon physics. Therefore, the
coefficients α, β, u, and v will be such that the above two eigenmode values are
complex and of different magnitude and phase, giving rise to two decay channels
with different lifetimes, as happens in neutral Kaon decay. In the quantum
mechanics analysis of neutral Kaon decays, Gell-Mann and Pais described the
neutral Kaons as superpositions of two hypothetical particles known as K10
and K20 , which are charge-parity eigenstates and have different decay lifetimes
(Gell-Mann and Pais, 1955). The K10 decays to a two-pion state with a lifetime
of about 0.9 × 10−10 second, while the K20 decays to a three-pion state with a
lifetime of about 0.5 × 10−7 second.
Semi-unitarity guarantees that
given |xn |2 + |yn |2 > 0. From this and the conditions xn = λn x0 , yn = λn y0 , the
eigenmode values can be written in the form λ2 ≡ λ+ = ρ1 eiθ1 , λ2 ≡ λ− = ρ2 eiθ2 ,
where 0 < ρ1 , ρ2 < 1 and θ1 and θ2 are real. The eigenmodes at time t = 0
corresponding to λ1 and λ2 will be denoted by Λ1,0 and Λ2,0 respectively, i.e.
Λ1,0 = [ a1 b1 ]T , Λ2,0 = [ a2 b2 ]T , and then the evolution rules give
⎡ ⎤ ⎡ ⎤
λn1 a1 λn2 a2
⎢ λn1 b1 ⎥ ⎢ λn2 b2 ⎥
⎢ ⎥ ⎢ ⎥
⎢ ⎥ ⎢ ⎥
Λ1,n = ⎢ cn,n ⎥ , Λ2,n = ⎢ dn,n ⎥ , (15.51)
⎢ . ⎥ ⎢ . ⎥
⎣ .. ⎦ ⎣ .. ⎦
c1,n d1,n
216 Particle Decays
where the coefficients {ck,n } , {dk,n } can be determined from the dynamics. The
initial modes Λ1,0 and Λ2,0 are linearly independent provided λ1 and λ2 are
different. Given that, then any initial labstate Ψ0 can be expressed uniquely as
a normalized linear combination of Λ1,0 and Λ2,0 , i.e., Ψ0 = μ1 Λ1,0 + μ2 Λ2,0 , for
some coefficients μ1 and μ2 . This is the analogue of the decompositions
√
|K 0 = {|K10 + |K20 }/ 2,
0 √
|K = {|K10 − |K20 }/ 2 (15.52)
16.1 Introduction
Our concern in this chapter is locality in quantum mechanics (QM). Locality is
a heuristic physics principle based on the following propositions.
No Action-at-a-Distance
All the evidence points to the principle that physical actions, taken within
restricted (localized) regions of space and time by observers or other agencies
such as systems under observation (SUOs), do not cause instantly observable
effects on other SUOs at large distances. This does not apply to mathematical/
metaphysical concepts such as quantum wave functions or correlations, as these
are conceptual objects (Scarani et al., 2000). Statements about instantaneous
wave function collapse are vacuous (have no empirical significance) and are
therefore not an issue of significance in physics. Such statements are an issue
to theorists who objectivize wave functions, as in Hidden Variables (HV) theory.
Action-at-a-distance is generally regarded as anathema by most physicists. For
example, Newton’s law of universal gravitation is well known for mathematically
encoding action-at-a-distance. There is direct evidence, however, in the form of
a letter written by Newton to Bentley, that Newton believed that gravity acting
“at a distance through a vacuum without the mediation of anything else” was
an absurdity (Newton, 2006).
The no action-at-a-distance principle is encoded in quantized detector net-
works (QDN) by the requirement that labstate preparation and consequent signal
detection never occur at the same stage.
Causal Transmission
All physically observable consequences of local actions taken by an observer or
SUO are transmitted by identifiable physical processes, such as electromagnetic
waves or neutrinos. There is no such thing as magic or action-at-a-distance.
218 Nonlocality
In QDN this proposition is taken into account implicitly in the labstate out-
come amplitudes at each stage, as these model how information is propagated
from stage to stage. When necessary, the information void can be modeled
as if there were fields and/or particles propagating through it, giving scope
for different mathematical models, such as Euclidean space, curved spacetime,
noncommuting spacetimes, and so on. The structure of the information void
is essentially a discussion of whatever modules have to be taken into account
between labstate preparation and signal detection.
No Superluminosity
According to the standard principles of relativity, the speed of transmission of
any observed physical effect is never greater than the local speed of light, as
measured in a standard localized laboratory. To date, no particles or signals that
travel faster than the speed of light (tachyons) have been observed.
A necessary, but not sufficient, condition for QM to respect the no-
superluminosity principle is that the no-communication theorem holds. This
theorem in standard QM and in QDN is discussed at the end of this chapter.
The no-communication theorem is insufficient in relativistic QM because it
does not mention the speed of light. In relativistic quantum field theory (RQFT),
the theorem is replaced by the above-mentioned no-superluminosity principle
directly, which involves lightcones. These have a special place in physics, having
both emergent and reductionist aspects.
Lightcones are obviously emergent structures because they define macroscopic
subsets of spacetime consisting of events that are all either time-like or space-like
relative to a given event (identified with the vertex of a given lightcone). On the
other hand, lightcones are intimately involved in the reductionist formulation of
RQFT, such as the postulate that operators representing observables at relatively
spacelike intervals have to commute. Interestingly, RQFT places no such restric-
tion on unobservables such as Dirac fields, which obey anticommutation relations.
In Chapter 24, we discuss the construction of fermionic quantum fields from
a QDN perspective, based on Jordan and Wigner’s nonlocal quantum register
approach, a manifestly emergent formulation (Jordan and Wigner, 1928).
QDN is not a reductionist approach to QM, as it deals principally with appa-
ratus sitting on the interface between relative internal context (the world of
SUO states) and relative external context (the environment in which that appa-
ratus and the observer are situated). The precise relationship between QDN
and lightcone structure is not clear at this time. This is consistent with the
general situation at this time that a proper quantum theory of spacetime, so-
called quantum gravity, has not been established. All attempts to do so from a
reductionist approach have failed, to date.1 It is not even clear at this time what
has to be “quantized.”
1
Failure from the point of view of proving empirically vacuous.
16.1 Introduction 219
We shall show in the last section of this chapter that QDN does obey the
no-communication theorem, which is certainly necessary for lightcone physics
to work. However, that theorem alone is not sufficient to establish lightcone
structure.
Shielding
QDN actually goes further than the no-superluminosity principle, in that it allows
for shielding. This is the empirical possibility of bypassing lightcone causality, the
standard relativistic assumption that if event B is inside the forward lightcone
of event A, then A can be the location of processes that “cause” effects to
be observed at B. In practice it is possible, and often necessary, to materially
isolate detectors so that whatever happens at one detector does not affect others,
even when they are time-like separated and could in principle interfere with each
other. Lest this be thought contrary to standard physics, we point out that all
experiments are done on this basis. For example, neutrino detectors are located
in deep mines, to filter out noise.
The shielding concept is really what underpins the stages concept. Our discus-
sion of the double-slit experiments in Chapter 10 illustrates the point well: when
looking at a screen for signals, an observer can do so over extended periods of real
laboratory time, provided the screen is not interfered with by processes external
to the experiment. The final stage of such a run, then, need not be identified
with a definite instant of labtime. This is a form of loss of absolute simultaneity,
different from the one discussed in special relativity (SR).
The above criteria are based on current empirical evidence and are subject
to constant empirical reexamination: experimentalists continue to search for
tachyons, for example. The hard fact is, however, that there have been no
observed violations of the locality principle to date. Therefore, since QDN is
greatly concerned with signal preparation and outcome detection, the locality
principle is one that should be respected by QDN. There are several aspects to
be discussed here regarding this.
Apparatus Nonlocality
A critical attribute of observation that cannot be ignored here concerns the spa-
tial distribution of the observer’s apparatus. No apparatus is perfectly localized
in space or in time. All equipment consists of vast numbers of atoms, which
have spatial extent, and all observations take time. We have in previous chapters
formalized this latter fact, that observations take time, into the stage concept.
Correlations
Apparatus nonlocality plays a critical role in quantum correlations. The super-
luminal transmission of certain types of information, interpreted as correlations,
has been investigated and speeds in excess of 10,000 times the speed of light
reported (Scarani et al., 2000). One of our aims in this book is to demystify
such phenomena. QDN interprets quantum nonlocality as originating from the
fact that apparatus is invariably nonlocal, as are the processes of extracting
information from it, rather than reflecting strange, nonclassical properties of
SUOs. Since apparatus has to be constructed before a quantum state or wave
function can be given any meaning, or a correlation measured, it is then obvious
that nonlocality is built into quantum physics from the word go, in the form of
correlations arising from empirical context.
We may summarize these comments by saying that relativity deals with relative
external context (REC), whereas QM discusses relative internal context (RIC).
Two historical category errors, the quantum gravity program and the Multiverse
paradigm, appear to have been made here. Quantum gravity attempts to extend
GR into RIC, while Multiverse attempts to extend QM into REC. Both attempts
appear to be empirically vacuous at this time.
Our division of empirical context into REC and RIC is not clear cut but is
identified in QDN with Heisenberg’s cut:
The dividing line between the system to be observed and the measuring
apparatus is immediately defined by the nature of the problem but it obviously
signifies no discontinuity of the physical process. For this reason there must,
within certain limits, exist complete freedom in choosing the position of the
dividing line. (Heisenberg, 1952)
The Einstein principle affects the QDN formalism because relativity asserts
that detectors that are outside each other’s light cones cannot causally influ-
ence each other, yet QDN may give amplitude effects between those detectors.
Explaining how the classical Einstein locality principle can survive in QM, and
indeed in QDN, is a major challenge.
Active Transformations
An active transformation is something done to the original set Θ, replacing it
with a new set, Θ ≡ {θa }. We represent an active transformation by the rule
Θ → Θ = Θ, (16.1)
where the arrow → means “is replaced by.”An active transformation implies the
existence of some observer (that is, mathematician, experimentalist, or theorist)
who is making a specific change in, or of, a set of objects. Moreover, there is an
implicit assumption that some observer (who may be a “superobserver” playing
the role of a god) has a memory of the original set and can compare it with the
new set.
Passive Transformations
On the other hand, a passive transformation is merely a relabeling of the elements
in a set, with no actual change in the contextually significant properties of those
elements. For such a transformation, we write
Θ → Θ = Θ, (16.2)
where Θ ≡ {θa }. A passive transformation therefore concerns how a set is
described, which, again, implies the existence of some observer (that is, mathe-
matician, experimentalist, or theorist) who is changing their way of describing a
given set.
Construction of Apparatus
The construction of apparatus is the severest form of active transformation, as
this creates physical context. We discuss this form of active transformation in
Chapter 25.
Interframe Experiments
An interframe experiment is essentially an experiment involving two observers:
one of these is associated with state preparation and the other is involved with
outcome detection. Such experiments explore perhaps the most spectacular and
deepest issues in physics. Examples are the Doppler shift, the Unruh effect
(Unruh, 1976), and indeed, the notion that more than one observer is a meaning-
ful topic in physics. Questions about standardization of physics protocols (units
and such like), observation of observers, the constancy of physical “constants,”
Early Universe physics, and so on, come flying at us immediately.
Such experiments may be represented symbolically by
(S, A|Ω, F ) → (S , D |Ω , F ), (16.5)
224 Nonlocality
where A is the apparatus that creates the state S, relative to observer Ω and D
is the detecting apparatus relative to observer Ω .
We place a prime on the transformed state/proposition, S , on the right-hand
side in (16.5), for two reasons. First, what the detecting observer Ω detects will
appear to have properties different from those of the state that the preparing
observer Ω believes they have prepared. Second, that a quantum state means
the same thing to different observers is a notion that needs to be questioned
in several respects, concerning the standardization of physics and the exchange
of context. In QDN, for instance, we do not accept that quantum states are
objective “things.”
To illustrate our concerns, consider the detection by observer Ω of a single
photon signal. On what basis could Ω believe that they had detected a signal
sent from a distant galaxy? The only plausible scenario is that the observer had
already received sufficient contextual information about that galaxy to formulate
and justify such a belief.
We could take the view that such an exchange of sufficient contextual infor-
mation from the source observer to detecting observer is equivalent to having
a single observer, encompassing both source and detectors. While reasonable in
most cases, that point of view seems bizarre as far as intergalactic processes are
concerned.
Active transformations play a role in mathematics, where they are associated
with functions, maps, and operations. A function can be regarded as a form of
active transformation: given a set Θ, a function f maps elements of that set into
some other set, Θ . We can even think of this as defining a “mathematical arrow
of time.”
Passive transformations have played a role in physics in some important situ-
ations.
Space-Time Symmetries
Passive coordinate transformations played a critical role in the development of
modern physics. Aristotelian physics was firmly based on the proposition that
the Earth is an absolute frame of reference. After the works of Galileo and
Newton, observers discussed physics more carefully. The observer’s frame of
16.3 Local Operations 225
where L are Newton’s laws of motion in the nonrelativistic case or the laws of
relativistic mechanics in the relativistic case, and frames F and F are related
by either a Galilean or Poincaré transformation, as appropriate.
We note the absence of any relative internal context in (16.6), typical of both
nonrelativistic classical mechanics and relativistic mechanics. This contextual
incompleteness gives the generalized proposition classification of such theories
as two.
2
Note that the language here is imprecise. Experiments to detect changes in the remote
detectors would actually involve two ensembles of runs, comparing partial measurements
on apparatus evolving with the action of Ln+1,n with partial measurements on apparatus
evolving without it.
226 Nonlocality
where each underlined number in bold, such as 24, represents a single element
of the CBR, and a, b, c, d, and e are complex coefficients. Show that this state
is separable relative to the split Q[5] = Q[L] Q[R] , where Q[L] ≡ Q2 Q5 and
Q[R] ≡ Q1 Q3 Q4 .
Solution
The CBR is not best suited to discuss splits, so we need to find the equivalent
of the signal basis representation (SBR) of the state. We translate the above
CBR basis vectors into SBR counterparts as follows. By inspection, the binary
decomposition of the integers 6, 10, 15, 20, 24, and 29 is
Ψ = acA 5 0 + adA
4A 3A
1A 4A
5 0 + aeA 5 0 +
3A
2A
+ bcA 4 0 + bdA 2A
1A 3A
4 0 + beA 3 0.
2A (16.9)
This can now be interpreted as the tensor product of two subregister states;
5 +bA
that is, we may write Ψ = Ψ[L] Ψ[R] , where Ψ[L] ≡ (aA 2 )0[L] is in Q[L] ≡
2 5 [R] 4 1 3 4
Q Q , Ψ ≡ (cA + dA A A + eA )0 is in Q ≡ Q Q Q , 0 = 0[L] 0[R] ,
3 [R] [R] 1 3 4
and information theory. Typically we would call the local observer Alice, and the
remote observer Bob.
What needs to be addressed is the following: if Alice and Bob have no connec-
tion, meaning that they have no channels of communication between them, then
for whom is the combined scenario Alice and Bob meaningful? We have argued
elsewhere in this book that truth values are contextual. So if we want to discuss
Alice and Bob together, we have to specify the context in which we are doing so.
The only answer that makes empirical sense is that there must be (implicitly,
if not explicitly) some third observer Carol who has the contextual information
to know about Alice and Bob and what they are observing and the outcomes
that they have found.
In CM, such an overseeing observer is generally not specified, a factor that
contributes to the essential contextual incompleteness of that discipline. In QM,
we cannot allow such contextual incompleteness. Whatever is asserted must have
some empirical basis for its truth values. Quantum entanglement runs directly
into this issue.
This chain of reasoning leads us to the primary observer concept. A primary
observer is the overseer and custodian of all relevant context: the buck stops with
a primary observer, there is nothing behind them, in the given context. So when
we discuss Alice and Bob as local and remote observers respectively, they are by
implication not primary observers. We will refer to them as secondary observers,
or subobservers.
A fundamental difference in QM between a primary observer and any sec-
ondary observers is that dimensions of Hilbert spaces do not follow an additive
rule: if Alice thinks she is dealing with a p-dimensional Hilbert space and Bob
thinks he is dealing with a q-dimensional space, then primary observer Carol,
who is overseeing Alice and Bob, is dealing with a pq-dimensional space, not a
p + q-dimensional Hilbert space. What is additive is qubit register rank.
Exercise 16.3 If Alice models her experiment with a rank-a qubit quantum
register and Bob models his with a rank-b qubit quantum register, prove that
if Carol models both experiments by a rank c = a + b qubit quantum register
such that the dimension of Carol’s quantum register equals the sum of the
dimensions of Alice and Bob’s registers, then a = b = 1.
register, where 0 ≤ j < 2q , to what element kn of the CBR for the original
register does the tensor product i[L] n ⊗ j n correspond? It is easy to show that
[R]
Then Carol can describe what Alice and Bob are doing by the evolution operator
L −1 2rR −1
2r
Un,m ≡ U [L]
n,m ⊗ U [R]
n,m = n ⊗ kn Un,m Un,m j m ⊗ lm .
i[L] [R] [L]i,j [R]k,l [L] [R]
(16.12)
i,j=0 k,l=0
Two important conclusions can be drawn from this analysis. (1) It is consistent
to apply QM to parts of the universe, while ignoring the rest, even though all of
it is subject to the laws of QM, and (2) it is the possibility of isolating apparatus
that gives rise to the SUO concept in the first place
The QM Account
Consider two observers, Alice and Bob. Alice will be our local observer, conduct-
ing active transformations on locally prepared signal states, while Bob will be
our remote observer, conducting observations at remote detectors. The aim is to
see if anything Alice does can affect what Bob observes.
First, we put ourselves in the position of a primary observer, Carol, who has an
overview of what Alice and Bob do. They are now to be regarded as secondary
observers, relative to Carol, although each of them believes themself to be a
230 Nonlocality
primary observer. Suppose Alice models the states she can prepare by elements
of a Hilbert space denoted HA , and suppose Bob models the states he observes
by elements of a Hilbert space HB . We will suppose that Carol has enough
information to model the combined system by the tensor product space HC ≡
HA ⊗ HB .
In the following we shall deal with pure states only, assuming that mixed states
present no exceptional concerns. This is on account of the fact that mixed states
involve no more than the extra complication of classical probabilities, and these
do not interfere in the way that entangled quantum states do.
Suppose Carol arranges for an entangled state to be prepared, described by a
normalized element in HC given by
|ΨC ) ≡ α|ψ A ⊗ |ψ B + β|φA ⊗ |φB , (16.13)
where |ψ A and |φA are normalized elements in HA , |ψ B and |φB are nor-
malized elements in HB , and |α|2 + |β|2 = 1. The actions of Alice and Bob are
respectively as follows.
Alice
Alice performs a local active operation U A on states in HA that she has access
to, changing them according to the prescription
|ψ A → |ψ A ≡ U A |ψ A , |φA → |φA ≡ U A |φA . (16.14)
Bob
Bob performs a measurement of an observable OB on states in HB to which he
has access.
Carol
From Carol’s perspective, a holistic account has to be given, in terms of states
in, and operators over, the total Hilbert space HC . From her perspective, Bob’s
observable corresponds to the operator OC ≡ I A ⊗ OB , where I A is the identity
operator over HA , while Alice’s local operator corresponds to the operator U C ≡
U A ⊗ I B , where I B is the identity operator over HB .
Before Alice performs her operation, Carol calculates that Bob’s expectation
value OB will be given by
OB ≡ (ΨC |OC |ΨC )
= |α|2 ψ B |OB |ψ B + |β|2 φB |O|φB
+ α∗ βψ A |φA ψ B |O|φB + αβ ∗ φA |ψ A φB |OB |ψ B , (16.15)
where we note the presence of interference terms.
After Alice performs her operation, Carol calculates that Bob’s expectation
value OB will be given by OB ≡ (Ψ C |OC |ΨC ), where now
|ΨC ) ≡ U C |ΨC ) = α|ψ A ⊗ |ψ B + β|φ A ⊗ |φB . (16.16)
16.7 The No-Communication Theorem 231
Then we find
Comparing (16.15) and (16.17), it is easy to see that OB = OB , because
ψ A |φA = ψ A |U A† U A |φA = ψ A |φA .
The prediction, therefore, is that no active unitary transformations performed
by Alice would affect Carol’s calculation of Bob’s measured expectation value.
In other words, Alice could not transmit any signals to Bob using entanglement.
17.1 Introduction
From the dawn of the quantum age in 1900, there has been an ongoing debate,
often fierce and hostile, between the supporters of the classical world view and
those of the quantum world view. This debate eventually led to remarkable exper-
iments, the convincing and repeated results of which are claimed by quantum
theorists to support the predictions of quantum mechanics (QM) and not those of
classical mechanics (CM). However, old paradigms tend to linger on the shelves of
science long past their sell-by dates and there are extant schools of theorists that
strive to find loopholes in the above-mentioned experiments. That is a legitimate
activity up to a point, but the divisive nature of the debate requires commentary,
which is the subject matter of this chapter.
QM was discovered only by advanced technology, so it stands to reason that
any test of QM will require even more advanced technology. In the previous
chapter we stressed the differences between active and passive transformations.
The experiments discussed here rely on such technically difficult active trans-
formations that the Hidden Variables (HV) theorists (for that is really what
they are) opposing QM are frequently able to think of objections, known as
loopholes, to the empirical protocols employed by the experimentalists. Some
of these loopholes are reasonable but many are not. Closing those loopholes
convincingly is an ongoing important activity in experimental quantum physics.
A significant feature of the experiments discussed in this chapter is that they
go beyond a certain point of complexity in terms of the number of classical and
quantum degrees of freedom involved. Historically, before that point had been
reached, classical interpretations of quantum wave functions appeared viable and
perhaps even attractive, such as that proposed by Bohm (Bohm, 1952). We
shall refer to the collective of such interpretations as the hidden variables (HV)
paradigm and the point in question as the Heisenberg point.
17.1 Introduction 233
It seems obvious with hindsight that the Heisenberg point was reached almost
immediately after Schrödinger introduced his wave mechanics formulation of QM
in 1926 (Schrödinger, 1926). Certainly, the wave function Ψ(x, t) for a single-
particle system under observation (SUO) can be visualized as an objective wave
of sorts in physical space. By the term physical space, we mean the three-
dimensional space P3 (Λ) of extension, position, and distance, associated with
a real laboratory Λ,1 a concept that would surely have been understandable to
Aristotle, Galileo, Newton, Hamilton, and Lagrange. But the Schrödinger wave
function Ψ(x1 , x2 , . . . , xN ; t) for an SUO consisting of N particles is actually a
time-dependent complex function over a real 3N -dimensional space C 3N , known
as configuration space, which is quite different conceptually from P3 (Λ).
Humans are remarkably stable creatures, both physically and mentally. Good
health is measured in years and mental outlook is measured in decades. The phe-
nomenon of persistence, which we have attributed as the origin of objectivization,
seems to us responsible for the classical world view. We see objects apparently
unchanged over significant stretches of time, and we come to believe that those
objects have real identities. In fact, we are generally strongly conditioned men-
tally to think in such classical terms. We imagine ourselves as sitting in physical
space P3 (Λ) and we try to relate all experience to it. We may refer to this as
mental persistence, or equivalently, classical conditioning.
Absolute physical space and absolute time (Newton, 1687)2 form the con-
ceptual foundations on which Newtonian mechanics was built. We shall refer
to this mathematical model as space-time, noting the hyphen between “space”
and “time.” This hyphen is important: it marks the recognition that observers
operate in their physical space with a process view of time rather than a manifold
or Block Universe perspective. Indeed, QM seems to us empirically meaningful
only in the space-time perspective, simply because probability, as we know it,
makes no sense otherwise.3
Not only does the space-time model give a remarkably good account of many
physical phenomena such as planetary orbits, but it conforms excellently to our
inherited classical conditioning: we think in terms of objects moving around
physical space with the passage of absolute time.
The advent of relativity did little to change this in practice, for the basic
reason that the speed of light is so great in relative terms that the space-time
model is a good one for all practical purposes. Indeed, its replacement, the Block
1
We follow here Schwinger’s statement, quoted in Chapter 24, that space and time are
contextually defined by apparatus.
2
We refer the reader to the Principia for Newton’s defining comments on what he meant by
“absolute space” and “absolute time”.
3
There will be theorists who interpret QM and probability in terms of abstract
mathematical structures over Block Universe manifolds, with operator norms, C ∗ algebras,
and such like. Those approaches to QM equate empirical physics with mathematical physics
and generally neglect observers and apparatus, thereby usually having a generalized
propositional classification of one.
234 Bell Inequalities
Parameters
Parameters are used in the description of apparatus and observers, and they
are to be found in relative internal context and relative external context. Every
classical model, including that of Bell, relies on parameters, such as the masses
of the particles being observed, orientation of apparatus relative to the labora-
tory, and so on. Parameters may be physical constants determined by previous
experiments, such as particle masses, or they may be classical degrees of freedom
under the control of the observer, such as orientation angles of a Stern–Gerlach
(SG) main magnetization field.
Parameters should not be confused with dynamical variables, which are theo-
retical constructs related to states of SUOs. Parameters are generally expressed
in terms of classical real numbers, associated with agreed systems of units. Some
parameters are specific, meaning that they are assumed to be exact to within
4
Take no one’s word for it.
5
That which is asserted without proof can be dismissed without proof.
17.2 The Stern–Gerlach Experiment 235
Variables
Variables have to do with states of SUOs. As the word suggests, variables change
over the course of an experimental run. The point of doing experiments is to
determine to what extent observers understand those changes.
Model Limits
A good model will make predictions about the range over which the variables
can go, for a fixed set Θ of parameters. For example, Newtonian mechanics and
Newton’s law of gravitation predicts how far the Earth could go from the Sun
in its annual orbit, given the known parameters, such as the mass of the Earth
and of the Sun, and the current position and velocity of the Earth relative to
the Sun.
Bell showed, in a CM model of two spin-half particles based on “reasonable”
classical assumptions such as locality, that there was a limit Λ to a certain
empirically measurable function F of the parameters Θ,6 given in the form of an
inequality F (Θ) Λ. Such inequalities are now universally referred to as Bell
inequalities (Bell, 1988). They are in focus in this chapter, because they provide
an empirical test of CM predictions versus those of QM.
6
By empirically measurable, we mean fixing the parameters and then performing an
experiment to calculate a value for F .
7
We emphasize again that such descriptions express a classical interpretation of what
happens in the laboratory: Ted pushes buttons in preparation device T and Alice looks at
signals on the detector screen. Neither Ted nor Alice see “particles” in the way spectators
observe baseballs or cricket balls.
236 Bell Inequalities
T S
Electrodynamic Forces
As it passes through the SG module S a , each particle’s instantaneous magnetic
dipole moment μ interacts with the inhomogeneous magnetic field B in S a , an
interaction modeled by a term proportional to μ · B in the classical Hamiltonian.
This generates an additional contribution to the standard Lorentz force qv × B
on that particle, and it is this additional force that is interpreted classically as
responsible for what Stern and Gerlach observed. Here q is the electric charge of
the particle, v is its instantaneous velocity, and B is the effective magnetic field
in which the particle is moving.
17.2 The Stern–Gerlach Experiment 237
The fact that Stern and Gerlach observed two regions, labeled by us A+ 1
and A− 1 in Figure 17.1, is indisputable. Therefore, a CM calculation should
account for that splitting. We assume that that can be done. There are
good examples of such bifurcations in CM, such as comets either falling into
elliptic captured orbits around the sun or entering the solar system once and
then leaving forever on hyperbolic orbits. Another example is from the statistics
of single car accidents on icy roads: cars will veer off a road either to the left or
to the right.
Lacking more detailed information, particularly about any hidden variables
that could contribute additional forces in the SG experiment, we simply assume
that if we could complete such calculations, they would show the necessary
− 8
bifurcation into either A+1 or A1 .
Deterministic Outcomes
The classical electromagnetic forces guiding each particle through module S a
are deterministic, meaning the following. Suppose that at the start of a run,
one of the particles, #1, in 10 has an initial position r 0 and initial velocity v 0 .
Subsequently, it enters S a and ends up somewhere on the detecting screen. Now
a typical beam will consist of a vast number of particles. Suppose another one
of the particles, #2, started off in 10 at exactly the same initial position, the
same initial velocity, and with the same internal degrees of freedom (HVs), as
#1, although at some other time. Then it would subsequently follow exactly the
same spatial path and would end up in exactly the same spatial position on the
screen as #1.
Consider now a beam of many particles from 10 passing through the same
apparatus. There will be a spread of initial positions and initial momenta, with
most particle velocities approximately along the same common direction toward
S a . There may also be some additional hidden variables, such as those involved
with the internal charge structure of the particles. Suppose there are N particles
in such a beam. Then the ith particle starts in 10 with a set of initial variables
denoted θ i . This includes any hidden variables.
Now according to the deterministic principle outlined above, that particle will
−
certainly end up in A+ 1 or else certainly in A1 , assuming perfect transmission,
that is, with no outside interference and with completely efficient detection. It is
important to note that for given θ i , which of the two outcomes occurs is not the
question; what matters is the asserted fact that one of them will be definitely
forced to occur by the deterministic nature of the mechanics. Moreover, this is
not a random outcome: which of the two sites it will be is predetermined by θi .
8
The assumption is made that cases where the particle would end up in the middle of the
−
detecting screen between A+1 and A1 constitute a tiny proportion of the whole and can be
neglected. There must be, for instance, relatively few, if any, comets that are on genuinely
parabolic trajectories.
238 Bell Inequalities
It is also important to note that the question of whether we could calculate that
outcome is irrelevant to the discussion.9
The classical deterministic paradigm outlined
above leads naturally to the
following assertion. Consider the set Θ ≡ θ 1 , θ 2 , . . . , θ N of all the initial HVs
associated with a beam in a given run. Then this set can be regarded as the
union Θ = A+ ∪ A− of two disjoint subsets A+ and A− , where A+ is the subset
−
of HVs that send a beam particle into A+ 1 and A is the subset of HVs that send
−
a beam particle into A1 .
Classical Counterfactuality
So far, nothing controversial has been written. To understand the next step,
we need to make a small diversion. There is a piece of logic (or metaphysics,
if you prefer) that is at the heart of the problem in “understanding” QM. It is
the essential point on which Einstein, Podolsky, and Rosen pitched their famous
argument against standard QM (Einstein et al., 1935). It goes by the name of
counterfactuality. A basic definition is the following.
Definition 17.1 If P and Q are two propositions (that is, statements) and
P is known to be false, then the statement P implies Q, written P ⇒ Q, is
a counterfactual (statement) if it is logically true, despite the fact that P is
false.
There are serious questions about the above definition, particularly in view of
the fact that we cannot avoid employing counterfactual reasoning in physics. The
most obvious problem is that Definition 17.1 is contextually incomplete: there
is no reference in it to any observer for whom the “truth” concept is valid, nor
is there any statement of a method by which a “truth” value could be
established. This is not hair-splitting: it matters in physics. That is why we
have qualified the word truth in that definition with the adjective logical. Logic
is not physics.
When it comes to physics, CM adopts a modified form of counterfactual-
ity, referred to here as the principle of classical counterfactuality, also known
9
It is remarkable that QM does not even attempt any such calculation, because not only is it
regarded as a vacuous enterprise, but there is also no mechanism in QM to deal with
individual outcomes. QM is after all a theory about the statistics of observation. HV
theorists, on the other hand, do not consider it contradictory to assert (1) that there are
variables they cannot know anything about, but (2) if they did know about them, they
could predict everything about their behaviour, in principle.
17.2 The Stern–Gerlach Experiment 239
beam particle into B1− , if the magnetic field axis had been b and not a. Here B1+
and B1− are the two regions on the detecting screen observed with module S b .
Classical counterfactuality allows us to go further and to make the decompo-
sition
Counterfactual Probabilities
HV theorists address the problem of the meaning of the counterfactual intersec-
tion A+ ∩ B + by the following argument. Consider a very large number N a of
particles, prepared by Ted in a standard way, passed through S a . By counting the
number N (A+ 1 ) that land in A1 , we can estimate the probability PCM (A )
+ +
+ a + −
N (A1 )/N that a particle would land in A1 and not in A1 . Likewise, if the
axis were b, then we can estimate the probability PCM (B + ) N (B1+ )/N b that
a particle would land in B1+ and not in B1− .
Note that we are now discussing a probability measure on the set Θ of hidden
variables associated with a beam prepared in a standard way by Ted.
According to CM, the rules of classical probability apply, so the counterfactual
probability PCM (A+ ∩B + ) that a particle would land in A+ 1 if the magnetization
axis were a, but would have landed in B1+ if the magnetization axis were b, is
given by the rule
HV theorists simply cannot escape this assertion, unless they are prepared to
introduce novel possibilities such as nonlocality, contextual effects, and such like,
and these generally make their line of argument unappealing.
The problem is, we cannot do any experiments to measure PCM (A+ ∩ B + )
directly, or any of the other counterfactual probabilities directly, because as
17.3 Circumventing the Simultaneity Problem 241
stated, only one magnetization axis can be set up at a time. We shall call this
the simultaneity problem.
10
The angular momentum of an electron state is generally discussed in terms of /2.
11
Empirically identical, apart from being spatially displaced, so not identical in the sense of
Leibniz.
242 Bell Inequalities
A A
S S
A A
T T
A B
S S
A B
12
At the time, beta decay experiments could not account for a discrepancy between the
energy going into a beta decay process and that going out. Pauli proposed that there was
an unobserved particle, now known as the neutrino and subsequently detected by Cowan,
Reines, and collaborators in 1956 (Cowan et al., 1956), carrying off the energy discrepancy.
17.3 Circumventing the Simultaneity Problem 243
b
field axis to some different direction b (we will now refer to his module as S
a
rather than S ). The relevant stage diagram is now Figure 17.2(b). Then we can
give an operational definition of PrCM (A+ ∩ B + ) and the other counterfactual
probabilities as follows.
Consider a large number N of runs in the eSG experiment. Suppose Alice
and Bob count their respective outcomes, recording the time of each. Afterward,
they come together and compare data sets and times. Then for every outcome in
−
which Alice found her electron in A+ 2 and Bob found his electron in B 2 , in the
same run, then that is counted as originating from the subset A+ ∩ B + , as if all
that information came from the original DS experiment, with a single electron.
Essentially, the second electron observed by Bob is like a ghost version of the
electron observed by Alice, and both are observed in the same run. Of course,
care is taken to take the opposite spins into account. Given the total count, an
estimate for the counterfactual probability PrCM (A+ ∩ B + ) follows immediately.
With this procedure, it is clear that total probability is conserved, that is, we
have
A Bell Inequality
We are now in position to construct a Bell inequality for this experiment. The
first step is to direct the discussion from the set Θ of HV (which by definition we
have no knowledge about), to outcome probabilities, for which we have empirical
data. Suppose we passed a beam with a large number N of electrons through our
SG device, and repeated that run a large number of times. By counting electron
impacts on the detector screen, we would then determine outcome frequencies,
and finally we would be in position to discuss probabilities.
Given a probability measure P over Θ, then for any subsets A, B of Θ, we
have the rule
With this rule and what we know about the subsets involved, we can deduce the
following relations:
This and similar inequalities are the focus of interest in numerous experiments.
Exercise 17.3 Use (17.5) and other relevant information to prove (17.6),
(17.7), and (17.8). Hence prove (17.9).
It has to be pointed out that there are more serious questions about this
inequality than those we raised about PCM (A+ ∩ B + ). We used two-spin states
to circumvent that latter problem, but that approach cannot deal with three
simultaneous magnetization axes. Essentially, we have to perform separate subex-
periments to determine the empirical values PEM P (A+ ∩ B − ), PEM P (B + ∩
C − ), and PEM P (A+ ∩ C − ) separately, ensuring that standardization across all
three subexperiments makes the test of the inequality (17.9) beyond reasonable
doubt.
We note in passing that (17.9) looks like the triangle inequality d(a, b) +
d(b, c) d(a, c) defined for a metric space, where d(a, b) is the “distance” between
elements a and b of the metric space.
denoted | + k, while one entering K1+ will be denoted | − k. These two states
form a calibration basis for Q and satisfy the orthonormality conditions
+k| + k = −k| − k = 1, +k| − k = 0. (17.10)
In contrast to QDN, standard QM tends to work with a fixed Hilbert space over
any given run. Neither of these approaches is incorrect: they encode the same
information differently. We will use the calibration basis as a reference to describe
all state vectors and other bases in the following discussion.
Having calibrated his module T , Ted now uses it to create a beam of normalized
particle states, |Ψ, at stage Σ0 , given by
|Ψ = u| + k + v| − k, (17.11)
where |u|2 + |v|2 = 1.13
With reference to Figure 17.1, now suppose a beam of particles represented by
such a state is subsequently sent through SG module S a , where the main mag-
netic field direction a is given by a = (sin θ cos ψ, sin θ sin ψ, cos θ), where θ and
ψ are standard spherical polar coordinates relative to the standard Cartesians
referred to hitherto. Any beam passing through S a will in turn be split into two
−
components, denoted A+ 1 and A1 , as discussed above. In standard QM, each
of these components will be associated with orthogonal, normalized quantum
outcome states | + a and | − a, respectively, and these can be used to form an
orthonormal preferred basis for Q, associated with S a .
Standard quantum theory gives the following relations between the calibration
basis {| + k, | − k} and the preferred basis {| + a, | − a} (the outcome states),
up to inessential arbitrary overall phase factors:
sin θ sin θ
| + k = √ | + a + √ | − a,
2 − 2 cos θ 2 + 2 cos θ
(17.12)
(1 − cos θ) (1 + cos θ)
| − k = √ | + a − √ | − a.
2 − 2 cos θ 2 + 2 cos θ
From this we can find the amplitude A(+a| + k) ≡ +a| + k for outcome
state | + a given initial state |+k, and so on. Hence we can find the conditional
probabilities. We find, for example,
Pr(+a| + k) ≡ |A(+a| + k)|2 = cos2 ( 12 θ). (17.13)
Two-Spin States
Disregarding all factors inessential to the present discussion, a normalized two
half-spin state |Φ of zero total spin zero is given by
|Φ = √1 |
2
+ ka ⊗ | − kb − √1 |
2
− ka ⊗ | + kb , (17.14)
13
The reader will appreciate by now how difficult it is to describe such a process without
using suggestive, misleading language.
246 Bell Inequalities
where the superscripts label the two spin-half particles.14 We now imagine that
particle a is sent through Alice’s module S a and particle b is sent through Bob’s
module S b .
Using (17.12), we may write
| + ka = αa | + a + β a | − a, | + kb = αb | + b + β b | − b,
(17.15)
| − ka = γ a | + a + δ a | − a, | − kb = γ b | + b + δ b | − b,
where again, superscripts label particles and
sin θa sin θb
αa ≡ √ , αb ≡ √ , (17.16)
2 − 2 cos θa 2 − 2 cos θb
and so on. Note that here {| + a, | − a} is a preferred basis for Qa , the qubit
associated with Alice’s electron, and {| + b, | − b} is a preferred basis for Qb ,
the qubit associated with Bob’s electron.
Given (17.15), we readily find
{(αa γ b − γ a αb )| + a ⊗ | + b + (αa δ b − γ a β b )| + a ⊗ | − b
|Φ = √1 .
2 +(β a γ b − δ a αb )| − a ⊗ | + b + (β a δ b − δ a β b )| − a ⊗ | − b}
(17.17)
PQM (+a, +b|Φ) + PQM (+b, +c|Φ) − PQM (+a, +c|Φ) 0. (17.18)
The first term in this expression is just the squared modulus of the coefficient of
the tensor product term | + a ⊗ | + b in (17.17). Hence we deduce
where θab is the angle between a and b. A similar calculation for the other two
terms in (17.15) gives
14
We can ignore the fact that electrons are identical and obey Fermi–Dirac statistics,
because the spreading of the beam prior to the electrons entering either SG(a) or else
SG(b) has introduced a form of classical labelling.
17.5 The QDN Calculation 247
q bc
qab
Figure 17.3. Plot of the function f (θab , θbc ) over a suitable domain.
The problem is that we can find vectors a, b, and c for which (17.21) is wrong.
For example, take these vectors to lie in a plane, with θab = π/3, θbc = π/3,
θac = θab + θbc = 2π/3. Then
In Figure 17.3 we show a plot of the function f (θab , θbc ) ≡ sin2 ( 12 θab ) +
sin2 ( 12 θbc ) − sin2 ( 12 θab + 12 θbc ) over a range of possibilities. There are two distinct
regions where the function value is negative, while the HV calculation predicts
that there should be no such regions.
There have been many experiments related to the one discussed here that
have shown violations of Bell’s inequalities, a frequently quoted one being that
of Aspect and others using photons (Aspect et al., 1982).
There is now not much doubt among the majority of physicists that classical
counterfactuality has been shown empirically to be a false principle in physics.
It remains an excellent principle as far as relative external context (the wider
Universe) is concerned: we can usually go to work and remain confident that our
house will still be there when we get back in the evening.
There remains a relatively small group of committed HV theorists who continue
to probe this issue and have come up with classically based possible explanations
for the observed empirical violations of Bell’s inequalities. Experimentalists con-
tinue to test these loopholes, and have reached the point where the HV classically
motivated “explanations” seem more unpalatable than the quantum theory they
are trying to circumvent.
Sa Sb
It was found that the total state at stage Σ1 , corresponding to the entangled
state (17.14) needs to be described as a state in the tensor product of a four-
dimensional internal spin space and a rank-four quantum register.
Stage Σ0
The preparation switch stage Σ0 creates a beam of spin-zero, two electron states,
1 00 . Here, |s1 represents a normal-
represented by the total state |Ψ0 ) ≡ |s10 ⊗ A 0 0
ized state in the one-dimensional Hilbert space of two-electron spin zero states.
Stage Σ0 to Stage Σ1
By the first stage, Σ1 , the beam has split into two entangled spin-half subbeams.
This stage is before any of the subbeams enter their respective SG apparatus.
Bitification requires each of these sub-beams to be associated with two signal
qubits,15 so we require a rank-four quantum register at that stage, as stated
above.
The dynamics is given by the rule
1 | + k a
⊗ | − k b
⊗ 1A
A 3−
1 00 = √
U1,0 |s0 ⊗ A1 1 1 1 1
01 . (17.23)
0
2 | − k1 ⊗ | + k1 ⊗ A
a b 2A4
1 1
Stage Σ1 to Stage Σ2
Using (17.15) and from Figure 17.4 we have
U2,1 | + ka1 ⊗ | − kb1 ⊗ A 3 01 = α a | + a 2 ⊗ A
1A 1 + β a | − a2 ⊗ A
2
1 1 2 2
b 3 b
× γ | + b2 ⊗ A2 + δ | − b2 ⊗ A42 02 ,
U2,1 | − ka1 ⊗ | + kb1 ⊗ A 4 01 = γ a | + a 2 ⊗ A
2A 1 + δ a | − a2 ⊗ A
2
1 1 2 2
b
× α | + b2 ⊗ A2 + β | − b2 ⊗ A
3 b 4 02 .
2
(17.24)
15
This is on account of the fact that entanglement can be detected, given the right apparatus.
17.5 The QDN Calculation 249
This is all that is needed to run our computer algebra program MAIN, which
gives us much more information than we need here. Our interest originated in
PCM (+a, −b). Context tells us that we need to calculate the probability that
detectors 12 and 32 are each in their respective signal state at stage Σ2 . Program
MAIN gives us the answer
1A
Pr(A 3
2 2 02 |Ψ0 ) = 2 |α γ − γ α | ,
1 a b a b 2
(17.25)
which is precisely (17.19), the result of the standard QM calculation. The same
argument applies to the other terms in the Bell inequality (17.18).
Our conclusion is that QDN gives the same results as standard QM, and hence
the same prediction of violations of Bell inequalities.
18
Change and Persistence
18.1 Introduction
Our interest in this chapter is in change, which is another way of discussing
persistence, or the apparent endurance over intervals of time of spatially extended
structures.
Persistence is the phenomenon that underpins all human activity. Given the
rule that we may think of as the first law of time, the dictum of Heraclitus
that everything changes, persistence is our name for those remarkable processes
that appear to circumvent that law and give stability to our lives. We should be
interested, for instance, in the fact that when we wake up each morning, we feel
that we are the same individuals who went to sleep the previous night. Indeed,
without persistence in one form or another, nothing would make sense, including
logic, rational thought, and mathematics, for these depend on comparisons of
standards and rules that persist in memory with information that we acquire in
process time.
Persistence is necessary for physics to make sense. It is emphatically not a
metaphysical topic but of the greatest relevance to science, including quantum
mechanics (QM). Persistence is implicit in Wheeler’s dictum that only acts of
observation are meaningful in physics, because observers and their apparatus
have to endure long enough to make observations. In quantized detector networks
(QDN), the persistence of observers and their apparatus long enough to perform
experiments should be regarded as axiomatic.
The subject of persistence is a deep and complex one, and will probably never
be fully understood. Several reasons contribute to this.
pragmatic view is the one held throughout this book, that the laws of physics are
contextual to observers. Without observers, science means nothing in a literal
sense. But according to the first law of time, observers themselves change over
time. This includes their memories and the conditioning that they have; these
are carried over from one stage to another. If the processes transporting those
memories and belief structures change those memories and belief structures,
then the laws of physics as understood by the observers at the time change as
well. Five hundred years ago, science knew little of Newton’s laws of motion; two
hundred years ago, science did not know that there was such a thing as quantum
mechanics.
We deal with the variability of observers by recognizing that truth values of
propositions, including the laws of physics, are contextual to observers at the
time of observation only. That the laws of physics themselves may change as the
Universe expands is not a fanciful idea: the possibility that the so-called constants
of nature have changed has been looked at by theorists such as Dirac (1938b)
and Magueijo (2003). A complete theory of observation would take into account
dynamical interaction between systems under observation, observers, and the
Universe that contains those observers.
Sheer Complexity
Another crucial factor associated with persistence is that observers and SUOs
are hideously complex phenomena. Indeed, the concepts of science are attempts
to organize that complexity into sufficiently simple forms that the brain can
interpret relatively quickly and easily.
Example 18.2 The human eye has a detecting screen known as the
retina, over which are detectors known as rods and cones. There are perhaps
120 million rods and 6 million cones in the typical human eye. When light from
suitable sources strikes the retina, many of these detectors will not register
252 Change and Persistence
a signal. Of those that do, complex chemical processes in the rods and cones
occur, resulting in signals being passed into specialized nerve layers below the
retina. These raw signals from the rods and cones are then processed in those
nerve layers before more complex signals are passed further on into the brain,
where much more complex processes of comparison and pattern recognition
must be occurring. The templates that the brain uses in these comparisons
appear to be relatively stable themselves, so much so that the brain does its
best to match the incoming signals to what it, the brain, has already prepared
and expected for. When this process fails in some way, optical illusions can
occur, such as a failure to recognize individuals whom we have met before.
The simplicity that we think we see around us is a total illusion.
1
In this context, stage Σn later than stage Σm always means n > m.
254 Change and Persistence
inherently dissimilar. This dimensionality issue does not arise in the case of
Riemannian geometry because the dimension of each tangent space in a manifold
is precisely the same as the dimension of the manifold over which they are defined.
We shall henceforth assume that when an observer is measuring the change
in a state of an SUO, it will be under carefully controlled conditions such that
dim Qn = dim Qm ≡ r. For instance, we might be interested in some branch
of macroscopic quantum mechanics, such as superfluidity or superconductivity.
Changes in quantum register dimension in that context would correspond to
fluctuations in the numbers of electrons or other particles in the SUO, which
would be a manifest breakdown of persistence.
This will have the critical role, rather like the metric connection in GR, of defining
a concept of parallel transport in QDN, as we see from the following argument.
Given an initial labstate
r
2 −1
Ψm ≡ Ψim im , (18.2)
i=0
we find
r
2 −1
Ψ̃n ≡ Nn,m Ψm = Ψim in , (18.3)
i=0
18.2 Comparisons
The discussion now reduces to a comparison of the naturally evolved labstate
Ψn ≡ Un,m Ψm and the persistent image state Ψ̃n ≡ Nn,m Ψm . The problem
is that there is no natural measure of difference between two quantum register
states in the same register, no measure of distance, that survives all criticism.
The following are some possibilities.
9
Example 18.3 Consider a rank-billion quantum register Q[10 ] . The states
Ψ≡A 2 . . . A
1A 109 0 and Φ ≡ A
999999999 A 2 . . . A
1A 999999999 0 have zero inner
product, but differ in only one signal out of a billion. By any conventional,
heuristic measure of similarity, these two states would be regarded as very
similar though not identical, but the Born measure of similarity gives them as
totally different.
We conclude that the Born measure of similarity is not good enough, in the
context of this chapter and the next, as a measure of similarity.
Equal-Length Strings
In his study of the transmission of information over telephone networks, Ham-
ming discussed the problem of identifying and then correcting errors in transmis-
sion (Hamming, 1950). This scenario is very much like a quantum experiment,
with essentially the same architecture, including an information void. There are
observers (the speaker and the listener ) and their preparation devices and final
state detectors (the telephones). The information void here consists of extensive
transmission lines and modules, such as telephone exchanges, signal amplifiers,
and so on.
Suppose S ≡ s1 s2 . . . sr is the message that is sent (the prepared state) and
T ≡ t1 t2 . . . tr is the message that is actually received (the outcome state). A
natural question is: how different are these two strings? Hamming devised a
geometrical method of quantifying the difference between two character strings
of equal length. He defined a distance, or metric, dH (S, T ) between S and T
according to the rule
dH (S, T ) ≡ number of matched pairs (si , ti ), i = 1, 2, . . . , r, for which si = ti .
(18.4)
The interpretation of the Hamming distance between two equal length strings
is that it is the minimum number of single character replacements in one string
that would convert it into the other. In Example 18.4, we can convert S into T
by the four replacements α → a, Q → P , 5 → 6, and {→ [.
Exercise 18.5 Prove that the Hamming metric is a true metric in the sense
of a metric space.
2
Note that the transmitted state is not an actual outcome (which is classical information),
but a quantum state immediately before it is looked at for an outcome, and is therefore best
thought of as a different form of information, referred to by us as quantum information.
258 Change and Persistence
0 ≡ 000 0 1 1 2 1 2 2 3
1 ≡ 100 1 0 2 1 2 1 3 2
2 ≡ 010 1 2 0 1 2 3 1 2
3 ≡ 110 2 1 1 0 3 2 2 1
4 ≡ 001 1 2 2 3 0 1 1 2
5 ≡ 101 2 1 3 2 1 0 2 1
6 ≡ 011 2 3 1 2 1 2 0 1
7 ≡ 111 3 2 2 1 2 1 1 0
where the coefficients i[k] , j [k] , k = 1, 2, . . . , r, are each either zero or one. Next,
we form the strings S i ≡ i[1] i[2] . . . i[r] , S j ≡ j [1] j [2] . . . j [r] . Finally, we calculate
the Hamming distance dH (S i , S j ).
Table 18.1 gives the Hamming distance between pairs of elements in the
preferred basis for a rank-three quantum register.
We note the following:
1. The maximally saturated state 7 ≡ 111 is furthest away in Hamming distance
terms from the ground state 0 ≡ 111,
2. All elements of the same signality σ are a Hamming distance σ from the signal
ground state.
3. For any two register basis elements, the Hamming distance between them
defines what can be thought of as a relative signality: if any one of these
elements were chosen to be the new ground state, then the other state would
have signality equal to its Hamming distance from that new ground state. This
underlines the fact that the ground state in a QDN register is not intrinsic to
the apparatus but defined contextually by the observer. The choice of signal
ground state is not dictated by lowest energy state.
The power of Hamming’s approach is that his distance rule is a genuine metric,
so that all the theorems of metric spaces can be applied. Lest this appear trivial,
we should consider that quantum registers modeling real situations may have
immense rank, such as that modeling a superconducting quantum interferometer.
In such situations, the Hamming metric distance approach may well prove useful.
The above approach is classical, in that it covers classical register states com-
pletely. QM, however, involves superpositions of register basis states, so we are
faced with the problem of defining the equivalent of a Hamming distance between
arbitrary, normalized quantum register states in the same register.
where dH (i, j) is the Hamming distance between elements i and j. Then using
completeness, we have the representation
r
2 −1
H= idH (i, j)j. (18.7)
i,j=0
We would in the first instance like to use this operator to attempt several
definitions of “distance” between more general quantum register states. Examples
are the following.
Neither (18.8) nor (18.9) is a genuine metric, because they do not return zero
in general when Φ = Ψ. However, that is not necessarily a bad thing. After all,
quantum states are not objective things. The fact that neither H1 (Ψ, Ψ) nor
H2 (Ψ, Ψ) is not zero in general reflects the fact that labstates are not classical.
If they were, they would be represented by a single element of the CBR, and in
that case, both measures of dissimilarity vanish.
Correlations in Statistics
The problem of comparing two data sets occurs frequently in statistics. Suppose
we have two sets of data, X and Y , each consisting of n real numbers, such that
260 Change and Persistence
the ith element of X is xi and the ith element of Y is y i . The question often
arises: how “close” are the two sequences {x1 , x2 , . . . , xn } and {y 1 , y 2 , . . . , y n }?
This question is a variant of the one originally addressed by Hamming. Statisti-
cians have developed the concept of correlation in a variety of forms that attempt
to give a number, usually between −1 and +1, that gives the degree of similarity,
dependence, or correspondence between the two data sets (or sequences). A
well-known correlation coefficient is the Pearson correlation coefficient, which
in the case we are discussing is known as a sample correlation coefficient CX,Y
defined by
n
(xi − x)(y i − y)
CX,Y ≡ / i=1 n , (18.10)
n
j=1 (x − x) k=1 (y − y)
j 2 k 2
where x and y are the respective sample averages. If a sample correlation coeffi-
cient has value +1, 0, or −1, the two samples are said to be perfectly correlated,
uncorrelated, or perfectly anticorrelated, respectively.
Detector Correlations
In our case, our interest is directly related to detector physics. The “data” values
in which we are interested, that is, the analogues of the xi and y i values in the
above, are truth values, suitably modified to give correlations. What greatly helps
here is that the values are binary. Suppose we look at two detectors and compare
their signal status. If they both register a signal, or if they both register ground
(no signal), then we can say they are perfectly correlated. Otherwise, they are
perfectly anticorrelated.
More commonly, we might take many such joint readings. Suppose the prob-
ability of finding perfect correlation is p. Therefore, the probability of anticorre-
lation is 1 − p. Assigning a value +1 for every case of perfect correlation and a
value of −1 for every case of perfect anticorrelation, we deduce that the average
correlation C will be given by C = 2p − 1.
U1,0 00 = a01 − b∗ 11 ,
U1,0 10 = b01 + a∗ 11 , (18.12)
We now come to a critical point, one that impinges on the quantum Hamming
measures of dissimilarity discussed above. Consider a persistent array of detectors
from stage Σm to stage Σn , where n > m, such that detector im at stage Σm is
regarded as the same as in at stage Σn . Given the labstates Ψm and Ψn , there
are two very different quantities that could be measured:
Correlation of Probabilities
i
Suppose the observer measured Pm , the probability of a positive signal in im
i
and, separately, Pn , the probability of a signal in in . These two quantities give
important information about what goes on at detector i at two separate stages.
But there is no direct correlation between those two pieces of information. Every
observation of im at stage Σm cannot possibly be influenced by what could
happen at in at stage Σn , and vice-versa, simply because each observation in
any given run necessarily precludes the other observation, in that run.
The argument goes as follows. If the observer looks at stage Σm to see what
the signal status of detector im is, then that stops the run and so in cannot be
investigated during that run. Likewise, if the observer decides to see what the
signal status is of detector in at stage Σn , that means they cannot have looked
at im at stage Σm , because that would have stopped that run immediately. In
brief, a comparison of the signal status im and in in any given run is ruled out.
i
The only information the observer has are the two probabilities Pm and Pni , and
these have been measured separately over many different runs.
If Pm
i
≡ 1 − Pmi
, Pni ≡ 1 − Pni , we define the correlation of probabilities Pn,mi
as follows:
i
Pn,m ≡ Pni Pm
i
+ Pni Pm
i
− Pni Pm
i
− Pni Pm
i
= (2Pni − 1)(2Pm
i
− 1) = Cni Cm
i
, (18.13)
Probability of Correlation
What the observer is really after is the probability of finding a signal at in
knowing for sure that there was a signal in im . This is then a genuine temporal
i
correlation, denoted Cn,m . In the following, we simplify the discussion by taking
the register to consist of just one detector.
262 Change and Persistence
Definition 18.6 Given initial state Ψm ≡ α0m +β1m , where |α|2 +|β|2 = 1,
and the evolution operator is Un,m , the temporal correlation Cn,m is given by
We point out that the probabilities Pr(0m |Ψm ) ≡ |α|2 and Pr(1m |Ψm ) ≡ |β|2
are found during the calibration process, prior to the experiment proper starting.
19.1 Introduction
Our aim in this chapter is to extend the Bell inequality discussion in Chapter 17
to its temporal analogue, known as the Leggett–Garg (LG) inequality.
Bell inequalities involve spatial nonlocality, that is, signal observations dis-
tributed over space. It was shown by Leggett and Garg that analogous inequalities
involving temporal nonlocality can be formulated (Leggett and Garg, 1985). We
shall discuss one of these, known as the LG inequality.
We saw in Chapter 17 that Bell inequalities are based on certain classical
mechanics (CM) assumptions about the nature of reality. Likewise, the LG
inequality is based on two CM principles that are not incorporated into quantum
mechanics (QM).
This principle is subtle. We experience its apparent validity all the time in our
ordinary lives: we look at objects and they appear not to change by those acts
of observation. However, quantum physics tells us otherwise, for all information
comes to us via quantum processes, and those involve quanta.1 Newton’s third
law, action and reaction are equal and opposite and act on different bodies, really
does have its quantum counterpart: no observation leaves the observed completely
unchanged.
Change Is Contextual
Up to now, our exposition of QDN has emphasized that everything changes.
But on reflection, that is a vacuous inconsistency, for change can be measured
1
We have not been much concerned with Planck’s constant so far in this book, but it
comes in here. Changes occurring to states of SUO when they are observed are quantified
by that unit of action, which is relatively negligible on our ordinary real-life (emergent)
scales of measurement.
19.3 The Classical Leggett–Garg Inequality 265
Our interest will be in the average correlation C 1,0 . From the information given
above, we find
N up N down 2N up
C 1,0 = − = − 1. (19.1)
N N N
By inspection, it is easy to see that the average correlation satisfies the constraints
−1 C 1,0 1.
signal status at Σ0 up up up up
signal status at Σ1 up up down down
signal status at Σ2 up down up down
i
C1,0 1 1 −1 −1
i
C2,1 1 −1 −1 1
i
C2,0 1 −1 1 −1
Ki 1 1 −3 1
Table 19.1 shows the possible up or down states of that bit at each stage, the
correlations associated with those bit states, and the LG correlation for that bit.
By inspection of Table 19.1, we see that the LG correlation K i ≡ C1,0 i i
+ C2,1
− C2,0 for the ith bit lies in the interval [−3, 1]. This is perfectly general, being
1
valid for any interaction whatsoever between the bits in that SUO. Since this is
true for any bit in the SUO, we conclude that the average K satisfies the same
condition. Hence we arrive at the LG condition
− 3 K 1. (19.3)
There is no way that the LG inequality could ever be violated classically, given
the principles of macrorealism and noninvasive measurability.
U U
1.5
K
–1
–2
–3 –2
–3 –2
–1 0 2
0
q2
1
q1 2 3
20.1 Introduction
Experimental violations of the Bell and Leggett–Garg inequalities studied in
Chapters 17 and 19, respectively, show that quantum states of systems under
observation (SUOs) cannot be interpreted classically. Interpretations that claim
to do this require contextually incomplete modifications of classical principles,
such as a breakdown of classical locality. Such modifications merely serve to make
standard quantum mechanics (QM) more appealing.
However, that is only one side of the observer–SUO fence. In this chapter,
we start to explore the other side of that fence, where the observer and their
apparatus live. Our focus is a thought experiment proposed by Franson (Franson,
1989), which suggests that apparatus cannot always be treated classically. Three
scenarios are discussed, each involving different time scales, with corresponding
different outcomes.
M j M S M j M
B B B B
phase φ and a time delay ΔT . The time delay ΔT is assumed the same for each
interferometer in any given run. The phase changes φ1 , φ2 associated with the
different interferometers can be altered by the observer, but are fixed before and
during each run.
The experiment hinges on the relationship between three characteristic times
(Franson, 1989).
Coherence Time τ1
In quantum optics experiments, a finite electromagnetic wave train of length L
moving at the speed of light c takes a time τ ≡ L/c to pass a given point. Such
a time is known as a coherence time. In the Franson experiment, the coherence
time τ1 is that associated with the production of the photon pair by stage Σ1 .
It is a characteristic of the photon pair source S and of the collimation proce-
dures applied subsequently. While τ1 cannot be altered, it can be determined
empirically. We shall assume that τ1 is the same for each photon.
Emission Time τ2
The second characteristic time is τ2 , the effective time interval within which
both photons in a pair can be said to have been emitted. This can be measured
during calibration by coincidence observations of detectors 13 and 23 with all
beam-splitters removed. It is assumed τ2 can be determined empirically and
that τ2 τ1 . This last inequality is crucial to FRANSON because when this
inequality holds, the observer has no way of knowing when a photon pair was
created during the relatively long time interval τ1 .1 It is this lack of knowledge
that leads to quantum interference in the FRANSON-3 scenario discussed below.
The spectacular aspect of FRANSON is that unlike the double-slit experiment,
1
Of course, “photon creation” is a vacuous metaphysical picture extrapolated from
observations done after the source has been triggered. There is no evidence, based on the
given apparatus, for the belief that anything has been “created” at S.
20.3 FRANSON-1: ΔT τ2 273
where the observer does not know from which point in space a photon came, here
the observer does not know at which point in time the photon pair was produced.
20.3 FRANSON-1: ΔT τ2
The relevant figure for this scenario is Figure 20.1. We discuss the experiment in
terms of its stages, as follows.
Stage Σ0
The initial total state is
1 00 ,
|Ψ0 ) ≡ |s0 ⊗ A (20.1)
0
Stage Σ0 to Stage Σ1
By stage Σ1 , the initial state has split into a correlated pair of photons moving
in opposite directions. The creation of this pair by this stage is represented by
the action of the contextual evolution operator U1,0 :
1A
|Ψ1 ) ≡ U1,0 |Ψ0 ) = |s1 ⊗ A 2
1 1 01 , (20.2)
where |s1 represents the combined spin state of the photon pair at this stage.
Stage Σ1 to Stage Σ2
The stage-Σ1 photons pass through beam splitters B 1 and B 2 as shown. One
output channel from each beam splitter leads directly to a final beam splitter,
while the other output channel is deflected by a mirror through a phase changer
before being deflected onto that final beam splitter. The four beam splitters B i ,
i = 1, 2, 3, 4, are parametrized by real transmission and reflection coefficients ti ,
ri respectively, according to the prescription given by Eq. (11.28).
274 The Franson Experiment
Stage Σ2 to Stage Σ3
There are four terms to consider in the transition from Σ2 to Σ3 :
U3,2 |s2 ⊗ A 2
1A 3 1 3 3 2 + ir4 A 4 03 ,
2 2 02 = |s3 ⊗ t A3 + ir A3 t4 A 3 3
U3,2 |s2 ⊗ A 2 02 = |s3 ⊗ t3 A
3A 3 + ir3 A
1 t 4 2
A + ir 4 4
A
2 2 3 3 3 3 03 ,
U3,2 |s2 ⊗ A 4
1A 3 1 3 3 4 + ir4 A 2 03 ,
2 2 02 = |s3 ⊗ t A3 + ir A3 t4 A 3 3
U3,2 |s2 ⊗ A 4
3A 3 3 3 1 4 + ir4 A 2 03 . (20.4)
2 2 02 = |s3 ⊗ t A3 + ir A3 t4 A 3 3
This is all the information needed for our computer algebra program MAIN
to evaluate the answers to all maximal questions. These answers turn out to
be complicated, long polynomials in the ti and ri parameters, so are not √ listed
here. However, setting them to the empirically useful value ti = ri = 1/ 2, i =
1, 2, 3, 4, as assumed by Franson (Franson, 1989), gives the relative coincidence
rates
Pr A 1A 2 03 |Ψ0 = sin2 1 φ1 sin2 1 φ2 ,
3 3 2 2
Pr A A
3 3 03 |Ψ0 = cos
3 2 2 1 2
1
2φ sin 12 φ2 ,
Pr A 1A4
3 3 03 |Ψ0 = sin
2 1 1 2 1 2
2 φ cos 2 φ ,
Pr A 3A4
3 3 03 |Ψ0 = cos
2 1 1
2φ cos2 12 φ2 . (20.5)
20.4 FRANSON-2: τ1 ΔT
In this variant of the Franson experiment, the photon wave trains 32 , 42 reflected
at B 1 and B 2 , respectively, travel along the long arms of their respective inter-
ferometers at the speed of light or less, depending on the medium through which
20.4 FRANSON-2: τ1 ΔT 275
M j M M j M
B S B
B B B B
they move. Because now the travel time difference ΔT is very much greater than
the coherence time τ1 , these wave trains arrive at B 3 and B 4 long after the
transmitted wave trains 12 and 22 have impinged on B 3 and B 4 , respectively. In
consequence, no local or global interference can take place. In fact, the observer
can now obtain total information concerning the timing of each coincidence
outcome in every run of the experiment and know precisely what path was taken
by each photon.
Under this circumstance, the four original detectors 13 , 23 , 33 , and 43 used
in FRANSON-1 now have to be regarded as eight separate detectors, i3 , i =
1, 2, . . . , 8, as shown in Figure 20.2. The first four of these register photon clicks
from short-path photons, while the last four signal clicks from those that have
traveled the long paths. This information is specific to each photon and does not
involve any photon pairs.
Significantly, the final-stage quantum register involved in this scenario and
the next one, FRANSON-3, is 256-dimensional. However, our computer algebra
program MAIN has no difficulty dealing with this because it has been encoded
to process only the relevant contextual Hilbert spaces, and these are of greatly
reduced dimensions.
This demonstrates a fundamental point about apparatus. In the conventional
usage of apparatus, experimentalists tend to regard their equipment as having
some sort of “transtemporal” identity, or persistence. In Figure 20.2, for example,
beam splitters B 3 and B 4 would most likely persist in the laboratory as material
objects, during the long interval ΔT between their interaction with wave-trains
12 and 22 and with the delayed wave-trains 32 and 42 . Even classically, however,
this need not be the case. It is conceivable that ΔT could be so long, such as
several years, that the beam splitters could be destroyed and rebuilt at leisure
between the observation of any short-arm photons and any long-arm photons.
276 The Franson Experiment
Whatever the actuality in the laboratory, from a quantum point of view, the
beam splitters receiving short- and long-arm photons should be considered as
completely separate pieces of equipment in this scenario (but not in the next).
In other words, apparatus and how it is used is time dependent. The analysis in
the next section shows that the rules for doing this can be quite nonclassical and
appear to violate the ordinary rules of causality.
For the FRANSON-2 scenario, τ1 ΔT , the dynamics follows the same rules
as in the previous section up to the transition from stage Σ2 to stage Σ3 . At this
point, the transformation rules have to take into account the possibility that the
observer could know the timings of all events. The rules for this transition are now
U3,2 |s2 ⊗ A 2
1A 3 1 3 3 2 + ir4 A 4 03 ,
2 2 02 = |s3 ⊗ t A3 + ir A3 t4 A 3 3
U3,2 |s2 ⊗ A 2
3A 3 5 3 7 2 + ir4 A 4 03 ,
2 2 02 = |s3 ⊗ t A3 + ir A3 t4 A 3 3
U3,2 |s2 ⊗ A 4 02 = |s3 ⊗ t3 A
1A 1 + ir3 A
3 t 4 6
A + ir 4 8
A
2 2 3 3 3 3 03 ,
U3,2 |s2 ⊗ A 4
3A 3 5 3 7 6 + ir4 A 8 03 .
2 2 02 = |s3 ⊗ t A3 + ir A3 t4 A 3 3
(20.6)
which should be compared with (20.4).
In this scenario, we find sixteen nonzero coincidence rates, each of the form
Pr(A i A j
3 3 03 |Ψ0 ), where i = 1, 3, 5, 7 and j = 2, 4, 6, 8. All of them are independent
of φ1 and of φ2 . For example, Pr(A 1A6
3 3 |Ψ0 ) = (t√
1 2 3 4 2
r t t ) , and so on. In the case of
i i
symmetrical beam-splitters, where t = r = 1/ 2, all 16 rates are equal to 1/16.
For FRANSON-2, the detectors behave in a manner consistent with the notion
that photons are classical-like particles propagating along definite paths.
We should comment on the choice of stage diagram for FRANSON-2, as
this impacts significantly on the encoding of program MAIN. Our choice of
introducing detectors 53 , 63 , 73 , and 83 seems inevitable but puzzling, because it
could be claimed that in a real experiment, there would only be four final stage
detectors, not eight. However, our observation above that the long-arm photon
signals could occur perhaps many years after the short-arm signals addresses that
point: the observer will have real evidence that one set of signals has arrived
long before the other set. The fact that the same atoms could persist in the
configuration of detectors 13 , 23 , 33 , and 43 until they did service as detectors
53 , 63 , 73 , and 83 is irrelevant.
A related point is that, for simplicity, we chose to model all of the eight final-
stage detectors to be associated with stage Σ3 , but that is not necessary. It would
perhaps have been more logical to assign the long-arm detectors to a separate,
later stage, but that would have been an inessential complication in programming
that would make no difference to the calculations or to the conclusions drawn
from them.
FRANSON-2 is an important illustration that the observer–apparatus side of
quantum physics can contain real surprises. The next variant, FRANSON-3, is
even more surprising.
20.5 FRANSON-3: τ2 ΔT τ1 277
M j M M j M
B S B
B B B B
20.5 FRANSON-3: τ2 ΔT τ1
This is the actual scenario discussed by Franson (Franson, 1989). In the following,
S stands for “short path” and L for “long path.” The fundamental change
induced by the observer’s setting of ΔT such that τ2 ΔT τ1 is that,
unlike the previous scenario, the observer cannot now use individual times of
detector clicks to establish which of the coincidences S–S or L–L has occurred
in a given run of the experiment. This presumes that a given run lasts at least
as long as the coherence time, and that coincidence observations are being made
at times greater than a critical time τc ≡ τs + ΔT into any given run. Here τs is
the earliest time that a wave train following an S path could take to a stage Σ3
detector. Coincidence observations of interference involving any L path at any
time earlier than τc into a given run will not occur, simply because any wave
train taking such a path would still be on its way to stage Σ3 detectors.
FRANSON-3 is discussed and modeled assuming observations are made at
times later than τc . In that regime, it will then not be possible for the observer to
use detection times to distinguish between S–S correlations and L–L correlations.
However, the observer will be able to filter out the two separate S–L correlations.
The relevant diagram is Figure 20.3, which is identical to Figure 20.2 except
now 53 is replaced by (3 ∨ 5)3 , 73 is replaced by (1 ∨ 7)3 , 63 is replaced by (4 ∨ 6)3 ,
and 83 is replaced by (2 ∨ 8)3 , where for example 3 ∨ 5 means “3 or 5.” Which
alternative is taken depends on the contextual information available in principle
to the observer.
The dynamics for this scenario is identical to that for the previous one, except
for the last equation in (20.6), which is replaced by
3A
U3,2 |s2 ⊗ A 4 3 3 3 1 4 + ir4 A
2 03 .
2 2 02 = |s3 ⊗ t A3 + ir A3 t4 A 3 3 (20.7)
278 The Franson Experiment
With this amended information, program MAIN gives the following nonzero
coincidence rates:
Pr A 2
1A 3 4
3 3 03 |Ψ0 = Pr A3 A3 03 |Ψ0 = 4 cos
1 2 1 1 1 2
2 φ + 2 φ ,
Pr A 2A
3 1 4
3 3 03 |Ψ0 = Pr A3 A3 03 |Ψ0 = 4 sin
1 2 1 1 1 2
2φ + 2φ , (20.8)
i i
√ eight others that each have value 1/16, for the symmetric case t = r =
and
1/ 2, i = 1, 2, 3, 4.
These results are a spectacular demonstration of temporal nonlocality as well
as spatial non-locality: interference is occurring between S–S and L–L outcomes.
Note that in actual FRANSON-3 type experiments, the observer would have
to measure the times at which coincidence clicks were obtained during each run
and then post-select, that is, filter out, those coincidences corresponding to the
{S–S, L–L} processes and those corresponding to {S–L, L–S}.
20.6 Conclusions
Since the publication of Franson’s original paper, there has been great interest
in empirical confirmation of FRANSON-3 predictions. While there is still some
room for debate concerning the interpretation of the experiment, the results
of Kwiat et al. (1993) vindicate Franson’s prediction, which corresponds to
2
1A
Pr(A 3 3 03 |Ψ0 ) = 4 cos ( 2 φ + 2 φ ) in our approach.
1 2 1 1 1 2
21.1 Introduction
In this chapter, we discuss some of the differences between apparatus and systems
under observation (SUOs). The discussion is phrased in terms of self–intervening
networks, wherein partial quantum outcomes in an early stage of an apparatus
can alter future configurations of that apparatus.
To explain what we mean, we introduce the following classification of experi-
ments. Note that reference to apparatus here includes real and virtual detectors,
and real and virtual modules. Virtual detectors and modules are mathematical
fictions that are introduced into the formalism for convenience.
Type-Zero Experiments
Many important experiments involve signal propagation through “empty space.”
We classify these as type zero (T0). The feature of T0 experiments qualifying
for this classification is that the observer has no control of the modules in the
information void. Examples are experiments in astrophysics, where the observer
can choose which source to observe and how they observe it, but has no influence
on the physical properties of whatever lies between source and detector. If there
are gas clouds between source and detector, the observer can only recognize that
fact after the experimental data are analyzed. An important modern variant of
T0 experiments involves map location via the Global Positioning System (GPS):
signals sent from Earth to geostationary satellites and received by mobile devices
back on Earth have traveled through Earth’s atmosphere and through empty
space, where special relativistic and general relativistic effects have to be taken
into account (Ashby, 2002).
Before the rise of experimental science, most “experiments” were based on
visual observations, which are principally of T0 classification. The advent of the
telescope and the microscope did not change things in this respect. Indeed, it
has been suggested that science did not progress in antiquity precisely because
of the philosophical principle that the only valid experiments could be of T0, as
280 Self-intervening Networks
Type-One Experiments
Many quantum experiments involve time-independent modules, introduced by
an observer into an information void, which appear to persist unchanged in the
laboratory over the course of each run of an experiment. By this we mean that
for each run of such an experiment, the apparatus that prepares the initial state,
shields it from the environment during that run, and detects the outcome state
is considered fixed during each stage of a run. In other words, such apparatus
persists over time scales significantly greater than the time scale of each run. We
classify such experiments as type one (T1). These differ from T0 experiments in
that the observer explicitly introduces real modules, such as beam splitters and
mirrors, into the information void.
The illusion of persistence is a convenient mental device, because all experi-
ments are done in process time and everything changes. However, the concept
of persistence is a potent and necessary one that is needed to make sense of
what science is all about. It is formalized in quantized detector networks (QDN)
by the use of null evolution, as discussed in Section 7.11 and Chapter 18. In
standard quantum mechanics (QM), T1 experiments are described in terms of
time-independent Hamiltonian operators in Schrödinger equations.
In practice, all experiments have some degree of T1 behavior. For instance,
the very action of constructing apparatus before an experiment starts carries the
hidden assumption that the constructed apparatus will persist long enough to be
useful. That assumption has some amusing aspects. For example, no one would
consider constructing parts of their apparatus out of ice, unless the environment
made that feasible, as in the Antarctic.
In quantum optics, the double-slit (DS) experiment, Mach–Zehnder interfer-
ometer experiments, and quantum eraser experiments (Walborn et al., 2002) are
all of this type. So too are high-energy particle-scattering experiments such as
those conducted at the Large Hadron Collider at CERN.
After T0, T1 experiments are technically the easiest to construct, and many
historical experiments were of this type. T1 experiments allow the focus of
attention to be entirely on the dynamical evolution of the states of an SUO,
because such states are conventionally regarded as the only objects of interest in
physics.
21.1 Introduction 281
Type-Two Experiments
An important class of experiment, referred to here as type two (T2), involves
some time dependence in the apparatus that is controlled by factors external
to the apparatus, either by the observer directly or by factors external to the
laboratory. When controlled by the observer, such experiments require a more
advanced technological level than in T1 experiments, such as advanced electronics
and computerization. In standard quantum mechanics (QM), T2 experiments
are described in terms of time-dependent Hamiltonian operators in Schrödinger
equations.
Spin-echo magnetic resonance experiments are of this type, because the exper-
imentalist arranges for certain magnetic fields to be rotated precisely, while addi-
tionally, the sample environment introduces random external influences related to
local temperature. An example of random changes controlled by the experimen-
talist are the delayed-choice experiments such as that of Jacques et al. (2007),
discussed in Section 14.5, where carefully arranged random changes are made
during each run. QM typically describes such experiments via time-dependent
Hamiltonian that may have stochastic elements.
T2 experiments are always going to be more interesting than T1 experiments
for two reasons: T1 experiments can be regarded as limiting cases of T2 exper-
iments where time dependence is turned off, and because T2 experiments have
the potential to reveal more information about the dynamics of systems under
observation than T1. Schwinger’s source theory in relativistic quantum field
theory shows that, in principle, T2 experiments allow for the extraction of all
possible information about quantum systems (Schwinger, 1969, 1998a,b,c).
T1 and T2 experiments may be collectively labeled as exophysical, because they
involve classical apparatus interventions that are external in origin to the SUO.
In such experiments, the apparatus is classically well defined at each instant
of time during each run, even in those situations where it changes randomly.
Therefore, a classical Block Universe (Price, 1997) account of apparatus during
each run of a T1 or T2 experiment may be possible.
Type-Three Experiments
In this chapter, we explore a third type of quantum experiment, referred to as
type three (T3). In such an experiment, the apparatus is modified internally by
the quantum dynamics of the SUO, rather than externally by the observer or the
environment.
There is an interesting point to be made here about experiments conducted
at high-energy particle-scattering laboratories such as the Large Hadron Collider
(LHC). Such experiments have fixed sources and detectors and do not have real,
physical modules in the interaction region. From that perspective they are T0
experiments. However, the observers can arrange for their particle beams to inter-
sect and allow particle collisions, so from that perspective, such experiments are
of type T1. However, the interactions that occur are entirely quantum processes,
282 Self-intervening Networks
so they are of also type T3 in an essential way. Indeed, that is the whole point
of such experiments.
In Chapter 25 we discuss such experiments as processes where virtual modules
are created in the information void by the dynamics of the SUOs. For exam-
ple, two-particle to two-particle scattering amplitudes discussed in high-energy
particle physics have all the characteristics of beam splitters.
Type-Four Experiments
In recent years, there has been interest in experiments that contain T2 and
T3 aspects. We have in mind experiments of the Eliztur–Vaidman bomb-tester
variety, discussed in Chapter 25. Such experiments can be classified as type-four
(T4) experiments
A question that we address toward the end of this chapter is whether T3
experiments can always be given a classical Block Universe account or whether
something analogous to superpositions of different apparatus has to be envisaged.
This is not to be confused with the superposition of labstates that we have
discussed up to now.
This question is related to the rules of quantum information extraction as they
are currently understood in QM. These rules state that quantum interference can
occur in the absence of classical which-path information, the most well-known
example of this being the DS experiment.
The question here is what precisely does a lack of which-path information
mean? If such as thing as a photon actually passed through one of the slits,
would it leave any trace in principle? Even if it did, it would have to be essentially
unobservable, at least far below the scales of classical mechanical detection. In
that case, an observer of the interference pattern would indeed be unaware of
such an interaction. In essence, such a scenario would be welcomed by supporters
of the Hidden Variables (HV) interpretation of empirical physics.
This supposition seems wrong to us for the following reason. In the DS exper-
iment, there are three cases: (1) a photon actually goes through one of the slits
but leaves no trace whatsoever; (2) a photon actually goes through one of the
slits and leaves a trace that cannot be detected; or (3) we avoid the use of the
photon concept.
Our inclination is to discount case 1 on the grounds that any assertion that
something “exists” or has occurred, but no evidence for it can ever be found, is
vacuous and can be dismissed by Hitchens’ razor.1
As for case 2, the idea that it is the mere lack of suitable photon-slit interaction
detection technology itself that produces interference patterns on a remote screen
seems be wrong on two counts. First, there is now sufficient evidence against the
1
“What can be asserted without evidence can be dismissed without evidence,” a modern
version of the Latin proverb “Quod gratis asseritur, gratis negatur ” (what is freely asserted
can be freely deserted).
21.1 Introduction 283
notion that photons are particles in the conventional sense (Paul, 2004). Second,
the idea that interference patterns could occur when one observer was unaware
of actual which-path information while another observer was aware of it seems
inconsistent.2 It seems likely that there has to be something deeper in the origin
of quantum interference. We do not believe it lies in the hidden variables direction
because of its contextual incompleteness.
Neutron interference experiments (Greenberger and YaSin, 1989) explore this
question in that they are moving toward larger scales of interaction between
SUOs and apparatus. In their experiment, the movement of mirrors involved
in their quantum erasure scheme involves macroscopic numbers of atoms and
molecules (Becker, 1998). In this case, the dynamical effects of the impact of
a particle on a mirror is reversed by a second impact. What is amazing is the
idea that all possible traces of the first impact could be completely erased, even
though there could (in principle) be time for information from the first impact
to be dissipated into the environment, thereby rendering the process irreversible.
This raises the question of what irreversibility actually means. Is a process
irreversible in some absolute sense, or is it contextual and dependent on the
observer? We have previously suggested that there are no absolutes in physics.
Therefore, we should examine the possibility that what is irreversible to one
observer need not be irreversible to another. This does not seem inconsistent or
unreasonable, given that irreversibility and observers are emergent processes, the
laws of which have not yet been understood in any significant way.
We focus exclusively on linear quantum evolution, i.e., quantum state evolution
conforming to the principles of QM as discussed for example in Peres (1995),
rather than appeal to any form of nonlinear quantum mechanics to generate
self-intervention effects. We explore a number of type-three thought experiments
involving photons, which act as either quantum or classical objects at various
times. As quantum objects they pass through beam-splitters and suffer random
outcomes as a result. As classical objects they are used to trigger the switching
on or off of macroscopic apparatus, a switching that determines the subsequent
quantum evolution of other photons.
We shall not discuss the nature of photons per se, except to say that they
are referred to as particles for convenience only: our ideology and formalism
treats them as signals in detectors (Jaroszkiewicz, 2008a). Everything is idealized
here, it being assumed that all detectors operate with 100 percent efficiency and
that photon polarizations and wavelengths can be adjusted wherever necessary
to make the scenarios discussed here physically realizable. The experiments we
discuss are not necessarily based on photons: other particles such as electrons
could be used in principle.
2
There is a possible loophole here: our comment involves two observers with access to
different contextual information. What that means physically is a nontrivial question for
any theory of observation. In the absence of any proper theory of such a scenario, we can at
best express our opinion on the matter.
284 Self-intervening Networks
S B
Note that in the program MAIN, null evolution carries 22 to 23 and 32 to 33 , and
the program reports detector status at stage Σ3 . The predicted nonzero outcome
probabilities are as expected:
Pr(A 3
1A 2 4
3 3 03 |Ψ0 ) = r , Pr(A3 A3 03 |Ψ0 ) = t .
2 2
(21.3)
21.3 Experiment SI-2: Double Self-intervention 285
B
B
S
B
2A
Pr(A 3 1A4
3 3 03 |Ψ0 ) = (r r ) , Pr(A 3 3 03 |Ψ0 ) = (t t ) ,
1 2 2 1 2 2
A
Pr(A A
3 3 03 |Ψ0 ) = (t r r ) , Pr(A 3 3 03 |Ψ0 ) = (t r t ) ,
3 5 1 2 3 2 3 6 1 2 3 2 (21.4)
4A
Pr(A 7 03 |Ψ0 ) = (r1 t2 r4 )2 , 4A
Pr(A 8 03 |Ψ0 ) = (r1 t2 t4 )2 .
3 3 3 3
Individual detector signal probabilities can be worked out from these correlations.
For example, Pr(13 ), the probability that detector 13 will be in its signal state
rather than its ground state at the final stage Σ3 , is given by
Pr(13 ) ≡ Pr(A 3
2A 3 5 3 6
3 3 03 |Ψ0 ) + Pr(A3 A3 03 |Ψ0 ) + Pr(A3 A3 03 |Ψ0 ) = (r ) , (21.5)
2 2
and so on.
There are no interference effects predicted in this experiment.
S B B
signality one is conserved, then the only scenario permitted is that a signal is
detected in 34 and in 44 . In that case, even though beam splitter B 4 is triggered,
no signals are detected at 54 or 64 .
The beam splitter saturation scenario shows that there is a fundamental dif-
ference between photon self-interference, which is a signality-one process, and
two-photon scattering, which is a signality-two process. The former is a mani-
festation of the quantum superposition principle, whereas the latter depends on
the details of photon–matter coupling. For instance, the lowest order amplitude
for photon–photon scattering in quantum electrodynamics is proportional to
α2 , where α is the fine structure constant. We mention in passing that the
correct calculation of the photon–photon scattering cross section requires delicate
treatment of divergent integrals, in order to bring the predictions in line with
cosmological observations (Liang and Czarnecki, 2011). This is a good example
of the need to factor in detector physics properly in relativistic quantum field
theory.
The dynamics for experiment SI-3 is handled in the usual way in program
MAIN, with the following predictions:
Pr(A 2
1A 3
1A
4 4 04 |Ψ0 ) = (t t ) , Pr(A 4 4 04 |Ψ0 ) = (t r r ) ,
1 2 2 1 2 3 2
Pr(A 5
4A 2 3
4 4 04 |Ψ0 ) = (t r t r + r t r t ) , Pr(A4 A4 04 |Ψ0 ) = (r t t ) , (21.6)
1 2 3 4 1 2 3 4 2 1 2 3 2
Pr(A 6
4A 3 4
4 4 04 |Ψ0 ) = (t r t r − r t r t ) , Pr(A4 A4 04 |Ψ0 ) = (r r ) .
1 2 3 4 1 2 3 4 2 1 2 2
1935). In brief, the architecture is this. A living cat is placed in a sealed, isolated
box along with a flask of poison and a mechanism that contains a radioactive
sample. The mechanism will release the poison if it is triggered by the decay of
any one of the atoms in the sample. An observer external to the box closes the
box and leaves it alone for 1 hour. At that time, the observer opens the box and
ascertains whether or not the cat is still alive or now dead.
The issue here is that according to standard quantum wave mechanics, the
wave function for the combined system of cat plus radioactive sample will develop
from a separable state of a {live cat and undecayed sample} to a entangled
superposition of a {live cat and undecayed sample} with a {dead cat and decayed
sample}. This prompts questions, such as “How can a cat exist in a superposition
of a live state and a dead state?” and so on, which have, over the decades since
Schrödinger’s article, generated a plethora of confusing views about the nature
of reality.
This scenario is one we can label T3. It is a complex example of a self-
intervening network, presented in simple terms.
For us, such questions are vacuous. All of them. If context is taken properly
into account, then no such questions need be raised. It suffices to look at what
the external observer can do, what they are actually doing, and when they
actually do it. The QDN prescription is clear. The observer prepares the combined
{cat/box/poison sample} SUO at stage Σ0 . On the basis of their contextual
information, such as the decay probability associated with the radioactive sample,
the observer can, just after closing the box, use standard quantum theory to
estimate the probability of the cat being alive when the box is opened at stage
Σ1 , that is, 1 hour later. What the observer eventually believes has actually
happened inside the box depends on the classical information that is allowed to
accumulate there during the hour. For instance, if a decay occurs, not only could
it trigger the poison to be released to kill the cat, but the lab time at which this
occurred could be registered. When the observer finally opened the box, if they
found the cat dead, then the registered time of death would be available.
It is such examples that make it clear that “the” information void is not
an absolute concept. It is contextual. In this case, the external observer has no
information about what is going on inside the box during the given hour that the
experiment runs. Therefore, a quantum description has to suffice. If after opening
the box, sufficient classical information is recovered from it to enable the observer
to reconstruct events inside that box during that hour, then a rather classical
description can be given.
At no point is the observer empirically justified in asserting that the cat is
in a quantum superposition of alive and dead. The observer may be justified in
imagining a mathematical description of the state of the system in such terms.
That is not the same as objective reality.
There are two further points to make here.
21.5 Schrödinger’s Cat 289
3
It is such considerations that make it obvious to us that elementary particle physics is
really a branch of emergent physics and should be approached as such, rather than as a
reductionist theory from which emergence could be understood.
290 Self-intervening Networks
lost it; he is a good driver or he is a bad driver; the cat is alive or it is dead.
In reality, wars may be lost by both sides, a man may drive well in his village
but be reckless on the motorway; a cat may be in a vegetative state that is not
dead but is not really worth calling life.
All of these concerns should be addressed before simplistic arguments based
on contextually incomplete views of quantum mechanics are given.
22
Separability and Entanglement
22.1 Introduction
In this chapter, we discuss the structure of signal amplitudes in terms of the
separability and entanglement of quantum register states. We review the concepts
of subregisters, splits, partitions, factorizable Hilbert spaces, the separability of
states, separations, and entanglements.
All Hilbert spaces in this chapter are taken to be complex and finite dimen-
sional, and denoted by capital calligraphic Latin letters, such as H. We do not
restrict the discussion to quantum bits, but ultimately, it is those in which we
are most interested.
Suppose X and Y are two finite-dimensional Hilbert spaces with a surjective
linear map U from X to Y.1 Suppose further that U has the property that it
preserves norm, that is, if for any element x in X we have U xY = xX ,
√
where xX ≡ (x, x)X is the Hilbert space “length” of x in X and yY ≡
√
(y, y)Y is the Hilbert space “length” of y in Y. Then U is called an isometric
isomorphism. Under these conditions, it is necessarily the case that X and Y have
the same dimension and, as far as basic Hilbert space properties are concerned,
are identical. We denote this form of equality by X Y.
1
Here surjective means that U X = Y, where U X is the image of X in Y under the linear
map U . This condition is not imposed in our definition of semi-unitary operators (which are
linear maps).
292 Separability and Entanglement
subregisters can have any dimension, including one. In the following, we shall
rule out one-dimensional Hilbert spaces as being of no empirical interest to us.
where U (1) and U (2) are copies of U. Then X is a complex Hilbert space of
dimension 3 × 4 × 3 × 2 = 72.
where the symbol ⊕ denotes a direct sum of vector spaces. The direct sum of
two Hilbert spaces is a Hilbert space with dimension equal to the sum of the
two Hilbert spaces being summed. The group theory of such bound states and
their decomposition leads in this case to the rule 1 ⊕ 3 2 ⊗ 2 (Lichtenberg,
1970).
22.3 Splits 293
H[ab] Ha Hb . (22.3)
22.3 Splits
A split is any convenient way of grouping the subregisters in a quantum register
into two or more factor registers, or atoms, each of which is regarded for the
purposes of that split as a Hilbert space of rank one, that is, a Hilbert space not
itself split into two or more factor registers. For large-rank quantum registers,
very many different splits will be possible.
The number of ways of splitting a rank-n quantum register is the same as the
number of ways Bn of partitioning a given set of cardinality n, a historically
important topic in combinatorial mathematics. The Bn are called Bell numbers 2
and satisfy many curious and interesting relations in diverse fields, such as
probability, game theory, and number theory. For instance,
n
n!
Bn+1 = Bk , B0 = 1, (22.5)
k!(n − k)!
k=0
from which we find the sequence {Bn , n = 1, 2, . . .} = {1, 2, 5, 15, 52, . . .}. An
explicit formula for Bn is given by Dobinski’s formula,
∞
1 kn
Bn = . (22.6)
e k!
k=0
22.4 Partitions
Consider a set {Ha : a = 1, 2, . . . , r} of Hilbert spaces, denoting the dimension
of Ha by da . Then the rank-r quantum register H[r] ≡ H1 H2 . . . Hr is a vector
space of dimension d[r] ≡ d1 d2 . . . dr that contains both entangled and separable
states.
The classification of states in such a register into separable or entangled types
is too limited for us, so we introduce the more useful concepts of separations and
entanglements. We explain our terminology starting with the separable sets.
Separations
For any two subregisters Ha , Hb of the quantum register H[r] , such that a = b,
we define the rank-two separation Hab to be the subset of the tensor product
Ha Hb consisting of all separable elements in that tensor product, that is,
Hab ≡ φa ψ b : φa ∈ Ha , ψ b ∈ Hb . (22.7)
By definition, Hab includes the zero vector 0ab of the tensor product Ha Hb . Note
that Hab = Hba .
The separation concept readily generalizes to higher rank separations as fol-
lows. Pick an integer k in the interval [1, r] and then select k different elements
1 2 k
a1 , a2 , . . . , ak of this interval. Then the rank-k separation Ha a ...a is the subset
1 2 k
of Ha Ha . . . Ha given by
1 2 k
1 2 k i i
Ha a ...a ≡ ψ a ψ a . . . ψ a : ψ a ∈ Ha , 1 i k . (22.8)
2
After E. T. Bell (Bell, 1938).
22.4 Partitions 295
be Hilbert spaces because they do not contain entangled states, which all tensor
products of rank two or more necessarily do.
Rank-Two Entanglements
We can now construct the entanglements, which are defined in terms of com-
plements. Starting with the lowest rank possible, we define the rank-two entan-
glement Hab to be the complement of the separation Hab in the tensor product
Ha Hb , that is,
Hab ≡ Ha Hb − Hab . (22.9)
Note that Hab cannot be a vector space because it does not contain the zero
vector.
The original tensor product space considered as a set is therefore the union
Ha Hb = Hab ∪ Hab (22.10)
of the set Hab of all separable states and the set Hab of all entangled states.
These two subsets are disjoint and neither is a vector space.
Separation Products
The generalization of the above to larger rank entanglements is straightforward,
but first it will be useful to extend our notation to include the concept of
separation product.
Suppose A and B are subsets of Hilbert spaces Ha and Hb , respectively, where
a = b. We define the separation product A • B to be the subset of Ha Hb given by
A • B ≡ {ψφ : ψ ∈ A, φ ∈ B} . (22.11)
Properties of the separation product are:
1. The separation product is symmetric, that is, A • B = B • A. This means
that the separation product is not equivalent to the Cartesian product A × B,
which is the set of all ordered pairs of elements.
2. Hab = Ha • Hb . Note that this is not the same thing as Ha Hb , which in our
notation is the tensor product of Ha and Hb .
3. The separation product is associative, commutative, and cumulative, i.e.
a
H • Hb • Hc = Ha • Hb • Hc ≡ Habc
Hab • Hc = Habc . (22.12)
The separation product can also be defined to include entanglements. For
example,
Hab • Hc = φψ : φ ∈ Hab , ψ ∈ Hc . (22.13)
Other expansions are possible, given that separations such as Hef g can be
expanded further as separation products.
Habcd ≡ Ha Hb Hc Hd
−Habcd ∪ Hab•cd ∪ Hac•bd ∪ Had•bc ∪ Hbc•ad ∪ Hbd•ac ∪ Hcd•ab
∪Hab•cd ∪ Hac•bd ∪ Had•bc ∪ Ha•bcd ∪ Hb•acd ∪ Hc•abd ∪ Hd•abc .
(22.17)
22.4 Partitions 297
Entanglement Partitions
The decomposition of a quantum register H into the union of disjoint separations
and entanglements will be called an entanglement partition of H, and each
element of that partition will be referred to as a partition element. We can now
list the various entanglement partitions that we have encountered, along with
the number of partition elements in each:
number of
rank register
partition elements
1 Ha 1 = B1
(22.18)
2 Ha Hb 2 = B2
3 Ha Hb Hc 5 = B3
4 Ha Hb Hc Hd 15 = B4
A calculation gives 52 partition elements in the entanglement partition of a
rank-five quantum register, which we recognize as B5 , the 5th Bell number.
The conjecture, then, is that the entanglement partition of a rank-r quantum
register has Br partition elements. This was proved by Eakins (Eakins, 2004),
and shows that the entanglement structure of large-rank quantum registers will
be too complicated to deal with without computer assistance.
The relationships between separations and entanglements are subtle, as are the
relationships between splits and entanglement partitions. Although the number
of partitions in the entanglement partition of a rank-r quantum register is the
same as the number of splits and given by the Bell numbers, splits and partitions
cannot coincide for r > 1. Every factor register in a split is a vector space,
whereas no partition element is a vector space. Both splits and partitions are
essential and unavoidable features in quantum mechanics and hence in the QDN
account of quantum measurement and causal set structure.
A final simplification in this line of investigation is to use the above superscript
notation to label the various elements of entanglements and separations. So, for
example, Ψabc•de•f gh denotes a state in the partition element Habc•de•f gh and so
on. Translated into direct terms, this means the following. First, this state is an
element of the rank-eight quantum register
H[8] ≡ Ha Hb Hc Hd He Hf Hg Hh . (22.19)
Ψabc•de•f gh = ψ a ψ b ψ c ψ de ψ f gh , (22.20)
This example is reasonably simple, as the two labstates involved can be ordered
as shown. In general, more complex patters will occur, and then the correspond-
ing zip diagrams will be more complicated.
The separation and entanglement structure of labstates should be of impor-
tance in any experiment. A significant application of these concepts is to causal
sets, studied in the next chapter.
23.1 Introduction
In this chapter we apply the concepts presented in Chapter 22 to a topic that
has been of much theoretical interest in recent decades: causal set theory (CST).
It is interesting and useful to review the status of CST relative to other
foundational topics in mathematical physics that have been and remain of prime
significance to empirical physics, namely, special relativity (SR), general relativ-
ity (GR), and quantum mechanics (QM). Those great frameworks are built on
foundations that presuppose the continuity of space and time.
Despite this, it is an empirical fact that we are surrounded by discreteness in
the form of atoms, so it is natural to see how that discreteness fits in with
the continuity of space and time. We have attempted in this book to show
that the application of discrete principles to QM can go far beyond the basic
Planck quantum of action. We have gone as far as discretizing empirical time,
in the form of the stage concept. However, as far as relativity in either of its
manifestations is concerned, the case for any form of discreteness has not yet been
established empirically, although great efforts are being made in that direction
in the program known as quantum gravity. We have particularly in mind the
discretization approaches to GR of Regge (Regge, 1961) and Penrose (Penrose,
1971).
Regge’s approach is a classically based discretization of the Riemannian man-
ifold concept, so has at best a generalized propositional classification (GPC)
of two.1 Penrose’s spin network paradigm started as a quantum description
of spacetime and has led to much work in the area of spin networks in the
hope of unifying GR and QM. However, workers in spin networks and related
branches of quantum gravity have focued their attention on the mathematical
1
When Regge’s approach is used to regularize Feynman path integrals in quantum gravity,
its GPC drops to one, as it then becomes an exercise in mathematical physics.
300 Causal Sets
complexities and hardly any on the role of observers and their apparatus, leading
to a GPC classification of quantum gravity as one, that is, as mathematical
physics and not physics. Claims made that such programs represent empirical
physics have not been empirically substantiated to date; quantum gravity at this
time remains empirically vacuous, despite the best efforts of many theorists over
many decades.2
In various attempts to account for the existence of space, time, and matter
in the Universe, physicists often adopt one of two opposing viewpoints. These
may be labeled bottom-up and top-down, reflecting the basic difference between
reductionist physics and emergent physics. QM makes an appearance in both
approaches because while it really models the emergent side of observation, its
mathematical structure has a strong reductionist flavor.
A number of bottom-up approaches to cosmology proceed from the assertion
that at its most basic level, the Universe can be represented by a vast collection of
discrete events embedded in some sort of mathematical space. For example, the
pregeometric approach asserts that conventional classical time and space and the
classical reality that we appear to experience all emerge on macroscopic scales
due to the complex connections between certain fundamental microscopic, unob-
served and unobservable, pregeometric entities. This approach was pioneered by
Wheeler (Wheeler, 1980) and has received attention more recently by Stuckey
(Stuckey, 1999).
A bottom-up approach to cosmology that we can relate to in quantized detector
networks (QDN) is the causal set hypothesis, which asserts that spacetime itself
is discrete at the most fundamental level. In CST models it is postulated that
classical, discrete events are generated either randomly or through some agency,
though neither the nature of these events nor the mechanism generating them
is explained or discussed in detail. QDN comes in cost-free at this point. In
this chapter we show how the particular mathematical properties and dynamical
principles of QDN generates causal set structures naturally, without any extra
assumptions.
Although it seems natural to generate causal sets by the discretization of a
pseudo-Riemannian spacetime manifold of fixed dimension, as is done in lattice
gauge theories and Regge’s approach to GR (Regge, 1961), causal set events
need not in principle be regarded as embedded in some background spacetime
of fixed dimension d. One view taken by some theorists is that conventional
(i.e., physical) spacetime emerges in some appropriate limit as a consequence
of the causal set relations between discrete events. If so, then it is reasonable
to expect that in the correct continuum limit, metric structure should emerge
naturally (Brightwell and Gregory, 1991). An even more intriguing hypothesis
2
This is not the same thing as QM over classical curved GR background spacetimes, for
which there is some empirical evidence, such gravitational lensing and the need to correct
GPS signals because of gravitational curvature and time dilation (Ashby, 2002).
23.1 Introduction 301
is the suggestion that the dimension of this emergent spacetime might be scale
dependent (Bombelli et al., 1987), making the model potentially compatible with
GR in four spacetime dimensions, string theory and p-brane cosmology, and
higher dimensional Kaluza–Klein theories. Such a hypothesis fits in with the
general philosophy taken in this book that physics is contextual: different effective
theories may explain different experiments.
between this new event and those already in existence. Specifically, either the
new event y may be related to another event x as x ≺ y, or x and y are said
to be unrelated. Thus the ordering of the events in the causal set is as defined
by the symbol ≺, and it is by a succession of these orderings, i.e., the growth
of the causet, that constitutes the passage of time. The relation x ≺ y is hence
interpreted as the statement: “y is to the future of x.” Further, the set of causal
sets that may be constructed from a given number of events can be represented
by a Hasse diagram of Hasse diagrams (Ridout and Sorkin, 2000).
The importance of CST is that in the large-scale limit of very many events,
causal sets may yield all the properties of continuous spacetimes, such as metrics,
manifold structure, and even dimensionality, all of which should be determined
by the dynamics (Bombelli et al., 1987). For example, it should be possible to use
the causal order of the set to determine the topology of the manifold into which
the causet is embedded (Brightwell and Gregory, 1991). This is the converse
of the usual procedure of using the properties of the manifold and metric to
determine the lightcones of the spacetime, from which the causal order may in
turn be inferred.
Distance may be introduced into the analysis of causal sets by considering the
length of paths between events (Bombelli et al., 1987; Brightwell and Gregory,
1991). A maximal chain is a set {a1 , a2 , ..., an } of elements in a causal set C such
that, for 1 ≤ i ≤ n, we have ai ≺ ai+1 and there is no other element b in C such
that ai ≺ b ≺ ai+1 . We may define the path length of such a chain as n − 1.
The distance d(x, y) between comparable (Howson, 1972) elements x, y in C may
then be defined as the maximum length of path between them, i.e., the “longest
route” allowed by the topology of the causet to get from x to y. This implements
Riemann’s notion that ultimately, distance is a counting process (Bombelli et al.,
1987). For incomparable elements, it should be possible to use the binary relation
≺ to provide an analogous definition of distance, in much the same way that light
signals may be used in special relativity to determine distances between space-like
separated events.
In a similar way, “volume” and “area” in the spacetime may be defined in
terms of numbers of events within a specified distance. Likewise, it should be
possible to give estimates of dimension in terms of average lengths of path in a
given volume. An attractive feature of causal sets is the possibility that different
spatial dimensions might emerge on different physical scales (Bombelli et al.,
1987), whereas in conventional theory, higher dimensions generally have to be
put in by hand.
Our discrete set structure arises from the separation and entanglement properties
of labstates and modules.
Furthermore, in the QDN approach, various relations assumed in the “sequen-
tial growth” mentioned above must be interpreted carefully. In quantum physics,
past, present, and future can never have the status they have in Block Universe
models. At best we can only talk about conditional probabilities, such as asking
for the probability of a possible future stage if we assumed we were in a given
present stage. This corresponds directly to the meaning of the Born interpretation
of probability in QM, where all probabilities are conditional.
Another point is that the causal set relations discussed by Sorkin and others
imply that the various elements a, b, . . . , z have an independent existence outside
the relations themselves and that these relations merely reflect some existing
attributes. This is a vacuous Block Universe perspective that QDN does not
accept.
A further criticism of classical CST comes from dynamics. In some diagrams,
relatively space-like events with no previous causal connection are permitted
to be the parents of the same event. The question arises then, given two such
unrelated events at a given “time” n, how any information from either of them
could ever coincide, that is, be brought together to be used to create any mutual
descendants. The only sensible answer is that there must be some external agency
organizing the flow of that information. But the whole point of standard CST,
however, is that there is no external space in which these events are embedded, or
any external “memory,” observer, or information store correlating such informa-
tion. It is not clear then how such processes could be encoded into the dynamics.
In our QDN approach, however, we have no such issue: the primary observer is
such an agency, manipulating apparatus in order to produce such processes.
Time
Stages go up the page, starting from stage Σ0 .
Separations
These are represented by squares surrounding single integers identifying separa-
tions. As with detectors on a given stage in a stage diagram, labeling is arbitrary.
Our convention then is to label atoms left to right.
Entanglements
A rank-r entanglement is represented by a left to right rectangle equivalent to r
adjacent squares, labeled by the atoms involved in the entanglement.
23.5 Case Study 1: The Double-Slit Experiment 305
Lines
These connect elements of the causal set as required from the modules involved.
K K
Figure 23.3. The causal set diagram for the monitored DS experiment.
The stage Σ2 labstate requires some commentary also, because Figure 10.2 is
misleading in the same respect as at stage Σ1 . The final stage labstate Ψ2 is an
element of the entanglement Q123...K
2 , where K is the number of detectors in the
detecting screen S. The final labstate may be written as a sum of computational
basis representation states:
2
K
a j,a
Ψ2 = α U2,1 2j−1 2 . (23.3)
j=1 a=1
With this analysis of the entanglement structure of the DS experiment, the appro-
priate QDN causal set diagram is given by Figure 23.2. Temporally successive
stages run bottom to top. Separations are in square boxes, and entanglements
are in rectangular boxes, labeled by the atoms involved.
f
S C B B
Figure 23.4. The amended stage diagram for the two photon interferometer.
B B
f f V V
photon interferometer. This casual set diagram is entirely classical: there are
no entanglements.
The reader is invited to analyze the other experiments discussed in this book in
terms of their labstate and apparatus causal set structures. It will become appar-
ent that there are significant complexities in even relatively simple experiments.
24
Oscillators
24.1 Introduction
Our aim in this chapter is to show how quantized detector networks (QDN)
describe the quantized one-dimensional bosonic (harmonic) oscillator. The adjec-
tive bosonic here refers to the fact that in the standard quantum theory of
this system under observation (SUO), the phase space operators of position
and momentum satisfy commutation properties, in contrast to the fermionic
oscillator, whose corresponding variables satisfy anticommutation properties and
which is studied at the end of this chapter.
The one-dimensionality referred to here is somewhat misleading: it refers to
the classical mechanics (CM) theory of a single particle moving in one spatial
dimension under the influence of an attractive quadratic force potential. In the
quantized version of the same SUO, there are infinitely many degrees of freedom,
requiring the use of an infinite-rank quantum register. This is one of the few
occasions in QDN in which we refer to the concept of infinity.
Because the rank of the oscillator’s quantum register is infinite, we find that the
QDN representation comes with a tremendous amount of mathematical overkill
and redundancy. Specifically, the Hilbert space dimension of such a quantum reg-
ister is far bigger than that actually needed to describe a quantized bosonic oscil-
lator, the vast bulk of states in the register being what we shall call transbosonic.
As a Hilbert space, the infinite-dimension quantum register is nonseparable,
which means that it has no denumerable (that is, countable) basis, unlike the
Hilbert space HHO of energy eigenstates used to describe the standard QM
bosonic oscillator. While the improper position coordinate basis {|x : x real} is
nondenumerable, the normalized energy eigenstate basis for HHO is countable.
We will show how the QDN labstates that are the analogues of standard QM
oscillator states are restricted to a subspace of the register of measure zero and
remain there from stage to stage.
The degree of “overkill” in the QDN description of the oscillator is no different
in essence to that encountered in the quantum field theory (QFT) description
310 Oscillators
of the oscillator. QFT is a many particle theory that allows for the possibility
of more than one oscillator to be excited at the same time. This is in contrast
to Heisenberg’s matrix mechanics (Heisenberg, 1925) and Schrödinger’s wave
mechanics (Schrödinger, 1926), which are one-particle descriptions of the oscilla-
tor. Being based on signal detector principles, QDN can readily describe arbitrary
numbers of oscillator-type signals, over and above quantum superpositions of one-
particle signal states. This underlines the point that QDN looks much more like
a halfway house to QFT rather than another representation of standard QM.
The transbosonic states referred to above are really a reflection of that fact.
1
By this we mean what is observable, not what is theorized.
2
If we view reductionism as the counterbalance to emergence, then the highest empirical level
corresponds to the lowest reductionist level, where minimal mathematical details are given.
24.2 The Classical Oscillator Register 311
10 01 12 03 14 05 16 07 18 09 110 . . . (24.4)
24.2 The Classical Oscillator Register 313
Our conclusion is, therefore, that states in R[∞] cannot be classified by the
integers alone. Instead, we may use the sequence corresponding to each state as
an index. Specifically, if S is the binary sequence
S ≡ {sa : sa = 0 or 1 for a = 0, 1, 2, . . .} , (24.7)
then the corresponding state S in R [∞]
is given uniquely by the expression
∞
0
S ≡ s 0 s1 s2 . . . = sa . (24.8)
a=0
where the generalized Kronecker symbol δ ST takes the value unity if and only if
the binary sequences S and T are identical; otherwise, it is zero.
We take it as an axiom that, given two binary sequences S and T , the product
1
∞
sa ta is 1 if the sequences are identical and 0 otherwise.
a=0
24.3 Quantization
Quantization amounts to associating the set of states in R[∞] as a (preferred)
basis B [∞] for a nonseparable Hilbert space Q[∞] . States in Q[∞] will be called
quantum register states and are of the form
Ψ= Ψ (S) S, (24.10)
S
where the summation is over all possible infinite binary sequences S and the
coefficients Ψ (S) are complex. The Hilbert space inner product is defined in the
obvious way, that is,
ΨΦ = Ψ(S)S Φ(T )T
S T
= Ψ(S)Φ(T )
ST = Ψ∗ (S) Φ (S) . (24.11)
S T δ ST S
24.4 Bosonic Operators 315
As discussed above, finite countable sequences can be mapped into the integers
via the computational map (5.14). For a sequence S that maps into integer n,
we can use the notation n rather than S to denote the corresponding quantum
register state.
Pa a
P Aa a
A
Pa Pa 0 Aa 0
a
P 0 a
P 0 a
A
Aa 0 Aa 0 Pa
a
A a
A 0 a
P 0
316 Oscillators
Theorem 24.6 The bosonic filter operators satisfy the multiplicative rule
P b
a P ab a
B B = δ PB , a, b ∈ N ≡ {0, 1, 2, . . .}. (24.14)
The element 2n in Q[∞] corresponds to the energy eigenstate |n in the stan-
dard quantum theory of the oscillator.
The bosonic filter operator a should not be confused with the signal projec-
P
1 B a
≡
tion operators P a ∞ b
I P , where I b is the identity operator for qubit
b=a
The set of all bosonic states is denoted QB and is the QDN analogue of the
Hilbert space of quantum oscillator states in standard QM.
Examples of transbosonic states are the signal ground state 0 and all those
finite countable elements p of the computational basis B where p is not some
power of two. In fact, almost all elements in the quantum register are trans-
bosonic. It can be readily verified that linear superpositions of bosonic states are
always bosonic states, while the linear superposition of any transbosonic state
with any other state in the register is always transbosonic.
24.5 Quantum Register Oscillator Operators 317
The importance of the bosonic operators is that when they are applied to
bosonic states, the result is always a bosonic state, which can be easily proved.
where the states |n, n = 1, 2, 3, . . . are the usual orthonormal excited states of
the oscillator ground state |0. The key to the QDN description is the observation
that these states are identified one-to-one with the bosonic states 2n discussed
above, namely,
|n ↔ 2n , n = 0, 1, 2, . . . (24.20)
We note that
|0 ↔ 10 01 02 03 . . . = 1, |1 ↔ 00 11 02 03 . . . = 2, (24.21)
and so on. A particularly significant observation is that the quantum register
signal ground state 0 is not the oscillator ground state |0. This is one of the
reasons we felt it necessary to introduce nonstandard notation for labstates:
keeping a clear distinction between those and conventional QM states is more
than a matter of notion but reflects deeper interpretational issues.
To find a quantum register representation of the ladder operators, we first
introduce some auxiliary notation. We define the register operators
∞ ⎧ ⎫ ⎧ ⎫
0 ⎨0 ∞ ⎬ ⎨0 ∞ ⎬ a
P≡ P a , Aa ≡ I b Aa , A a ≡ Ib A . (24.22)
⎩ ⎭ ⎩ ⎭
a=0 b=a b=a
The operators A a can be used to generate bosonic states from the signal ground
state 0, which is transbosonic.3 Specifically, we have
a 0,
2a = A a = 0, 1, 2, . . . (24.23)
With these definitions we construct the operators
a ≡ A
B a PAa+1 , a+1 PAa .
Ba ≡ A (24.24)
Remarkably, these operators are bosonic, as can be readily proved from the fact
that
Aa IB = Aa . (24.25)
We find for a = 0, 1, 2, . . .
'
∞
Ba ≡ ⊗ P b Aa A a+1 = P 0 P 1 . . . P a−1 Aa A a+1 P a+2 P a+3 . . .
b=a,a+1
'
∞
a ≡
B ⊗ Pb A a Aa+1 = P 0 P 1 . . . P a−1 A a Aa+1 P a+1 P a+3 . . .
b=a,a+1
(24.26)
These operators satisfy the relations
a Bb = δ ab Pa+1 ,
B b = δ ab Pa .
Ba B (24.27)
Then the standard ladder operators a, a† take the quantum register
representation
∞
a ↔ aB ≡ 2 (n + 1)Bn ,
n=0
∞
a† ↔ a†B ≡ n.
2 (n + 1)B (24.28)
n=0
3
The signal ground state 0 is not the same as the oscillator ground state |0, which is
identified with the quantum register state 20 = 1 in the computational basis representation
and 10 01 02 03 . . . in the occupation basis representation.
24.6 Comparison with Quantum Field Theory 319
≡ 1 a† a +
The standard QM bosonic oscillator Hamiltonian operator H 1
has
2 2
quantum register representation
∞
↔ H B ≡ 1 a† aB + 1 IB =
H n+
1
PnB . (24.31)
2 B 2 2
n=0
This operator commutes with the bosonic identity IB , and therefore, any states
in the quantum register that start off bosonic at initial time remain bosonic, if
they evolve under this Hamiltonian.
that is, as the infinite direct sum of appropriately symmetrized tensor products
of copies of a single-particle Hilbert space H. Looking at the above presented
QDN version of the bosonic oscillator, we see that the QDN signal ground
state 0 corresponds to the QFT vacuum |0; the QDN bosonic state space
QB corresponds to H; and the transbosonic states in QDN correspond to the
multiparticle states in QFT.
The QDN formalism may be interpreted as an attempt to encode Fock’s vision
of QFT into a mathematical formalism that is based on detectors rather than
SUOs. In the next chapter, we go further in this respect by extending the QDN
formalism to allow for the possibility of constructing the signal ground state
0 itself through the creation of apparatus in the laboratory from a state of
nonexistence, denoted ∅, that corresponds to the information void that we have
focused on in other chapters. The information void represents the state of a
laboratory in which there are no detectors whatsoever. However, this certainly
does not mean that there is no observer or laboratory.
320 Oscillators
a , a = 1, 2, . . . , N :
Next, we define a set of nonlocal operators, αa , α
α1 ≡ A1 I 2 I 3 . . . I N ,
αa ≡ σ 1 σ 2 . . . σ a−1 Aa I a+1 . . . I N ,
1
α ≡A 1I 2I 3 . . . I N ,
a
α ≡ σ 1 σ 2 . . . σ a−1 A a I a+1 . . . I N , a = 1, 2, . . . , N. (24.34)
These are the required “fermionic field operators.” They satisfy the anticommu-
tation relations
αa αb + αb αa = 0, a α
α b + α
b α
a = 0,
b + α
αa α b αa = δ ab I, 1 a, b, N. (24.35)
Exercise 24.10 Use the signal bit algebra listed in Table 4.1 to prove
(24.35).
24.7 Fermionic Oscillators 321
αa 0 = 0,
a 0 = 0,
α
α
α 0 = 0, a = 1, 2, . . . , N.
a a
(24.36)
This means that we can reconstruct all the relevant structure of fermionic field
theory, the operators α a and αa playing the role of fermionic creation and
annihilation operators.
A significant feature of the above anticommutation relations is that they are
achieved via the use of the nonlocal operators and do not invoke Grassmannian
(anticommuting) numbers to do so.
25
Dynamical Theory of Observation
25.1 Introduction
In this chapter we discuss how quantized detector network (QDN) theory can
be extended to cover the creation and decommissioning of apparatus, from the
perspective of observers and their laboratories. This extension will be referred to
as extended QDN. It allows us to discuss bit power sets, laboratories, the universal
register, contextual vacua, and the creation of quantized detector apparatus.
This extended formalism is used to describe the Elitzur–Vaidman bomb-tester
experiment and the Hardy paradox experiment.
A central concept running through this book is that of the observer, the
enigmatic “I ” of I think therefore I am. The problem is that, despite the many
triumphs of quantum mechanics (QM), the physics of the observer and observa-
tion is still not well understood.
Regardless of how observers are defined and whether classical or quantum prin-
ciples are involved, physicists generally believe that classical information in some
form is extracted from systems under observation (SUOs) in actual physics exper-
iments. In all branches of science, their language reflects this belief: experimental-
ists talk of measuring an electron’s spin or the mass of a new particle, and so on.
The conceptual issues in quantum mechanics such as wave–particle duality,
quantum interference, and nonlocality gave the first indication that all might
not be well with this perspective. We need only to look at the photon concept to
appreciate some of the problems with the idea that photons are particles (Paul,
2004). There are experiments where a photon (that is, a signal) is detected from
a crystal, but the atom “from whence it came” cannot be identified, suggesting
that a cooperative process is involved, rather like the phonon concept in the
physics of crystals.
The problem we have with the photon-as-particle concept is the gap in logic.
If the particle interpretation is taken literally, then obvious questions about
the physical structure or its equivalent of such particles spring up. Does such
25.2 Power Bits 323
a particle have a “surface”? If it has a surface, what is that surface made of?
These and other simple (minded) questions quickly lead to the conclusion that
the particle concept is a convenient and economical objectification of context. In
other words, a useful illusion.
Some particle concepts appear better in this respect than others. For instance,
fermion number is conserved in astrophysical processes, but photon number is
not. Photons from the sun cannot be said to have traveled from the deep interior,
whereas we could say so about neutrinos.
So much for the signals that observers detect. What about the observers them-
selves? It is surely too great an assumption to think of them as a mere auxiliary
phenomenon in physics. Without observers, physics is a vacuous subject.
What seems to be missing is a comprehensive dynamical theory of observation
that would treat observers and SUOs more on the same footing. In such a
theory, observers would be subject to the same laws of physics as the SUOs
that they were observing. Such a theory would be capable of accounting for
the creation and annihilation of observers and their apparatus, as well as states
of SUOs, because in the real world, that is what goes on and nothing lasts
forever.
We do not have such a theory. That may come only once the physics of
emergence has been much better understood. However, we can make a start
with what we have at present in what we think may be a useful direction
(Jaroszkiewicz, 2010). In this chapter, we develop an extension of the binary
truth value concepts explored in earlier chapters, aimed at describing the possible
states of apparatus in a more general form than having just two states, yes or no.
P(B) = {0 , 1 , 2 , 3 },
∼ ∼ ∼ ∼
(25.1)
where we define
0 ≡ {0}, ∼
∼
1 ≡ {1}, ∼
2 ≡ {0, 1}, ∼
3 ≡ {∅}. (25.2)
324 Dynamical Theory of Observation
In this scheme, the set {∅} is a nontrivial element of P(B) and counts as one
element of the power set. We shall refer to the power set of a bit as a power bit.
In this chapter, we shall work in terms of the elements of P(B) rather than with
the elements of B itself, identifying elements ∼ 0 and ∼1 of P(B) as synonymous
with bit states 0 and 1 of B, respectively. The value of using P(B) rather than B
itself is that the elements of the former are sets, so we can use the set properties
of union and intersection to make propositions about power bits equivalent to
those in the algebra of sets (Howson, 1972).
Interpretation
Before we proceed further, however, we need to resolve the following problem:
the power set P(B) of a bit B appears to have too many elements for a physical
interpretation. Logic suggests that only the elements ∼0 and ∼
1 of the power set are
actually needed in an experiment. What could the elements ∼ 2 and ∼
3 represent?
It has been our experience that many such questions in QDN are answered
by looking at the situation in question and identifying any hidden assumptions.
Consider an experimentalist who is going into a laboratory in order to determine
the signal status of a certain detector. Even before they could get an answer, one
glaring fact has to be true: that the detector actually exists.
Now whether or not a detector actually exists in a laboratory (“existence”
being defined as a detector actually being found and recognized as such by an
observer coming into that laboratory) is surely an empirical question, the answer
to which the observer cannot assume is “yes, it exists,” before they enter the
laboratory. This then is where one of our two extra elements of the power set
P(B) finds a natural interpretation. We interpret the element ∼ 3 ≡ {∅} as a state
of physical nonexistence, in the laboratory, of the detector in question. We shall
call it the void state of a detector.
It will undoubtedly seem strange to assign a state in our formalism to a
condition of nonexistence of a detector. But if we want to construct a theory
where apparatus can be created or destroyed, then that is precisely what we
have to do.
As for the remaining element, ∼2 ≡ {0, 1}, of the power set P(B), the ambiguity
in the elements it has (0 and 1) fits in nicely with an obvious physical condition
that a detector could be in: a state where the apparatus exists physically but is
not operating normally, to the extent that it is not registering either a ground
state or a signal state. For instance, such a detector could be faulty. Or it could
have been deliberately taken off-line (or decommissioned). We shall call such a
condition of a detector the faulty state, the off-line state, or the decommissioned
state.
Given the above four states of a detector, we can find four natural power bit
∼
questions, denoted i , i = 0, 1, 2, 3. These are explained in Table 25.1, along with
the answers they induce when asked of each of the four power bit states.
25.4 Matrix Representation 325
∼
Is this detector in its ground state? 0 1 0 0 0
∼
Is this detector in its signal state? 1 0 1 0 0
∼
Is this detector off-line? 2 0 0 1 0
∼
Is this detector in its void state? 3 0 0 0 1
⎡ ⎤ ⎡ ⎤
1 0
⎢0⎥ ⎢1⎥
0 [0
∼ ∼
] ≡⎢ ⎥
⎣0⎦ , 1 [1
∼ ∼
] ≡⎢ ⎥
⎣0⎦ , and so on. (25.4)
0 0
O on ∼i by the action of a power bit
We represent the action of bit operator ···
matrix [O
···
] on a column matrix [∼i ], such that
O i [O
··· ∼
][ i ] ≡ [O
··· ∼
i ].
··· ∼
(25.5)
∼
In this matrix representation the dual elements i are represented by the row
matrices
∼ ∼
0 1 0 0 0 , 1 0 1 0 0 ,
∼ ∼
2 0 0 1 0 , 3 0 0 0 1 . (25.6)
This is consistent with the question and answer relations (25.3).
We can use the power bit matrix representation to define the operational
∼
meaning of the dyadics ∼i j . These can then serve as formal basis elements in
326 Dynamical Theory of Observation
the expansion of power bit operators. Given a power bit operator defined by
3 ∼
O = i=0 ···
(25.5), we can write it as the formal (dyadic) expression ··· O ∼i i .
Products of power bit operators are defined in the natural way: given power bit
operators M
···
,N
···
, we define their “product” M N by its action on any element ∼i
··· ···
rule is associative but not commutative, which can be seen from the matrix
representation.
a laboratory. Therefore, the bit operators we discuss in this section cannot map
elements of a bit power set to any state other than ∼ 0, ∼
1, ∼
2 , or ∼
3.
The operators we define below have been constructed on the basis of what
seems reasonable to us. The following bit operators seem physically reasonable,
practically useful, and unavoidable in the kind of formalism we are aiming for.
Identity ···
I
This operator maps every element back into itself, i.e., ···
I ∼i = ∼i . Its matrix
I ]ij = δij .
elements are given by the Kronecker delta, that is, [···
Annihilator ···
Z
This operator maps any element ∼i of the power set P(B) into the void state
3 ≡ {∅}, that is, ···
∼
Z ∼i = ∼
3 , i = 0, 1, 2, 3. Its matrix representation is
⎡ ⎤
0 0 0 0
⎢0 0 0 0⎥
[Z ] =⎢
⎣0
⎥. (25.7)
··· 0 0 0⎦
1 1 1 1
The annihilator has a fundamental role in our theory: it represents the process of
destroying and removing from the laboratory all traces of an already physically
existing detector.
P and ···
The Power Bit Signal Projection Operators ··· P
These operators have the action
P0 = ∼
··· ∼
0, P1 = ∼
··· ∼
3, P2 = ∼
··· ∼
3, P3 = ∼
··· ∼
3,
0 = 3, 1 = 1, 2 = 3, 3 = 3, (25.8)
P
··· ∼ ∼
P
··· ∼ ∼
P
··· ∼ ∼
P
··· ∼ ∼
25.5 Special Operators 327
A, ···
The Power Bit Signal Creation and Annihilation Operators ···
A
These operators are defined principally by their action on the normal signal states
0 and ∼
∼
1:
A0 = ∼
··· ∼
3, A1 = ∼
··· ∼
0, A2 = ∼
··· ∼
3, A3 = ∼
··· ∼
3,
0 = 1, 1 = 3, 2 = 3, 3 = 3, (25.10)
A
··· ∼ ∼
A
··· ∼ ∼
A
··· ∼ ∼
A
··· ∼ ∼
Comparison with Table 4.1 shows that these tables are isomorphic, provided the
Z in Table 25.2.
zero operator Z in Table 4.1 is identified with the annihilator ···
P &
P A &
A
··· ··· ··· ···
P
···
P
···
Z
··· ···
A Z
···
&
P Z &
P Z &
A
··· ··· ··· ··· ···
A
···
Z
···
A
··· ···
Z P
···
&
A &
A Z &
P Z
··· ··· ··· ··· ···
328 Dynamical Theory of Observation
two scenarios. If a detector is in its void state, then it does not exist, so the
action of the construction operator represents the physical construction of that
detector, set up in its ground state in the laboratory, prior to any experiment.
It is assumed that facilities exist in the laboratory for this. Alternatively, if the
detector already exists, then the construction operator resets it to its ground
state if it is normal or repairs it and sets it to its ground state if it is faulty.
The construction operator is represented by the matrix
⎡ ⎤
1 1 1 1
⎢ 0 0 0 0⎥
[C
···
] =⎢
⎣ 0 0 0 0⎦ .
⎥ (25.12)
0 0 0 0
Decommissioning Operator D
···
No Detector
The detector in question does not exist. The observer will see this; it is an
empirical fact that has physical significance. This absence is represented by
3.
extended labstate ∼
25.7 The Universal Register 329
Faulty Detector
The detector in question exists physically, but is faulty, off-line, or decommis-
sioned. This situation is represented by extended labstate ∼2.
Ground State
The detector in question exists physically, is in perfect working order, and is in
its ground state. This is represented by extended labstate ∼0.
Signal State
The detector in question exists physically, is in perfect working order, and is in
1.
its signal state. This is represented by extended labstate ∼
Remark 25.1 This is one of the very few places in QDN that we permit an
infinity to enter the discussion; it costs us nothing in material terms and is a
purely theoretical construct that nevertheless represents a very real state of
existence of a laboratory, before any detectors have been constructed in it.
The cardinality of the universal register is a measure of how many power sets
could belong to it. That depends on the imagination, and may be assumed to be
infinity. Precisely what sort of Cantorian cardinality it should be is not clear, but
that is a vacuous concept anyway. If we thought in terms of Λ sitting in continuous
space, then we expect at least the cardinality, c, of the continuum. However, that
is a metaphysical statement, because there would not be enough energy in the
universe to create a continuum of detectors.1 What helps us immeasurably here
is that real observers can only ever deal with finite numbers of detectors. We
can generally ignore all nonexistent potential detectors, but keep in mind that
an absence of a detector can be significant in some circumstances, such as in the
Renniger thought experiment (Renniger, 1953).
The product notation in (25.14) reflects the relationship between collections
of power sets and the tensor products of qubit spaces that we encounter in the
quantum version of this approach, discussed later. In these products, ordering is
not significant, since labels keep track of the various terms. An arbitrary classical
1∞ α
labstate Ψ∼
n in the universal register Ωn at stage Σn will be of the form α ψn , ∼
state Ψ
∼n
≡ α ψ n in Ωn according to the rule
∼
∞ ⎧ ∞ ⎫
0 ⎨0 ⎬ ∞
0
OnΨ
∼n
= Oα ψ βn ≡ Oα α
n ψn . (25.15)
··· ··· n ⎩ ∼ ⎭ ··· ∼
α β α
1∞
For every classical register state Ψ
∼
≡ α ψ α there will be a corresponding
∼
∼ 1∞ ∼ ∼
dual register state Ψ ≡ α ψ α , where ψ α is dual to ψ α . Classical register states
∼
1
In other words, the Universe could not observe itself completely.
25.8 Contextual Vacua 331
[r]
More generally, an extended labstate ∼ 0 n consisting of a number r of detectors
each in its signal ground state at stage Σn is given by
0 [r] ≡ ···
∼n
C 1n ···
C 2n . . . ···
C rn ∼
∅n , (25.18)
where without loss of generality we label the detectors involved from 1 to r. Such
a state will be said to be a rank -r ground state, or contextual vacuum state.
We can now draw an analogy between the vacuum of RQFT and the rank-r
ground states in our formalism. The physical three-dimensional space of con-
ventional physics would correspond to a contextual vacuum of extremely large
rank, if physical space were relevant to the experiment. This would be the case
for discussions involving particle scattering or gravitation, for example. For many
experiments, however, such as the Stern–Gerlach experiment and quantum optics
networks, physical space would be considered part of the relative external context
and therefore could be ignored for the purposes of those experiments. It all
depends on what the observer is trying to do.
In the real world there is more than one observer, so a theory of observation
should take account of that fact. That is readily done in extended QDN. For
example, the mutual contextual vacuum for two distinct observers A and B for
which some commonality of time had been established would be represented by
an element in Ωn of the form
0 A,B ≡ ···
∼n
C A,1 C A,2
n ··· C A,r
n . . . ··· n
A
C B,1 ···
··· n
C B,2 C B,r
n . . . ··· n
B
∅n ,
∼
(25.19)
and so on. If subsequent dynamics was such that the detectors of observer A
never sent signals to those of observer B and vice versa, then to all intents and
332 Dynamical Theory of Observation
purposes we could discuss each observer as if we were primary and they were
separate secondary observers. If on the other hand some signals did pass between
them, then that would be equivalent to having only one secondary observer.
If no commonality of time or other context has been established between
primary observers, then there can be no physical meaning to (25.19). This is
an important point in cosmology, where there are frequent discussions about
multiple universe “bubbles” beyond the limits of observation. The mere fact
that astronomers have received light from extremely distant galaxies establishes
a context between the signal sources in those galaxies and the detectors used
by the astronomers and validates the use of general relativity (GR) all the way
to those regions of spacetime. If, on the other hand, no such signals have been
received, then there is no such context. Therefore, relative to astronomers today,
the universe beyond the horizon of observation can be meaningfully represented
only by an information void, not a spatial vacuum. Something may be there, but
we should not discuss it as if we had access to any form of information about it,
such as its spacetime structure.
Much the same concern must be raised about black hole physics. That requires
careful analysis of the contextual relation between observers outside the critical
(Schwarzschild) radius and those who were assumed to be inside it, if any such
context can be found. An event horizon is a boundary between observers on one
side and observers on the other. For all intents and purposes, observers on one side
of such a horizon have to regard those on the other side as in an information void.
This raises deep questions and concerns about the use of coordinate patches
in general relativity and black hole physics that may be related to the problems
of “quantizing gravity.” We have asserted throughout this book that quantum
mechanics is much more like a theory of entitlement than about the structure of
SUOs. With this in mind, we should always ask the question: how do we know
this or that? Consider GR. It is standard to postulate a line element without
asking what could have been the source of that spacetime structure, or whether
there are physically enigmatic possibilities, such as closed time-like curves, as in
Gödel’s spacetime (Gödel, 1949). There may be event horizons in our solutions to
Einstein’s equations. Our point is this: what empirical entitlement do we have to
discuss two regions of spacetime separated by an event horizon, such as happens
in the case of the Schwarzschild metric? Contextual completeness should tell
us that such a scenario is vacuous. Trying to quantize any theory under such
circumstances seems to be asking for trouble.
25.9 Experiments
Long before any experiment can begin, an observer starts off with a laboratory Λ
∅ . Then at some stage long before any runs can be taken, specific
in its void state ∼
apparatus consisting of a finite number r of detectors has to be constructed
in Λ. We will assume without loss of generality that these are all functioning
25.9 Experiments 333
normally and in their ground state, so the labstate at that point is given by the
equivalent of the right-hand side of (25.18). All of this is necessary before state
preparation.
Modules
We have so far discussed only the detectors. Similar comments can be made about
the modules that sit in the information void. The difference is that labstates are
directly associated with detectors, both real or virtual, whereas modules are
almost invariably classical in their representation, at least in the formulation of
QDN that we have discussed in this book. Therefore, we have not invested much
effort in constructing a dynamical theory of modules, although that is expected
in a more complete theory.
There is an intriguing relationship here with Feynman diagrams in RQFT.
Consider quantum electrodynamics (QED) and the simplest Feynman diagram
for electron–proton scattering, Figure 25.1. In QED, the particle source and final
state detectors are taken to be at remote past and future times, respectively, and
propagation is in the information void (the vacuum). However, the quantum fields
are interacting in QED. This particular Feynman diagram can be interpreted as
a process where a virtual module (the photon, represented by the wavy line in
Figure 25.1) is created not by the observer, but by the quantum process itself.
This module has many of the characteristics of a beam splitter, discussed in
Chapter 11. Because this module is created via the quantum dynamics in the
information void, it is reasonable to classify the experiment as type T3, as defined
in Chapter 21.
Care has to be taken not to take Feynman diagrams too literally. They rep-
resent individual terms in infinite perturbation expansions or in asymptotic
expansions of complete amplitudes. Those complete amplitudes will satisfy all
the requirements of scattering matrix theory (S-matrix) (Eden et al., 1966), and
they will behave as virtual quantum modules.
from faulty detectors (assuming these have been identified) by postselecting only
those labstates that contain the normal bit states ∼ 0 or ∼1 . We shall confine our
attention to such normal labstates until we deal with applications to quantum
mechanics.
Given this condition, we can restrict our discussion at any given stage Σn to
the physical register R ∼ n
, a subset of the universal register Ωn . If there are rn
working detectors in the laboratory at stage Σn , then R ∼
n consists of 2
rn
normal
states. This is the extended QDN version of the classical registers Rn discussed in
earlier chapters. We may represent these normal states with the extended version
of the computational basis and signal basis representations discussed previously,
For example, a typical normal extended basis labstate is of the form
∼
i n ≡ ∼i 1n ∼i 2n . . . ∼i rnn , (25.20)
There are several interesting processes that can be described by these extended
A in to the rank-
register operators. For example, an application of the operator ···
[r]
0 n ≡ ···
r contextual vacuum ∼ C 1n ···
C 2n . . . ···
C rn ∼
∅ n gives a rank-(r − 1) contextual
vacuum:
···
C 1n ···
A in ··· C 2n . . . ···
C rn ∼
∅ n = ···
C 1n ···
C 2n . . . ···
C i−1 C i+1
n ··· C rn ∼
n . . . ··· ∅n . (25.23)
result Ain 0n = 0. In the extended QDN case, we do not get zero but the equivalent
25.11 The Elitzur–Vaidman Bomb-Tester Experiment 335
[r]
A in on the signal ground state (contextual vacuum) ∼
of it: the action of ··· 0 n of
R
∼
[r]
maps it into the ground state of a different physical register, one of rank
r − 1, a state orthogonal to every state in R
∼
[r]
.
25.10 Quantization
The signal operators and the corresponding projection operators ··· i can
P
P in , ··· n
be used to discuss all the quantum processes covered in this book, and will
be used in the discussions on the bomb-tester and Hardy paradox below. An
additional feature we make use of is that we can now factor into the mathematical
C in and the decommissioning operators
representation the construction operators ···
D in explicitly.
···
In the next two sections we show how the extended QDN formalism describes
experiments where the apparatus changes in one way or another during the
experiment. In particular, the faulty/off-line state plays an important role in
these experiments.
M B
B T
2
One of the reasons this experiment is significant and to some extent baffling is the strange
mix of classical and quantum counterfactuality. Analyzing it in terms of which-path
information seems the best approach, because such information is not controversial and its
extent can be established directly from a stage diagram in general.
336 Dynamical Theory of Observation
3
The choice of the words its and the in this sentence is deliberate. A void state is contextual,
whereas we can assume that there is just one enormous universal register that describes the
totality of possible contextual existence and nonexistence that could ever be imagined,
under all possible circumstances. This latter assumption costs us nothing.
25.11 The Elitzur–Vaidman Bomb-Tester Experiment 337
before stage Σ0 , that is, before the start of each run, so we would normally expect
the apparatus to exist and be in its quiescent, no-signal state,
∼
0 0 ≡ ···
C 10 ···
C 20 ···
C 30 ···
C 40 ···
C 50 ···
C 60 ···
C 70 ∼
∅0 . (25.24)
This deserves some explanation, because sites 10 , 21 , 31 , 42 , and 52 are not real
detectors. So the question arises, what does it mean to “create” them?
The answer is given by Schwinger, quoted in Chapter 24. The modules S,
B 1 , B 2 , M , and T really do “exist” relative to the observer, and it is that
very existence that creates, as it were, the context for virtual detector sites
10 , 21 , 31 , 42 , and 52 to be meaningful. We may refer to this as “Schwinger
counterfactual reality.” It is not metaphysics but actually part and parcel of the
process of running any experiment, as we have commented before. The observer
would have had to place real detectors in those sites in order to calibrate the given
modules, before the actual runs had started (and then taken those calibration
detectors away). Otherwise, there could be no degree of confidence in what the
observer believed about the labstates being studied.
In addition to the creation of real and virtual detector sites, the preparation
device S prepares a photon signal in 10 , so the initial extended labstate Ψ ∼
0 is
given by Ψ A 1 0 0 . This is the initial extended labstate for each of the two
0 ≡ ···
∼ 0∼
networks we now consider.
In the following, it is useful to label the respective transmission coefficient and
reflection coefficient associated with each beam splitter separately, in order to
see how information is flowing through the networks. We work in extended QDN
throughout.
Stage Σ1 to Stage Σ2
In this scenario, the bomb is a dud, so any signal entering module T from port
21 is transmitted directly to 52 . Taking the mirror M not to change any phase,
we then have
1 5 4 )0 2 .
Ψ
∼
2 ≡ ···
U 2,1 Ψ
∼
A 2 + ir1 ···
1 = (t ··· A 2 ∼ (25.26)
Stage Σ2 to Stage Σ3
Using
5 0 2 = (t2 A
U 3,2 ···
A 7 + ir2 A
6 )0 3 , (25.27)
··· 2∼ ··· 3 ··· 3 ∼
A 4 02
U 3,2 ··· = (t 2
A6 + ir 2
A 7 )0 3 , (25.28)
··· 2∼ ··· 3 ··· 3 ∼
338 Dynamical Theory of Observation
The significance of this result is√ that for matched symmetric beam splitters, for
which t1 = r1 = t2 = r2 = 1/ 2, the prediction is that Pr(73 |Ψ ∼
0 ) = 0. The
conclusion, that for symmetrical beam splitters a dud bomb never coincides with
a positive signal in detector 73 , is critical to the point of this experiment.
Stage Σ1 to Stage Σ2
In this scenario, the bomb explodes and decommissions the device T if a signal
enters from 21 . Hence we may write
Ψ ≡ ···
U 2,1 Ψ = (t1 ··· A 4 )0 2 .
D 52 + ir1 ··· (25.30)
∼2 ∼1 2 ∼
Stage Σ2 to Stage Σ3
The debris created by any bomb exploding at stage Σ2 will certainly be trans-
mitted on to stage Σ3 , so we have
···
D 52 ∼
U 3,2 ··· D 53 ∼
0 2 = ··· 03 ,
A 4 0 2 = (t2 A
U 3,2 ··· 6 + ir2 A
7 )0 3 , (25.31)
··· 2∼ ··· 3 ··· 3 ∼
Random Testing
Unfortunately, the observer does not know before each run whether a particular
bomb is active or a dud. Consider a sequence of runs such that there is a (classical)
25.12 The Hardy Paradox Experiment 339
A D
Here Ψ∼n
and Ψ∼n
are the extended labstates for the active and dud scenarios,
respectively, at stage Σn . It could be thought that this must be incorrect at stage
Σ0 , because the observer has no information as to which scenario is in place. But
in fact, that information could be established in principle before stage Σ1 by, for
example, triggering the bomb directly. It would explode in the active case and
A
that would inform the observer that the initial state should have been Ψ ∼n
in the
now aborted run. A similar remark holds for the dud case.
The overall probability of triggering detector 73 is given by
7 ,
P
Pr(73 ) = T r ρ 3 ··· (25.35)
3
···
B R B
B B
Case (1)
In case (1) and in conventional terminology, the annihilation effectively blocks
off one of the entry ports in each of those beam splitters. This is equivalent
to a virtual module in interaction region R, which would otherwise be a true
information void. In this case, the photons produced by the annihilation are
detected in 33 and 43 .
Case (2)
In case (2), the virtual detectors 32 and 42 transmit their amplitudes onto beam
splitters B 3 and B 4 , respectively, as shown in Figure 25.3. Interference occurs in
each of these beam splitters. In the symmetric case, a signal will be detected at
63 and not in 54 , and a signal will be detected in 73 and not in 83 .
Stage Σ0 to Stage Σ1
The electron–positron pair produced by source S is split by magnetic fields ready
to be deflected into beam splitters B 1 and B 2 as shown, giving extended labstate
Ψ 1A
≡ ···
A 2 01 .
∼1 1 ··· 1 ∼
Stage Σ1 to Stage Σ2
1 1 4 )(t2 A
2 + ir2 A
3 )0 2 .
Ψ
∼
2 ≡ ···
U 2,1 Ψ
∼
A 2 + ir1 ···
1 = (t ··· A 2 ··· 2 ··· 2 ∼
(25.37)
Stage Σ2 to Stage Σ3
The only novel factor concerns potential electron–positron annihilation, involving
32 and 42 . We will explore the possibility of partial annihilation, by taking the
evolution for this term to be given by
A4A
U 3,2 ··· 3 0 2 = αD 4 D 3 0 3 + β(t4 A
8 + ir4 A
7 )(t3 A
5 + ir3 A
6 )0 3 , (25.38)
··· 2 ··· 2 ∼ ··· 3 ··· 3 ∼ ··· 3 ··· 3 ··· 3 ··· 3 ∼
where |α|2 + |β|2 = 1. These coefficients reflect the fact that electron–positron
annihilation is a quantum process and does not always occur. Note the intro-
duction of the destruction operators here, signifying that photons have been
produced (these can be regarded as the debris from the annihilation of the virtual
electron–positron detectors at 33 and 43 ).
The relevant information when fed into program MAIN gives the following
correlations for the symmetric beam splitter case:
Pr(33 , 43 |10 ) = 14 |α|2 ,
Pr(53 , 73 |10 ) = Pr(53 , 83 |10 ) = Pr(63 , 83 |10 ) = 16 |1
1
− β|2 , (25.39)
Pr(63 , 73 |10 ) = 16 |3
1 2
+ β| .
When β is set to one, corresponding to no annihilation in region R, the predic-
tion is that the only outcome is that both 63 and 73 show a signal. However,
when annihilation is permitted, α = 0 and the outcomes show characteristics
of interaction-free measurement. This is deduced from conservation of electric
charge. The argument goes as follows. If a signal is detected at, say, 53 , then
annihilation cannot have occurred, because signality is conserved in this experi-
ment; annihilation would give a signal at 33 and at 43 , and so a signal elsewhere
would be ruled out. But a signal in 53 arises only if something is affecting the
signal from 32 , and the only factor that could affect that could be 42 . Therefore,
any signal in 53 is an “interaction-free” detection of the particle at 42 . A similar
remark holds for any signal observed at 83 .
342 Dynamical Theory of Observation
In the Preface to this book, we started with Alice and Bob having very different
views of their observations. Alice used an optical telescope and reported nothing
unusual about a distant galaxy that she could see. Bob, on the other hand,
detected intense radio activity in that galaxy. Our question was: who has the
true view of that galaxy?
We advised the reader that the answer is not Alice. Neither is it Bob. The
answer is not both of them, nor is it neither of them. It is not a trick question
either. So what is the answer that this book would supply?
Our answer will come presently.
While this book has set forth a definite mathematical perspective on empirical
physics, it has contained quite a lot of commentary that might be disparaged as
metaphysics or philosophy. Such commentary is generally frowned on in science,
because it has no empirical content. It is vacuous.
In our defense, we point out the obvious: science is not a robotic activity; it is
carried out by humans and these are driven by their metaphysical, philosophical,
and emotional imperatives. For instance, the hard-core scientific view that quan-
tum theory needs no interpretation but only application is itself a philosophy.
It is no more than a conditioned response, based on opinion and subscription
to current scientific norms. It is, indeed, a philosophy of how to do quantum
mechanics. So we all do it, in one way or another. Our concern in this book is
that the way that we do it should be based soundly on scientific principles. The
quantized detector approach that we describe and use in this book is our attempt
to do just that.
There is an excellent paper on all of this that has the provocative title “Quan-
tum Theory Needs No Interpretation,” by Fuchs and Peres (2000). It seems on
the face of it to dismiss any sort of “interpretation” of quantum mechanics. In
fact, close reading of it confirms (to us at least) the agenda that we have set out
in this book. Yes, quantum mechanics needs no interpretation, when it is being
applied to processes that take place in the information void.
344 Conclusions
But in our view, that is only half the story. We cannot exclude the observer and
what they are doing. The information void is contextual: it is defined by whatever
observer is involved, and by the apparatus that they are using. Schwinger said
just that (see his quote on this in Chapter 24). It has long seemed obvious to us
that the idea expressed by Schwinger really points to an extension of what physics
means. Physics cannot be just the study of systems under observation (SUOs).
The relationship of observers to those SUOs and to the Universe in which both
observers and SUOs are embedded should surely be just as important an issue
in physics as the perceived properties of SUOs.
That naturally leads to the realization that the physics of emergence should
be at least as important in physics as the reductionist agenda is currently. Our
explanation as to why it is not so regarded currently is that it is hard, much
harder than standard reductionist approaches to physics, which absolutely rely
on and utilize emergent concepts, but place them firmly outside the scope of
investigation.
Fortunately, human opinions on what is significant carry no weight in real
science: nature has a tendency to tell us when we are being complacent. Markers
of this are evident throughout quantum mechanics: the randomness of quantum
outcomes, the nonlocality of quantum correlations, the violations of Bell and
Leggett–Garg inequalities, quantum interference, state reduction, the Kochen–
Specker theorem (Kochen and Specker, 1967), entanglement, and more.
That brings us back to Alice and Bob. It was not a trick question: it was not
a proper question in the first place, because it is contextually incomplete.
As given, there was no indication for whom this question could have a truth
value. If you say that it is you, the reader, then our response is: what sort of
“truth” is it that deals with an invented scenario? Alice and Bob don’t exist. They
were invented in the Preface for the purposes of demonstrating a fundamental
point.
It may seem a glib, cheap point, but it is not. It points to the fact that “truth”
does not exist in a vacuum. It requires an observer for whom that “truth” is
empirically meaningful.
Contextual incompleteness is invariably the main ingredient in a number of
theoretical disciplines that purport to “explain” reality, but in fact are as fictional
in their scientific content as our image of Alice and Bob is. That is the essential
point behind the article of Fuchs and Peres, mentioned above. If we could, we
would retitle their paper: quantum theory needs no interpretation, it needs to be
extended to cover the observer. That is what this book is all about.
A final comment about Alice and Bob. Our answer is that each of them is
stating the truth, but only relative to their individual contexts. In that sense
they are both making contextually true statements. But that is not the same as
saying they both have the “true” view of the galaxy they are observing: there is
no “true” view of that galaxy in an absolute sense. Reality is far too deep for
that.
Appendix
Symbol Meaning
× Cartesian product, direct product
⊗ tensor product
Σn stage n
An apparatus at stage Σn
rn number of real or virtual detectors in
An = the rank of An
in the ith real or virtual detector in An
Qin the qubit representing in , i = 1, 2, . . . , rn
Q n ≡ Q n Q n . . . Qn
1 2 rn
the quantum register at stage Σn : a rank-rn
tensor product
In the observer’s information about Qn
Hn ≡ (Qn , In ) Heisenberg net at stage Σn
2 rn
dimension of Qn ≡ dim Qn
Bni the preferred basis for Qin , i = 1, 2, . . . , r
Rn ≡ Bn Bn . . . Bn
rn 1 2 rn
the preferred basis for Qn : a Cartesian product
kn kth element of computational basis Bn ,
k = 0, 1, 2, . . . , 2rn − 1
kn dual of kn
Ψn pure labstate at stage Σn : an element of Qn
Ain ith signal destruction operator at stage Σn
An i
ith signal creation operator at stage Σn
Pin ≡ Ain A i , ith no-signal projection operator
n
i ≡ A
P i Ai ith signal projection operator
n n n
i i i i
Sn ≡ {Pn , Pn , An , An } ith signal set
i
mn ≡ im j n
Tij transition operator
346 Appendix
A.4 Ensembles
The ensemble concept arises in physics for two reasons. First, no experiment
normally validates a generalized proposition in a single run, even in classical
physics, which deals with generalized propositions with a generalized proposition
classification (GPC) of 2. Such propositions are based on the classical assertion
that SUOs have precise qualities that can be quantified by exact measurement.
Regardless of their view of that assertion, experimentalists will know that, in
reality, experimental errors and inaccuracies are always present. To overcome
this, multiple runs of the basic experimental protocol are generally performed and
then averages and other statistical quantities established from the accumulated
data. These multiple runs constitute ensembles of one kind or another.
The second reason for the use of ensembles is that quantum physics asserts
that the outcome of any given run in an experiment is a random variable, so that
statistical analysis based on the Born rule is required as a matter of principle
and policy.
Ensembles come in several varieties, each with its particular spatiotemporal
architecture, and each characterized by generally unstated, implicit context.
Whichever kind of ensemble is chosen depends on several factors usually outside
the experimentalist’s control, such as limited resources and time. The following
are two important kinds of ensemble.
Spatial Ensembles
A spatial ensemble is a physical collection in a given laboratory of multiple,
mutually isolated (from each other) copies of a given SUO, such as atoms in
a crystal, such that a basic run is performed on each copy once. The statistics
for the experiment is then established by collecting the outcome data from each
copy and assuming that outcome frequencies can be attributed to probabilities
associated with the original SUO.
A.4 Ensembles 349
Temporal Ensembles
Some experiments are too costly to perform via spatial ensembles, so the standard
alternative is to use a temporal ensemble. In such an architecture, multiple
runs of the same basic protocol are implemented in temporal succession using
the same apparatus each time. In principle, an ideal temporal ensemble should
be equivalent to an ideal spatial ensemble, but that is an assertion that can
be challenged on the basis of cosmological evidence. It is now believed that the
universe is expanding in an irreversible way, relative to all endophysical observers.
Therefore, the environment around any laboratory is not quite the same during
any given run of an experiment as any other run. Of course, such discrepancies
are minute and could be laughed at as generating a pointless debate, were it not
for one glaring fact: the expansion does have observable effects, namely, the red
shift of light from distant galaxies.
The issue here is the relative scale of times: a comparison of the typical time
τ to complete a given run, the interval T to the next run, and the age A of
the Universe estimated from the observed Hubble constant. When τ and T are
both negligible compared with A, as is almost always the case, then temporal
ensembles should be as good as spatial ensembles.
350 Appendix
It is standard practice to use V to mean (V, F). The ground field F is generally
understood, but it is important to know whether it is R or C. In the former case
we say V is a real vector space, while in the latter case we say V is a complex
vector space. The space of three-vectors used to represent position in physical
space is a real vector space, while the Hilbert space of quantum state vectors is
a complex vector space.
It is easy to show that, ∀λ ∈ F, λ0V = 0V . Likewise, if 0F is the zero element
in F, then 0F a = 0V , ∀a ∈ V . It is important not to confuse the scalar zero 0F
and the vector zero 0V .
In practice, we do not bother to use a different symbol for vector addition, ,
in order to distinguish addition in the field, +. Henceforth, the same symbol, +,
will be used for both.
We often modify the notation involving the additive inverse −b, writing
a + (−b) = a − b, (A.9)
thereby suggesting a new binary process known as subtraction. This is not nec-
essary, but it is useful and should always be interpreted in terms of the addition
of vectors.
What is immensely astounding is that the above theory, which may appear no
more than a mathematician’s game, seems necessary to describe empirically val-
idated quantum physics, with the additional surprise that the field F is required
to be C and not R.
Spanning Sets
Suppose S ≡ {v 1 , v 2 , . . . , v k } is a set of vectors in some vector space V with
ground field F. Let M be the set of all vectors of the form x1 v 1 +x2 v 2 +· · ·+xk v k ,
where x1 , x2 , . . . , xk ∈ F. Then M is a subspace of V , spanned by S. We write
M ≡ [v 1 , v 2 , . . . , v k ] . (A.10)
Linear Independence
An expression of the form x1 v 1 +x2 v 2 +· · ·+xk v k , where v 1 , v 2 , . . . , v k ∈ V and
x1 , x2 , . . . , xk ∈ F is called a linear combination of the vectors v 1 , v 2 , . . . , v k .
352 Appendix
If the xi are not all zero, then it is called a nontrivial linear combination.
Otherwise it is called trivial.
A set of vectors {v 1 , v 2 , . . . , v k } is linearly dependent if there exists a nontrivial
linear combination equal to the zero vector 0V . In other words, {v 1 , v 2 , . . . , v k }
is linearly dependent if the equation
x 1 v 1 + x 2 v 2 + · · · + xk v k = 0 V (A.11)
Theorem A.2 Any vector space that has a finite spanning set contains a basis.
A vector space is finite dimensional if it has a finite basis (i.e., one consisting
of a finite number of vectors). Hence, every vector space spanned by a finite
spanning set is finite dimensional.
v = x 1 e 1 + x 2 e 2 + · · · + xn e n = x i e i (A.13)
All bases of a finite-dimensional vector space have the same number of ele-
ments. The dimension dim V of a finite-dimensional vector space V is the number
of elements of a basis.
Linear Transformations
Let U and V be two vector spaces, not necessarily of the same dimension, over
the same field F. A linear transformation (or linear mapping) T of U into V is
a mapping that assigns to every u ∈ U a unique vector T (u) ∈ V , such that
T (u1 + u2 ) = T (u1 ) + T (u2 ) , ∀u1 , u2 ∈ U,
(A.14)
T (λu) = λT (u) , ∀u ∈ U, ∀λ ∈ F.
T (u) is the image of u under T .
The set of all linear mappings of U into V is denoted L (U, V ) .
A.5 Vector Spaces 353
Given a linear transformation T (U, V ), the set of all vectors u ∈ U such that
T u = 0V (A.15)
If dim ker T is called the nullity of T and dim T (U ) is called the rank of T ,
then the above theorem can be stated as
Linear Functionals
A linear functional f˜ is a linear mapping of a vector space V into its ground field
F, such that
One-Forms
Given any two linear functionals f˜, g̃ ∈ L (V, F), define the linear combination
αf˜ + βg̃, where α,β ∈ F by the rule
αf˜ + βg̃ (v) ≡ αf˜(v) + βg̃ (v) , α, β ∈ F, v ∈ V. (A.19)
With this rule, L (V, F) is itself a vector space, known as the dual vector space
and denoted by V ∗ . Elements of this vector space will be called one-forms.
The convention we shall follow as much as possible is that vectors will be
represented by symbols in bold, such as v, while one-forms will be denoted by
symbols with a tilde, or a bar, such as ω̃ or ω.
The one-form/vector relation is employed in QDN in our representation of
questions and answers, as discussed in Chapter 2.
An important fact is that when V is finite dimensional, then V ∗ has the same
dimension, namely, dim V ∗ = dim V .
354 Appendix
Dual Basis
Suppose V is an n-dimensional vector space with basis B (V ) ≡ {ea : a = 1,
2, . . . , n}. Then an arbitrary vector v ∈ V can be written in the form
n
v= v a ea , (A.20)
a=1
Bracket Notation
Given a vector v ∈ V and one-form ω̃ ∈ V ∗ , the bracket notation ω̃, v ≡ ω̃ (v)
is often used. In quantum mechanics, vectors are often written as kets, such as
|ψ, and dual vectors as bra-vectors, such as φ|, a notation used extensively by
Dirac (Dirac, 1958). Then the “inner product” φ|ψ is called the bracket of |ψ
and φ|.
Quantum mechanics vector spaces use a complex-valued field and the following
rule is imposed: φ|ψ∗ = ψ|φ.
Example A.5 Consider the vector space of all real 2 × 2 matrices M (2, R) and
the vector space of all real 3×3 matrices M (3, R) . Then elements of the Cartesian
product M (2, R) × M (3, R) are given by Kronecker products of matrices. For
example, if A ≡ [Aab ] ∈ M (2, R) and B ≡ [Bij ] ∈ M (3, R), then the Kronecker
product A ⊗ B ≡ [Cai,bj ] is an array with double matrix indices, such that
Unfortunately, direct product vector spaces are not vector spaces themselves,
which can be readily seen by considering linear combinations of arbitrary
elements.
Because we need vector addition to represent superposition in QM, and we find
ourselves dealing with tensor products, we overcome this problem by extending
U × V to a larger space, the tensor product of U and V , denoted by U ⊗ V. This
tensor product space is defined to contain all linear combinations of elements of
the Cartesian product space U × V and satisfies all the axioms of a vector space
A.5 Vector Spaces 355
over the common ground field F. Elements of U ⊗ V are either of the form u ⊗ v
(which is in U × V ) or linear combinations of such direct products, which may
or may not be in U × V .
Elements of U ⊗ V that are of the form u ⊗ v for u ∈ U , v ∈ V are called
separable. Elements of U ⊗ V that are not separable are called entangled.
Entanglement is of great significance in quantum mechanics. The physically
observable properties of entangled quantum states lie at the heart of the problems
with the interpretation of quantum mechanics.
As with ordinary arithmetic, the tensor product operation ⊗ takes precedence
over the vector summation operation + in U ⊗V , so we may leave out the brackets
and just write
Similarly,
0U ⊗ v = u ⊗ 0V = 0U ⊗V . (A.25)
0F (u ⊗ v) = 0U ⊗V . (A.26)
Rank
The rank of a tensor product space is the number of vector spaces in the product.
For example, U ⊗ V is a rank-two tensor product space; U 1 ⊗ U 2 ⊗ · · · ⊗ U n is
of rank n.
Elements of a given tensor product have the rank of that tensor product space.
Hence scalars have rank zero, while vectors and one-forms have rank one.
Given a number of vectors spaces V 1 , V 2 , . . . , V r , then the rank-r tensor
product space V 1 ⊗ V 2 ⊗ · · · ⊗ V r has dimension equal to the product of all
the individual dimensions, that is,
dim V 1 ⊗ V 2 ⊗ · · · ⊗ V r = dim V 1 dim V 2 . . . {dim V r } . (A.27)
Separable Bases
If {ua : a = 1, 2, . . . , dim U } is a basis for U and {v b : b = 1, 2, . . . , dim V } is a
basis for V , then a frequently useful basis for U ⊗ V is given by {ua ⊗ v b : a = 1,
2, . . . , dim U , b = 1, 2, . . . , dim V }. Every element of this basis is a separable
element of the tensor product space U ⊗ V , so we call this a separable basis.
From this we immediately conclude that
Hilbert spaces
Hilbert spaces are finite or infinite dimensional vector spaces with a complete
inner product (Streater and Wightman, 1964).
References
Kraus, K. 1974. Operations and Effects in the Hilbert Space Formulation of Quantum
Theory. Berlin: Springer, pages 206–229.
Kraus, K. 1983. States, Effects, and Operations: Fundamental Notions of Quantum
Theory. Lecture Notes in Physics (190). Berlin: Springer-Verlag.
Kwiat, P. G., Steinberg, A. M., and Chiao, R. Y. 1993. High-Visibility Interference
in a Bell-Inequality Experiment for Energy and Time. Physical Review A, 47(4),
R2472–R2475.
Kwiat, P., Weinfurter, H., Herzog, T., Zeilinger, A., and Kasevich, M. 1994. Exper-
imental Realization of “Interaction-Free” Measurements. In: K. V. Laurikainen,
C. Montonen, and Sunnarborg (eds.), Symposium on the Foundations of Modern
Physics 1994. Editions Frontières.
Larmor, J. J. 1897. On a Dynamical Theory of the Electric and Luminiferous Medium,
Part 3, Relations with Material Media. Phil. Trans. Roy. Soc., 190, 205–300.
Laven, P. A., Taplin, D. W., and Bell, C. P. 1970. Television Reception over Sea Paths:
The Effect of the Tide. BBC Research Department Report, 1–25.
Leech, J. W. 1965. Classical Mechanics. Methuen and Co.
Leggett, A. J., and Garg, A. 1985. Quantum Mechanics versus Macroscopic Realism:
Is the Flux There When Nobody Looks? Phys. Rev. Lett., 54(9), 857–860.
Lehmann, H., Symanzik, K., and Zimmermann, W. 1955. Zur Formulierung Quan-
tisierter Feldtheorien. Il Nuovo Cimento, 1(1), 205–225.
Lewis, G. N. 1926. The Conservation of Photons. Nature, 118, 874–875.
Liang, Y., and Czarnecki, A. 2011. Photon–Photon Scattering: A Tutorial.
arXiv:1111.6126v2 [hep-ph], 1–9.
Lichtenberg, D. B. 1970. Unitary Symmetry and Elementary Particles. Academic
Press.
Lorentz, H. A. 1899. Simplified Theory of Electrical and Optical Phenomena in Moving
Systems. Proc. Acad. Science Amsterdam, I, 427–443.
Ludwig, G. 1983a. Foundations of Quantum Mechanics I. New York: Springer.
Ludwig, G. 1983b. Foundations of Quantum Mechanics II. New York: Springer.
Magueijo, J. 2003. New Varying Speed of Light Theories. Rept. Prog. Phys., 66(11),
2025–2068.
Mardari, G. N. 2005. What Is a Quantum Really Like? In: G. Adenier, A. Y. Khren-
nikov, and T. M. Nieuwenhuizen (eds.), Quantum Theory: Reconsideration of
Foundations–3. AIP Conference Proceedings, vol. 81.
Markopoulou, F. 2000. Quantum Causal Histories. Class. Quant. Grav., 17,
2059–2072.
Mars Climate Orbiter Mishap Investigation Board. 1999. Phase I Report. 1–44.
Martin, J. L. 1959a. The Feynman Principle for a Fermi System. Proc. Roy. Soc., A251,
543–549.
Martin, J. L. 1959b. Generalized Classical Dynamics and the “Classical Analogue” of
a Fermi Oscillator. Proc. Roy. Soc., A251, 536–542.
Merli, P. G., Missiroli, G. F., and Pozzi, G. 1976. On the Statistical Aspect of Electron
Interference Phenomena. Am. J. Phys., 44(3), 306–307.
Meschini, D. 2007. Planck-Scale Physics: Facts and Beliefs. Found. Science, 12(4),
277–294.
Minkowski, H. 1908. Space and Time. A translation of an Address delivered at the 80th
Assembly of German Natural Scientists and Physicians, at Cologne, 21 September,
75–91. Reprinted in H. A. Lorentz, A. Einstein, H. Minkowski, and H. Weyl, The
Principle of Relativity. Dover.
362 References
Misra, B., and Sudarshan, E. C. G. 1977. The Zeno’s Paradox in Quantum Theory.
J. Math. Phys., 18(4), 756–763.
Newton, I. 1687. The Principia (Philosophiae Naturalis Principia Mathematica).
University of California Press, 1999. New translation by I. B. Cohen and Anne
Whitman. University of California Press, 1999.
Newton, I. 1704. Opticks. 1952 ed. Dover Publications.
Newton, I. 2006. Original letter from Isaac Newton to Richard Bentley. The Newton
Project (online).
Nielsen, M. A., and Chuang, I. L. 2000. Quantum Computation and Quantum
Information. Cambridge University Press.
Paris, M. G. A. 2012. The Modern Tools of Quantum Mechanics (A Tutorial on
Quantum States, Measurement, and Operations). Eur. Phys. J. Special Topics,
203, 61–86.
Paul, H. 2004. Introduction to Quantum Optics. Cambridge University Press. Translated
from the German Photonen. Eine Einführung in die Quantenoptik, 2. Auflage
(1999).
Penrose, R. 1971. Angular Momentum: An Approach to Combinatorial Spacetime. In:
T. Bastin (ed.), Quantum Theory and Beyond. Cambridge University Press.
Peres, A. 1995. Quantum Theory: Concepts and Methods. Kluwer Academic.
Petkov, V. 2012. Space and Time: Minkowski’s Papers on Relativity. Minkowski
Institute Press, pages 1–37.
Planck, M. 1900a. On the Theory of the Energy Distribution Law of the Normal
Spectrum. Verhandl. Dtsch. Phys. Ges., 2(17), 237–245.
Planck, M. 1900b. Ueber eine Verbesserung der Wien’schen Spectralgleichung [On an
Improvement of Wein’s Equation for the Spectrum]. Verhandl. Dtsch. Phys. Ges.,
2(13), 202–204.
Poincaré, H. 1890. Sur Le Problème Des Trois Corps et Les Équations de la Dynamique.
Acta. Math., 13, 1–270.
Price, H. 1997. Time’s Arrow. Oxford University Press.
Procopio, L. M., et al. 2016. Experimental Test of Hyper-Complex Quantum Theories.
arXiv:1602.01624v2 [quant-ph].
Regge, T. 1961. General Relativity Without Coordinates. Il Nuovo Cimento, 19(3),
558–571.
Renniger, M. 1953. Zum Wellen-Korpuskel-Dualismus. Zeits. für Physik, 136,
251–261.
Requardt, M. 1999. Space-Time as an Orderparameter Manifold in Random Networks
and the Emergence of Physical Points. gr-qc/99023031, 1–40.
Ridout, D. P., and Sorkin, R. D. 2000. A Classical Sequential Growth Dynamics for
Causal Sets. Phys. Rev., D61, 024002. arXiv: gr-qc/9904062.
Rosa, R. 2012. The Merli–Missiroli–Pozzi Two-Slit Electron-Interference Experiment.
Phys. Perspect., 14, 178–195.
Scarani, V., Tittel, W., Zbinden, H., and Gisin, N. 2000. The Speed of Quantum
Information and the Preferred Frame: Analysis of Experimental Data. Phys. Lett.,
A276, 1–7.
Schrödinger, E. 1926. Quantisierung als Eigenwertproblem (Erste Mitteilung). Ann.
Phys., 384(4), 361–376.
Schrödinger, E. 1935. The Present Situation in Quantum Mechanics. Naturwis-
senschaften, 23(48), 807–812. Original title: Die gegenw artige Situation in der
References 363