(Ebook) Image Statistics in Visual Computing by Tania Pouli, Erik Reinhard, Douglas W. Cunningham ISBN 9781439874905, 9781568817255, 1439874905, 1568817258 Full Chapters Instanly
(Ebook) Image Statistics in Visual Computing by Tania Pouli, Erik Reinhard, Douglas W. Cunningham ISBN 9781439874905, 9781568817255, 1439874905, 1568817258 Full Chapters Instanly
https://ebooknice.com/product/image-statistics-in-visual-
computing-4747968
★★★★★
4.6 out of 5.0 (88 reviews )
DOWNLOAD PDF
ebooknice.com
(Ebook) Image Statistics in Visual Computing by Tania Pouli,
Erik Reinhard, Douglas W. Cunningham ISBN 9781439874905,
9781568817255, 1439874905, 1568817258 Pdf Download
EBOOK
Available Formats
https://ebooknice.com/product/biota-grow-2c-gather-2c-cook-6661374
https://ebooknice.com/product/matematik-5000-kurs-2c-larobok-23848312
https://ebooknice.com/product/sat-ii-success-math-1c-and-2c-2002-peterson-
s-sat-ii-success-1722018
(Ebook) Master SAT II Math 1c and 2c 4th ed (Arco Master the SAT
Subject Test: Math Levels 1 & 2) by Arco ISBN 9780768923049,
0768923042
https://ebooknice.com/product/master-sat-ii-math-1c-and-2c-4th-ed-arco-
master-the-sat-subject-test-math-levels-1-2-2326094
(Ebook) Cambridge IGCSE and O Level History Workbook 2C - Depth
Study: the United States, 1919-41 2nd Edition by Benjamin
Harrison ISBN 9781398375147, 9781398375048, 1398375144,
1398375047
https://ebooknice.com/product/cambridge-igcse-and-o-level-history-
workbook-2c-depth-study-the-united-states-1919-41-2nd-edition-53538044
https://ebooknice.com/product/high-dynamic-range-imaging-acquisition-
display-and-image-based-lighting-2011388
https://ebooknice.com/product/introduction-to-visual-computing-core-
concepts-in-computer-vision-graphics-and-image-processing-6992588
The authors keep the material accessible, providing mathematical definitions where
appropriate to help you understand the transforms that highlight statistical regularities
present in images. The book also describes patterns that arise once the images are
transformed and gives examples of applications that have successfully used statistical
regularities. Numerous references enable you to easily look up more information
about a specific concept or application. A supporting website also offers additional
information, including descriptions of various image databases suitable for statistics.
Tania Pouli
Collecting state-of-the-art, interdisciplinary knowledge in one source, this book Erik Reinhard
explores the relation of natural image statistics to human vision and shows how natural
image statistics can be applied to visual computing. It encourages you to develop novel Douglas W. Cunningham
insights and applications in all disciplines that relate to visual computing.
K13473
Computer Graphics
Image Statistics
in Visual Computing
i i
i i
i i
i i
Image Statistics
in Visual Computing
Tania Pouli
Erik Reinhard
Douglas W. Cunningham
AN A K PETERS BOOK
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
This book contains information obtained from authentic and highly regarded sources. Reasonable efforts have been
made to publish reliable data and information, but the author and publisher cannot assume responsibility for the valid-
ity of all materials or the consequences of their use. The authors and publishers have attempted to trace the copyright
holders of all material reproduced in this publication and apologize to copyright holders if permission to publish in this
form has not been obtained. If any copyright material has not been acknowledged please write and let us know so we may
rectify in any future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted, or uti-
lized in any form by any electronic, mechanical, or other means, now known or hereafter invented, including photocopy-
ing, microfilming, and recording, or in any information storage or retrieval system, without written permission from
the publishers.
For permission to photocopy or use material electronically from this work, please access www.copyright.com (http://
www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923,
978-750-8400. CCC is a not-for-profit organization that provides licenses and registration for a variety of users. For
organizations that have been granted a photocopy license by the CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for
identification and explanation without intent to infringe.
Visit the Taylor & Francis Web site at
http://www.taylorandfrancis.com
i i
i i
i i
i i
i i
i i
i i
i i
i i
Contents
I Background 1
1 Introduction . . . . . . . . . . . . . . . . . . 3
1.1 Statistics as Priors . . . . . . . . . . . . . . . 6
1.2 Statistics as Image Descriptors . . . . . . . . . . 7
1.3 Statistical Pipeline . . . . . . . . . . . . . . 10
1.4 Natural Images . . . . . . . . . . . . . . . . 11
1.5 Discussion . . . . . . . . . . . . . . . . . 11
2 The Human Visual System. . . . . . . . . . . . . . 15
2.1 Radiometric and Photometric Terms . . . . . . . . . 15
2.2 Human Vision . . . . . . . . . . . . . . . . 17
2.3 The Eyes . . . . . . . . . . . . . . . . . . 18
2.3.1 Optical Media and the Retina . . . . . . . . 19
2.3.2 Photoreceptors . . . . . . . . . . . . . 20
2.3.3 Horizontal and Bipolar Cells . . . . . . . . 25
2.3.4 Amacrine and Ganglion Cells . . . . . . . . 26
2.4 The Lateral Geniculate Nucleus and Cortical Processing . . 27
2.5 Implications of Human Visual Processing . . . . . . . 29
2.5.1 Visual Acuity . . . . . . . . . . . . . 30
2.5.2 Temporal Resolution . . . . . . . . . . . 31
2.5.3 Contrast . . . . . . . . . . . . . . . 31
vii
i i
i i
i i
i i
viii Contents
II Image Statistics 69
4 First-Order Statistics . . . . . . . . . . . . . . . 71
4.1 Histograms and Moments . . . . . . . . . . . . 72
4.1.1 Image Moments and Moment Invariants . . . . . 74
4.1.2 Histogram Adjustments . . . . . . . . . . 75
4.2 Moment Statistics and Average Distributions . . . . . . 77
4.3 Material Properties . . . . . . . . . . . . . . 81
4.4 Nonlinear Compression in Art . . . . . . . . . . . 83
4.5 Dark-Is-Deep Paradigm . . . . . . . . . . . . . 85
4.6 Summary . . . . . . . . . . . . . . . . . 86
5 Gradients, Edges, and Contrast . . . . . . . . . . . . 89
5.1 Real-World Considerations . . . . . . . . . . . . 89
5.1.1 Perceptual Consequences . . . . . . . . . 90
5.1.2 Image Space Consequences . . . . . . . . . 90
i i
i i
i i
i i
Contents ix
5.2 Gradients . . . . . . . . . . . . . . . . . 90
5.2.1 The Forward Difference Method . . . . . . . 91
5.2.2 The Backward Difference Method. . . . . . . 92
5.2.3 The Central Difference Method . . . . . . . 93
5.2.4 The Söbel Operator . . . . . . . . . . . 94
5.2.5 Second Derivative Methods . . . . . . . . . 95
5.2.6 Gradient Magnitude . . . . . . . . . . . 96
5.2.7 Gradient Statistics . . . . . . . . . . . . 97
5.2.8 Single-Image Gradient Statistics . . . . . . . 97
5.3 Edges . . . . . . . . . . . . . . . . . . . 99
5.3.1 Definition of an Edge . . . . . . . . . . . 100
5.3.2 Edge Detection Processes . . . . . . . . . 100
5.3.3 Edge Statistics . . . . . . . . . . . . . 103
5.4 Linear Scale Space . . . . . . . . . . . . . . 104
5.4.1 The N-Jet and Feature Detectors . . . . . . . 108
5.4.2 Implications for Human Perception . . . . . . 109
5.4.3 Scale-Space Statistics . . . . . . . . . . . 110
5.5 Contrast in Images . . . . . . . . . . . . . . 111
5.6 Image Deblurring . . . . . . . . . . . . . . . 114
5.7 Superresolution . . . . . . . . . . . . . . . 116
5.8 Inpainting . . . . . . . . . . . . . . . . . 116
6 Fourier Analysis . . . . . . . . . . . . . . . . . 119
6.1 Autocorrelation . . . . . . . . . . . . . . . 121
6.2 The Fourier Transform . . . . . . . . . . . . . 123
6.3 The Wiener-Khintchine Theorem . . . . . . . . . . 128
6.4 Power Spectra . . . . . . . . . . . . . . . . 128
6.4.1 Slope Computation . . . . . . . . . . . 129
6.4.2 Spectral Slope Analysis . . . . . . . . . . 132
6.4.3 Dynamic Range. . . . . . . . . . . . . 137
6.4.4 Dependence on Image Representation . . . . . 139
6.4.5 Angular Dependence . . . . . . . . . . . 140
6.4.6 Temporal Dependence . . . . . . . . . . 140
6.4.7 1/f Failures . . . . . . . . . . . . . . 141
6.5 Phase Spectra . . . . . . . . . . . . . . . . 142
6.6 Human Perception . . . . . . . . . . . . . . 144
6.7 Fractal Forgeries . . . . . . . . . . . . . . . 145
6.8 Image Processing and Categorization . . . . . . . . 145
6.9 Texture Descriptors . . . . . . . . . . . . . . 146
6.10 Terrain Synthesis . . . . . . . . . . . . . . . 147
6.11 Art Statistics . . . . . . . . . . . . . . . . 148
i i
i i
i i
i i
x Contents
i i
i i
i i
i i
Contents xi
i i
i i
i i
i i
xii Contents
i i
i i
i i
i i
Foreword
To me, the term natural image statistics does not merely refer to a collection
of statistics one can collect from images but signifies a paradigm shift in the
way we reason about images of natural scenery. Instead of modeling them as
a deterministic outcome of a physical process using explicit geometric entities
and illumination laws, we started considering natural images as random variables
characterized by unique statistical regularities.
In fact, these two modeling approaches, integrated in the Bayesian framework,
have proved themselves most effective for a wide range of image restoration tasks.
For example, when removing noise or blur from an image, the signal-to-noise
level becomes very low across the high-frequency band and therefore a large num-
ber of unknowns must be recovered. State-of-the-art denoising and deblurring
methods fill in this missing data based on the sparse distribution that derivative
and wavelet coefficients exhibit in natural images. Similarly, when filling holes
in an image or increasing its resolution, millions of pixels must be determined.
Non-parametric patch-based statistical models show a remarkable ability to re-
cover this seemingly missing data by exploiting various self-similarities found
within and across natural images.
The statistical investigation of natural images has also proved itself valuable
for studying the human visual system, which was exposed and adapted to this
stimulus throughout its evolution. Researchers showed, for example, that im-
age transformations based on some optimal statistical criterion, such as compact
representation, provide important insights over the response of various sensory
neurons.
These practical and theoretical implications make the search for new and more
accurate natural image priors a very active line of research in image understand-
ing, processing, and neuroscience.
The book Image Statistics in Visual Computing offers a delightful coverage
of this fascinating subject. It starts by describing fundamental concepts of human
visual perception, as well as important calibration aspects behind the collection of
natural image databases. The book thoroughly covers key image transformations
xiii
i i
i i
i i
i i
xiv Foreword
Raanan Fattal
School of Computer Science and Engineering
The Hebrew University of Jerusalem
i i
i i
i i
i i
Preface
Natural images exhibit statistical regularities that differentiate them from random
collections of pixels. Moreover, the human visual system appears to have evolved
to exploit such statistical regularities. The field of natural image statistics is tradi-
tionally concerned with trying to understand how human vision operates by iden-
tifying statistical regularities in natural images that may be used in some way.
There are some amazing examples of how the human visual system is beauti-
fully optimized to observe and interpret natural images. One example that put us
on the trail of natural image statistics is described in detail in Chapter 10: Take
a large set of pixels from a collection of natural images and transform them to
cone response space. If these three color channels are then rotated using principal
components analysis, the result is a new color space that is formally decorrelated.
As it happens, the color space created this way strongly resembles a color
opponent space. It is known from many sources that the human visual system
also employs a color opponent space to encode the visual signal prior to sending
it from the eye to the brain. The beauty of this is, therefore, that human vision
actively decorrelates its visual input, provided it is observing natural images. In
practice it appears that the three decorrelated channels are closer to independence,
which is an even stronger statistical claim.
This has subsequently given rise to applications in image processing that treat
the three color channels in color opponent space as independent, simplifying an
otherwise more difficult three-dimensional problem into three one-dimensional
problems. Color transfer algorithms form a good example of a class of algorithm
in which this appoach is successfully explored.
The lesson learned from this example—and the premise for this book—is that
the field of natural image statistics is not only helpful in the study of human vi-
sion but may also create opportunities that can be exploited in visual engineering
disciplines such as computer vision, computer graphics, and image processing.
Thus, as many areas of visual computing are concerned either with producing im-
agery for observation by humans or with processing images to obtain information
xv
i i
i i
i i
i i
xvi Preface
Cover Image
One of the most fascinating aspects of natural image statistics is their ability to de-
construct complex images of natural environments into compact descriptors. The
seemingly endless variety of nature leads to striking—and repeatedly occurring—
regularities. The image used in the cover tries to show exactly this. Starting from
a photograph of a natural scene, the maple leaves are slowly transformed to pro-
gressively simpler shapes, eventually converging to circles arranged according to
the “Dead leaves model,” discussed in Chapter 11.
Acknowledgments
We are grateful for the help offered by many friends and colleagues who have
assisted either directly or indirectly with the production of this book. In particular,
i i
i i
i i
i i
Preface xvii
we would like to thank A K Peters, the Max Planck Institute for Informatics,
and the Brandenburg Technical University for their support. We would also like
to thank Ayan Chakrabarti, Mark Fairchild, Bill Geisler, Jan-Mark Geusebroek,
and Dale Purves for allowing us to use their images. Further, we would like to
thank Karol Myszkowski, Patrick Perez, and Hans-Peter Seidel. We are also very
grateful for the great foreward that Ranaan Fattal has written for us.
Many researchers have made their work available online, and this has proved
to be a real help, allowing us to reproduce their results on our own images.
For instance, for the production of several figures we have adapted Stanford’s
Wavelab package,1 which is a suite of MATLAB tools for experimenting with
wavelets. The Wavelab team and list of contributors consists of David Donoho,
Arian Maleki, Morteza Sharam, Jon Buckheit, Maureen Clerc, Jerome Kalifa,
Stephane Mallat, Thomas Yu, Mark Reynold Duncan, Xiaoming Huo, Ofer Levi,
Shaobing Chen, Iain Johnstone, Jeffrey Scargle, Rainer von Sachs, Thomas Yu,
Jeffrey Scargle, and Eric Kolaczyk. We gratefully acknowledge their efforts.
Stéphane Mallat’s book A Wavelet Tour of Signal Processing2 is accompanied
on the internet by a large set of numerical experiments that can be used in the
classroom.3 These are designed and maintained by Gabriel Peyré. We thank him
for providing us with the answers to these excercises, significantly simplifying
our experiments and the production of further figures for our book. The “Dead
leaves” figures in Chapter 11 (and indeed part of the front cover) were made using
Gabriel’s MATLAB “Toolbox Image.”
We would also like to thank Ivan Selesnick, Shihua Cai, Keyong Li, Levent
Sendur, and A. Farras Abdelnour at the Department of Electrical and Computer
Engineering of Brooklyn Poly for making available on the internet a collection of
MATLAB routines to experiment with several wavelet transforms.4 In particular,
we have used their dual-tree complex wavelet transform implementation for ex-
perimenting with correlations across scale of the magnitude of complex wavelets.
Some of the ICA related figures in Chapter 7 were created by modifying a
version of the FastICA code that is kindly made available by Aapo Hyvärinen,
Jarmo Hurri, and Patrik Hoyer as part of their book on natural image statistics,5 a
book that served as an inspiration for ours, albeit with a different focus.
The texture synthesis examples were generated with the code kindly made
available by Javier Portilla and Eero Simoncelli.6 This code relies on the multi-
scale image processing toolbox made available by Eero Simoncelli.7
1 www-stat.standord.edu/~wavelab
2 www.ceremade.dauphine.fr/~peyre/wavelet-tour/
3 www.ceremade.dauphine.fr/~peyre/numerical-tour/
4 eeweb.poly.edu/iselesni/WaveletSoftware/index.html
5 www.naturalimagestatistics.net
6 www.cns.nyu.edu/lcv/texture
7 www.cns.nyu.edu/~eero/software.php
i i
i i
i i
i i
xviii Preface
Chapter 9 (“Markov random fields”) contains some figures that were gener-
ated with code kindly made publicly available by Uwe Schmidt, Qi Gao, and
Stefan Roth.8
We would like to also thank Christian Winger, a student at the Brandenburg
Technical University in Cottbus for allowing us to use his implementation of the
inpainting algorithm by Crimini et al. [134] for the figure in Chapter 5. This
method was implemented and extended as part of Christian’s bachelor’s thesis.
Our gratitude also goes to all the great people at A K Peters and CRC Press
for their help and guidance throughout the publishing process. Our editors, Sarah
Chow and Charlotte Byrnes, have been a pleasure to work with, and their ideas
and corrections have helped make the book better. Finally, we’d like to especially
thank Alice and Klaus Peters as well for signing us up in the first place; we hope
they’re enjoying their well-earned retirement.
8 www.gris.tu-darmstadt.de/research/visinf/software/index.en.htm
i i
i i
i i
i i
Part I
Background
i i
i i
i i
i i
i i
i i
i i
i i
Chapter 1
Introduction
For many reasons, including basic survival, we continuously sense the environ-
ment around us. Light is perhaps the premier carrier of information about our
environment. The signal it conveys is also the most complex to observe and in-
terpret. It should therefore not come as a surprise that a significant portion of the
human brain is dedicated to processing the light reaching our eyes (although there
appear to be significant differences between individuals in terms of how much of
the brain is devoted to the processing of visual stimuli [17]). Images—both still
and moving—contain a wealth of information: a quick look through the nearest
window is enough to tell us what the weather is like, whether we are in the coun-
tryside or the city, and what time of the day it might be. Movement in a scene
might tell us of an approaching predator or that the bus is approaching, while
color and texture may inform us about the state of ripeness of fruit or whether our
loaf of bread has gone off again.
Considering that human vision takes as input at any point in time an image
of the world through each eye—effectively two projections of the world onto the
retina—it is remarkable that we are able to infer so much of the world around us.
These two streams of flat images are sufficient to allow us to navigate complex en-
vironments, meaningfully respond to events, and recognize individuals. Anyone
who has tried to implement an algorithm to reconstruct a three-dimensional (3D)
environment from a pair of two-dimensional (2D) images knows how impossibly
difficult this is. Yet, human vision appears to accomplish this effortlessly.
Thus, human vision has the task of inferring at least some aspects of the 3D
configuration of our world, given 2D image inputs. This is known to be an un-
derconstrained problem: in essence, human vision has to invent one dimension
which has gone missing as a result of the projection of light on the retina. It
has been repeatedly shown that human vision employs many heuristics, some of
which are known. As an example, human vision generally expects a single light
source [598] that comes from above [407]. Face recognition, for instance, be-
comes significantly impaired if a human face is illuminated from below.
i i
i i
i i
i i
4 1. Introduction
There is a long history of research on the processes taking place within the
human (and animal) visual system that enable us to perceive and interpret im-
ages. Vision scientists typically study such processes through carefully con-
structed psychophysical experiments that measure overt behavior to systematic
changes in simple light patterns [498, 554, 138] or by directly measuring electro-
physiological responses while the retina is stimulated with a simple light pattern
[354].1 In the latter case, the light pattern that maximally stimulates a given cell
is known as that cell’s preferred stimulus, and it carries considerable information
about the structure and size of the cell’s receptive field. Alternatively, one could
measure behavior or cell responses while the retina is stimulated with natural im-
ages, since that is what the retina usually processes [445, 616, 151, 30, 764].
Naturally, the ability of the visual system to analyze information is very much
related to the information that is there to be processed, which just happens to
be images. The field of natural image statistics has focused on exactly this rela-
tionship. Initially the field focused on describing images such as television sig-
nals [168, 424], but it has since expanded into explicitly examining the many
links between the structure of a wide variety of image types on the one side
and the workings of the visual system on the other. In general, natural image
statistics research collects a set of natural images, transforms them according to
a statistical optimization criterion, and then evaluates whether the result in some
sense forms a reasonable description of the response properties of some neurons
[645, 234, 26, 546, 691].
The argumentation is roughly as follows. The retina has a large number of
discrete photoreceptors, so that the projection of light is spatially discretized.
Equivalently, photographs contain a large number of discrete pixels.2 The val-
ues that these elements can take is not fully random. This can, for instance, be
seen in Figure 1.1, where at the top a natural image is displayed. Below, we have
shuffled the order of the pixels. All pixels in the top image are also present in the
bottom images. However, the bottom images do not carry meaningful informa-
tion. Moreover, human vision is not well equipped to discern differences between
images such as these.
As the images at the bottom of Figure 1.1 are effectively recombinations of the
pixels of the image at the top, one may ask how many combinations of pixels there
exist for every random image [637]. Assuming that we have a small-ish image of 4
Mpix = 4 × 106 pixels, then the number of ways these pixels could be recombined
would be (4 × 106 )!, which for practical purposes approaches infinity. This means
that the number of natural images that we might encounter in real life is infinitely
smaller than then number of possible combinations in which the photoreceptors
in the retina could be stimulated [395, 215, 154, 639].
1 See hubel.med.harvard.edu.
2 We are assuming for this discussion, as well as for the remainder of the book, that images are
digitally stored and represented, and that they are therefore spatially discretized in a pixel grid.
i i
i i
1. Introduction 5
Original image
i i
6 1. Introduction
ate them from random images? Furthermore, in what ways does the human visual
system expect “naturalness”? In this book we look at these and similar questions,
which are core to this exciting area of research called natural image statistics.
Beyond helping to understand how the human visual system works, the regu-
larities in natural images are very valuable to disciplines that rely on images, such
as computer graphics, computer vision, or image processing. Similar in spirit to
the processing blocks within the visual system, computational models can take
advantage of the regularities in natural images to produce more plausible results
or to infer information that may otherwise be lost.
If, for instance, the aim of a particular technique is to reconstruct—or in-
paint—areas of an image that may be hidden, statistical information from nearby
regions can be used to create a patch that looks similar, yet not identical, to its
neighbors [568] (see Section 5.8). In another scenario, statistical regularities
computed over a collection of images can serve as priors, guiding (optimization)
algorithms [211, 453, 667]. The distribution of edge magnitudes in natural im-
ages, for example, can aid algorithms aiming to deblur images (see Section 5.6).
In effect, this produces an expectation of how edges are typically distributed in
images, helping the deblurring algorithm in question produce a solution that con-
forms to that expectation. Likewise, an analysis of art has shown that paintings
often have the same statistics as real-world images, and thus a number of exist-
ing techniques can be used to analyze, recover, or otherwise alter paintings (for
reviews, see [279, 602, 709]).
Although the distinction is not always as clear cut, images can be analyzed
in ensembles or individually. In the first case, the statistical analysis can provide
insights about regularities and properties of general scene types and situations,
which can in turn be applied when manipulating further images of that type. In
the latter case, the information acquired by looking at an individual image is only
useful for dealing with that particular image. However, this approach has shown
to provide powerful statistics internal to the image itself [829]. Statistics are of
course also commonly used in a third way; namely, for analyzing the results of
studies or experiments often performed for assessing imaging algorithms [138],
although this is somewhat outside the scope of this book. The following sec-
tions will briefly introduce the possible applications of statistics in the context of
imaging disciplines.
i i
i i
i i
i i
Sections 5.6–5.8). The most direct way in which this may happen is in optimization-
type algorithms.
An optimization algorithm requires an objective function—the function that is
minimized—which encodes a set of opposing characteristics that the result ought
to have. The algorithm typically begins by guessing an initial solution. The qual-
ity of the solution is then measured by inserting it into the objective function.
Then iteratively a new candidate solution is guessed and compared against the
previous best solution. There are several more or less standard choices for op-
timization processes, defining how the next guess is selected, but the main chal-
lenge is in the design of an appropriate objective function.
Objective functions usually consist of multiple terms. The first term is usually
a norm that compares the current guess against some desired goal. The closer the
image is to the goal, the smaller the number that this term returns. As this term
will admit many solutions that are mathematically equally good, extra conditions
are normally added to steer the optimization into a direction that is good according
to some other, higher-level criteria.
Thus, such additional terms are designed to penalize wrong solutions and pro-
mote more appropriate ones respectively. They act as priors in the sense that they
encode expectations regarding the mathematical shape of the solution. Often such
priors take the form of smoothness terms. When the goal of the optimization al-
gorithm is to filter or generate an image, natural image statistics have proven to
be a good source of priors, effectively allowing some statistical form of natural-
ness to be encoded. An example is the deblurring techniques mentioned earlier:
the average distribution of gradients in images can serve as a prior to suppress
spurious edges that might arise from the deblurring process [211, 667].
In the particular case of image gradients, they are found to often have small
values and only sometimes have large values, as shown in Figure 1.2. In other
words, the distribution of gradients is heavy-tailed rather than Gaussian. Note
that although here we show the results computed on a single image, this trend
holds in general for natural images. Thus, if we were to compute the gradient
distribution over a large set of natural images, we would find very similar results,
as we will see in Chapter 5. This means that it is possible to use this distribution as
a general prior in optimization problems, whenever we wish our resulting image
to have a gradient distribution that is in some sense “natural.” Accounting for the
non-Gaussian distribution of gradients has helped better solve several problems,
including image deblurring [211], image denoising [631, 686], superresolution,
and demosaicing [726].
i i
i i
8 1. Introduction
7 6
Bin count [5
ï ï ï ï
Gradient Gradient
Figure 1.2. Image gradients are heavy-tailed, as shown in this example. The
image at the top was first converted to log space, and then the horizontal gradients
were computed. They are plotted in a linear plot (bottom left) and for visualization
in a log-linear plot (bottom right). Note that most gradients are small, leading to a
heavy-tailed distribution. (Monument Valley, USA, 2012)
graph, for instance, clues as to how the new object would look if it were actually
placed in the real scene can be obtained by analyzing the image itself. This can,
for instance, provide information regarding color and illumination (mis-)matches.
Conversely, if we wish to remove an object from an image, it is possible to
analyze regions that neighbor the pixels to be removed. Here, statistics may help
to infer what would be the most likely scene that lies behind the removed object.
In this case, no general statistical assumptions are made about the input image,
i.e., it is not desirable to try and infer a specific background based on ensemble
statistics.
Instead, statistical tools are used to analyze the scene and purely based on
those, the missing parts can be reconstructed. For instance, in the image shown in
Figure 1.3, we may want to remove the camel without making it obvious that the
resulting image was manipulated in any way. Areas surrounding the camel con-
tain pixels similar to what would likely be behind it, and as such they can be used
to estimate the missing information. Since only a single image is available, an ac-
curate reconstruction would not be possible, but relying on statistical similarities
1.2. Statistics as Image Descriptors 9
Figure 1.3. Inpainting example. Note that this is only an illustration, created
semi-automatically with Photoshop. See Section 5.8 for statistics-based inpaint-
ing. (Giza, Egypt, 2007)
Figure 1.4. An example of a new texture (right) that was synthesized based on the
example texture given on the left. The texture on the right was tiled nine times.
i i
10 1. Introduction
of that image to that effect [69]. The statistical assumption made in general about
scenes is that the average reflectance of all materials is achromatic (the gray-world
assumption). This means that if we illuminate a scene with a colored light source,
the light reflected off objects (and subsequently captured in an image) would be
white only if the color of the illuminant were white. If the average color of an
image therefore deviates from gray, that difference is an indication of the color of
the light source.
In this case, the expectation that images will average to middle gray serves as
a global prior, albeit in a simple sense. On the other hand, the specific statistics
of each image, namely, the average of all its pixel values, describes that partic-
ular image and how it differs from the expected value. This technique for white
balancing, as well as variants thereof, are discussed further in Chapter 10.
i i
i i
i i
i i
their nature are rich in data. Second, the statistical analysis can aid in deriving
models for the data.
1.5 Discussion
We have found that most transforms that could be applied to pixel data and which
have some bearing on human vision all more or less point into the same direction,
which is that of sparseness. Statistical distributions of transformed data tend to
have high kurtosis, which is another way of saying that the data tend to have most
i i
i i
Other documents randomly have
different content
Welcome to our website – the ideal destination for book lovers and
knowledge seekers. With a mission to inspire endlessly, we offer a
vast collection of books, ranging from classic literary works to
specialized publications, self-development books, and children's
literature. Each book is a new journey of discovery, expanding
knowledge and enriching the soul of the reade
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
ebooknice.com