0% found this document useful (0 votes)

1K views37 pages

Elementary Linear Algebra 10th Edition-664-700

Uploaded by

brayan moreno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views37 pages

Elementary Linear Algebra 10th Edition-664-700

Uploaded by

brayan moreno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

6.

4 Best Approximation; Least Squares

In this section we will be concerned with linear systems that cannot be solved exactly and for which an approximate solution is
needed. Such systems commonly occur in applications where measurement errors “perturb” the coefficients of a consistent system
sufficiently to produce inconsistency.

Least Squares Solutions of Linear Systems

Suppose that is an inconsistent linear system of m equations in n unknowns in which we suspect the inconsistency to be
caused by measurement errors in the coefficients of A. Since no exact solution is possible, we will look for a vector x that comes as
“close as possible” to being a solution in the sense that it minimizes with respect to the Euclidean inner product on .
You can think of as an approximation to b and as the error in that approximation—the smaller the error, the better
the approximation. This leads to the following problem.

Least Squares Problem

Given a linear system of m equations in n unknowns, find a vector x that minimizes with respect to the
Euclidean inner product on . We call such an x a least squares solution of the system, we call the least squares
error vector, and we call the least squares error.

To clarify the above terminology, suppose that the matrix form of is

The term “least squares solution” results from the fact that minimizing also minimizes
.

Best Approximation
Suppose that b is a fixed vector in that we would like to approximate by a vector w that is required to lie in some subspace W
of . Unless b happens to be in W, then any such approximation will result in an “error vector” that cannot be made equal
to 0 no matter how w is chosen (Figure 6.4.1a). However, by choosing

we can make the length of the error vector

as small as possible (Figure 6.4.1b).

Figure 6.4.1
These geometric ideas suggest the following general theorem.

THEOREM 6.4.1 Best Approximation Theorem

If W is a finite-dimensional subspace of an inner product space V, and if b is a vector in V, then is the best
approximation to b from W in the sense that

for every vector w in W that is different from .

Proof For every vector w in W, we can write

(1)

But being a difference of vectors in W is itself in W; and since is orthogonal to W, the two terms on the
right side of 1 are orthogonal. Thus, it follows from the Theorem of Pythagoras (Theorem 6.2.3) that

Since , it follows that the second term in this sum is positive, and hence that

Since norms are nonnegative, it follows (from a property of inequalities) that

Least Squares Solutions of Linear Systems

One way to find a least squares solution of is to calculate the orthogonal projection on the column space W of the
matrix A and then solve the equation

(2)

However, we can avoid the need to calculate the projection by rewriting 2 as

and then multiplying both sides of this equation by to obtain

(3)

Since is the component of b that is orthogonal to the column space of A, it follows from Theorem 4.8.9b that this
vector lies in the null space of , and hence that

Thus, 3 simplifies to

which we can rewrite as

(4)
This is called the normal equation or the normal system associated with . When viewed as a linear system, the individual
equations are called the normal equations associated with .

In summary, we have established the following result.

THEOREM 6.4.2

For every linear system , the associated normal system

(5)

is consistent, and all solutions of 5 are least squaressolutions of . Moreover, if W is the column space of A, and x is
any least squares solution of , then the orthogonal projection of b on W is

(6)

If a linear system is consistent, then its exact solutions are

the same as its least squares solutions, in which case the
error is zero.

E X A M P L E 1 Least Squares Solution

(a) Find all least squares solutions of the linear system

(b) Find the error vector and the error.

Solution
(a) It will be convenient to express the system in the matrix form , where

It follows that

so the normal system is

Solving this system yields a unique least squares solution, namely,

(b) The error vector is

and the error is

E X A M P L E 2 Orthogonal Projection on a Subspace

Find the orthogonal projection of the vector on the subspace of spanned by the vectors

Solution We could solve this problem by first using the Gram–Schmidt process to convert into an
orthonormal basis and then applying the method used in Example 6 of Section 6.3 . However, the following method
is more efficient.

The subspace W of spanned by , , and is the column space of the matrix

Thus, if u is expressed as a column vector, we can find the orthogonal projection of u on W by finding a least
squares solution of the system and then calculating from the least squares solution. The
computations are as follows: The system is

The normal system in this case is

Solving this system yields

as the least squares solution of (verify), so

or, in comma-delimited notation, .

Uniqueness of Least Squares Solutions

In general, least squares solutions of linear systems are not unique. Although the linear system in Example 1 turned out to have a
unique least squares solution, that occurred only because the coefficient matrix of the system happened to satisfy certain conditions
that guarantee uniqueness. Our next theorem will show what those conditions are.

THEOREM 6.4.3

If A is an matrix, then the following are equivalent.

(a) A has linearly independent column vectors.
(b) is invertible.

Proof We will prove that and leave the proof that as an exercise.

Assume that A has linearly independent column vectors. The matrix has size , so we can prove that this
matrix is invertible by showing that the linear system has only the trivial solution. But if x is any solution of this
system, then is in the null space of and also in the column space of A. By Theorem 4.8.9b these spaces are orthogonal
complements, so part (b) of Theorem 6.2.4 implies that . But A is assumed to have linearly independent column vectors, so
by Theorem 1.3.1.

As an exercise, try using Formula 7 to solve the problem

in part (a) of Example 1.

The next theorem, which follows directly from Theorem 6.4.2 and Theorem 6.4.3, gives an explicit formula for the least squares
solution of a linear system in which the coefficient matrix has linearly independent column vectors.

THEOREM 6.4.4

If A is an matrix with linearly independent column vectors, then for every matrix b, the linearsystem
has a unique least squares solution. This solution is given by

(7)
Moreover, if W is the column space of A, then the orthogonalprojection of b on W is

(8)

OPTIONAL
The Role of QR-Decomposition in Least Squares Problems
Formulas 7 and 8 have theoretical use but are not well suited for numerical computation. In practice, least squares solutions of
are typically found by using some variation of Gaussian elimination to solve the normal equations or by using
QR-decomposition and the following theorem.

THEOREM 6.4.5

If A is an matrix with linearly independent column vectors, and if A = QR is a QR-decomposition of A (see Theorem
6.3.7), then for each b in the system has a unique least squares solution given by

(9)

A proof of this theorem and a discussion of its use can be found in many books on numerical methods of linear algebra. However,
you can obtain Formula 9 by making the substitution in 7 and using the fact that to obtain

Orthogonal Projections on Subspaces of Rm

In Section 4.8 we showed how to compute orthogonal projections on the coordinate axes of a rectangular coordinate system in
and more generally on lines through the origin of . We will now consider the problem of finding orthogonal projections on
subspaces of . We begin with the following definition.

DEFINITION 1

If W is a subspace of , then the linear transformation that maps each vector x in into its orthogonal
m
projection in W is called the orthogonal projection of R on W

It follows from Formula 7 that the standard matrix for the transformation P is
(10)

where A is constructed using any basis for W as its column vectors.

E X A M P L E 3 The Standard Matrix for an Orthogonal Projection on a Line

We showed in Formula 16 of Section 4.9 that

is the standard matrix for the orthogonal projection on the line W through the origin of that makes an angle θ with
the positive x-axis. Derive this result using Formula 10.

Solution The column vectors of A can be formed from any basis for W. Since W is one-dimensional, we can take
as the basis vector (Figure 6.4.2), so

We leave it for you to show that is the identity matrix. Thus, Formula 10 simplifies to

Figure 6.4.2

Another View of Least Squares

Recall from Theorem 4.8.9 that the null space and row space of an matrix A are orthogonal complements, as are the null
space of and the column space of A. Thus, given a linear system in which A is an matrix, the Projection
Theorem (6.3.3) tells us that the vectors x and b can each be decomposed into sums of orthogonal terms as

where and are the orthogonal projections of x on the row space of A and the null space of A, and the vectors
and are the orthogonal projections of b on the null space of and the column space of A.

In Figure 6.4.3 we have represented the fundamental spaces of A by perpendicular lines in and on which we indicated the
orthogonal projections of x and b. (This, of course, is only pictorial since the fundamental spaces need not be one-dimensional.)
The figure shows as a point in the column space of A and conveys that is the point in col(A) that is closest to b. This
illustrates that the least squares solutions of are the exact solutions of the equation .

Figure 6.4.3

More on the Equivalence Theorem

As our final result in the main part of this section we will add one additional part to Theorem 5.1.6.

THEOREM 6.4.6 Equivalent Statements

If A is an matrix, then the following statements are equivalent.

(a) A is invertible.
(b) has only the trivial solution.
(c) The reduced row echelon form of A is .
(d) A is expressible as a product of elementary matrices.
(e) is consistent for every matrix b.
(f) has exactly one solution for every matrix b.
(g) .
(h) The column vectors of A are linearly independent.
(i) The row vectors of A are linearly independent.
(j) The column vectors of A span .
(k) The row vectors of A span .
(l) The column vectors of A form a basis for .
(m) The row vectors of A form a basis for .
(n) A has .
(o) A has nullity 0.
(p) The orthogonal complement of the null space of A is .
(q) The orthogonal complement of the row space of A is .
(r) The range of is .
(s) is one-to-one.
(t) is not an eigenvalue of A.
(u) is invertible.

The proof of part (u) follows from part (h) of this theorem and Theorem 6.4.3 applied to square matrices.
OPTIONAL

We now have all the ingredients needed to prove Theorem 6.3.3 in the special case where V is the vector space .

Proof of Theorem 6.3.3 We will leave the case where as an exercise, so assume that . Let
be any basis for W, and form the matrix M that has these basis vectors as successive columns. This makes
W the column space of M and hence the null space of . We will complete the proof by showing that every vector u in
can be written in exactly one way as

where is in the column space of M and . However, to say that is in the column space of M is equivalent to saying
for some vector x in , and to say that is equivalent to saying that . Thus, if we can
show that the equation

(11)

has a unique solution for x, then and will be uniquely determined vectors with the required properties. To
do this, let us rewrite 11 as

Since the matrix M has linearly independent column vectors, the matrix is invertible by Theorem 6.4.6 and hence the
equation has a unique solution as required to complete the proof.

Concept Review
• Least squares problem
• Least squares solution
• Least squares error vector
• Least squares error
• Best approximation
• Normal equation
• Orthogonal projection
Skills
• Find the least squares solution of a linear system.
• Find the error and error vector associated with a least squares solution to a linear system.
• Use the techniques developed in this section to compute orthogonal projections.
• Find the standard matrix of an orthogonal projection.

Exercise Set 6.4

1. Find the normal system associated with the given linear system.
(a)
(b)

Answer:

(a)

(b)

In Exercises 2–4, find the least squares solution of the linear equation .

2. (a)
;

(b)
;

3. (a)

(b)

Answer:

(a)

(b)

4. (a)

(b)

In Exercises 5–6, find the least squares error vector resulting from the least squares solution x and verify that it is
orthogonal to the column space of A.

5. (a) A and b are as in Exercise 3(a).

(b) A and b are as in Exercise 3(b).

Answer:
(a)

(b)

6. (a) A and b are as in Exercise 4(a).

(b) A and b are as in Exercise 4(b).

7. Find all least squares solutions of andconfirm that all of the solutions have the same error vector. Compute the least
squares error.
(a)
;

(b)
;

Answer:

(a) Solution: ; least squares error:

(b) Solution: (t a real number); least squares error:

(c) Solution: (t a real number); least squares error:

8. Find the orthogonal projection of u on the subspace of spanned by the vectors and .
(a)
(b)

9. Find the orthogonal projection of u on the subspace of spanned by the vectors , , and .
(a) ; , ,
(b) ; , ,

Answer:

(a) (7, 2, 9, 5)
(b)

10. Find the orthogonal projection of on the solution space of the homogeneous linear system

11. In each part, find , and apply Theorem 6.4.3 to determine whether A has linearly independent column vectors.
(a)

(b)

Answer:

(a) A does not have linearly independent column vectors.

(b) A does not have linearly independent column vectors.

12. Use Formula 10 and the method of Example 3 to find the standard matrix for the orthogonal projection onto
(a) the x-axis.
(b) the y-axis.
[Note: Compare your results to Table 3 of Section 4.9.]
13. Use Formula 10 and the method of Example 3 to find the standard matrix for the orthogonal projection onto
(a) the xz-plane.
(b) the yz-plane.
[Note: Compare your results to Table 4 of Section 4.9.]

Answer:

(a)

(b)

14. Show that if is a nonzero vector, then the standard matrix for the orthogonal projection of on the line
is

15. Let W be the plane with equation .

(a) Find a basis for W.
(b) Use Formula 10 to find the standard matrix for the orthogonal projection on W.
(c) Use the matrix obtained in part (b) to find the orthogonal projection of a point on W.
(d) Find the distance between the point and the plane W, and check your result using Theorem 3.3.4.

Answer:

(a)
(b)
(c)

(d)

16. Let W be the line with parametric equations

(a) Find a basis for W.

(b) Use Formula 10 to find the standard matrix for the orthogonal projection on W.
(c) Use the matrix obtained in part (b) to find the orthogonalprojection of a point on W.
(d) Find the distance between the point and the line W.

17. In , consider the line l given by the equations

and the line m given by the equations

Let P be a point on l, and let Q be a point on m. Find the values of t and s that minimize the distance between the lines by
minimizing the squared distance .

Answer:

18. Prove: If A has linearly independent column vectors, and if is consistent, then the least squares solution of and
the exact solution of are the same.
19. Prove: If A has linearly independent column vectors, and if b is orthogonal to the column space of A, then the least squares
solution of is .
20. Let be the orthogonal projection of onto a subspace W.
(a) Prove that .
(b) What does the result in part (a) imply about the composition ?
(c) Show that [P] is symmetric.

21. Let A be an matrix with linearly independent row vectors. Find a standard matrix for the orthogonal projection of
onto the row space of A. [Hint: Start with Formula 10.]

Answer:

22. Prove the implication of Theorem 6.4.3.

True-False Exercises

In parts (a)–(h) determine whether the statement is true or false, and justify your answer.

(a) If A is an matrix, then is a square matrix.

Answer:

True
(b) If is invertible, then A is invertible.

Answer:

False
(c) If A is invertible, then is invertible.

Answer:

True
(d) If is a consistent linear system, then is also consistent.

Answer:

True
(e) If is an inconsistent linear system, then is also inconsistent.

Answer:

False
(f) Every linear system has a least squares solution.

Answer:

True
(g) Every linear system has a unique least squares solution.

Answer:

False
(h) If A is an matrix with linearly independent columns and b is in , then has a unique least squares solution.

Answer:

True

Copyright © 2010 John Wiley & Sons, Inc. All rights reserved.
6.5 Least Squares Fitting to Data
In this section we will use results about orthogonal projections in inner product spaces to obtain a technique
for fitting a line or other polynomial curve to a set of experimentally determined points in the plane.

Fitting a Curve to Data

A common problem in experimental work is to obtain a mathematical relationship between two
variables x and y by “fitting” a curve to points in the plane corresponding to various experimentally
determined values of x and y, say

On the basis of theoretical considerations or simply by observing the pattern of the points, the experimenter
decides on the general form of the curve to be fitted. Some possibilities are (Figure 6.5.1)
(a) A straight line:
(b) A quadratic polynomial:

(c) A cubic polynomial:

Because the points are obtained experimentally, there is often some measurement “error” in the data, making
it impossible to find a curve of the desired form that passes through all the points. Thus, the idea is to choose
the curve (by determining its coefficients) that “best” fits the data. We begin with the simplest and most
common case: fitting a straight line to data points.

Figure 6.5.1

Least Squares Fit of a Straight Line

Suppose we want to fit a straight line to the experimentally determined points

If the data points were collinear, the line would pass through all n points, and the unknown coefficients a and
b would satisfy the equations
We can write this system in matrix form as

or more compactly as

(1)

where

(2)

If the data points are not collinear, then it is impossible to find coefficients a and b that satisfy system 1
exactly; that is, the system is inconsistent. In this case we will look for a least squares solution

We call a line whose coefficients come from a least squares solution a regression line or a
least squares straight line fit to the data. To explain this terminology, recall that a least squares solution of 1
minimizes

(3)

If we express the square of 3 in terms of components, we obtain

(4)

If we now let

then 4 can be written as

(5)

As illustrated in Figure 6.5.2, the number can be interpreted as the vertical distance between the line
and the data point . This distance is a measure of the “error” at the point
resulting from the inexact fit of to the data points, the assumption being that the are known
exactly and that all the error is in the measurement of the . Since 3 and 5 are minimized by the same vector
, the least squares straight line fit minimizes the sum of the squares of the estimated errors , hence the
name least squares straight line fit.

Figure 6.5.2 measures the vertical error in the least squares straight line.

Normal Equations
Recall from Theorem 6.4.2 that the least squares solutions of 1 can be obtained by solving the associated
normal system

the equations of which are called the normal equations.

In the exercises it will be shown that the column vectors of M are linearly independent if and only if the n data
points do not lie on a vertical line in the xy-plane. In this case it follows from Theorem 6.4.4 that the least
squares solution is unique and is given by

In summary, we have the following theorem.

THEOREM 6.5.1 Uniqueness of the Least Squares Solution

Let be a set of two or more data points, not all lying on a vertical
line, and let

Then there is a unique least squares straight line fit

to the data points. Moreover,

is given by the formula

(6)

which expresses the fact that is the unique solution of the normal equations

(7)

E X A M P L E 1 Least Squares Straight Line Fit

Find the least squares straight line fit to the four points , , , and . (See
Figure 6.5.3.)

Figure 6.5.3

Solution We have

so the desired line is .

E X A M P L E 2 Spring Constant

Hooke's law in physics states that the length x of a uniform spring is a linear function of the
force y applied to it. If we express this relationship as , then the coefficient b is
called the spring constant. Suppose a particular unstretched spring has a measured length of 6.1
inches (i.e., when ). Forces of 2 pounds, 4 pounds, and 6 pounds are then applied
to the spring, and the corresponding lengths are found to be 7.6 inches, 8.7 inches, and 10.4
inches (see Figure 6.5.4). Find the spring constant.

Figure 6.5.4

Solution We have

and

where the numerical values have been rounded to one decimal place. Thus, the estimated value
of the spring constant is pounds/inch.
Historical Note On October 5, 1991 the Magellan spacecraft entered the atmosphere of Venus and
transmitted thetemperature T in kelvins (K) versus the altitude h in kilometers (km) until its signal
was lost at an altitude of about 34 km. Discounting theinitial erratic signal, the data strongly
suggested a linear relationship, so a least squares straight line fit was used on the linear part of the
data to obtain the equation

By setting in this equation, the surface temperature of Venus was estimated at K.

Least Squares Fit of a Polynomial

The technique described for fitting a straight line to data points can be generalized to fitting a polynomial of
specified degree to data points. Let us attempt to fit a polynomial of fixed degree m

(8)

to n points

Substituting these n values of x and y into 8 yields the n equations

or, in matrix form,

(9)

where
(10)

As before, the solutions of the normal equations

determine the coefficients of the polynomial, and the vector v minimizes

Conditions that guarantee the invertibility of are discussed in the exercises (Exercise 7). If is
invertible, then the normal equations have a unique solution , which is given by

(11)

E X A M P L E 3 Fitting a Quadratic Curve to Data

According to Newton's second law of motion, a body near the Earth's surface falls vertically
downward according to the equation

(12)

where
s = vertical displacement downward relative to some fixed point
= initial displacement at time
= initial velocity at time
g = acceleration of gravity at the Earth's surface
from Equation 12 by releasing a weight with unknown initial displacement and velocity and
measuring the distance it has fallen at certain times relative to a fixed reference point. Suppose
that a laboratory experiment is performed to evaluate g. Suppose it is found that at times
, and .5 seconds the weight has fallen , and 3.73
feet, respectively, from the reference point. Find an approximate value of g using these data.

Solution The mathematical problem is to fit a quadratic curve

(13)

to the five data points:

With the appropriate adjustments in notation, the matrices M and y in 10 are

Thus, from 11,

From 12 and 13, we have , so the estimated value of g is

If desired, we can also estimate the initial displacement and initial velocity of the weight:

In Figure 6.5.5 we have plotted the five data points and the approximating polynomial.

Figure 6.5.5

Concept Review
• Least squares straight line fit
• Regression line
• Least squares polynomial fit
Skills
• Find the least squares straight line fit to a set of data points.
• Find the least squares polynomial fit to a set of data points.
• Use the techniques of this section to solve applied problems.

Exercise Set 6.5

1. Find the least squares straight line fit to the three points , , and .

Answer:

2. Find the least squares straight line fit to the four points , , , and .
3. Find the quadratic polynomial that best fits the four points , , , and .

Answer:

4. Find the cubic polynomial that best fits the five points , , , , and
.
5. Show that the matrix M in Equation 2 has linearly independent columns if and only if at least two of the
numbers are distinct.
6. Show that the columns of the matrix M in Equation 10 are linearly independent if and
at least of the numbers are distinct. [Hint: A nonzero polynomial of degreem has at
most m distinct roots.]
7. Let M be the matrix in Equation 10. Using Exercise 6, show that a sufficient condition for the matrix
to be invertible is that and that at least of the numbers are distinct.
8. The owner of a rapidly expanding business finds that for the first five months of the year the sales (in
thousands) are , and $8.0. The owner plots these figures on a graph and conjectures
that for the rest of the year, the sales curve can be approximated by a quadratic polynomial. Find the least
squares quadratic polynomial fit to the sales curve, and use it to project the sales for the twelfth month of
the year.
9. A corporation obtains the following data relating the number of sales representatives on its staff to annual
sales:

Explain how you might use least squares methods to estimate the annual sales with 45 representatives, and
discuss the assumptions that you are making. (You need not perform the actual computations.)
10. Pathfinder is an experimental, lightweight,remotely piloted,solar-powered aircraft that was used in aseries
of experiments by NASA to determine the feasibilityof applyingsolar power for long-duration,high-
altitude flight. In August 1997 Pathfinder recordedthe data in the accompanying table relating altitude H
and temperature T. Show that a linear model is reasonable by plotting the data, and then find theleast
squares line of best fit.
Table Ex-10

11. Find a curve of the form that best fits the data points , , by making the
substitution . Draw the curve and plot the data points in the same coordinate system.

Answer:

True-False Exercises

In parts (a)–(d) determine whether the statement is true or false, and justify your answer.

(a) Every set of data points has a unique least squares straight line fit.

Answer:

False
(b) If the data points are not collinear, then 1 is an inconsistent system.

Answer:

True
(c) If is the least squares line fit to the data points , then
is minimal for every .

Answer:
False
(d) If is the least squares line fit to the data points , then
is minimal.

Answer:

True

Copyright © 2010 John Wiley & Sons, Inc. All rights reserved.
6.6 Function Approximation; Fourier Series
In this section we will show orthogonal projections can be used to approximate certain types of functions by
simpler functions that are easier to work with. The ideas explained here have important applications in
engineering and science. Calculus is required.

Best Approximations
All of the problems that we will study in this section will be special cases of the following general problem.

APPROXIMATION PROBLEM

Given a function f that is continuous on an interval , find the “best possible approximation” to f
using only functions from a specified subspace W of .

Here are some examples of such problems:

(a) Find the best possible approximation to over by a polynomial of the form .
(b) Find the best possible approximation to over by a function of the form
.
(c) Find the best possible approximation to x over by a function of the form
.
In the first example W is the subspace of spanned by , and ; in the second example W is the
subspace of spanned by , , and ; and in the third example W is the subspace of
spanned by , , , and .

Measurements of Error
To solve approximation problems of the preceding types, we first need to make the phrase “best
approximation over ” mathematically precise. To do this we will need some way of quantifying the
error that results when one continuous function is approximated by another over an interval . If we
were to approximate by , and if we were concerned only with the error in that approximation at a
single point , then it would be natural to define the error to be

sometimes called the deviation between f and g at (Figure 6.6.1). However, we are not concerned simply
with measuring the error at a single point but rather with measuring it over the entire interval . The
problem is that an approximation may have small deviations in one part of the interval and large deviations in
another. One possible way of accounting for this is to integrate the deviation over the interval
and define the error over the interval to be
(1)

Geometrically, 1 is the area between the graphs of and over the interval (Figure 6.6.2); the
greater the area, the greater the overall error.

Figure 6.6.1 The deviation between f and g x0

Figure 6.6.2 The area between the graphs of f and g over [a, b] measures the error in approximating f
by g over [a, b]

Although 1 is natural and appealing geometrically, most mathematicians and scientists generally favor the
following alternative measure of error, called the mean square error:

Mean square error emphasizes the effect of larger errors because of the squaring and has the added advantage
that it allows us to bring to bear the theory of inner product spaces. To see how, suppose that f is a continuous
function on that we want to approximate by a function g from a subspace W of , and suppose
that is given the inner product

It follows that

so minimizing the mean square error is the same as minimizing . Thus the approximation problem
posed informally at the beginning of this section can be restated more precisely as follows.

Least Squares Approximation

LEAST SQUARES APPROXIMATION PROBLEM

Let f be a function that is continuous on an interval , let have the inner product

and let W be a finite-dimensional subspace of . Find a function g in W that minimizes

Since and are minimized by the same function g, this problem is equivalent to looking for a
function g in W that is closest to f. But we know from Theorem 6.4.1 that is such a function
(Figure 6.6.3).

Figure 6.6.3

Thus, we have the following result.

THEOREM 6.6.1

If f is a continuous function on , and W is a finite-dimensional subspace of , then the

function g in W that minimizes the mean square error

is , where the orthogonal projection is relative to the inner product

The function is called the last squares approximation to f from W.

Fourier Series
A function of the form

(2)

is called a trigonometric polynomial; if and are not both zero, then is said to have order n. For
example,

is a trigonometric polynomial of order 4 with

It is evident from 2 that the trigonometric polynomials of order n or less are the various possible linear
combinations of

(3)

It can be shown that these functions are linearly independent and thus form a basis for a
-dimensional subspace of .

Let us now consider the problem of finding the least squares approximation of a continuous function
over the interval by a trigonometric polynomial of order n or less. As noted above, the least squares
approximation to f from W is the orthogonal projection of f on W. To find this orthogonal projection, we must
find an orthonormal basis for W, after which we can compute the orthogonal projection on W
from the formula

(4)

(see Theorem 6.3.4b). An orthonormal basis for W can be obtained by applying the Gram–Schmidt process to
the basis vectors in 3 using the inner product

This yields the orthonormal basis

(5)

(see Exercise 6). If we introduce the notation

(6)

then on substituting 5 in 4, we obtain

(7)

where

In short,

(8)

The numbers are called the Fourier coefficients of f.

E X A M P L E 1 Least Squares Approximations

Find the least squares approximation of on by

(a) a trigonometric polynomial of order 2 or less;
(b) a trigonometric polynomial of order n or less.

Solution
(a)
(9a)

For , integration by parts yields (verify)

(9b)

(9c)

Thus, the least squares approximation to x on by a trigonometric polynomial of

order 2 or less is

or, from (9a), (9b), and (9c),

(b) The least squares approximation to x on by a trigonometric polynomial of order n

or less is

or, from (9a), (9b), and (9c),

The graphs of and some of these approximations are shown in Figure 6.6.4.

Figure 6.6.4

It is natural to expect that the mean square error will diminish as the number of terms in the
least squares approximation

increases. It can be proved that for functions f in , the mean square error
approaches zero as ; this is denoted by writing
The right side of this equation is called the Fourier series for f over the interval .
Such series are of major importance in engineering, science, and mathematics.

Jean Baptiste Fourier (1768–1830)

Historical Note Fourier was a French mathematician and physicist who discovered
the Fourier series and related ideas while working on problems of heat diffusion. This
discovery was one of the most influential in the history of mathematics; it is the
cornerstone of many fields of mathematical research and a basic tool in many branches
of engineering. Fourier, a political activist during the French revolution, spent time in
jail for his defense of many victims during the Terror. He later became a favorite of
Napoleon and was named a baron.
[Image: The Granger Collection, New York]

Concept Review
• Approximation of functions
• Mean square error
• Least squares approximation
• Trigonometric polynomial
• Fourier coefficients
• Fourier series
Skills
• Find the least squares approximation of a function.
• Find the mean square error of the least squares approximation of a function.
• Compute the Fourier series of a function.
Exercise Set 6.6
1. Find the least squares approximation of over the interval by
(a) a trigonometric polynomial of order 2 or less.
(b) a trigonometric polynomial of order n or less.

Answer:

(a)
(b)

2. Find the least squares approximation of over the interval by

(a) a trigonometric polynomial of order 3 or less.
(b) a trigonometric polynomial of order n or less.

3. (a) Find the least squares approximation of x over the interval by a function of the form .
(b) Find the mean square error of the approximation.

Answer:

(a)

(b)

4. (a) Find the least squares approximation of over the interval by a polynomial of the form
.
(b) Find the mean square error of the approximation.

5. (a) Find the least squares approximation of over the interval [−1, 1] by a polynomial of the form
.
(b) Find the mean square error of the approximation.

Answer:

(a)

(b)

6. Use the Gram–Schmidt process to obtain the orthonormal basis 5 from the basis 3.
7. Carry out the integrations indicated in Formulas 9a, 9b, and 9c.
8. Find the Fourier series of over the interval .
9. Find the Fourier series of and , over the interval .

Answer:

10. What is the Fourier series of ?

True-False Exercises

In parts (a)–(e) determine whether the statement is true or false, and justify your answer.

(a) If a function f in is approximated by the function g, then the mean square error is the same as the
area between the graphs of and over the interval .

Answer:

False
(b) Given a finite-dimensional subspace W of , the function g = projW f minimizes the mean square
error.

Answer:

True
(c) is an orthogonal subset of the vector space with respect to the
inner product .

Answer:

True
(d) is an orthonormal subset of the vector space with respect to the
inner product .

Answer:

False
(e) is a linearly independent subset of .

Answer:

True

Copyright © 2010 John Wiley & Sons, Inc. All rights reserved.
Chapter 6 Supplementary Exercises
1. Let have the Euclidean inner product.
(a) Find a vector in that is orthogonal to and and makes equal
angles with and .
(b) Find a vector of length 1 that is orthogonal to and above and such that the
cosine of the angle between x and is twice the cosine of the angle between x and .

Answer:

(a) with
(b)

2. Prove: If is the Euclidean inner product on , and if A is an matrix, then

[Hint: Use the fact that .]

3. Let have the inner product that was defined in Example 6 of

Section 6.1 . Describe the orthogonal complement of
(a) the subspace of all diagonal matrices.
(b) the subspace of symmetric matrices.

Answer:

(a) The subspace of all matrices in with only zeros on the diagonal.
(b) The subspace of all skew-symmetric matrices in .

4. Let be a system of m equations in n unknowns. Show that

is a solution of this system if and only if the vector is orthogonal to every row vector
of A with respect to the Euclidean inner product on .
5. Use the Cauchy–Schwarz inequality to show that if are positive real numbers, then

6. Show that if x and y are vectors in an inner product space and c is any scalar, then
7. Let have the Euclidean inner product. Find two vectors of length 1 that are orthogonal to all three of
the vectors , , and .

Answer:

8. Find a weighted Euclidean inner product on such that the vectors

form an orthonormal set.

9. Is there a weighted Euclidean inner product on for which the vectors and form an
orthonormal set? Justify your answer.

Answer:

No
10. If u and v are vectors in an inner product space , then u, v, and can be regarded as sides of a
“triangle” in V (see the accompanying figure). Prove that the law of cosines holds for any such triangle;
that is,

where is the angle between u and v.

Figure Ex-10

11. (a) As shown in Figure 3.2.6, the vectors (k, 0, 0), (0, k, 0), and (0, 0, k) form the edges of a cube in
with diagonal . Similarly, the vectors

can be regarded as edges of a “cube” in with diagonal . Show that each of the above
edges makes an angle of θ with the diagonal, where .
(b) Calculus required What happens to the angle θ inpart (a) as the dimension of approaches ?
Answer:

(b) approaches

12. Let u and v be vectors in an inner product space.

(a) Prove that if and only if and are orthogonal.
(b) Give a geometric interpretation of this result in with the Euclidean inner product.

13. Let u be a vector in an inner product space V, and let be an orthonormal basis for V.
Show that if is the angle between u and , then

14. Prove: If and are two inner products on a vector space V, then the quantity
is also an inner product.
15. Prove Theorem 6.2.5.
16. Prove: If A has linearly independent column vectors, and if b is orthogonal to the column space of A,then
the least squares solution of is .
17. Is there any value of s for which and is the leastsquares solution of the following linear
system?

Explain your reasoning.

Answer:

No
18. Show that if p and q are distinct positive integers, then the functions and are
orthogonal with respect to the inner product

19. Show that if p and q are positive integers, then the functions and are
orthogonal with respect to the inner product

Eigenvalues and Eigenvectors: An: Example. Consider The Matrix
No ratings yet
Eigenvalues and Eigenvectors: An: Example. Consider The Matrix
23 pages
Notes From Anton 10 TH Ed CH 4
No ratings yet
Notes From Anton 10 TH Ed CH 4
5 pages
Number Theory Notes Anwar Khan
No ratings yet
Number Theory Notes Anwar Khan
219 pages
Solution 7th Edition PDF
0% (1)
Solution 7th Edition PDF
13 pages
L-1 Linear Algebra Howard Anton Lectures Slides For Student
100% (1)
L-1 Linear Algebra Howard Anton Lectures Slides For Student
19 pages
Anton Linear Algebra 10th Edition Solutions Set 3.3
No ratings yet
Anton Linear Algebra 10th Edition Solutions Set 3.3
36 pages
Understanding Relations and Properties
No ratings yet
Understanding Relations and Properties
57 pages
Continue
No ratings yet
Continue
5 pages
Linear Algebra
No ratings yet
Linear Algebra
150 pages
Real Vector Spaces and Axioms
50% (2)
Real Vector Spaces and Axioms
4 pages
Lec-2 LA Linear Algebra Howard Anton Lectures Slides For Student
No ratings yet
Lec-2 LA Linear Algebra Howard Anton Lectures Slides For Student
49 pages
Normalizing Database Design Assignment
No ratings yet
Normalizing Database Design Assignment
5 pages
Summarizing Academic Texts Guide
No ratings yet
Summarizing Academic Texts Guide
4 pages
Linear Equations and Matrix Solutions
No ratings yet
Linear Equations and Matrix Solutions
127 pages
205 Business-Statistics Fy-BBA
No ratings yet
205 Business-Statistics Fy-BBA
107 pages
Eigenvalues and Eigenvectors Quiz
No ratings yet
Eigenvalues and Eigenvectors Quiz
6 pages
Hungerford Solution 1-5GroupProduct
No ratings yet
Hungerford Solution 1-5GroupProduct
9 pages
Unit 13
No ratings yet
Unit 13
33 pages
Best App for Anna University Students
No ratings yet
Best App for Anna University Students
239 pages
Linear Algebra Updated Mid Paper Spring 2021
0% (1)
Linear Algebra Updated Mid Paper Spring 2021
3 pages
Eigenvectors of Real and Complex Matrices by LR and QR Triangularizations
No ratings yet
Eigenvectors of Real and Complex Matrices by LR and QR Triangularizations
24 pages
Abstract Algebra Third Edition David S. Dummit Online Version
No ratings yet
Abstract Algebra Third Edition David S. Dummit Online Version
461 pages
Countable Set
No ratings yet
Countable Set
9 pages
Diagonalization Practice Problem
No ratings yet
Diagonalization Practice Problem
6 pages
Statistical Inference in Data Science
No ratings yet
Statistical Inference in Data Science
121 pages
BSCS 3 Linear ALgebra Final 2023
No ratings yet
BSCS 3 Linear ALgebra Final 2023
1 page
International Islamic University, Islamabad BS Mathematics Real Analysis II
No ratings yet
International Islamic University, Islamabad BS Mathematics Real Analysis II
29 pages
L5-Higher Ratio Tests
100% (2)
L5-Higher Ratio Tests
16 pages
Graph Theory Practice Problems Guide
No ratings yet
Graph Theory Practice Problems Guide
13 pages
Important Terms: Chapter 5 - Graph and Tree Data Structures Data Structures and Algorithm
No ratings yet
Important Terms: Chapter 5 - Graph and Tree Data Structures Data Structures and Algorithm
20 pages
Matrices in Engineering Science
No ratings yet
Matrices in Engineering Science
3 pages
3-Euclidean Vector Spaces
No ratings yet
3-Euclidean Vector Spaces
97 pages
Product Cipher
No ratings yet
Product Cipher
12 pages
Equations of Lines and Planes
100% (1)
Equations of Lines and Planes
26 pages
Group Theory (Isomorphism and Homomorphism)
No ratings yet
Group Theory (Isomorphism and Homomorphism)
6 pages
Intro to Differential Equations Concepts
0% (1)
Intro to Differential Equations Concepts
4 pages
Chole Sky
100% (1)
Chole Sky
6 pages
Gamma and Betta Function Adv Calculus Schaum
No ratings yet
Gamma and Betta Function Adv Calculus Schaum
17 pages
Inverse of Matrix by Elementary Transformation
No ratings yet
Inverse of Matrix by Elementary Transformation
19 pages
Homework 1 Problems
0% (1)
Homework 1 Problems
8 pages
Theory of Numbers
No ratings yet
Theory of Numbers
71 pages
1.6 Translations Reflections
0% (1)
1.6 Translations Reflections
4 pages
Complex Analysis Exercise 5.2
No ratings yet
Complex Analysis Exercise 5.2
28 pages
3.1 Mean Value Theorems in Differential Calculus: 1. Rolle's Theorem 2. Lagrange's Theorem 3. Cauchy's Theorem
No ratings yet
3.1 Mean Value Theorems in Differential Calculus: 1. Rolle's Theorem 2. Lagrange's Theorem 3. Cauchy's Theorem
18 pages
Assignment C++
No ratings yet
Assignment C++
53 pages
CSE411: Advanced Database Systems Overview
No ratings yet
CSE411: Advanced Database Systems Overview
9 pages
Chapter 5 Exercise
No ratings yet
Chapter 5 Exercise
51 pages
2 Lattices
No ratings yet
2 Lattices
47 pages
Unit I: Matrices-II: Similar Matrices & Diagonalization of A Matrix
No ratings yet
Unit I: Matrices-II: Similar Matrices & Diagonalization of A Matrix
22 pages
Kleene's Theorem Part III: FA Construction
No ratings yet
Kleene's Theorem Part III: FA Construction
21 pages
Final Paper GCUf Main Paper Solution 2023 Algebric Number Theory M.Ayoub
No ratings yet
Final Paper GCUf Main Paper Solution 2023 Algebric Number Theory M.Ayoub
18 pages
Linear Algebra Formula Sheet
No ratings yet
Linear Algebra Formula Sheet
3 pages
Best Approximation and Least Squares Solutions
No ratings yet
Best Approximation and Least Squares Solutions
10 pages
Least Squares Solutions in Linear Algebra
No ratings yet
Least Squares Solutions in Linear Algebra
22 pages
L16 - 17 Linear Algebra - Least Square Approximation
No ratings yet
L16 - 17 Linear Algebra - Least Square Approximation
11 pages
Least-Squares Solutions in Linear Algebra
No ratings yet
Least-Squares Solutions in Linear Algebra
7 pages
Lec 10
No ratings yet
Lec 10
31 pages
Lay Linalg5 06 05
No ratings yet
Lay Linalg5 06 05
15 pages
Linear Algebra Basics and Solutions
No ratings yet
Linear Algebra Basics and Solutions
55 pages
Gauss Law Problems
No ratings yet
Gauss Law Problems
1 page
Differential Equations: Separation of Variables
No ratings yet
Differential Equations: Separation of Variables
6 pages
Composite Materials Mechanics Solutions
No ratings yet
Composite Materials Mechanics Solutions
3 pages
B2TE1 CSSLGI1 15XTVR2 UA Datasheet 250331022951 250331022951
No ratings yet
B2TE1 CSSLGI1 15XTVR2 UA Datasheet 250331022951 250331022951
1 page
Lab 1 - PHY130 - Units
No ratings yet
Lab 1 - PHY130 - Units
8 pages
X3301 Manual Eng
No ratings yet
X3301 Manual Eng
40 pages
Work and Energy
No ratings yet
Work and Energy
5 pages
Large-Scale Particle Tracing Velocimetry
No ratings yet
Large-Scale Particle Tracing Velocimetry
8 pages
(Film) Tds (Ny대전) Eng
No ratings yet
(Film) Tds (Ny대전) Eng
4 pages
NEET Matrices MCQ Practice Questions
No ratings yet
NEET Matrices MCQ Practice Questions
7 pages
IGCSE Chemistry: States of Matter
No ratings yet
IGCSE Chemistry: States of Matter
15 pages
Gaston Gts Manual
No ratings yet
Gaston Gts Manual
30 pages
Dryer Kaeser
No ratings yet
Dryer Kaeser
102 pages
Isometric Circle and Arc Drawing Guide
No ratings yet
Isometric Circle and Arc Drawing Guide
10 pages
Density Work Sheet by Amir
No ratings yet
Density Work Sheet by Amir
3 pages
Application of WAAM Process To Titanium Rv.0
No ratings yet
Application of WAAM Process To Titanium Rv.0
33 pages
Construction Materials & Testing
No ratings yet
Construction Materials & Testing
9 pages
Digital Image Processing MCQs and Answers
0% (2)
Digital Image Processing MCQs and Answers
35 pages
Sheet 3
No ratings yet
Sheet 3
2 pages
Flute Bore Hole Math
No ratings yet
Flute Bore Hole Math
30 pages
Msse 55 1241
No ratings yet
Msse 55 1241
9 pages
Bjt-Low Frequency Injection
No ratings yet
Bjt-Low Frequency Injection
4 pages
Geometric Transformations Overview
No ratings yet
Geometric Transformations Overview
51 pages
Autodesk Robot Structural Analysis Professional 2019
No ratings yet
Autodesk Robot Structural Analysis Professional 2019
3 pages
Geothermal Energy Systems
No ratings yet
Geothermal Energy Systems
104 pages
X Science Notes Ch.13
No ratings yet
X Science Notes Ch.13
10 pages
Superbolt Multi Jackbolt Tensioner Manual
No ratings yet
Superbolt Multi Jackbolt Tensioner Manual
16 pages
Overview of Stitch Types and Seams
No ratings yet
Overview of Stitch Types and Seams
9 pages
PSP1 Datasheet
No ratings yet
PSP1 Datasheet
2 pages
Solar PV Inverter Training by Gomis-Bellmunt
No ratings yet
Solar PV Inverter Training by Gomis-Bellmunt
36 pages