Defence of empirical evidence, reply to Shimshon Bichler and Jonathan Nitzan
Paul Cockshott, Allin Cottrell, Alejandro Valle Baeza December 25, 2010
Bichler and Nitzan argue against the ndings of close correlations between labour content and monetary value added on two main grounds: 1. That the empirical studies do not use labour time to estimate labour content but instead use monetary data from the input output tables. They claim that the evidence for the labour theory of value is thus an artefact of circular reasoning. Those of us who have published evidence for close correlations between labour values and prices are guilty of circular reasoning since we presume what we must show: that it is possible to work backwards from money to labour time. 2. That the correlations we observe are essentially supurious since they do not take into account industry scale. They provide an interactive spreadsheet that allegedly shows how such spurious correlations can arise. We respond to both of these arguments below.
1 Product ows: Quantities versus monetary magnitudes
The claim that all empirical studies of labour value to price correlations have relied on monetary input output tables is false. The Swedish input output tables give labour inputs not in money but in person years. Zachariah[8] has published price/labour value comparisons for a number of economies including Sweden. The Swedish data, which uses actual person years of labour input shows the same strong correlations that have been observed in other studies that started with wage expenditures as their measure of labour input. Cockshott, Cottrell and Michaelson [7] had earlier used independent data from the New Earnings Survey on average hourly wage rates in each industry to work back from the wage totals given in the UK input output tables to the number of person hours of labour that these represent. When this was done the correlation between labour values and prices remained as strong as when wage expenditure was used as a surrogate for labour input.
The use of monetary magnitudes rather than in-kind product ows is in part a result of the degree of aggregation of the actually available inputoutput tables for capitalist economies. input-output table That is, in order to construct a meaningful
in natura it is necessary that the data be fully disaggregated
There can be no meaningful number for
by product, but many of the industries as dened in the actual tables produce a wide range of dierent products. the
quantity of output of Aircraft and Parts or Electronic Components and
Accessories, or for the in-kind ow of the product of the latter industry into the former. In a planned economy it would be possible to construct material ows in terms of unique identiers for each type of product - using bar codes for instance. Since this information is not available to national oces of statistics in capitalist economies the practical solution is to present the aggregate monetary values of ows between. But this does not create a problem, if one is interested in comparing the aggregate monetary value of the output of the industries with the aggregate labour-value of those same outputs. The point is this:
The vector of aggregate sectoral labour values calculated from a monetary table will agree with the vector calculated from a physical table, up to a scalar, regardless of the price vector and the (common) wage rate used in constructing the monetary table. Or in
other words, the vector of sectoral labour values obtained is independent of the price vector used. One might just as well (if it were practically possible) use an arbitrary vector of accounting prices or weights to construct the monetary table. The fact that actual prices are used in the published data does not in any way contaminate the value gures one obtains; no spurious goodness of t between values and prices is induced. We provide a proof of this in section 3.1. Correlation coecients between two vectors do not change under scalar multiplication of one of the vectors. A correlation between rainfall and temperature for example is not aected by whether temperature is measured in Farenheit or Centigrade, nor whether the rainfall is measured in centimeters or inches. Thus since the aggregate sectoral values obtained from the monetary data agreeup to a scalar, namely
w,
the common money wage ratewith those that would
be obtained from the data
in natura, it follows that the correlation coecients
obtained this way will be the same as those that would be obtained with in natura data. The sole source of variation would be the assumption that wage rate was the same accross industries. But as we have said above, tests have already been performed with correcting for diering wage rates accross industries and the correlation coecient remains very strong. In the UK case [7] found a correlation coecient of 98% using an assumption of uniform wage rates versus 96% after adjusting for wage rates.
2 Spurious correlation
Bichler and Nitzan[3] argue that the correlations we observe are essentially supurious since they do not take into account industry scale. They provide an in-
teractive spreadsheet that allegedly shows how such spurious correlations can arise. This is continuation of the argument advanced by Kliman [6]. We will respond to this strand of their argument in 3 ways: 1. By focusing on what the hypothesis being tested in our studies was. 2. By showing that their demonstration of spurious correlation is based on what in computer science is called a type error, and what in physics is called a dimensional error. This argument we owe to Valle and Frhlich [2, 5]. 3. By citing additional empirical data that show the observed correlations between labour content and monetary value are not spurious.
2.1
The basic hypothesis being tested
There seems to be a certain irony to Bichler and Nitzan's opposition to the labour theory of value. The idea, on which they found their work, that capital is power has a respectable classical pedigree, Adam Smith had long ago written that monetary wealth was power. But he was specic, moetary wealth was the
power to command the labour of others. If Bichler and Nitzan were to seek a
measurable correlate to power they would do well to follow Smith. But in doing so they would have to abandon their opposition to the labour theory of value, since in Smith's case the power is a power over labour and there is a direct correlation between amount of money and amount of labour commanded. This correlation, Bichler and Nitzan deny, thus depriving their theory of the realistic foundation that Adam Smith had. The purpose of the empirical studies that we and others have done on price/labour correlations has been to verify this basic proposition on which classical economics stood: that labour is the source of commodities exchange value. We got into this because we wanted to analyse national income in marxist categories - rate of surplus value, organic composition of capital etc. We were turned down by reviewers on the grounds that we were using monetary quantities to measure what should have been labour value ratios. In order to establish the validity of doing this we responded by showing that even if you break the economy down in much ner detail, using input output tables there was a close correlation between labour value magnitudes and monetary magnitudes, and that it was thus valid to use monetary data to work out ratios like them organic composition of capital or the rate of exploitation. Marx's analysis of capitalist exploitation rests on the hypothesis that embodied labour is the source of monetary value. To establish the validity of his hypothesis and analysis of exploitation that stems from it, it is sucient to break down the economy into a large number of sectors and show that the monetary value of the gross output of these sectors correlates closely with the labour expended to produce that gross output. This in turn requires that you compute two vectors
1. A vector of monetary ows of output indexed by sector, each element of this vector is of dimension Million per year. 2. A vector of the number of people whose annual labour was directly or indirectly embodied in this monetary output, the dimension of each element of this vector is a number of persons since person hours per annum reduces to dimension persons. This, incidentally is exactly the format used by the Swedish input output tables mentioned in section 1. If a strong correlation exists between the two vectors we can say that the data are consistent with the hypothesis that labour is the source of value. It must be emphasised that this method directly examines what we want to test : whether monetary value is proportional to labour used. The argument by Kliman, Bichler and Nitzan that the correlations observed are spurious depends on the idea that there could be an independent 3rd factor that is the cause of the variation both in the persons vector and the monetary ow vector. Any correlation observed in science could potentially be spurious, so this is always a possibility. But for an allegation of spurious correlation to be born out, one must both identify this third factor and also show that it actually does induce the correlations observed. What could this 3rd factor be? Kliman suggests that it is industry size. Big industries employ more people and also sell more output, so that the correlation arises just because of this fact. But for a 3rd factor to be the common cause of the variation in our two vectors that third factor must itself be quantiable. How do you measure industry size? The obvious measures of an industry's size : how many people it employs, or its turnover are ruled out, since we are looking for something independent. Kliman, Bichler and Nitzan suggest that there is some third form of industry size causes the variations in both employment and turnover. There certainly are other possible measures : the area of land an industry occupies, the number of tons of output it produces, the number of kilowatts hours of energy it uses. In principle any of these could be the 3rd factor that detemined both the labour used and the turnover of an industry, but we merely have to list them to see how implausible it is that land or tonnage is an appropriate third source of variation. Agriculture is by far the largest industry in the UK in physical terms. proportional to its 'size' in these terms. The water supply industry is the largest in terms of kilograms delivered, but again, its postion in terms of turnover and employment falls far short. Energy input is a more plausible 3rd factor, and we examine this in detail in section 2.3. Bichler and Nitzan dont propose any of these, instead they suggest that the common cause of variation is the number of units of output produced. Bichler and Nitzan produce a spreadsheet showing that if you have two vectors It occupies the most space, but its employment and turnover are far in no way
a, b that
are uncorrrelated and that if we multiply these by a 3rd random vector
c,
then
ac will be correlated with bc.
the vectors
Mathematically this ne but it has no relevance
to the question under dispute unless some economic meaning can be given to
a, b, c.
Putting headings at the top of the columns like 'Unit Price',
'Number of units sold' does not give their mathematical example any economic grip unless they can explain what these 'Units' are in the context of the input output tables used in our study. If we look at the UK industrial subdivisions what are the units of output? For industry 30, Footware, it is presumably pairs of shoes. But what is the unit of output for industry 47 Rubber Products, or industry 50 Ceramic goods? For industry 67, Weapons and Amunition, is the unit of output a bullet, a tank or an atom bomb? There is simply no practical way of measuring the scale of these industries in terms of 'units'. Unless Bichler and Nitzan can:
say how to measure the number of units produced by these, and around 100 hundred other industries in the I/O tables; show that the vector of numbers of units produced is in fact highly correlated with both the monetary turnover vector and the embodied labour vector;
their claim that a spurious correlation is induced by the number of units produced will remain no more than idle speculation. We will now go on to argue both that the very idea of correlating price vectors with labour content vectors, as they purport to do in their example spreadshee (section 2.2), and to show that a number of plausible potential sources of spurious correlation have already been tested and found not to operate (section 2.3).
2.2
A type error.
The spreadheet provided by Bichler and Nitzan purports to show an example in which prices and values are uncorrelated but, once multiplied by the output vector, they become correlated. We have made some criticisms of this above, but now we will focus on another part of their argument : the part that shows the initial correlations between prices and values is insignicant. This part relies on the library correlation function built into Excel. Now Excel is an untyped computer maths package. It does not check that the mathematical operations one is performing make sense since it knows nothing about what the numbers in a spreadsheet represent. More rigourous programming systems like Fortress [1] or Vector Pascal [4] allow the user to specify the units being used for variables so that dimensional analysis can be applied. Had this been done the computer would have warned Bichler and Nitzan of the mistake that they were making. Dimensional analysis is a set of rules to verify basic aspects of mathematical models, it contains necessary but not sucient conditions for the validity of a model. Variables, in general, are ordered pairs: a magnitude
and a unit of
measurement
[m],
by example oil price is 90
[U Sdollar/barrel].
Basic rules of
dimensional analysis are: 1.Any mathematical expression must be dimensionally consistent, i.e. units of left side of the expression must be equal to units of the right side of the expression. 2.Addition or subtraction of magnitudes with same units is allowed:
x[m] + y[m] = (x + y)[m]
3.Addition or subtraction of magnitudes with dierent units is not allowed:
x[m] + y[t]
is impossible.
4.Multiplication of variables with dierent units is allowed The value of commodity labor time and
5.Division of variables with dierent units is aceptablex[m]/y[t]
isi [tl/ui ] and it's price
x[m]y[t] = (xy)[mt] = (x/y)[m/t] pi [$/ui ] where tl means
ui
the physical unit of merchandise
i.
Does the average price of oil and a pencil means anything? Of course do not. In the same vein, average price of all commodities is nonsense. gously,the average value of dierent commodities is not denable. sional analysis corroborate this because averaging oil price AnaloDimenwith
po [$/barrel]
pp [$/pencil]
is forbidden by rule 3.
The correlation coecient between two vectors uct of the normalised vectors
a, b is the inner or dot prod-
(a, b) = N (a).N (b)
Where the normalisation mean of
and divides by
N (x) (x) the
function for a vector standard deviation of
x x
subtracts :
(x)
the
N (x) =
(x (x)) (x)
Since normalisation depends on computing the mean of a vector and since computing the mean depends on addition, it is clear that normalisaation is only dened on vectors of homogenous dimension. applies to vectors of homogenous dimension. Value price correlation would be dened by: ( ) ( ) (, p) = N ().N (p) = (()) . (p(p)) () (p) where Hence correlation likewise only
()
and
(p)
are simple means of values and prices; and
(),(p)
are standard deviations of
and
p.
However, averaging prices and values is an
invalid operation as is taking the standard deviation since this in turn depends on the mean. Hence the attempt to compute It is because vectors not
(p, )
is a big mistake, but why?
and
are not dimensionally homogenous, they are
elements of two variables but two vectors each of
dierent variables.
Sustainers of the spurious correlation criticism have confused the problem: there is no correlation of two variables with and labor values a third variable. The correlation of prices multiplied by gross production
n observations complicated by introducing pi xi
i xi
multiplied by gross production is well dened because each
pi xi
has dimension [$] and each
i xi
has dimension
[tl].
Each is of homogenous
dimension and thus correlation is well dened on them. According to above arguments the incorectly named spurious correlation is actually the right way to measure value price deviations and Bichler and Nitzan's assumed non spurious correlation is a total mistake. Also the correct approach allows us to see that if the basket used is changed the results cannot be compared. Exactly as when ination is measured: ination is the variation of the monetary value of a specic basket.
2.3
Poor correlations with other value bases
3 APPENDICES
3.1 Proof
Consider an economy characterized by the following arrays:
An
nn
matrix of intersectoral product
ows in kind, such that amount of industry put in industry
uij
represents the
j 's
output used as in-
i.
q l
An An
n1
vector of gross outputs of the
industries, in their natural units.
n1 vector of direct labour-hours pernn { qi Qij = 0
diagonal matrix for for
formed in each industry. It will be useful also to dene an
such that
i=j i=j
the
The standard calculation of labour-values proceeds as follows. First calculate n n matrix of technical coecients as A = Q1 U and the n-vector of 1 direct labour input per unit of physical output as = Q l. The n-vector of unit values (vertically integrated labour coecients) is then given by
v = (I Q1 U )1 Q1 l = (I A)1
and the
n-vector
of
aggregate values of the sectoral outputs is
V = Qv = Q(I A)1 n-vector p
such that represent the prices of the commodities and the scalar
(1)
We now construct the monetary counterpart to the above arrays. Let the (common) money wage rate.
Let us also dene an
denote the
nn
diagonal matrix
Having addressed the issue of intersectoral wage dierentials above, we abstract from it here.
{ Pij =
pi 0
for for
i=j i=j
Corresponding to each of the initial real arrays above there is a monetary version as follows:
U = UP q = Pq = wl l nn
Matrix of money-values of intersectoral product ows Vector of money-values of gross outputs Vector of industry wage-bills
From these we can construct counterparts to the derived real arrays. First the diagonal matrix
Q,
whose diagonal elements are
p i qi ,
is given by
{ Qij =
qi 0
for for
i=j = QP i=j
(2)
The counterpart to the matrix of technical coecients is
A = Q1 U = (QP )1 U P = P 1 Q1 U P = P 1 AP
The elements of is the (3)
represent the pounds' worth of input from sector
required
to produce a pound's worth of output in sector
i.
Finally, the counterpart to
n-vector
l = Q1 = (QP )1 wl = wP 1 Q1 l = P 1 w
(4)
whose elements represent the direct labour cost per pound's worth of output in each sector. Now here is the issue: suppose we are not privy to the information on product ows in kind and labour-hours, and have at our disposal only the information given in the monetary tables. On this basis we can calculate a vector
v,
v = (I A)1
While
vi
represented the vertically integrated labour hours per physical unit of
output of commodity i, the
vi
that we are able to obtain from the monetary ta-
bles represents the vertically integrated labour cost per pound's worth of output of commodity i. If we then multiply up by the money-value of the gross outputs of the industries we obtain the vector of vertically integrated labour costs for the industries.
v V = Q = Q(I A)1
values that could be obtained in principle from the data
(5)
We are interested in the relationship between (1), the aggregate sectoral
in natura, and (5), the
corresponding gures obtained by using the monetary data. On the basis of the correspondences (2), (3) and (4) we can rewrite (5) as
V = QP (I P 1 AP )1 P 1 w
Recall that (1) specied see that
(6)
V = wV
V = Q(I A)
Comparing these two equations we
on condition that
(I A)1 = P (I P 1 AP )1 P 1
sides of (7). On the left, we simply get
(7)
That this condition is indeed satised may be seen by taking inverses on both
(I A);
on the right we get
[P (I P 1 AP )1 P 1 ]1 = P (I P 1 AP )P 1 = (P AP )P 1 = I A
This means we have proved that
V = wV , which is to say that the aggregate
sectoral values obtained from the monetary data agreeup to a scalar, namely
w,
the common money wage ratewith those that would be obtained from the
data
in natura, if these were available. The aggregate value vector is independent
of the price vector used in forming the monetary tables. This proof was originally presented in a conference paper at the Easter Economics Association in 1997.
3.2
Formal treatment of dimensioned data
Dimensional analysis is familiar to scientists and engineers and provides a routine check on the sanity of mathematical expressions. Dimensions can not be expressed in the otherwise rigourous type system of standard Pascal, but they are a useful protection against the sort of programming confusion between imperial and metric units that caused the demise of a recent Mars probe. They provide a means by which oating point types can be specialised to represent dimensioned numbers as is required in physics calculations. For example:
kms =(mass,distance,time); meter=real of distance; kilo=real of mass; second=real of time; newton=real of mass * distance * time POW -2; meterpersecond = real of distance *time POW -1;
The identier must be a member of a scalar type, and that scalar type is then refered to as the
basis space of the dimensioned type. The identiers of the basis
space are refered to as the dimensions of the dimensioned type. Associated with each dimension of a dimensioned type there is an integer number refered to as the power of that dimension. This is either introduced explicitly at type declaration time, or determined implicitly for the dimensional type of expressions.
logd t of a dimend of type t is raised. Thus for t =newton in the example above, and d =time, logd t = 2 If x and y are values of dimensioned types tx and ty respectively, then the following operators are only permissible if tx = ty : +, - ,<, >, =, <=, >=. For
A value of a dimensioned type is a dimensioned value. Let sioned type
be the power to which the dimension
+ and -, the dimensional type of the result is the same as that of the arguments. The operations. The operations *, / are permited if the types the basis space of the other. The operation
tx and ty
share
the same basis space, or if the basis space of one of the types is a subrange of
POW
is permited between dimensioned types and integers.
Dimension deduction rules
1. If
x = y z for x : t1 , y : t2 , z : t3 logd t2 + logd t3 .
with basis space
B B
then
dB logd t1 = dB logd t1 = t2 , B
2. If
x = y/z for x : t1 , y : t2 , z : t3 logd t2 logd t3 .
then
with basis space
then
3. If
x = y POW z for x : t1 , y : t2 , z : integer dB logd t1 = logd t2 z .
with basis space for
References
[1] E. Allen, D. Chase, J. Hallett, V. Luchangco, J.W. Maessen, S. Ryu, G.L. Steele Jr, S. Tobin-Hochstadt, J. Dias, C. Eastlund, et al. language specication. The Fortress
Sun Microsystems, 139:140, 2005.
[2] Alejandro Valle Baeza. Dimensional analysis of price-value correspondence: a spurious case of spurious correlation. 2010. [3] P. Cockshott, S. Bichler, and J. Nitzan. Testing the Labour Theory of Value: An Exchange.
Investigacion economica, LXIX:274,
bnarchives.yorku.ca, 2010. SIGPLAN Not., 37(6):59
[4] Paul Cockshott. Vector pascal reference manual. 81, 2002.
[5] N. Frhlich. Dimensional Analysis of price-value deviations. 2010. [6] Andrew J. Kliman. The law of value and laws of statistics: sectoral values and prices in th us economy, 1977-97.
Camb. J. Econ., 26(3):299311, 2002.
10
[7] G. Michaelson, W. P. Cockshott, and A. F. Cottrell. Testing marx: some new results from uk data. [8] David Zachariah.
Capital and Class, pages 103129, 1995. Indian
Labour value and equalisation of prot rates.
Development Review, 4(1):121, 2006.
v
11