[Print] [Email] [Save] [Go To Full Text]

Title: Contingent valuation: Is some number better than no

number?

Subject(s): VALUATION -- United States

Source: Journal of Economic Perspectives, Fall94, Vol. 8

Issue 4, p45, 20p

Author(s): Diamond, Peter A.; Hausman, Jerry A.

Abstract: Criticizes the use of the contingent valuation

method in damage assessments and government decision

making. Inconsistency of the contingent valuation

method with economic theory; Judging surveys of

willingness-to-pay for public goods; Welfare

analysis; National Oceanic and Atmospheric

Administration panel's evaluation of the method.

AN: 9501060642

ISSN: 0895-3309

Full Text Word Count: 9909

Database: Academic Search Elite

Print: Click here to mark for print.

View Item: [Full Page Image] Full Page Image

 

-------------------

[Go To Citation]

 

 

CONTINGENT VALUATION: IS SOME NUMBER BETTER THAN NO NUMBER?

 

Most economic analyses aim at explaining market transactions. Data on

transactions, or potentially collectible data on transactions, are the

touchstone for recognizing interesting economic analyses. However loose the

connection between a theoretical or empirical analysis and transactions,

this connection is the basis of the methodology of judging the credibility

and reliability of economic analyses. Generally, individuals do not purchase

public goods directly. Lack of data on transactions implies that economists

must find other methods to assess surveys asking for valuations of public

goods.

 

To address this problem, we begin with a discussion of the methodology of

evaluating contingent valuation surveys. While there is some experimental

evidence about small payments for public goods, we work with the assumption

that we do not have data on actual transactions for interesting

environmental public goods to compare with survey responses of hypothetical

willingness-to-pay. This situation creates the need for other standards for

evaluating survey responses. Evaluation involves the credibility, bias (also

referred to as reliability in the literature), and precision of responses.

Credibility refers to whether survey respondents are answering the question

the interviewer is trying to ask. If respondents are answering the right

question, reliability refers to the size and direction of the biases that

may be present in the answers. Precision refers to the variability in

responses. Since precision can usually be increased by the simple expedient

of increasing the sample size, we will not discuss precision further in this

paper. Problems of credibility or of bias are not reduced by increases in

sample size. Thus credibility and bias must be evaluated when considering

the use of such surveys--in benefits cost analyses, in the determination of

damages after a finding of liability, or as general information to affect

the legislative process.[1]

 

We discuss how to judge the content in contingent valuation surveys together

with evidence from surveys that have been done. Surveys designed to test for

consistency between stated willingness-to-pay and economic theory have found

that contingent valuation responses are not consistent with economic theory.

The main contingent valuation anomaly that we discuss is called the

"embedding effect," and was first analyzed systematically by Kahneman and

Knetsch (1992).[2] The embedding effect is the name given to the tendency of

willingness-to-pay responses to be highly similar across different surveys,

even where theory suggests (and sometimes requires) that the responses be

very different.[3] An example of embedding would be a willingness-to-pay to

clean up one lake roughly equal to that for cleaning up five lakes,

including the one asked about individually. The embedding effect is usually

thought to arise from the nonexistence of individual preferences for the

public good in question and from the failure of survey respondents, in the

hypothetical circumstances of the survey, to consider the effect of their

budget constraints. Because of these embedding effects, different surveys

can obtain widely variable stated willingness-to-pay amounts for the same

public good, with no straightforward way for selecting one particular method

as the appropriate one.

 

In short, we think that the evidence supports the conclusion that to date,

contingent valuation surveys do not measure the preferences they attempt to

measure. Moreover, we present reasons for thinking that changes in survey

methods are not likely to change this conclusion. Viewed alternatively as

opinion polls on possible government actions, we think that these surveys do

not have much information to contribute to informed policy-making. Thus, we

conclude that reliance on contingent valuation surveys in either damage

assessments or in government decision making is basically misguided.

 

Judging Surveys of Willingness-to-Pay for Public Goods

 

A number of bases exist for forming judgments about whether particular

respondents are answering the right question and whether the response is

roughly correct. One widely accepted basis is by reaching the conclusion

that a particular response is simply not credible as an answer to the

question the interviewer is trying to ask. It is standard practice in the

contingent valuation literature to eliminate some responses as being

unreasonably large to be the true willingness-to-pay. Thus trimming

responses that are more than, say, 5 percent of income for an environmental

public good that contains only nonuse value may be criticized for having an

arbitrary cutoff, but not for omitting answers that are believed to be

credible. Similarly, it is standard practice to eliminate some responses of

zero on the basis that these are "protest zeros," that answers to other

questions in the survey indicate that individuals do put a positive value on

changes in the level of the public good, and thus zero is not a credible

answer.

 

A widely accepted incredibility test indicates that it is not automatic that

the response given is an answer to the question that the interviewer wants

answered. But we need to go further in considering how to form a judgment on

the survey responses; it is not adequate to assume that any response that is

not obviously wrong is an accurate response to the question the survey

designer had in mind.

 

A number of additional bases have been used by people arguing that responses

are or are not acceptable. The methods we shall discuss include verbal

protocol analysis, the patterns of willingness-to-pay responses across

individuals, and across surveys.

 

In considering the relevance of this evidence for the question of whether

survey responses are accurate measures of true preferences, it is useful to

have in mind some possible alternative hypotheses of how people respond to

such surveys, since the responses are not simply random numbers. Several

hypotheses have been put forward as alternatives to the hypothesis that the

responses are measures of true economic preferences. Individuals may be

expressing an attitude toward a public good (or class of public goods),

expressed in a dollar scale because they are asked to express it in a dollar

scale (Kahneman and Ritov, 1993). Individuals may receive a "warm glow" from

expressing support for good causes (Andreoni, 1989).[4] Individuals may be

describing what they think is good for the country, in a sort of casual

benefit-cost analysis (Diamond and Hausman, 1993). Individuals may be

expressing a reaction to actions that have been taken (for example, allowing

an oil spill) rather than evaluating the state of a resource.

 

Under all of these alternative hypotheses, responses are not an attempt by

an individual to evaluate his or her own preference for a public good. For

example, people doing casual benefit-cost analyses may be reflecting how

much they think people generally care about the issue. We think that

different hypotheses are likely to be appropriate for different people. Thus

the question is not whether the hypothesis of an accurate measurement of

preferences is the single best hypothesis, but whether the fraction of the

population for whom the hypothesis of accuracy is reasonable is sufficiently

large to make the survey as a whole useful for policy purposes.

 

All of these alternatives are based on what individuals are trying to do;

there are further questions of standard survey biases (such as interviewer

bias, framing bias, hypothetical bias) and whether people have enough

information to express a preference with any accuracy, even if they are

attempting to express a preference. Insofar as this understanding is faulty,

expressed preferences are not an expression of true economic preferences.

 

Verbal Protocol Analysis

 

For verbal protocol analysis, individuals are asked to "think aloud" as they

respond to a questionnaire, reporting everything that goes through their

minds. Everything the subjects say is recorded on audio tapes that are

transcribed and coded for the types of considerations being mentioned.

Schkade and Payne (1993) have done such an analysis using a contingent

valuation survey that asks for willingness-to-pay to protect migratory

waterfowl from drowning in uncovered waste water holding ponds from oil and

gas operations.

 

The transcripts show the inherent difficulty in selecting a

willingness-to-pay response and the extent to which people refer to elements

that ought to be irrelevant to evaluating their own preferences. If people

are trying to report a preference, we would expect them to consider inputs

into the forming of their preferences, such as how much they care about

birds, how important the number of killed birds are relative to the numbers

in the species. Conversely, we would not expect them to report a

willingness-to-pay just equal to what they think the program will cost.

Respondents verbalized many diverse considerations. Perhaps the most common

strategy involved first acknowledging that something should be done and then

trying to figure out an appropriate amount. About one-fourth of the sample

mentioned the idea that if everyone did his part then each household would

not have to give all that much. About one-sixth of the sample made

comparisons with donations to charities. About one-fifth of the sample said

they just made up a number or guessed an answer. Many respondents seemed to

wish to signal concern for a larger environmental issue. This pattern may

reflect the unfamiliarity of the task the respondents faced.

 

These findings strongly suggest that people are not easily in touch with

underlying preferences about the type of commodity asked about. The findings

do not lend support to the hypothesis that responses are an attempt to

measure and express personal preferences. To the extent that individuals

consider costs to everyone, the analysis supports the hypothesis of casual

benefit-cost analysis. To the extent that individuals look to their own

charitable contributions for a guide, the analysis is consistent with

hypotheses that explain actual contributions, such as the warm glow

hypothesis.

 

Variation in Willingness-To-Pay Across Individuals

 

If stated willingness-to-pay is a reflection of true preferences, then we

would expect certain patterns of answers across different individuals (other

things equal). We would expect self-described environmentalists to have

larger willingnesses-to-pay. We would expect individuals with higher incomes

to have larger willingnesses-to-pay. Both results do occur. However, such

results do not distinguish among the various hypotheses that were spelled

out above since we would expect roughly similar results from any of them.

Thus this potential basis for evaluation does not have much bite.[5] We do

observe that the income effects that are measured in typical surveys are

lower than we would expect if true preferences are measured, lower for

example than measured income elasticities for charitable giving.[6]

 

Variation in Willingness-To-Pay Across Surveys

 

Another approach to forming a judgment is to compare willingness-to-pay

responses to different questions, whether in the same or in different

surveys.

 

Multiple Questions. If a survey question reveals a true valuation, it should

not matter whether the question is asked by itself or with other questions,

nor if asked with any other questions, what the order of questioning is.

However, when Tolley et al. (1983) asked for willingness-to-pay to preserve

visibility at the Grand Canyon, the response was five times higher when this

was the only question, as compared to its being the third such question.

Attempts to claim this result to be consistent with preferences have relied

on income effects and substitution effects. Neither of these

rationalizations for the anomalous results is compelling, as we explain in a

moment.

 

The importance of question order was also shown in a study by Samples and

Hollyer (1990) asking for the values of preserving seals and whales. Some

respondents were asked for willingness-to-pay to preserve seals first,

followed by a question about whales. Others were asked for

willingness-to-pay in the reverse order. Seal value tended to be lower when

asked after whale value, while whale value was not affected by the sequence

of questions.[7] Thus the sum of willingness-to-pay depended on the sequence

of the questions asked. The authors offer an explanation (p. 189) "based on

debriefing sessions held with the interviewer."

 

Apparently, when respondents valued seals first, they used their

behavior in this market situation to guide their responses to whale

valuation questions. Since whales are generally more popular than

seals, respondents were reluctant to behave more benevolently toward

seals compared with humpback whales. Consequently, whale values were

inflated in the S-W questionnaire version to maintain a relatively

higher value for the humpbacks. This behavioral anchoring effect did

not exist in the W-S version, where whales were valued first.

 

To have the value of preserving both seals and whales depend on the sequence

in which the questions are asked is not consistent with the hypothesis that

stated willingness-to-pay accurately measures preferences. These results can

be interpreted in two ways. One interpretation is that contingent valuation

studies that ask two questions rather than one are unreliable. The other

interpretation is that the warm glow hypothesis is supported, since having

expressed support for the environment in the first question permits a sharp

fall in the second response. This effect is not present, however, when such

a response would seem illogical to the respondent. More generally, one needs

to decide whether a given pattern of responses is a result of survey design

issues or a result of the underlying bases of response. This distinction is

especially important when the pattern of results appears anomalous with or

contradictory to the hypothesis that preferences are accurately measured.

 

Single Questions and the Embedding Effect. Alternatively, one can ask a

single willingness-to-pay question each to different samples. For example,

assume that one group is asked to evaluate public good X; a second is asked

to evaluate Y; and a third is asked to evaluate X and Y. What

interpretations could we make if the willingness-to-pay for X and Y

(together) is considerably less than the sum of the willingness-to-pay for X

and the willingness-to-pay for Y?[8] One interpretation is that we are

seeing an income effect at work. That is, having "spent" for X, one has less

income left to purchase Y. Given that the stated willingness-to-pay amounts

are very small relative to income and that measured income elasticities are

very small, the attempted income effect argument does not explain the

differences found.

 

A second interpretation is to assume that individual preferences have a

large substitution effect between X and Y. In some settings the assumption

on preferences needed to justify the results is implausible. For example,

Diamond et al. (1993) asked for willingness-to-pay to prevent logging in

one, two, and three particular wilderness areas. Stated willingness-to-pay

to preserve two (and three) areas was less than the sum of

willingnesses-to-pay to preserve each of them separately.

 

At first look, this result appears to be an appropriate substitution effect,

since protecting one area results in being less willing to protect another.

However, preferences should be defined over wilderness remaining, not over

proposals for development that are defeated. If preferences are concave over

the amount of wilderness available (or, more generally, if different

wilderness areas are substitutes), then willingness-to-pay is larger the

smaller the quantity of wilderness remaining. This implies that the

willingness-to-pay to preserve two threatened areas should be larger than

the sum of willingness-to-pay to preserve each as the lone area threatened

with development.[9] Instead, stated willingness-to-pay was roughly the same

for preserving one, two or three threatened areas, making the amount for

several areas together significantly less than the sum of the amounts for

the areas separately. Note that these surveys vary both the number of areas

threatened and the number to be preserved. Neither the income effect nor the

substitution effect can plausibly explain the embedding effect in this

experiment. The hypothesis that this survey is eliciting individual

preferences is not consistent with individuals having reasonably behaved

preferences. However, from the point of view of the warm glow hypothesis,

this pattern makes sense. That is, the warm glow hypothesis is that

individuals are primarily reporting an expression of support for the

environment, an expression that does not vary much with small changes in the

precise environmental change being described.

 

A similar variation in responses across surveys appears in the study of

Desvousges et al. (1993). They described a problem killing 2000, 20,000 and

200,000 birds. The willingness-to-pay to solve this problem was roughly the

same in all three cases. Since the number of surviving birds is smaller the

larger the problem, concave preferences over surviving birds should have

resulted in more than a 100-fold variation in willingness-to-pay across this

range.[10] Thus this study shows a contradiction between stated

willingness-to-pay and the usual economic assumptions on preferences. Again,

the study is consistent with the hypothesis that the responses are primarily

warm glow, and so need not vary noticeably over moderate differences in the

resource.

 

Adding-up Test. One difficulty in the approach described above is that the

plausibility of the willingness-to-pay patterns depends on assumptions on

the plausible (concave) structure of preferences. Another approach to tests

of consistency that does not rely on an assumption of concave preferences is

to attempt to measure the same preference in two different ways. This test

can be constructed by varying the background scenario as well as varying the

commodity to be purchased. For example, assume that one group is asked to

evaluate public good X; a second group is told that X will be provided and

is asked to evaluate also having Y; and a third is asked to evaluate X and Y

(together). Now the willingness-to-pay for X and Y (together) should be the

same as the sum of the willingness-to-pay for X and the willingness-to-pay

for Y, having been given X (the same up to an income effect that can be

measured in the survey and that empirically is small).[11] Thus, Diamond et

al. (1993) varied the number of wilderness areas being developed as well as

the number that could be protected. In this way the sum of two areas

separately evaluated (with different degrees of development) should be the

same as the value of preserving two areas (apart from a very small income

effect). Again, the results of the survey are inconsistent with the

responses being a measure of preferences.[12]

 

Embedding still infects even very recent work done by experienced contingent

valuation analysts who were well aware of the problem. Schulze et al. (1993)

asked for willingness-to-pay for partial and complete cleanup of

contamination of the Clark Fork National Priorities List sites in Montana.

After removing protest zeros and high responses, the mean stated

willingness-to-pay for complete cleanup was $72.46 (standard error of $4.71)

while the mean response for a considerably smaller partial cleanup was

$72.02 (s.e. $5.10). As part of the survey, respondents were asked whether

their responses were just for this cleanup or partly to cleanup other sites

or basically as a contribution for all environmental or other causes (or

other). Only 16.9 percent reported their answers as just for this cleanup;

that is, a vast majority of respondents recognized an embedding effect in

their own responses. These respondents were asked what percentage of their

previous answer was for this cleanup, and the willingness-to-pay responses

were adjusted by these percentages. After this adjustment, the mean stated

willingness-to-pay for complete cleanup was $40.00 (s.e. $2.62) while the

mean response for partial cleanup was $37.15 (s.e. $2.71).

 

These numbers (and the large fraction of people recognizing that they are

embedding) support the hypothesis that the responses are dominated by a warm

glow. No reason is offered by the authors for the conclusion that the

adjustment they do removes the dominance of warm glow. Neither do they

perform an adding-up test such as that described above. This adding-up test

could have been done by asking a third sample for willingness-to-pay to

extend a "planned" partial cleanup to a complete cleanup. In short, the

embedding problem does not appear to be one that contingent valuation

practitioners know how to solve.

 

With a pattern of results that are inconsistent with the usual economic

assumptions, two interpretations are always possible: the surveys were

defective or the contingent valuation method as currently practiced does not

measure with accuracy. One should consider all the surveys that attempt to

test for consistency in order to judge which interpretation is likely to be

correct. The studies we have described have been criticized as not done well

enough to be an adequate test.[13] However, they are the only quantitative

tests we are aware of. No comparable comparison tests have been done by

proponents of the accuracy of contingent valuation, although the embedding

effect has long been recognized.

 

Differing Payment Vehicles. It is interesting to note what two contingent

valuation proponents, Mitchell and Carson (1989), have written about the

question that respondents are trying to answer. In discussing the

sensitivity of responses to the payment vehicle (the way in which the

hypothetical payment is to be collected), they write (pp. 123-24):

 

It was earlier assumed that only the nature and amount of the amenity

being valued should influence the WTP [willingness-to-pay] amounts; all

other scenario components, such as the payment vehicle and method of

provision, should be neutral in effect . . . More recently, Arrow

(1986), Kahneman (1986), and Randall (1986) have argued against that

view, holding that important conditions of a scenario, such as the

payment vehicle, should be expected to affect the WTP amounts. In their

view, which we accept, respondents in a CV [contingent valuation] study

are not valuing levels of provision of an amenity in the abstract; they

are valuing a policy which includes the conditions under which the

amenity will be provided, and the way the public is likely to be asked

to pay for it.

 

In other words, Mitchell and Carson appear to accept the idea (consistent

with the findings about some respondents by Schkade and Payne, 1993) that

individuals' responses arise from casual benefit-cost analyses, not solely

from an examination of their own preferences over resources. For welfare

analysis and damage measurement, benefit-cost studies may be different from

preferences. We will return to this issue.

 

Evaluation of Bias: Calibration

 

Surveys about behavior often have systematic biases relative to the behavior

they ask about. Thus, it is common to "calibrate" the responses--that is,

adjust for the biases--as part of using them for predictive purposes. In

particular, when using surveys to estimate demand for new products, it is

standard practice to use a calibration factor to adjust survey responses in

order to produce an estimate of actual demand (Urban, Katz, Hatch, and Silk,

1983). As Mitchell and Carson (1989, p. 178) have written: "Such

'calibration' is common in marketing designed to predict purchases. If a

systematic divergence between actual and CV [contingent valuation] survey

behavior existed and could be quantified, calibration of CV results could be

undertaken."

 

As some evidence on the need for calibration, comparisons of hypothetical

surveys and actual offers often find large and significant differences.

These comparisons have been done for private goods (Bishop and Heberlein,

1979; Dickie, Fisher and Gerking, 1987; Neill et al., 1993).[14] Comparisons

have also been done for charitable donations (Duffield and Patterson, 1992;

Seip and Strand, 1992). These studies find a need to calibrate, with

calibration factors involving dividing stated willingness-to-pay by a number

ranging from 1.5 to 10.

 

How this calibration should be extended to the public good context is

unclear, since the public good context includes both unfamiliar commodities

and unfamiliar transactions. But the lack of study of appropriate

calibration factors is not a basis for concluding that the best calibration

is one-for-one.[15]

 

Welfare Analysis

 

If an accurate measure of willingness-to-pay for the pure public good of the

existence of an environmental amenity were available, the measured

willingness-to-pay would belong in benefit-cost analysis, just like a pure

public good based on resource use. Similarly, the measure should be included

in the incentives government creates (through fines and damage payments) to

avoid damaging an environmental amenity. As we know from the pure theory of

public goods, we would simply add individual willingness-to-pay across the

population.[16] In this section, we consider the welfare implications of

using stated willingness-to-pay as if it were an accurate measure of

preferences in the case that the responses are generated by the alternative

hypotheses given above.

 

One set of problems arises even if willingness-to-pay is being measured

accurately, if measured willingness-to-pay contains an altruism component.

That is, individuals may be willing to pay to preserve an environmental

amenity because of their concerns for others (who may be users or also

nonusers). Consider what happens if society adds up everyone's

willingness-to-pay and compares the sum with the cost of some action. As a

matter of social welfare evaluation we might conclude that such altruistic

externalities are double counting, since a utility benefit shows up in the

willingness-to-pay of both the person enjoying the public good and the

people who care about that person. For example, consider the income

distribution problem in a three-person economy. If two of the people start

to care about each other, is this change in preferences a reason for a

government to increase the level of incomes allocated to the two of them?

Similarly, we can ask if the government should devote more taxes to cleaning

up lakes where neighbors are friendly with each other than to lakes where

neighbors do not know (or care about) each other.

 

Moreover, if altruistic externalities are thought to be appropriately

included in the analysis, it is necessary to include all such externalities

for accurate evaluation. In particular, if people care about each other's

utilities, they care about the costs borne by others as well as the benefits

received by others. An adjustment for altruism must include external costs

as well as external benefits if we are to avoid the possibility of a Pareto

worsening from an action based on a calculation that appears to be a Pareto

improvement (Milgrom, 1993).

 

A second general problem arises when stated willingness-to-pay may be a poor

guess, even though it may be the best guess individuals have of their true

willingness-to-pay. Individuals often face the problem of trying to form

judgments about the gains from a purchase in settings where the link between

the commodity and utility is hard to evaluate. One example is the grade of

gasoline to buy, assuming that one wants to minimize cost per mile. In the

case of environmental amenities, individuals may have a derived demand based

on their beliefs about the relationship between the amenity and variables

they really care about. For example, they may care about the survival of a

species and not know about the range of natural variation in population

size, about the probability of survival as a function of population size,

nor about the effect of environmental damage on population size. Such

derived preferences may be a poor guide to policy; it may be more

informative to have expert evaluation of the consequences of an

environmental change than to consult the public directly about environmental

damage.

 

The issues just discussed were based on the hypothesis that stated

willingness-to-pay is a measure of an individual preference over an outcome.

Under the hypothesis that responses reflect casual benefit-cost evaluations

rather than preferences, it would be inappropriate to add any other benefits

to those coming from a contingent valuation survey since such benefits are

presumably included by the respondents, however imperfectly, in their

benefit-cost analyses. But if contingent valuation is just a survey of

benefit-cost estimates, rather than preferences, it might be better to have

a more careful analysis done by people knowing more about environmental

issues and about the principles of benefit-cost analysis. Moreover, if

responses are benefit-cost estimates rather than preferences, they do not

measure a compensable loss in damage suits.

 

The embedding effect is supportive of the hypothesis that responses are

primarily determined by warm glow. If respondents get pleasure from thinking

of themselves as supportive of the environment, the willingness-to-pay for

this warm glow is not part of the gain from a particular environmental

project--unless there are no cheaper ways of generating the warm glow. That

is, if an individual wants to see the government do at least one

environmental project (or n projects) a year in order to feel

"environmentally supportive," the person would support one project, but not

any particular project. Moreover, if different samples are asked about

different projects, the responses will appear to support many projects, even

though the warm glow comes from the desire to support a single project.

 

An illustration of this view comes from the fact that when individuals are

asked simultaneously about many projects, stated willingness-to-pay is far

below the sum of stated willingnesses-to-pay from asking about the projects

separately. For example, Kemp and Maxwell (1993) asked one group for

willingness-to-pay to minimize the risk of oil spills off the coast of

Alaska, and found a mean stated willingness-to-pay of $85 (with a 95 percent

confidence interval of +/- $44). Then they asked a different sample for

willingness-to-pay for a broad group of government programs, followed by

asking these people to divide and subdivide their willingness-to-pay among

the separate programs. By the time they reached minimizing the risk of oil

spills off the coast of Alaska, they found a mean of $0.29 (with a 95

percent confidence interval of +/-$0.21).

 

These findings make little sense if responses are measures of preferences,

and considerable sense if the response is primarily a warm glow effect from

a desire to express support for protecting the environment. In the latter

circumstance, we would expect little warm glow for any single project in a

context where respondents are asked about many government projects affecting

the environment. Therefore warm glow may need to be purged from stated

willingness-to-pay even if (as witnessed by charitable contributions) people

really are willing to pay for some warm glow.[17]

 

A different complication arises if people do not really care about the

resource, but care about the activity that might harm a resource. For

example, the stated willingness-to-pay to clean up a natural oil seepage

might be zero while the stated willingness-to-pay to clean up a man-made oil

spill is positive. This outcome is the flip side of the "protest zero,"

where people state no willingness-to-pay to repair environmental damage that

they feel is someone else's responsibility. As noted earlier, it is standard

practice to consider this zero not to be an accurate measure of preferences,

on the assumption that people care about the resource.

 

Survey results suggest that many answers are heavily influenced by concern

about actions, not resources. For example, Desvousges et al. (1993) find a

large stated willingness-to-pay to save small numbers of common birds. The

finding seems much more likely to reflect a feeling that it is a shame that

people do things that kill birds rather than a preference over the number of

birds. Concern over the actions of others is different from concern about

the state of the environment. Concern about actions is conventionally part

of the basis of punitive damages, but not compensatory damages. That is,

deliberately or recklessly destroying the property of others opens one up to

liability for compensatory damages for the value of the property destroyed

and also punitive damages. On the other hand, the legal system does not

compensate people who are upset that others engage in actions such as

reading Lady Chatterley's Lover. When and how such concerns should affect

public policy is a complex issue, one not explored here.

 

One complication from the perspective of benefit-cost analysis is that

preferences over acts (as opposed to states of the world) do not provide the

consistency that is necessary for consistent economic policy. For example,

if people are willing to pay to offset an act, then proposing and not doing

an act appears to generate a welfare gain. For example, consider the warm

glow from blocking development of a wilderness area. If one proposes two

projects and has one blocked, are people better off (from the warm glow)

than if one project is proposed and happens? Does this imply that the

government would do good by proposing projects that it does not mind seeing

blocked? More generally, the relationship of benefit-cost analysis and

Pareto optimality has been developed and is understood in a setting where

preferences are defined over resources.

 

We note that under the hypothesis of Kahneman and Ritov (1993), responses to

contingent valuation surveys are expressions of attitudes toward public

goods that the respondents are required to state in dollar terms. Responses

are then not measures of willingness-to-pay and provide no quantitative

basis for estimates of environmental damages, although like polls generally,

they do alert the government about concerns of the public.

 

The "Some Number is Better than No Number" Fallacy

 

We began this essay by arguing that stated willingnesses-to-pay from

contingent valuation surveys are not measures of nonuse preferences over

environmental amenities. We then considered some of the welfare implications

of treating the responses as if they were a measure of nonuse preferences

when they were generated by different considerations. We concluded that such

welfare analysis would not be a guide to good policy. Our conclusion is

often challenged by the common Washington fallacy that even if stated

willingness-to-pay is inaccurate, it should be used because no alternative

estimate exists for public policy purposes. Put more crudely, one hears the

argument that "some number is better than no number."[18] This argument

leads to the claim that it is better to do benefit-cost studies with stated

willingness-to-pay numbers, despite inaccuracy and bias, rather than use

zero in the benefit-cost analysis and adjust for this omission somewhere

else in the decision-making process.

 

To evaluate this argument, one needs a model of the determination of

government policy.[19] Ideally, one would like to carry out a number of

government decisions twice: once using zero in the benefit-cost study, and a

second time using stated willingness-to-pay, with associated adjustments of

the decision process in recognition of the inclusion or omission of a

contingent valuation number. Such a comparison would recognize that much

more input goes into government decisions than just the benefit-cost study.

That is, the comparison is not between relying on contingent valuation and

relying on Congress, but between relying on Congress after doing a

contingent valuation study and relying on Congress without doing a

contingent valuation study.[20] Thus one is asking whether inclusion of such

survey results tends to improve the allocation process, even if the numbers

are not reliable estimates of the preferences called for by the theory.

Similarly, one can ask whether the combination of fines and damage payments

will result in more efficient decisions to avoid accidents with or without a

contingent valuation estimate of nonuse value.

 

Judge Stephen Breyer (1993) has recently reviewed government responses to

public perceptions of risk. Since he feels that public perceptions of risk

are inaccurate and that Congress is responsive to these public perceptions,

Breyer wants to increase the role of administrative expertise in designing

public policy to deal with risks. A similar situation seems to exist with

respect to contingent valuations of nonuse value. If we conclude that

contingent valuation is really an opinion poll on concern about the

environment in general, rather than a measure of preferences about specific

projects, public policy is likely to do better if the concern is noted but

expert opinion is used to evaluate specific projects and to set financial

incentives to avoid accidents. One could hope for a more consistent relative

treatment of alternative natural resources in this way.

 

In both economic logic and politics, we expect that using contingent

valuation in decision making about the environment would soon be extended to

other policy arenas where existence values are equally plausible. We do not

expect that policy would be improved by using contingent valuation to affect

the levels and patterns of spending for elementary school education, foreign

aid, Medicaid, Medicare, AFDC, construction of safer highways, medical

research, airline safety, or police and fire services. Yet people have

concerns for others in all of these areas that parallel their concern for

the environment.

 

Concern for other people naturally includes concern about their jobs. Thus,

in considering rules that limit economic activity to protect the

environment, it is as appropriate to include a contingent valuation of

existence value for destroyed jobs as the one for protection of the

environment. The fact that jobs may be created elsewhere in the economy does

not rule out concern about job destruction per se. These possible extensions

of the use of contingent valuation increase the importance of considering

the "some number is better than no number" fallacy.

 

Referenda

 

We have heard the argument that if referenda are legitimate, so too is

contingent valuation. That is, one can consider a contingent valuation

survey to be a forecast of how voters would respond to a binding referendum.

This perspective raises the same issues considered above. How should we

decide how to interpret the bases of how people vote in referenda? Since

different bases imply different appropriate uses of the responses, how

should voting responses be used for economic analysis? Moreover, the

necessity of calibration remains, since no obvious reason exists for people

necessarily to vote the same in binding and nonbinding referenda. And, as in

the previous section, we can ask whether we think we get better policies

with or without such surveys.

 

It is interesting to consider issues raised by polls about actual referenda,

as well as by the referenda themselves. Sometimes polls are accurate

predictors of voting outcomes; sometimes, they are not, even when they are

taken close to election day. Sometimes, repeated polls about the same

referendum find very large changes in expressed intentions as a referendum

campaign proceeds.

 

Magleby (1984) has analyzed statewide polls in California and Massachusetts

for which at least three separate surveys were done. In some cases, the

polls show roughly the same margin over time. Magleby calls these "standing

opinions" and believes that this stability comes from the deep attachment to

their opinions that voters hold on some controversial issues such as the

death penalty and the equal rights amendment. In some cases, the polls show

significant changes in the margin of preferences, but no change in the side

that is ahead. Magleby calls these "uncertain opinions." Examples of such

votes involve handgun registration and homosexual teachers. In some cases,

significant changes in voting intentions occur as the campaign proceeds,

with victory in the actual election going to the side that had at one time

been far behind. Magleby calls these outcomes "opinion reversals." For

example, in a referendum for flat rate electricity, a February poll showed

71 percent in favor, 17 percent opposed, and 12 percent undecided. The

actual vote was 23 percent in favor, 69 percent opposed and 7 percent

skipping this question. Other examples of such votes are a state lottery and

a tax reduction measure. In his analysis of 36 propositions in California,

Magleby found that on 28 percent of the issues, voters held standing

opinions, on 19 percent voters had uncertain opinions, and on 53 percent he

found opinion reversals. That is, in a majority of cases, early opinion

polls were not good predictors of election outcomes. Moreover, they were not

even good predictors of later opinion polls, after the campaign had run for

some time.

 

It seems to us that responses to contingent valuation questionnaires for a

single environmental issue are likely to be based on little information,

since there is limited time for presentation and digestion of information

during a contingent valuation survey. This conclusion suggests that the

results of such surveys are unlikely to be accurate predictors of informed

opinions on the same issues if respondents had more information and further

time for reflection, including learning of the opinions of others. Such

surveys are therefore unlikely to be a good basis for either informed

policy-making or accurate damage assessment.

 

Even if a contingent valuation survey were a good predictor of an actual

referendum, one can also question the use of actual referenda to obtain

economic values. Considerable skepticism exists about the extent to which

voting on a referendum represents informed decision making (see, for

example, Magleby, 1984). In the functioning of a democracy, it may be more

important to place some powers directly with the voters, rather than with

their elected representatives, than to worry about the quality of decision

making by voters.[21] However, incorporating contingent valuation survey

responses in benefit-cost analyses or judicial proceedings does not seem to

have a special role in enhancing democracy. In the looser context of

legislative debate, such opinion polls may have a role to play, although the

net value of that role is unclear.

 

NOAA Panel Evaluation of Contingent Valuation

 

In light of the controversy and the stakes involved, the National Oceanic

and Atmospheric Administration recently appointed a prestigious panel to

consider the reliability of contingent valuation studies of nonuse values in

damage suits.[22] The panel's Report (NOAA, 1993) begins with criticisms of

contingent valuation. In discussing the alleged inconsistency of some

results with rational choice, the Report states (p. 4604) that: "some form

of internal consistency is the least we would need to feel some confidence

that the verbal answers correspond to some reality." The Report also

addresses the need for rationality (p. 4604).

 

It could be asked whether rationality is indeed needed. Why not take

the values found as given? There are two answers. One is that we do not

know yet how to reason about values without some assumption of

rationality, if indeed it is possible at all. Rationality requirements

impose a constraint on the possible values, without which damage

judgments would be arbitrary. A second answer is that, as discussed

above, it is difficult to find objective counterparts to verify the

values obtained in the response to questionnaires.

 

In discussing "warm glow" effects, the Report recognizes the claim that

contingent valuation responses include a warm glow. They write (p. 4605):

"If this is so, CV [contingent valuation] responses should not be taken as

reliable estimates of true willingness to pay."

 

The Report states that the burden of proof of reliability must rest on the

survey designers. It states (p. 4609) that a survey would be unreliable if

there were "[i]nadequate responsiveness to the scope of the environmental

insult," as occurred in the embedding examples we have discussed.

Unfortunately, the Panel did not elaborate on how to test for

reliability.[23] We interpret the view they express to call for testing of

the internal consistency of responses to the same survey instrument with

different levels of environmental problem and policy successes. The Report

cites no existing study that has passed such internal consistency tests.

 

The Report presents a set of guidelines which would define an "ideal"

contingent valuation survey (and are summarized in Portney's paper in this

issue). The Report asserts (p. 4610) that studies meeting such guidelines

can produce estimates "reliable enough to be the starting point" of a

judicial process of damage assessment. The Report offers no reason for

reaching this conclusion, although the finding that surveys that do not meet

their guidelines may be biased is not a basis for concluding that surveys

that do meet their guidelines are not biased. In particular, they state no

reason for reaching the conclusion that following their guidelines implies

that responses are not dominated by a "warm glow." The Panel does not

explicitly call for testing whether a survey done according to their

guidelines is reliable. In particular, they do not mention a need to check

the internal consistency of responses. Nor do they explain their conclusion

that the inconsistencies between stated willingness-to-pay and economic

theory come from survey design issues and would go away if the survey had

followed their guidelines.

 

Conclusion

 

We believe that contingent valuation is a deeply flawed methodology for

measuring nonuse values, one that does not estimate what its proponents

claim to be estimating. The absence of direct market parallels affects both

the ability to judge the quality of contingent valuation responses and the

ability to calibrate responses to have usable numbers. It is precisely the

lack of experience both in markets for environmental commodities and in the

consequences of such decision that makes contingent valuation questions so

hard to answer and the responses so suspect.

 

We have argued that internal consistency tests (particularly adding-up

tests) are required to assess the reliability and validity of such surveys.

When these tests have been done, contingent valuation has come up short.

Contingent valuation proponents typically claim that the surveys used for

these tests were not done well enough. Yet they have not subjected their own

surveys to such tests. (We note that Hanemann does not address the question

of which split-sample internal consistency tests, if any, he thinks a

contingent valuation survey needs to pass.) There is a history of anomalous

results in contingent valuation surveys that seems closely tied to the

embedding problem. Although this problem has been recognized in the

literature for over a decade, it has not been solved. Thus, we conclude that

current contingent valuation methods should not be used for damage

assessment or for benefit cost analysis.

 

It is impossible to conclude definitely that surveys with new methods (or

the latest survey that has been done) will not pass internal consistency

tests. Yet, we do not see much hope for such success. This skepticism comes

from the belief that the internal consistency problems come from an absence

of preferences, not a flaw in survey methodology. That is, we do not think

that people generally hold views about individual environmental sites (many

of which they have never heard of); or that, within the confines of the time

available for survey instruments, people will focus successfully on the

identification of preferences, to the exclusion of other bases for answering

survey questions. This absence of preferences shows up as inconsistency in

responses across surveys and implies that the survey responses are not

satisfactory bases for policy.

 

* The authors want to thank Bernard Saffran and four editors for helpful

comments.

 

1 With two estimates of an economic value, one can analyze directly whether

one is a biased estimate of the other. With nonuse value, the lack of an

alternative direct estimate of willingness-to-pay makes it relevant to

consider credibility directly, as well as the differences between survey

results and behavior in other contexts where transactions data are

available.

 

2 Another failure of contingent valuation surveys to be consistent with

economic preferences is that stated willingness-to-pay is usually found to

be much less than stated willingness-to-accept. From economic theory,

willingness-to-pay differs from willingness-to-accept only by an income

effect. Thus, their values should be extremely close in typical contingent

valuation circumstances, where the stated willingness-to-pay is a small

share of the consumer's overall budget, and willingness-to-pay amounts show

a small income elasticity. For further discussion of this problem with

contingent valuation surveys and other problems, see Diamond and Hausman

(1993) and Milgrom (1993).

 

3 The term embedding came from the research approach of "embedding" a

particular good in a more inclusive good, and contrasting the stated

willingness-to-pay for the good with that obtained by allocating the

willingness-to-pay for the more inclusive good among its components

(Kahneman, personal communication).

 

4 This approach was developed for actual charitable contributions, not

survey responses. Kahneman and Knetsch (1992) call it the purchase of moral

satisfaction.

 

5 The importance of the lack of bite of such considerations comes, in part,

from the fact that the contingent valuation study of the Exxon Valdez spill

that was done for the state of Alaska (Carson et al., 1992) included such

analyses, but none of the more powerful split-sample consistency tests that

we discuss below.

 

6 The empirical finding of low income elasticities is also inconsistent with

the typical finding of a large divergence between willingness-to-pay and

willingness-to-accept, discussed in footnote 2.

 

7 Samples and Hollyer used dichotomous choice surveys. They estimated that

whales were valued at $125 when asked about first, and $142 when second.

Seals were valued at $103 when asked about first and $62 when second. When

they asked about both (together) in a single question, the estimated values

were $131 and $146 in two surveys.

 

8 This approach is similar to the work that was initiated by Kahneman (1986)

and done recently by Kahneman and Knetsch (1992), Kemp and Maxwell (1993),

Desvousges et al. (1993), Diamond et al. (1993), McFadden and Leonard

(1993), Loomis, Hoehn and Hanemann (1990).

 

9 For derivation of the convexity of willingness-to-pay when preferences are

concave and the scenario is varied in this way, see Diamond (1993). That

paper also contains a number of other implications of preferences for

willingness-to-pay that can be used for internal consistency tests.

 

10 Proponents of contingent valuation have made several critiques of this

study. One critique is that it was a mall stop survey. But similar results

followed when the questionnaire was used for the verbal protocol study cited

above, which involved subjects coming to be interviewed. Another criticism

is that in addition to the absolute numbers, the survey questions described

the number of birds at risk as "much less than 1%" of the population, "less

than 1%," and "about 2%." Thus, one can wonder whether respondents were

paying attention to the absolute numbers which varied 100-fold or the

percentages which varied from "much less than 1%" to "about 2%."

Interpreting "much less than" as less than half, about 2% is at least a

four-fold increase over less than half of 1%. If some people were paying

attention to the percentages and some to the absolute numbers, the range

should have been between four-fold and 100-fold. If, as Hanemann suggests,

respondents did not perceive any real difference between "much less than 1%"

and "about 2%," it is noteworthy that they perceived a large difference

between zero and "much less than 1%." Moreover, these percentages were

selected by the authors since they were the percentages in three actual oil

spills: Arthur Kill, Nestucca, Exxon Valdez. This pattern of results is

consistent with the responses being dominated by a "warm glow."

 

11 Willingness-to-pay is a function of the two vectors giving alternative

levels of public goods and the level of income. Thus the willingness-to-pay

to improve the environment from z to z" of someone with income I can be

written WTP(z, z", I). The change from z to z" can be broken into two

pieces, a change from z to z' and a change from z' to z". From the

definition of willingness-to-pay, one has WTP(z, z", I) = WTP(z, z', I) +

WTP(z', z", I-WTP(z, z', I)).

 

This adding-up test makes no use of an assumption on the magnitude or sign

of income or substitution effects. One could do an adding-up test without

the adjustment of income shown in the equation by comparing WTP(z, z", I)

with WTP(z, z', I) + WTP(z, z", I). This comparison would involve a

deviation from exact adding-up because of the income effect. With a

willingness-to-pay on the order of $30 and a household income level of

$30,000, even an income elasticity of one--higher than the elasticity

typically measured in contingent valuation surveys--would lead to a $.03

deviation from exact adding-up. For a formal derivation, see the revised

version of Diamond (1993).

 

12 In brief response to Hanemann's criticisms of our analysis, we note that

he does not address this adding-up test and seems comfortable accepting the

idea that the less wilderness preserved, the less people care about any

particular area of wilderness. These two tests do not rely on any assumption

of different wilderness areas being interchangeable, as indicated by the

vector interpretation of z in the previous footnote. In terms of Hanemann's

test mentioned in his note 25 of whether willingness-to-pay to protect each

of the areas is the same, we note that he did not do the statistical test

correctly. Moreover, this reference is an example of Hanemann's trait of

ignoring the central criticism while attacking a side issue. In Diamond et

al., the focus is on the adding-up test, not a scope test. The adding-up

test was clearly rejected.

 

13 One can ask whether the patterns of thought reflected in the responses to

the questions in any particular survey also occur in other survey settings.

Cognitive psychology has found a number of such patterns that are robust. We

think that the patterns reflected in these surveys are similarly robust.

 

14 On the Dickie, Fisher and Gerking (1987) study, see also the critique by

Hausman and Leonard (1992).

 

15 In its proposed rules for damage assessment, the National Oceanic and

Atmospheric Administration (1994) has proposed a default calibration of

dividing by two, in the absence of direct arguments by trustees of natural

resources for a different calibration factor.

 

16 For the correct use of a benefit-cost calculation, we need to be

considering the marginal project for finding the optimum. With many projects

under consideration, and a nonoptimal starting point, one does not get the

right answer by asking about many projects independently and carrying out

all that pass the test (Hoehn and Randall, 1989).

 

17 In the context of the bird study by Desvousges et al. (1993), Kahneman

(personal communication) has proposed to purge the warm glow by

extrapolating willingness-to-pay as a function of birds saved back to zero

and then subtracting this amount from the estimate of willingness-to-pay at

any particular level of birds. This approach involves a curve-fitting

extrapolation and the assumption that warm glow is totally insensitive to

the magnitude of the problem, an assumption that is probably not completely

correct.

 

18 The history of economic policy awaits an investigation, similar to the

famous study of the sociologist R. K. Merton on the history of Newton's "on

the shoulders of giants" remark, to trace the lineage of the "some number is

better than no number" fallacy.

 

19 One can also consider how a social welfare maximizing planner might use

the information in contingent valuation surveys. There is useful information

if people are expressing preferences that are not otherwise accessible to

the planner. However, if the other hypotheses are the correct description of

the bases of willingness-to-pay responses, then the planner would not be

receiving useful information. Treating the responses as measures of what

they do not measure would mislead such a planner.

 

20 The results of a contingent valuation survey are not binding. Thus a

respondent who was behaving strategically would select a response that

reflected his or her belief in how the results of the survey would affect

actual outcomes. Thus we do not understand how the NOAA Panel could conclude

that with a dichotomous choice question there is no strategic reason for the

respondent to do otherwise than answer truthfully.

 

21 The allocation of a decision directly to the voters, rather than

indirectly through the choice of elected representatives, and the form in

which referenda are put to voters are both methods of agenda control. In

many settings, design of the agenda has large effects on voting outcomes.

 

22 Kenneth Arrow (co-chair), Robert Solow (co-chair), Edward Learner, Paul

Portney, Roy Radner, and Howard Schuman.

 

23 Nor, we add, do Portney or Hanemann in this symposium.

 

References

 

Andreoni, James, "Giving with Impure Altruism:

Applications to Charity and Ricardian Equivalence,"

Journal of Political Economy, December 1989, 97, 1447-58.

 

Bishop, R. C., and T. A. Heberlein, "Measuring Values of

Extramarket Goods: Are Indirect Measures Biased?,"

American Journal of Agricultural Economics, December

1979, 61, 926-30.

 

Breyer, Stephen, Breaking the Vicious Circle: Toward Effective

Risk Regulation. Cambridge: Harvard University Press,

1993.

 

Carson, Richard T., et al., "A Contingent Valuation Study of

Lost Passive Use Values Resulting From the Exxon Valdez

Oil Spill," A Report to the Attorney General of the State

of Alaska, 1992.

 

Desvousges, W. H., et al., "Measuring Natural Resource Damages

with Contingent Valuation: Tests of Validity and

Reliability. In Hausman, J., ed., Contingent Valuation:

A Critical Assessment. Amsterdam: North Holland Press,

1993, 91-164.

 

Diamond, P. A., "Testing the Internal Consistency of

Contingent Valuation Surveys," working paper, MIT, 1993.

 

Diamond, P. A., and J. A. Hausman, "On Contingent Valuation

Measurement of Non-use Values" In Hansman, J., Ed.,

Contingent Valuation: A Critical Assessment. Amsterdam:

North Holland Press, 1993, 3-38.

 

Diamond, P. A., J. A. Hausman, G. K. Leonard, and M. A.

Denning, "Does Contingent Valuation Measure Preferences?

Experimental Evidence." In Hausman, J., ed., Contingent

Valuation: A Critical Assessment. Amsterdam: North

Holland Press, 1993.

 

Dickie, Mark, Ann Fisher, and Shelby Gerking, "Market

Transactions and Hypothetical Demand Data: A Comparative

Study," Journal of the American Statistical Association,

March 1987, 82, 69-75.

 

Duffield, John W., and David A. Patterson, "Field Testing

Existence Values: An Instream Flow Trust Fund for Montana

Rivers," mimeo, University of Montana, 1992.

 

Hausman, J. A., Contingent Valuation: A Critical Assessment.

Amsterdam: North Holland Press, 1993.

 

Hausman, J. A., and G. Leonard, Contingent Valuation and the

Value of Marketed Commodities. Cambridge: Cambridge

Economics, 1982.

 

Hoehn, John, and Alan Randall, "Too Many Proposals Pass the

Benefit Cost Test," American Economic Review, June 1989,

79, 544-51.

 

Kahneman, Daniel, "Comments on the Contingent Valuation

Method." In Cummings, Ronald G., David S. Brookshire, and

William D. Schulze, eds., Valuing Environmental Goods:

A State of the Arts Assessment of the Contingent

Valuation Method. Totowa: Rowman and Allanheld, 1986,

185-94.

 

Kahneman, Daniel, and Jack L. Knetsch, "Valuing Public Goods:

The Purchase of Moral Satisfaction," Journal of

Environmental Economics and Management, January 1992, 22,

57-70.

 

Kahneman, Daniel and Ilana Ritov, "Determinants of Stated

Willingness to Pay for Public Goods: A Study in the

Headline Method," mimeo, Department of Psychology,

University of California, Berkeley, 1993.

 

Kemp, M. A. and Maxwell, "Exploring a Budget Context for

Contingent Valuation Estimates," In Hausman, J., ed.,

Contingent Valuation: A Critical Assessment, Amsterdam:

North Holland Press, 1993, 217-70.

 

Loomis, John, John Hoehn, and Michael Hanemann, "Testing the

Fallacy of Independent Valuation and Summation in

Multi-part Policies: An Empirical Test of Whether 'Too

Many Proposals Pass the Benefit Cost Test,'" mimeo,

University of California, Davis, 1990.

 

Magleby, David B, Direct Legislation, Voting on Ballot

Propositions in the United States. Baltimore and London:

The Johns Hopkins University Press, 1984.

 

McFadden, Daniel, and Gregory K. Leonard, "Issues in the

Contingent Valuation of Environmental Goods:

Methodologies for Data Collection and Analysis." In

Hausman, J., ed., Contingent Valuation: A Critical

Assessment. Amsterdam: North Holland Press, 1993.

 

Milgrom, P., "Is Sympathy an Economic Value?," In Hausman, J.,

ed., Contingent Valuation: A Critical Assessment.

Amsterdam: North Holland Press, 1993, 417-42.

 

Mitchell, Robert Cameron and Richard T. Carson, Using Surveys

to Value Public Goods. Washington D.C.: Resources for the

Future, 1989.

 

National Oceanic and Atmospheric Administration, 1993, "Report

of the NOAA Panel on Contingent Valuation," Federal

Register, 1993, 58, 10, 4602-14.

 

National Oceanic and Atmospheric Administration, "Natural

Resource Damage Assessments; Proposed Rules," Federal

Register, 1994, 59, 5, 1062-191.

 

Neill, Helen, R., et al., "Hypothetical Surveys and Real

Economic Commitments," Economics Working Paper B-93-01,

Department of Economics, College of Business

Administration, University of South Carolina, 1993.

 

Samples, Karl C, and James R. Hollyer, "Contingent Valuation

of Wildlife Resources in the Presence of Substitutes and

Complements." In Johnson, Rebecca L., and Gary V.

Johnson, eds., Economic Valuation of Natural Resources:

Issues, Theory and Applications. Boulder: Westview Press,

1990, 177-192.

 

Schkade, D. A., and J. W. Payne, "Where do the numbers come

from? How people respond to Contingent Valuation

Questions." In Hausman, J., ed., Contingent Valuation:

A Critical Assessment. Amsterdam: North Holland Press,

1993, 271-304.

 

Schulze, William, D., et al., "Contingent Valuation of Natural

Resource Damages Due to Injuries to the Upper Clark Fork

River Basin," State of Montana, Natural Resource Damage

Program, 1993.

 

Seip, Kalle, and Jon Strand, "Willingness to Pay For

Environmental Goods in Norway: A Contingent Valuation

Study With Real Payment," Environmental and Resource

Economics, 1992, 2, 91-106.

 

Tolley, George S., et al., "Establishing and Valuing the

Effects of Improved Visibility in the Eastern United

States," Report to the U.S. Environmental Protection

Agency, Washington, D.C., 1983.

 

Urban, Glen L., Gerald M. Katz, Thomas E. Hatch, and Alvin J.

Silk, "The ASSESSOR Pre-Test Market Evaluation System,"

Interfaces, 1983 13, 38-59.

 

~~~~~~~~

 

by Peter A. Diamond and Jerry A. Hausman

 

Peter A. Diamond and Jerry A. Hausman are Professors of Economics,

Massachusetts Institute of Technology, Cambridge, Massachusetts.

-------------------

Copyright of Journal of Economic Perspectives is the property of American

Economic Association and its content may not be copied or emailed to

multiple sites or posted to a listserv without the copyright holder's

express written permission. However, users may print, download, or email

articles for individual use.

Source: Journal of Economic Perspectives, Fall94, Vol. 8 Issue 4, p45, 20p.

Item Number: 9501060642