VEGETABLES AND
SPECIALTIES
November 1999, ERS-VGS-279
Approved by the World Agricultural Outlook Board
---------------------------------------------------------------------------
Special article
---------------------------------------------------------------------------
Modeling the U.S. processing
tomato industry
Charles Plummer
Agricultural economist, Market and Trade Economics Division, Economic Research Service, U.S. Department of Agriculture.
Abstract:
The United States is the world's largest producer of processed tomato products. Over the last 20 years, average
production has increased 47 percent and the farm value of the crop is now around $700 million. Processing tomatoes are second
only to potatoes in terms of national per capita consumption. Given the significance of this industry, this article presents an
econometric model designed to provide short run projections of this industry's key variables, such as acreage, yield,
production, price, trade, and domestic use.
Keywords: Processing tomatoes, modeling, production, trade,
prices, supply, demand.
The U.S. processing tomato industry has realized substantial growth and change over the past 20 years. In the late 1970's
(1975-79), total processing tomato production averaged 7.3 million tons, with an average annual crop value of $463 million.
The average crop size for the past 5 years (1994-98) was 10.7 million tons (47 percent above the late 70's) with an average
annual crop value of $658 million (up 42 percent). The preliminary estimate for the 1999 crop is over 11.5 million tons,
just shy of the record set in 1994, and the crop value will likely top $700 million.
With such tremendous growth in the industry over the past two decades, older econometric forecasting models of the processing
tomato industry are unable to capture more recent structural and demand changes, and are therefore unlikely to yield useful
forecasts. The purpose of this article is to present an econometric model designed to provide short run projections of
the industry's key variables, such as acreage, yield, production, price, trade, and domestic use. An overview of industry
structure and domestic and export demand is followed by development of the model and model forecasts.
Industry Overview
Much of the increase in tomato production over the past 20-30 years can be attributed to improved yields and increased
efficiency at the grower and processor levels. In the late 1970's (1975-79), the average yield of processing tomatoes in the
United States was 22.1 tons/acre. By the late 1980's the average had risen to 28.5 tons/acre, and in the late 1990's the average
has been 33.2 tons/acre. This increase in yields is due to the steady development of higher-yielding hybrid varieties, improved
cultural practices such as increased use of transplanting, and a continued shift in production to
California, where average yields are currently about 30 percent higher than the average yield in
the rest of the United States. In the late 1970's, California accounted for about 84 percent of total U.S.
processing tomato production. At the end of the 1990's, California is averaging
approximately 94 percent of the total U.S. production.
In addition to improving raw tomato yields, hybrid varieties (along with advances in mechanization) have helped to improve
final product output. Hybrid tomato characteristics (such as skin thickness, solids content, etc.) have helped harvesting
equipment to become more efficient, and have helped improve processor recovery rates and product quality. Harvest has become
less labor intensive and processors are now more able to meet stringent output standards for final product.
These improvements have resulted in some overall structural change within the industry. Several major companies have
restructured to source a large portion of their product needs through other tomato processors who can meet their quality
standards. This restructuring has helped to improve overall industry efficiency, as processing capacity consolidates and
older, less efficient processing plants are taken out of production (Welty). For example, 1992 California tomato paste
processing capacity (tomato paste output) from direct marketers (i.e., not including remanufacturers) was 484,000 pounds/hour
from a total of 16 processing facilities. By 1998, total capacity had risen to 698,000 pounds/hour from a total of 13
facilities. Additionally, of the 16 facilities in operation in 1992, 8 had been built prior to 1976. By 1998, the number of
facilities in use built before 1976 had decreased to five (The Food Institute).
Growth in Domestic Demand
Tomatoes and tomato products are an important part of the American diet. After potatoes, tomatoes are the most widely
consumed vegetable in the United States. During the past 20 years, U.S. annual per capita use of tomatoes and tomato products
has increased by nearly 30 percent to a total fresh-weight equivalent of 93 pounds/person in 1998. Processed tomato
products, including sauces, ketchup, pastes, salsa, juice, etc., accounted for 81 percent of that total. Utilization
statistics by processed product type are unavailable. However, an estimate
based on limited data from the late 1980's suggests that the largest use is for sauces (35 percent), followed by paste (18
percent), canned tomatoes (17 percent), ketchup, and juice (each about 15 percent).
Domestic use of processed tomato products has increased substantially in the 1990's. During the 1980's, per capita use
of processed tomato products averaged 63.5 pounds. During the 1990's, processed use has averaged 75.5 pounds per person. The
increase is likely the result of continued expansion in food-service demand, especially for pizza, tacos, and other Italian
and Mexican foods.
Some of the increase may also be due to rising public awareness of the health benefits of processed tomato products in the diet.
Several studies this decade have linked diets rich in tomatoes and tomato products to reduced risk of various cancers and heart
disease. Tomatoes contain lycopene, a naturally occurring compound that, when ingested, acts as a powerful
anti-oxidant that helps protect human cells from the degenerative effects of
various free radicals. Some studies indicate that canned tomato products may contain higher concentrations of lycopene
than fresh tomatoes due to the heat processing used to create the product.
Although domestic consumption of processed tomato products has boomed in the 1990's, it appears that trend may be leveling
somewhat as the century closes. Per capita use has averaged just under 75 pounds for the past 5 years, with the preliminary
estimate for 1999 at 75.3. However, with continuing strong export potential in the coming decade, slow growth (or even a
slight decline) in domestic demand in the next few years does not necessarily translate into little or no growth in
domestic production.
Exports Could Drive Future Production
Even with domestic consumption of processed tomato products appearing to level somewhat, the outlook for continued growth in
U.S. production looks good as there appears to still be potential for continuing growth in exports. The United States is the
world's largest producer of processed tomato products, and exports have just recently become an increasingly important
outlet for U.S. producers. Prior to 1989, exports of processed tomato products rarely accounted for more than 1 to
2 percent of total processed tomato utilization (on a raw-equivalent basis).
However, since 1989, the importance of exports has steadily risen, and in 1998 exports accounted for 12 percent of total
utilization. In 1989, all U.S. processed tomato product exports were valued at $60.1 million. By
1998, total value had nearly quadrupled, rising to $237.1 million.
Such a dramatic increase can be partly accredited to increased access to the Canadian market as a result of the North American
Free Trade Agreement (NAFTA). In 1989, Canada accounted for 25 percent ($15.1 million) of total export value of all processed
tomato products. By 1998, Canada's take had risen to 50 percent ($119.3 million). NAFTA has also increased access to the Mexican
market, although exports to Mexico are still relatively small (accounting for 6 percent of total export
value in 1998). Exports to Mexico increased from $3.4 million in 1989 to $14.4
million in 1998.
Exports of processed tomato products have also increased to other regions of the world, although the growth rate has not been as
rapid as it has been to Canada and Mexico. Export value to Japan, the second largest U.S. export market for processed tomato
products, rose 251 percent from 1989 to 1998. Japan accounted for $27.3 million (12 percent) in processed tomato exports in
1998, of which, 22 percent was ketchup. Other Asian and Pacific Rim countries took nearly $23 million in
U.S. exports in 1998, with nearly 29 percent of that being ketchup. The rapid growth
of Western-style fast food chains in Japan and other Asian Pacific Rim countries over the last decade
has spurred the growth in U.S. processed tomato exports to the region. As Western-style
cuisine continues to increase in popularity around the world, the United States should remain well situated
to continue increasing exports of processed tomato products.
Model Development
An econometric model was developed to estimate U.S. processing tomato acreage, beginning
stocks, production, grower price, trade, and domestic utilization. The system of equations can be
found in table A1. The supply portion of the model consists of an equation to estimate beginning
stocks, an acreage response function, a yield equation, a multiplicative identity for
production, and an import equation. The demand portion of the model is determined by an export equation, with
domestic utilization being the residual (beginning stocks + domestic production + imports - ending
stocks - exports). Because nearly 100 percent of anticipated production is contracted between
processors and growers prior to planting, at an agreed upon price, acres planted is the best variable
(when combined with an autoregressive [AR] term) in forecasting grower price. The model
is estimated using annual data from 1970 to 1998. Endogenous and exogenous variables are identified in table A2.
The method used to minimize the annual variation of an endogenous variable is ordinary least
squares (OLS) regression. The explanatory variables are either economic variables logically
consistent with microeconomic theory, proxy variables such as trend, or related forecast variables
such as Economic Research Service (ERS) or National Agricultural Statistics Service (NASS)
estimates. In all but two of the regression equations, first-order serial correlation appeared to be a
significant problem (as indicated by a Durbin-Watson statistic below the critical value
at the 5 percent level). In these equations, autoregressive (AR) and/or moving average (MA) terms were
added to account for the problem. When current endogenous variables appear as independent
variables (right-hand side variables) in regression equations, OLS may produce biased and/or
inconsistent parameter estimates. However, since these problems didn't appear to be severe in any
of the equations, all published equations are OLS regression results.
Selected in-sample summary statistics are found in table A3.
2/ Two-stage least squares regressions were run on relevant
equations and the results were not significantly different from OLS results. Therefore, all published equations are OLS
regression results.
Forecasting Assumptions and Results
For any model to forecast successfully, some assumptions are necessary. First, the model must be correctly specified to
reflect market behavior. The in-sample summary statistics (table A3) partially confirm this assumption. Additionally, it must be
assumed that the behavior captured by the model must continue into the future. Naturally, this assumption weakens as the
forecasting horizon extends because the model is unable to capture out-of-sample (future) dynamics without
being re-estimated. Finally, several exogenous factors must be given future values for estimation.
For this model, the exogenous variables future values are forecast using AR and ARMA models.
Thus, caution must be used when interpreting the model's forecasts, as they are based on a set of
explicit assumptions that include forecasts of determinate variables. Changes in
these assumptions can have significant effects on the model's forecasts. The equations used to determine future values
of the exogenous variables are in table A1 and selected in-sample summary statistics can be found in table A3.
The best test of a forecasting model's accuracy is a comparison of forecasts with actual values. Because recent structural
changes have had such an impact on the processing tomato industry, the
model was estimated using data through 1998 in order to capture these changes in the explanatory
variables. This currently leaves 1999 as the only out-of-sample year for
which to evaluate the model's accuracy. Preliminary USDA estimates of 1999 processing tomato acreage, yield, and
production indicate fairly good agreement between model forecasts and actual (preliminary) values (table A4).
3/ Current USDA estimates are for contracted production only.
However, this typically accounts for 99 percent of the industry total, which will be published in January 2000.
For the 2000 simulation, the model was run using USDA's July and September preliminary estimates for 1999 values of contract
acreage, yield, and production. Because 1999 seems to be shaping up as a somewhat unusual trade year, trade volume for 1999 was
not estimated by model equations. Export volume in 1999 was estimated to be 15 percent below 1998 levels, and import
volume was estimated to be 20 percent above 1998. Model equations were
used to estimate the remaining 1999 variables and all year 2000 estimates. The results of this simulation indicate
declines of 9 percent in acreage, 6 percent in production (assuming trend
yields), and 4 percent in average grower prices (table A4).
The model forecasts a 9-percent decline in acreage next year although other market factors seem to indicate that a smaller
decline is possible. The 2000 season looks now as though it may have a
beginning very similar to 1995. It looks as though beginning stocks in the year 2000, although
significantly higher than 1999, will not be substantially higher than levels
experienced recently in the industry. As was the case in 1995, the 2000 season will follow 2 previous years of declining stocks,
and one year of relatively high production. In 1995, growers increased acreage under these similar circumstances, and
production was relatively high for a second consecutive year. This, subsequently, contributed to increased beginning stocks
again for 1996. Due to the relatively large stocks on hand, acreage was decreased in 1996. However, strong yields led to
increased production and record-high beginning stocks in 1997. This forced a large drop in acreage and production in 1997
followed by only a slight acreage increase in 1998. During this 3-year adjustment period (1996-1998), grower prices decreased
each year.
In order to avoid the potential of a similar build-up in stocks over the next several years, it is likely that acreage will
decrease in 2000, but perhaps not quite the 9-percent forecast by the model. With increased grower prices in 1999, and a good
export outlook in the coming years, a more gradual decline in acreage is more likely. If planted acreage decreased about 5
percent to 335,000 acres in 2000, average acreage abandonment and trend yields (which would approach record-high levels) would lead
to only a slight decline in production 4/---------. If only a slight decline in production is
realized next year, stocks would likely rise substantially in 2001 and trigger a larger decline in acreage. The forecasts for
the various scenarios can be seen in table A4.
4/ The 95 percent confidence interval for ACPL in the year 2000
is 277,250 to 356,526 acres.
Long-run Outlook
Long-run forecasts (from 2001 and beyond) with the model are somewhat impractical, as the potential for forecasting error
increases with the forecast horizon. Over time, current assumptions and the relationships estimated by the model will
likely change and new relationships, not captured by the existing model, may become important. However, the model can be used as a
reference point to speculate on long-run prospectus in the industry. For example, the model indicates that per
capita domestic use of processed tomato products may decline slightly over the next couple of
years and then increase slowly (table A5). Likewise, production is forecast to decline for the next 2
years before steadily increasing to nearly 13.1 million tons in 2010 (table A4).
Conclusions
As the model suggests, exports (EXPVOL) and cost-effective (CNPPI[-1]/PPITW[-1]) production are key variables in determining
planted acreage. Continued growth in export markets through a rapidly expanding food-service industry and increasing consumer
awareness about the effects of a healthy diet should both bode well for processing tomato demand in the years to come. If the
U.S. producers can remain relatively low-cost producers of high-quality processed tomato products, the
industry should be well situated to capitalize on increasing worldwide demand.
Conversely, slow growth in exports and/or increasing producer costs relative to returns would likely translate into little
or possibly even negative growth in the domestic industry.
Because beginning stocks, exports, and producer costs relative to returns are such important variables in forecasting planted
acreage, reliable forecasts of these variables are vital to forecasting acreage, production, and price. Perhaps the most
glaring weakness of this particular forecasting model is the lack of a theoretically satisfactory export volume equation. While
the simple AR process fits well statistically and accounts for the relatively recent explosion in exports, it does
nothing to account for potential changes in key variables which would likely
affect export volume, such as changes in price, exchange rates, and stocks on hand. An improvement in long-run export
forecasting would likely improve the model's other long-run projections.
References
Chern, W.S., and R.E. Just. "Econometric Analysis of Supply Response and Demand for Processing Tomatoes in California."
University of California, Berkley Agricultural Experiment
Station. Giannini Foundation Monograph No. 37, 1978.
Conway, Roger K. and John F. Yanagida. "Estimating U.S. Potato Demand: Structure and Forecasts." USDA, ERS. Vegetables and
Specialties Situation and Outlook Report, TVS-218 (1980): 31-40.
The Food Institute. "Food Markets in Review, 1998/99: Tomato Products." Fair Lawn, NJ, 1998, pp. 2-5.
Love, John M. and Lois S. Willet. "Modeling the U.S. Potato Industry." USDA, ERS. Vegetables and Specialties Situation and
Outlook Report, TVS-250, March 1990, pp. 16-24.
Welty, John C. "The California Tomato Growers Association's District Meeting Wrap-Up." California Tomato Growers, Inc.
VEGETABLES AND SPECIALTIES is published three times a year (includes
yearbook) by the Economic Research Service, U.S. Department of Agriculture,
Washington, DC 20036-5831. This release contains only the text of the report -- tables and graphics are not included. See supplemental data
files in Lotus 123 (.WK1) format.
Subscriptions to the printed version of the report are available from the
ERS-NASS order desk. Call, toll-free, 1-800-999-6779 and ask for stock
#SUB-VGS-4039, $30/year. ERS-NASS accepts MasterCard and Visa.
Printed copies of the Vegetables and Specialties Situation and Outlook report
are available. For more information contact Gary Lucier, 202-694-5253 or
Charles Plummer, 202-694-5256. The text of the report will also be
available electronically via the ERS website at www.econ.ag.gov.
ERS-USDA news release |