Feng, P. Et Al. 2019 (AFM)
Feng, P. Et Al. 2019 (AFM)
Feng, P. Et Al. 2019 (AFM)
a
School of Life Sciences, Faculty of Science, University of Technology Sydney, PO Box 123, Broadway, NSW 2007, Australia
b
NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, Wagga Wagga, NSW 2650, Australia
c
Climate Change Research Centre and ARC Centre of Excellence for Climate Extremes, University of New South Wales, Sydney, NSW 2052, Australia
d
NSW Department of Primary Industries, Orange Agricultural Institute, NSW 2800, Australia
e
State Key Laboratory of Soil Erosion and Dryland Farming on the Loess Plateau, Northwest A&F University, Yangling 712100, Shaanxi, China
f
College of Resources and Environment, University of Chinese Academy of Science, Beijing 100049, China
Keywords: Accurately assessing the impacts of extreme climate events (ECEs) on crop yield can help develop effective
Extreme climate events agronomic practices to deal with climate change impacts. Process-based crop models are useful tools to evaluate
Wheat yield climate change impacts on crop productivity but are usually limited in modelling the effects of ECEs due to over-
APSIM simplification or vague description of certain process and uncertainties in parameterization. In this study, we
Random forest
firstly developed a hybrid model by incorporating the APSIM model outputs and growth stage-specific ECEs
Hybrid model
indicators (i.e. frost, drought and heat stress) into the Random Forest (RF) model, with the multiple linear
regression (MLR) model as a benchmark. The results showed that the APSIM + RF hybrid model could explain
81% of the observed yield variations in the New South Wales wheat belt of south-eastern Australia, which had a
33% improvement in modelling accuracy compared to the APSIM model alone and 19% improvement compared
to the APSIM + MLR hybrid model. Drought events during the grain-filling and vegetative stages and heat events
immediately prior to anthesis were identified as the three most serious ECEs causing yield losses. We then
compared the APSIM + RF hybrid model with the APSIM model to estimate the effects of future climate change
on wheat yield. It was interesting to find that future yield projected from single APSIM model might have a
1–10% overestimation compared to the APSIM + RF hybrid model. The APSIM + RF hybrid model indicated
that we were underestimating the effects of climate change and future yield might be lower than predicted using
single APSIM informed modelling due to lack of adequately accounting for ECEs-induced yield losses. Increasing
heat events around anthesis and grain-filling periods were identified to be major factors causing yield losses in
the future. Therefore, we conclude that including the effects of ECEs on crop yield is necessary to accurately
assess climate change impacts. We expect our proposed hybrid-modelling approach can be applied to other
regions and crops and offer new insights of the effects of ECEs on crop yield.
⁎
Corresponding author at: State Key Laboratory of Soil Erosion and Dryland Farming on the Loess Plateau, Northwest A&F University, Yangling 712100, Shaanxi,
China.
E-mail address: yuq@nwafu.edu.cn (Q. Yu).
https://doi.org/10.1016/j.agrformet.2019.05.018
Received 22 November 2018; Received in revised form 15 May 2019; Accepted 20 May 2019
Available online 25 May 2019
0168-1923/ © 2019 Elsevier B.V. All rights reserved.
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
ECEs are usually defined as atypical precipitation, temperature, and learning approach (Shalev-Shwartz and Ben-David, 2014), which
other weather factors compared to their historical distributions (IPCC, usually result in better performance compared to conventional linear
2012). During a crop growth cycle, various ECEs are likely to occur and regression models (Everingham et al., 2015; Feng et al., 2018; Jeong
cause varying degrees of yield losses. In this study, we focused on three et al., 2016). However, a major limitation of statistical models is that
agro-climatic extremes, i.e. drought, heat stress, and frost, which are they usually only provide a simple evaluation of impacts, rather than
widely considered to have great impacts on crop growth and yield provide a deeper understanding of physiological constraints required to
(Guarin et al., 2018). Drought is currently the main constraint to crop inform adaptation strategies (Roberts et al., 2017). Thus, results from
yield in rainfed systems. Drought-induced insufficient water supply can statistical models might sometimes be vague and ambiguous in aiding
adversely impact crop growth in all growing stages, by restraining root targeted development of adaptive practices.
growth and leaf expansion during the vegetative growth stage, and In recent years, the value of combining both process-based crop
limiting photosynthesis, carbon allocation and yield formation during models and statistical models is gaining recognition. Pagani et al.
reproductive stages (Chaves et al., 2002). The relationship between (2017b) incorporated outputs from the sugarcane model Canegro
heat stress (high temperatures above specified thresholds) and yield (Inman-Bamber, 1991) and agro-climatic indicators into multiple linear
losses have also been identified in numerous studies (Innes et al., 2015; regressions to reproduce recorded yield. Their results showed that the
Pagani et al., 2017a; Semenov and Shewry, 2011). This relationship combined model increased prediction accuracy by about 20% com-
could be potentially explained by a number of mechanisms, including pared to each individual model. Everingham et al. (2016) obtained si-
decreased net photosynthesis (Rezaei et al., 2015), increased main- milar higher levels of accuracy by combining APSIM model and random
tenance respiration rates (Innes et al., 2015), and accelerated plant forest (RF) algorithm. Guzmán et al. (2017) combined DSSAT model
development (Stratonovitch and Semenov, 2015). Crops are most vul- (Jones et al., 2003) and support vector regression model for a com-
nerable to heat stress at reproductive stages, especially from anthesis to prehensive assessment of groundwater variability to demonstrate that
maturity. A single, isolated extreme heat event at anthesis can con- the hybrid model performed better in characterizing groundwater
siderably reduce grain yield, but a continuous period of heat stress can variability. In evaluating the effects of ECEs on wheat yield, the existing
lead to almost total yield loss (Porter and Semenov, 2005). As with state-of-art studies are still based on either crop models (Cammarano
drought and heat, frost (low temperatures below specified thresholds) and Tian, 2018) or statistical models (García-León et al., 2019). Our
can also reduce crop growth in all growing periods from the seedling study will firstly use the combination of the two kinds of models, i.e.
stage to harvest. For example, leaves of wheat seedlings are vulnerable crop models and machine learning techniques, to explore new insights
to extreme cold conditions and may wither (Fuller et al., 2007). When of the effects of ECEs on wheat yield.
the wheat inflorescence is forming but prior to flowering, frost events Australian wheat production is crucial to global food security, be-
can result in sterile flowers, decreasing grain number (Barlow et al., cause Australia is one of the world’s major grain exporters (Grundy
2015). et al., 2016). The New South Wales (NSW) wheat belt is the main wheat
Two distinct methods have been widely used to examine climate- production area of south-eastern Australia, accounting for 27% of the
yield relationships, i.e. process-based crop models and statistical national production (www.abares.gov.au, 2013-14). However, inter-
models. Process-based crop models have been developed to account for annual wheat yields in this area are highly variable. Compared to the
the complex interactions between the local environment, the crop long-term mean, up to 1 t·ha−1 yield loss has occurred frequently over
genotype, and management practices (Chenu et al., 2017). In recent the past three decades (http://www.abs.gov.au/Agriculture). There is a
years, crop models have been widely used to characterise the effects of very definite possibility that recurrent extreme events drive the inter-
historical and future ECEs on crop yield in multiple regions around the annual variability in wheat yields (Hughes et al., 2015). Moreover,
world (Cammarano and Tian, 2018; Harrison et al., 2014; Jin et al., drought and heat stress are projected to increase due to the changing
2017; Lobell et al., 2015). While process-based crop models can provide climate (BOM and CSIRO, 2016). In this study, we combined the APSIM
a comprehensive understanding of the timing, frequency and intensity model (Holzworth et al., 2014) and RF algorithm to build a hybrid
of ECEs on crop growth (Watson et al., 2017), they have limitations. model to evaluate impacts of ECEs on wheat yields. The main objectives
Some of the limitations relate to over-simplification or vague descrip- were to 1) develop a hybrid model to reproduce historical observed
tion of certain process and uncertainties in parameterisation, which can wheat yields in the NSW wheat belt, 2) quantify the relative importance
lead to inaccurate results (Eitzinger et al., 2013). These limitations are of growth stage-specific drought, heat, and frost events in determining
especially obvious in simulating ECEs (Barlow et al., 2015). For ex- wheat yields, and 3) compare the yield differences projected by the
ample, heat stress impacts are particularly poorly captured in crop APSIM alone and the hybrid model under future climate change.
models (Fischer, 2011; White and Hoogenboom, 2010). For example,
most crop models simulate the effects of high temperature on leaf se- 2. Materials and method
nescence and stem carbohydrate accumulation and distribution, rather
than directly model damage to reproductive organs and processes. This 2.1. Study sites
raises uncertainty over the application of crop models to properly ac-
count for yield losses resulting from ECEs and the validity in assessing The NSW wheat belt (Fig. 1) is located in south-eastern Australia,
long-term impacts of ECEs under climate change (Schauberger et al., with its western border bounded by the semi-arid interior. It accounts
2017). for nearly 30% of the total areas planted to wheat in Australia (www.
Statistical models use various regression methods to link historical abares.gov.au, 2013-14), making it important in terms of both domestic
yields to historical climate data which are then used to make predic- and international food security (Ray et al., 2015). Generally, wheat is
tions about yields under altered climate conditions (Schlenker and mainly grown under rainfed conditions and the typical growing season
Roberts, 2009). They are easy to handle and relatively easy to compute. is May to November (Gomez-Macpherson and Richards, 1995).
With the increasing availability and improved quality of observed data, The NSW wheat belt is characterized by variable topography and
statistical models usually have a high level of accuracy (Folberth et al., climatic conditions. There is an east-west gradient in both elevation and
2019; Innes et al., 2015). Moreover, newly emerging machine learning precipitation/temperature. The eastern part of the wheat belt consists
algorithms may improve the ability of statistical models to explore of mountains with elevation up to 1100 m and the western areas are
climate-yield relationships (Chlingaryan et al., 2018). Machine learning mainly plains. Average growing season temperature ranges from 8.3 °C
algorithms are capable of disentangling the effects of co-linear climate in the south-east to 17.1 °C in the north-west and the average growing
variables and analysing hierarchical and nonlinear relationships be- season precipitation ranges from 171 mm in the south-west to 763 mm
tween the predictors and the response variable through an ensemble in the south-east in 1961–2000 (Wang et al., 2017a). In addition, the
101
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
Table 1
A brief description of the 29 study sites used in the study, including location, Soil No.(details at http://www.asris.csiro.au/), GSR (mm), GST (°C), HY, HDR and
AMWY (t ha−1).
ID Site Soil No. GSR GST HY HDR AMWY
Note: GSR: growing season rainfall; GST: growing season temperature; HY: harvest year; HDR: heading date record year; AMWY: and annual mean wheat yield.
102
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
× (y 2045)3 (1)
103
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
Table 3
List of extreme climate events used in this study. Heat events were calculated at FI, F, and SGF stages. Frost events were calculated at EJ
and FI stages. Drought events were calculated at EJ, FI, F, and SGF stages. Thus, totally 9 weather extreme indicators were used in this
study.
Extreme event Description Growth stage
Heat Number of days with daily maximum temperature > 27 °C FI, F, SGF
Frost Number of days with daily minimum temperature < 0 °C EJ, FI
Drought Number of days with ARID > 0.6 EJ, FI, F, SGF
Note: EJ: end of juvenile; FI: floral initiation; F: flowering; SGF: start of grain filling; ARID: Agricultural Reference Index for Drought.
For RCP8.5, it was fitted by: values and outliers (Elavarasan et al., 2018). Moreover, the RF can
3 approximate functions with both linear and non-linear relations and
267.78 1.6188 × y y 2010
[CO2]year = 1034.3 + 53.342
+ 21.746 × can also identify the relationship between the response and a variable,
4.0143 + 100
y5.2822 which is conditional upon other variables (Hoffman et al., 2018). Given
y 1911 3 that the effects of ECEs on crop yields are often nonlinear (Lobell et al.,
+ 100.65 × 2011; Schlenker and Roberts, 2009), RF is expected to perform well in
100 (2)
assessing the nonlinear relationship. Our previous studies (Feng et al.,
where y is the calendar year from 1900 to 2100 (i.e. y = 1900, 1901, 2018, 2019; Wang et al., 2018) have demonstrated that the RF model
…, 2100). usually performed better than many other machine learning techniques,
in agricultural-based applications. In addition, RF mode is able to
2.6. Climate extremes indicators provide the relative importance of each predictor in determining re-
sponse variable. Therefore, in the present study, we intended to take
In APSIM, wheat cultivation is divided into 11 stages, i.e. sowing, advantage of RF to enhance the ability of APSIM in simulating the ef-
germination, emergence, end of juvenile (EJ), floral initiation (FI), fects of ECEs on wheat yields.
flowering (F), start of grain filling (SGF), end of grain filling, maturity, We also used the MLR to build the hybrid model with the APSIM
harvest rips, and end crop. In this study, we took EJ, FI, F, and SGF into model. MLR is a commonly used regression method to model the linear
consideration, as they represent the four main growing stages. In this relationship between the independent variables and the dependent
study, we intended to evaluate impacts of three kinds of ECEs (Table 3) variable. It is considered to be the extension of ordinary least-squares
at the four main growing stages on wheat yields. The indicators for heat that involves more than one explanatory variable. It is easy to under-
and frost are simple counts of days with maximum/minimum tem- stand and implement, but usually limited in disentangling the nonlinear
peratures above/below fixed thresholds (Tashiro and Wardlaw, 1989; relationships between the predictors and the response.
Zheng et al., 2012). The impact of water deficit was assessed using the
ARID - Agricultural Reference Index for Drought (Woli et al., 2012).
This drought index is a simple, general, soil-plant-atmosphere metric. It 2.8. Hybrid-modelling approach
usually performs better than many other drought indices in agricultural
drought evaluations (Woli et al., 2013). Fig. 2 illustrates the processes of combining the APSIM and RF (or
MLR) models in our study. First, the APSIM wheat module was run to
Ti simulate wheat phenology, biomass and yield based on NVT trial da-
ARIDi = 1
ETo, i (3) tasets. The outputs of phenology date were then used as references for
the calculation of the 9 indicators at the four growth stages. Lastly,
where i represents the ith day, Ti is the transpiration during the ith day
APSIM simulated biomass and the 9 indicators were applied as pre-
(mm d−1), and ETo,i is the reference evapotranspiration on the ith day
dictors in RF (or MLR) for estimating wheat yield. In this study, we
(mm d−1). When calculating ARID, ETo,i is assumed to be equal to
proposed the RF (or MLR) model as an external modification which was
potential evapotranspiration and can be estimated using the Priestley
expected to help improve the performance of APSIM model in simu-
and Taylor (1972) method. Ti is estimated through a macroscopic
lating the effects of growth stage-specific ECEs.
modeling approach which is based on the water content. Detailed de-
We performed the RF model using the R package “randomForest”
scriptions and calculation processes can be found in Woli et al. (2012).
(Liaw and Wiener, 2002). Two parameters were needed to be de-
ARID values fall between 0 and 1. Values higher than 0.6 are usually
termined before the implement of the model, i.e. ntree (the number of
recognized as high water stress, thus we chose 0.6 as the threshold to
trees to grow in the forest) and mtry (the number of randomly selected
evaluate daily drought condition.
predictor variables at each node). We set the ntree as the default values
To calculate these indicators, we first ran APSIM simulations and
of 500. While for mtry, it could affect the model accuracy. As the dataset
obtained wheat phenology information, including dates and duration of
was not large, we adopted a trial and error analysis to determine the
wheat growing stage. Then, according to the phenology information,
value of mtry. Values of 1–10 were tried and 5 was chosen finally as it
stage-specific ECEs indicators were calculated or counted out. After
leaded to a little higher model accuracy. The relative importance of
calculation, we found that heat events rarely happened at the EJ stage
variables was assessed through the “%IncMSE” metric in the RF model.
and frost events rarely happened at the F and SGF stages during the
In addition, the MLR model was performed using the R package “Rattle”
historical period (2008–2017). Thus, we eventually selected 9 extreme
(Williams, 2011).
events at the four growing stages (Table 3).
RF (random forest) is a nonparametric and ensemble learning al- The NVT trial data (516 trials, 2008–2017) were used to calibrate
gorithm originated from classification and regression trees (Breiman, the models. The output yields from the APSIM model were directly used
2001). It is a nonparametric technique which builds multiple decision to compare with the observed data. While for the RF model, a 10-fold
trees and combines them together to obtain a prediction. Thus, the RF cross validation approach was applied to the 516 data. The coefficient
usually presents good accuracy in spite of the presence of missing of determination (R2) and root mean square error (RMSE) were used for
104
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
Fig. 2. Diagram of the input and output per model for the APSIM + RF (or MLR) hybrid model applied in this study. EJ: end of juvenile; FI: floral initiation; F:
flowering; SGF: start of grain filling; RF: random forest; MLR: multiple linear regression.
model evaluation following: date (Fig. 3). Agreement between simulations and observations was
2 described by the root mean square error (RMSE), the coefficient of
n
i=1
(Oi O¯ )(Pi P¯ ) determination (R2) and the slope of the regression lines. As shown in
R2 = n
(Oi O¯ )2
n
(Pi P¯ ) 2 Fig. 3a, the simulated flowering dates were consistent with observed
i=1 i=1 (4) dates, with an RMSE of 5.01 days (R2 = 0.82, y = 0.72x+76.71,
n P < 0.01), suggesting that the APSIM was able to provide a satisfactory
(Oi Pi )2
RMSE = i=1 estimation of wheat flowering dates. As the flowering stage was gen-
n (5) erally viewed as the indicative stage of the entire wheat phenology, it
Where n is the number of samples, Oi and Pi denote observed and si- was likely that APSIM could also provide fairly good estimations of the
mulated values, and Ō represents the mean of observed values. other three growth stages. This laid a foundation for our subsequent
Generally, the model with higher R2 and lower RMSE is considered to calculation of stage-specific ECEs indicators. For wheat yields, the
be the more accurate model. model was able to explain 61% of the variation and the RMSE was 0.86
t ha−1. This was a common and acceptable result for large-scale crop
3. Results model simulations (Jin et al., 2017), even though the accuracy was not
high. In general, inaccurate simulations were due to the absent or rough
3.1. Model performance assumptions around certain factors, including pest, diseases, and
weather extremes as we mentioned above. In subsequent analysis, we
The performance of the APSIM wheat module was evaluated by managed to increase the accuracy through improving the ability of si-
comparing simulated and observed wheat grain yield and flowering mulating ECEs.
Fig. 3. Comparison of observed and APSIM simulated values of grain yield and flowering date from 2008 to 2017 at the 29 sites across the New South Wales wheat
belt. Totally 516 yield data and 47 flowering date data were collected from National Variety Trials of Australia (see text for more detail about this dataset). Dashed
lines are the 1:1 ratio line. Red lines are the linear regression fit.
105
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
We used the MLR model and the RF model as external modification Fig. 4c shows the time series of the three kinds of wheat yields, i.e.
on the APSIM model and created two hybrid models to predict wheat observed, APSIM simulated, and the two hybrid models simulated, from
yield. Compared to the APSIM model alone, both hybrid models showed 2008 to 2017. The observed yields ranged from 3.0 t ha−1 to 5.2 t ha−1,
higher accuracy in reproducing the observed yields (Fig. 4). The with the greatest inter-annual variability. In general, all models’ si-
APSIM + RF hybrid model explained 81% of the variation in observed mulations successfully captured the temporal pattern of the observed
yields, an increase of 33% compared to the APSIM model. It also re- wheat yields. However, the APSIM model tended to slightly over-
duced the RMSE by 0.32 t ha−1. Moreover, the slope of the regression estimate the yields in almost every year. This was particularly evident
function was close to 1.0, meaning that the APSIM + RF hybrid model in 2008, 2010, and 2013, where an overestimate of up to 0.5 t ha−1
was unbiased in simulation of wheat yields for our study area. While for occurred. These overestimations can be attributed to an underestimate
the APSIM + MLR hybrid model, its model accuracy increased slightly of the effects of ECEs on wheat yields. Through incorporating ECEs
compared to the APSIM model alone and was far below the APSIM + indicators, the APSIM + RF hybrid model succeeded in making the si-
RF hybrid model. Thus, the external modification using the RF model mulated yields more consistent with the observed yields.
with ECEs indicators could greatly improve the performance of the
APSIM model.
Fig. 4. Comparison of observed, APSIM simulated, APSIM + MLR simulated, and APSIM + RF simulated wheat yields from 2008 to 2017 at the 29 sites across the
New South Wales wheat belt. (a) observed vs. APSIM + MLR hybrid model simulated. (b) observed vs. APSIM + RF hybrid model simulated. (c) time series of the
four kinds of yields. In (c), error bars indicate the standard deviation from yields at the 29 sites.
106
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
Fig. 5. Number of extremes climate events occurred from 2008 to 2017. Values for each year were averaged values of the 29 study sites.
3.2. Effects and projected changes of ECEs wheat yield to different ECEs at a same stage were different. For ex-
ample, the F stage was more vulnerable to heat rather than drought,
The historical occurrence of ECEs is shown in Fig. 5. Drought was while the SGF stage was more vulnerable to drought rather than heat.
the most commonly occurring ECE during the historical period. In The shaded area (Fig. 6) denotes the 10th to 90th percentile of each
general, the four growth stages, i.e. EJ, FI, F, and SGF, usually last for variable in the calibration dataset. All lines have a sharp change in the
72–88, 37–48, 6–9, and 25–34 days respectively for the four cultivars. shaded area, meaning that a small change of each ECE may have a large
Thus, nearly one quarter, half, and half of the EJ, F, and SGF stages impact on wheat yield. As shown in the boxplots in Fig. 6, drought
respectively experienced drought conditions. While for heat, it com- events were projected to have small increase in occurrence at the four
monly occurred around and post anthesis. The SGF stage was most study stages, but heat events were likely to increase significantly. In
vulnerable to heat stress and up to two-thirds of this stage might be particular, under RCP8.5, heat events during the end of the 21st century
under heat threat. Frost events mainly occurred during the EJ stage and may double compared to the baseline period. Frost events were likely to
a few frost events also occurred at the FI stage. Fig. 4b and Fig. 5 show decrease, by > 10 days at the EJ stage and by 1-4 days at the FI stage.
that there was an obvious and direct relationship between wheat yields However, the decrease of frost events at the EJ stage might also cause
and the occurrences of ECEs. Wheat yields were much lower during yield reductions. Thus, in general, more yield losses were indicated as a
years with a higher occurrence of ECEs, such as 2012 and 2017. result of changes in future ECEs.
According to the above results, the RF model could potentially im-
prove the performance of the APSIM model in simulating the effects of 3.3. Differences between APSIM projected and the hybrid model projected
ECEs. We then obtained the relative importance (percentage values in future wheat yields
Fig. 6) and the marginal effect (lines in Fig. 6) of each predictor from
the RF model. The trend of the line, rather than the actual values, de- As the RF model performed better in improving the performance of
scribes the nature of the dependence between the response and the the APSIM model compared to MLR. We then used the APSIM model
predictor variables. All ECEs, except frost during the EJ stage, had and the APSIM + RF model to evaluate the impacts of future climate
negative effects on wheat yield. Drought events occurred at the SGF and change on wheat yield in the study area. Projected changes in simulated
EJ stages had high importance values, meaning that they were more wheat yield from the APSIM model and the hybrid model for two of the
harmful to wheat growth. The third was heat events at the FI stage. study sites are shown in Fig. 7 (other sites in Fig. A2). We calculated
Thus, even though heat events were more common during the SGF changes of simulated wheat yields for each site and found that trends
stage (Fig. 5), they tended to cause more yield losses if occurred at the from APSIM-simulated wheat yield differed in different sites. However,
FI stage. Frost events at the EJ stage had positive effects on wheat yield. the APIM + RF hybrid model-simulated yield was projected to decrease
This may be due to that winter wheat plant is capable of withstanding in all study sites. For example, Fig. 7 shows the APSIM-simulated yields
extreme cold before the initiation of flowering (Fowler and Carles, were projected to decrease by 0.5–3% at Bellata but increase by 1.5–3%
1979). Moreover, the plant also requires enough exposure to cool at Mayrung. This might be due to different soil conditions and climate
temperature for jarovization, which will affect subsequent growth and projections at the two sites. However, the APSIM + RF hybrid model-
development (Robertson et al., 1996). In addition, the responses of simulated wheat yields were projected to decrease at both sites.
107
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
Fig. 6. Partial dependence of wheat yield change on extreme climate events and projected changes in each event under RCP4.5 and RCP8.5. The random forest model
could provide partial dependence of the change in the response (blue lines) for selected predictors, when accounting for the average effect of all other driver
predictors. The blue lines are smoothed representations of the response, with fitted values (model predictions) for the calibration data. The trend of the line, rather
than the actual values, describes the nature of the dependence between the response and predictors. The shaded area denotes calibration data between the 10th and
90th percentile. The percentages values denote the relative importance of each predictor generated from the random forest model. The box plots indicate the
occurrences of extreme climate events during the baseline (1961–2000) period and two future periods (2041–2060 and 2081–2100) based on the 34 downscaled
GCMs. Box boundaries indicate the 25th and 75th percentiles across GCMs, whiskers below and above the box indicate the 10th and 90th percentiles. The black lines
within each box indicate the multi-model median. EJ, FI, F, and SGF indicate end of juvenile, floral initiation, flowering, and start of grain filling, respectively.
According to the multi-model ensemble mean values (2041–2060), the model and the hybrid model showed that the APSIM + RF hybrid
APSIM + RF hybrid model-simulated wheat yields were 4% and 3% model is better at reproducing historical wheat yields. Using the RF
lower than the baseline levels at Bellata and Mayrung, respectively. algorithm as an external modification on the APSIM model outputs
Moreover, the yield reductions magnified over time. The differences appears to improve the performance of the individual APSIM model
between the two models were mainly caused by different responses to (Fig. 3 and Fig. 4). Moreover, the RF model also outperformed the MLR
ECEs. Climate change might result in various trends of the APSIM model. Everingham et al. (2016) conducted a similar study through
projections at different sites, but the ECEs changes, especially the in- incorporating the biomass simulated by the APSIM model and several
crease of heat and drought events (Fig. 6), were most likely to reduce climate indices into the RF algorithm to simulate sugarcane yield and
the yields to lower levels. also obtained a high R2 of ˜0.8. The most likely explanation is that the
external statistical model may help improve the performance of the
crop model by simulating the effects of these selected climate indices.
4. Discussion
Using the crop model outputs as predictors may also improve the ability
of the statistical model to consider more agro-climatic processes.
The comparison between the results obtained from the APSIM
108
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
Fig. 7. Projected changes in simulated wheat yield from the APSIM model and the APSIM + RF hybrid model for two of the study sites. Changes were estimated
between two future periods (2041–2060 and 2081–2100) and the baseline period (1961–2000) under RCP4.5 and RCP8.5 based on the 34 downscaled GCMs. Box
boundaries indicate the 25th and 75th percentiles across GCMs, whiskers below and above the box indicate the 10th and 90th percentiles. The black lines and
crosshairs within each box indicate the multi-model median and mean respectively.
Keating and Thorburn (2018) proposed the possible research trend of climate change are needed to be assessed. In our study, we found that
blending mechanistic and empirical statistical models. The approach drought events are expected to have a slight increase, while heat events
outlined in this paper can be viewed a feasible method which may be around anthesis and grain-filling periods will likely be more common.
easily extended to other wheat growing regions to obtain new insights Semenov and Shewry (2011) obtained similar results in a study in
to guide agricultural practices. Europe and demonstrated that wheat plants are expected to suffer more
In recent years, researchers have been concerned with ECEs because heat stress than drought in the future. Given the severe and rapid da-
of their remarkable damage to crops (Lesk et al., 2016; Rezaei et al., mage caused by heat stress around anthesis, a single day increase in
2015). It was reported that nearly one-quarter of all damage and losses heat events is likely to cause great wheat yield losses. On the other
in the agriculture sector are caused by ECEs in many countries (FAO, hand, while fewer frost days are expected because of global warming,
2015). As ECEs may impact during different stages of a crop cycle and we found that frost events during vegetative stages had a positive effect
cause varying degrees of yield losses, it is necessary to identify the most on wheat yield across our study region. Thus, in general, ECEs changes
harmful ECEs in a particular region. In our study, we found that will result in future increased risks for wheat production in the study
drought events during grain-filling and vegetative stages have the area.
greatest adverse effects on wheat yields across a wide range of agro- Given the more unfavorable weather conditions in the future, the
climatic zones. Drought events around anthesis may have relatively possible trend of future wheat yields is frequently discussed among
small effects on wheat yields. One reason is probably that yield for- researchers. The most commonly used method to assess climate impacts
mation mainly occurs at the grain filling stage (Royo et al., 2006). is a combination of process-based crop models and GCMs. For example,
Moreover, the number of drought events occurring around anthesis is Qian et al. (2016) reported that an average increase in wheat yields of
relatively low. A short-term drought event is not able to severely reduce 26–37 % to be expected during 2041–2070 compared to a baseline
the final yield (Clarke et al., 1992). period (1971–2000) in the Canadian Prairies using the CERES-Wheat
In contrast, the impacts of heat and frost events on wheat yield are model and two GCMs. Wang et al. (2017b) conducted a study in Aus-
usually direct and rapid (Barlow et al., 2015). We found heat events tralia using the APSIM model and 11 GCMs to demonstrate that there
around anthesis (FI_Heat and F_Heat), result in a large negative impact would be a decrease in the yield in the south-eastern Australian wheat
on wheat yields, even though they may only occur over a few days in belt throughout the 21st century. The majority of these previous and
the growing season, which is consistent with Balla et al. (2009)’s and similar climate change impacts studies only used one single crop model.
Hays et al. (2007)’s studies. This is mainly because the sensitivity of the However, model comparison studies, especially for climate change
wheat plant to various ECEs varies with different growth stages studies, have emphasized the limited ability of crop models to account
(Hlaváčová et al., 2018). During anthesis, the awns or spikes start to for ECEs. Sánchez et al. (2014) observed that a number of crop models
emerge from the flag leaf as the grain begins to form. However, even a adequately predict mean yields, but are less able to predict extreme low
single frost or heat event may cause sterility and aborted grains, thereby yields, due to their inability to handle ECEs. Hochman et al. (2013)
reducing the number of grains in the inflorescence. While during the reported that the APSIM model poorly accounted for ECEs such as se-
grain-filling period, as grains have formed, the adverse impacts from vere frost and was also overly optimistic about water limited yield
heat events will be reduced (Barlow et al., 2015). impacts in some seasons and locations. Therefore, it is questionable
The ECEs that have occurred during the historical period have al- whether crop model-based projections accurately reflect the direction
ready resulted in large yield losses, so their likely changes induced by and magnitude of the effects of climate change on yield.
109
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
In our study, we used the hybrid model which systematically in- 5. Conclusions
corporated the APSIM model output and ECEs indicators through
random forest algorithm to predict future wheat yields and found that a Impacts of heat, frost, and drought events on current and future
single APSIM model may have 1–10% overestimation for future yield wheat yields were analyzed in this study based on a hybrid model
projections compared to the APSIM + RF hybrid model (Fig. A2). The which incorporated the APSIM model output and climate extremes in-
overestimation was mainly caused by underestimating the ECEs-in- dicators into the Random Forest algorithm. The likely changes of the
duced yield losses. This is likely to be a common phenomenon in crop climate extremes were also discussed. The major conclusions are as
model projections, as the most popular crop models poorly account for follows:
impacts of ECEs (Eitzinger et al., 2013). However, as future ECEs are
projected to increase in most part of the world (IPCC, 2012), previous (1) Drought and pre-anthesis heat events are the major climate ex-
projections based on crop models might overestimate the most likely tremes causing current wheat yield losses in the New South Wales
yield level in the future. Thus, whether an increase or a decrease of wheat belt of south-eastern Australia.
wheat yield is projected in a particular region using crop models, the (2) In general, future climate in the New South Wales wheat belt is
most likely achieved yields may be lower due to the underestimation of expected to be more unfavorable. Drought events are projected to
ECEs-induced yield losses. remain at historical levels, while heat events are projected to in-
Appropriate adaptation strategies can be developed in order to crease in the future. Frost events during the vegetative and pre-
maintain and improve wheat yields in the face of current and future anthesis stages will decrease.
increasing ECEs in the NSW wheat belt. Two kinds of strategies are (3) Future yield projections from conventional process-based crop
frequently discussed among researchers, i.e. minimizing and escaping models might have a 1–10% overestimation because of the under-
the adverse effects of ECEs. In terms of minimizing the adverse effects estimation of climate extremes-induced yield losses.
of ECEs, the main approach is to increase the resistance of crops
through breeding. From our study, the heat tolerance traits will be In addition, the combination of process-based crop models and
important. Stratonovitch and Semenov (2015) also reported heat tol- statistical models showed high performance in modelling extreme cli-
erance around flowering in wheat as a key trait for increased yield mate events and is worthy of consideration for future research. We
potential in Europe under climate change. On the other hand, escaping believe this study would provide some useful information for local
the adverse effects of climate extremes is also a potential approach, farmers and policy makers with respect to development of adaptation
which mainly aims to stagger the reproductive stages to avoid suffering strategies in face of increased climate extremes under climate change.
ECEs. Adjusting sowing date is currently the most effective farming
management practice to avoid negative impacts of ECEs. Optimising Acknowledgments
sowing dates can lead to suitable duration of the pre-anthesis period for
accumulating biomass and suitable flowering and grain-filling windows The first author acknowledges the China Scholarship Council (CSC)
without frost, heat, and terminal drought (Bell et al., 2014). Shortening for the financial support for his PhD study. Facilities for conducting this
the whole growth period may also help avoid suffering terminal heat study were provided by the New South Wales Department of Primary
and drought events, but yield may also decline because of short growth Industries and University of Technology Sydney. Thanks to Grains
length. Therefore, more studies that take into account both breeding Research and Development Corporation (GRDC) National Variety Trials
and selection approaches as well as farming practices are required to (NVT), Australia (http://www.nvtonline.com.au/) for their data sup-
maintain and improve wheat yields in face of increasing ECEs. port. We are grateful to the three anonymous reviewers whose com-
ments have greatly improved this manuscript.
Appendix A
Fig. A1. Distribution of observed wheat yields (2008–2017) for the 29 sites across the New South Wales wheat belt.
110
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
Fig. A2. Projected changes in simulated wheat yield per hectare from the APSIM model and the APSIM + RF hybrid model for the study sites. Changes were
estimated between two future periods (2041–2060 and 2081–2100) and the baseline period (1961–2000) under RCP4.5 and RCP8.5 based on the 34 downscaled
GCMs. Box boundaries indicate the 25th and 75th percentiles across GCMs, whiskers below and above the box indicate the 10th and 90th percentiles. The black lines
and crosshairs within each box indicate the multi-model median and mean respectively.
111
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
112
P. Feng, et al. Agricultural and Forest Meteorology 275 (2019) 100–113
113