These and other features, aspects, and advantages of the present invention will become better understood when the following detailed description is read with reference to the accompanying drawings in which wherein:
Product prediction recovery may, for example, involve a training stage and a use stage. During the training stage, a model for product recovery system is generated, values for the parameters of the model are determined based on training data, and the values are inserted for the parameters in the model to complete the product recovery prediction model. Model generation and parameter value determination are based on equations (1)-(25) described below. During the use stage, the completed product recovery prediction model is used to predict the recovery of a product from a crop.
The product, for example, may be sugar, and the crop, for example, may be sugarcane. However, the present invention applies to products recovered from other crops also.
As an example, a sample sugar processing facility may have two harvest seasons in a year. The first season can be from December to July, while the second season can run from August to October. In this example, the first season production and the harvest data from year 1 to year 5 can be used to find the relationship between recovery, age, variety, season (as represented by Julian Date range), and weather conditions. Julian Dates are the numerical values representing the harvesting season, where August 1 has a numerical value of 1 and where July 31 has a numerical value of 365 or 366. It should be noted that selection of August 1 as Julian Date 1 is chosen to be dependent on selected regional weather conditions and can vary by country or by weather conditions.
The planting regions are classified into different zones based on weather, soil, and irrigation conditions. In the first seasons of years 1 to 5, n different varieties are processed. The difference between the planting date and the harvesting date is considered to be the sugarcane age.
According to sample data for this example, the overall recovery variation as a function of actual date and as a function of Julian Date is shown in
The age of the sugarcane load is calculated using the planting date and the harvest date. The ages of the sugarcane loads are observed to vary in the range of 240 days to 750 days. However, most varieties are harvested when their ages are between 300 to 510 days.
Based on this information, the sugarcane loads can be classified, for example, in twenty-three different age groups. The first age group is for all those loads having an age value less than 300 days, while the last age group is for all those loads having an age value greater than 510 days. Each of the other age groups may be assigned an age range of ten days. The use of a ten day age range is based on the assumption that the sucrose accumulation within sugarcane within a ten day time period does not vary significantly and, hence, can be safely considered as constant. However, as desired, a number of age groups other than twenty three may be used, the age ranges may have increments other than ten, and uniform age ranges need not be used.
Sugar recovery is a function of various factors or effects such as variety (cultivar), age of sugarcane loads, season represented by Julian Date ranges, and such other effects as weather conditions, including rainfall, maximum temperature, temperature difference between maximum and minimum temperatures, and humidity such as relative humidity.
This sequential modeling approach is illustrated in
As is shown in
The dominant different varieties (the plant cane and ratoon of the same sugarcane variety are treated as different varieties) are considered during generation of the Julian Date effect model at the block 12 and of the age effect model at the block 14. The independent models generated at the blocks 12 and 14 for the Julian Date and age effects are then integrated at a block 16 to produce a combined model that considers variety, Julian Date, and age effects while predicting (or estimating) sugar recovery.
A prediction error model is generated at a block 18 from the combined model produced at the block 16 and can be attributed to other conditions such as conditions related to weather changes, soil., and/or irrigation. This prediction error model is generated at the block 18. Rainfall data is input at a block 20, and temperature data is input at a block 22. The temperature data includes data on the maximum temperature and on the difference between maximum and minimum temperature by actual date.
The dynamic weather model so obtained is then integrated with the combined (static variety) model at a block 24 to produce a global model that can then be used for predicting sugar recovery.
These various models are generated in accordance with the equations discussed below. The recovery vs. Julian Date plot of
{circumflex over (r)}
d,v
JD=(av)(JDd2)+(bv)(JDd) ∀d,∀v (1)
where d represents harvesting day, v represents variety, JDd is a variable representing Julian Date for harvesting day d, av represents a parameter for variety v so as to model the Julian Date effect on sugar recovery, and bv represents another parameter for variety v so as to model the Julian Date effect on sugar recovery. The harvesting date d is the actual date rather than as Julian Date. Accordingly, equation (1) is the Julian Date model.
Although the effects of Julian Date is assumed to be a second order relationship, the effects of Julian Date can be modeled by using a relationship other than a quadratic relationship. For example, a linear relationship or other non-linear relationship may be used.
The av and bv in equation (1) are determined by plugging the actual data entered at 10 of
If desired, ranges can be assigned to the individual variety parameters av and bv. If so, the values that are determined for the parameters av and bv can be constrained to fall within these ranges.
To find a relationship between age and recovery, the entire sample set of example harvest data used herein was scanned to find a date range in which only a few varieties were used dominantly and in which there was no rainfall or only mild rainfall just before and during the corresponding date range. When a quadratic relationship is assumed, the recovery values that can be predicted using only age information were significantly close to the actual values.
Assuming, a second order relationship between age and recovery, the following equation can be used as a basis for the age dependent model:
{circumflex over (r)}
d,v,a
A=(cv)(Ād,v,a2)+(dv)(Ād,v,a) (2)
where {circumflex over (r)}d,v,aA represents the day/variety/age group wise recovery of sugar as a function of age, Ād,v,a is a variable representing the weighted average age of the load belonging to age group a for variety v on harvesting day d, and cv and dv are parameters for variety v so as to model the age effect on sugar recovery. As in the case of equation (1), the harvesting date d is the actual date rather than as Julian Date. Accordingly, equation (2) is the sugarcane age model.
Although the effects of age is assumed to be a second order relationship, the effects of age can be modeled by using a relationship other than a quadratic relationship. For example, a linear relationship or other non-linear relationship may be used.
The parameters cv and dv in equation (2) are determined by plugging the actual data entered at 10 of
If desired, ranges can be assigned to the individual variety parameters cv and dv. If so, the values that are determined for the parameters cv and dv can be constrained to fall within these ranges.
The combined model to address both Julian Date and age effects can be given according to the following equation:
{circumflex over (r)}
d,v,a
={circumflex over (r)}
d,v
JD
+{circumflex over (r)}
d,v,a
A
−e
v
∀d,∀v,∀a (3)
In Equation (3), a bias term ev has been added for each variety v. By substituting equations (1) and (2) into equation (3), equation (3) can be rewritten according to the following equation:
{circumflex over (r)}
d,v,a=(av)(JDd2)+(bv)(JDd)+(cv)(Ād,v,a2)+(dv)(Ād,v,a)=ev ∀d,∀v,∀a (4)
Equation (4) can be used to determine a predicted sugar recovery for a harvesting day d, for a sugarcane variety v, and for-an age group a. The predicted recovery value for a harvesting day d for all varieties and age groups can be obtained using weight fractions and is given by the following equation:
where Nv is the set of sugarcane varieties, Na is the set of age groups, and Wd,v,a is a weight fraction for a load of age group a and variety v on harvesting day d.
The weight fractions Wd,v,a are determined by dividing the weight of a load for a given harvesting day d, variety v, and age group a by the sum of weights of all loads for varieties v, and age groups a on harvesting day d.
The combined model represented by equations (4) and (5) can be fitted to the sample production and harvest data of an industry in order to estimate the parameters av, bv, cv, dv, and ev. The estimation of these parameters is solved as an optimization problem according to the following objective function:
where Nd is the set of harvesting days and εdabs represents the absolute error, which is calculated in accordance with the following equations:
εdabs≧Rd−{circumflex over (r)}d ∀d (7)
εdabs≧−(Rd{circumflex over (r)}d)∀d (8)
where Rd is the actual sugar recovery in percent on harvesting day d. The quantity εdabs always stores the positive difference between Rd and {circumflex over (r)}d. However, to use linear programming (LP) relaxation of the constraints given by Equations (7) and (8), the following constraint is included into the optimization problem:
εdabs≦Rd {d (9)
Also, a few additional LP tightening constraints can be applied, if desired, as follows and can be obtained using the domain knowledge about the relationship between age and recovery:
R
d−2.0≦{circumflex over (r)}d,v,a≦Rd+0.75 ∀d,∀v,∀(a=1, . . . ,5,16, . . . ,23) (10)
R
d−0.75≦{circumflex over (r)}d,v,a≦Rd+2.0 ∀d,∀v,∀(a=6, . . . ,15) (11)
The constraints given by Equations (9) to (11) are optional constraints to make the optimization search space more compact. The upper and lower bounds on the parameters obtained while modeling the Julian Date and age effects separately can also be included in the optimization problem.
As in the case of the parameters av, bv, cv, and dv, ranges can be assigned, if desired, to the parameter ev. If so, the value that is determined for the parameters ev can be constrained to fall within these ranges.
The range constraints that are placed on the parameters av, bv, cv, dv, and ev, are very specific to the sample data and need to pre-estimated for other harvest data, as sugarcane is a weather sensitive crop. The linear optimization problem with the objective function given by Equation (6) and subject to the constraints as given above is solved to estimate the optimal parameter values for parameters av, bv, cv, dv, and ev. Alternatively, since the main aim of the objective function given by Equation (6) is error minimization, this objective function could instead be a non-linear objective function.
The actual and the predicted recoveries based on the sample set of example data are plotted in
f
d
=R
d
−{circumflex over (r)}
d
∀d (12)
where fd is the residual error on harvesting day d, Rd is the actual recovery on harvesting day d, and {circumflex over (r)}d is the estimated or predicted recovery on harvesting day d. The residual error fd is mainly caused by ignoring conditions other than Julian Date and age. Such other conditions may include, for example, weather and/or irrigation and/or soil conditions.
The residual error fd of equation (12), which is obtained after modeling the Julian Date and age effects, is an indication, for example, of unmodeled weather effects. For example, the residual error using weather information such as rainfall, maximum temperature, and the difference between maximum and minimum temperatures (i.e., delta temperature) can be modeled as discussed below. The residual error can be predicted using weather information as given by the following equation:
{circumflex over (f)}
d
={circumflex over (f)}
d
RF
+{circumflex over (f)}
d
MT
+{circumflex over (f)}
d
ΔT
∀d (13)
where {circumflex over (f)}dRF is the residual rain fall model that considers the effect of rainfall for the last n months rain fall (e.g., n=8), {circumflex over (f)}dMT is the residual maximum temperature model that considers the effect of the maximum temperature for the last r months (e.g., r=6), and {circumflex over (f)}dΔT is the residual delta temperature model that considers the effect of the difference between the maximum and minimum temperature for the last r months.
The residual rainfall model is dynamic in nature and includes three terms as given by the following equation:
{circumflex over (f)}
d
RF
={circumflex over (f)}
d
RF
+{circumflex over (f)}
d
RF
+{circumflex over (f)}
d
RF
∀d (14)
The first term {circumflex over (f)}dRF
where i represents a rainfall summation index, rfi is a parameter useful in modeling the rainfall effect on the recovery of sugar, z is a summation index representing zone or area, Nz is the set of all zones, WZd,z represents a weight fraction for a sugarcane load from zone z on harvesting day d, and RFd,z is a variable representing the rainfall in zone z on harvesting day d. There are ten rainfall parameters rf1, . . . , rf10 in Equation (15), which will be determined while predicting the effect of last ten days rainfall on recovery. This term helps in analyzing the effect of the last ten days rainfall on recovery. (These ten days are the ten days just prior to harvesting day d.) A time period other than ten days can instead be used in connection with equation (15).
The second term {circumflex over (f)}dRF
where m is a summation index. Hence, using Equations (15) and (16), the rainfall effect for the last sixty days (2 months) is considered. There are five rainfall parameters rf11, . . . , rf15 in Equation (16). A time period other than eleven to sixty days and slots other than ten day slots can instead be used in connection with equation (16).
The rainfall effect for the remaining six months is captured in monthly slots (slots of 30 days) in the last term of Equation (14) as given by the following equation:
There are six rainfall parameters rf16, . . . , rf21 in Equation (17). Hence, in total, there are twenty-one parameters in this dynamic residual rainfall model to predict effect of rainfall on the recovery of sugar from sugarcane. A number of months other than six and slots other than thirty day slots can instead be used in connection with equation (17).
The residual model for predicting the effect of maximum temperature on sugar recovery is also considered dynamic in nature and contains two terms as given by the following equation:
{circumflex over (f)}
d
MT
={circumflex over (f)}
d
MT
+{circumflex over (f)}
d
MT
∀d (18)
The first term {circumflex over (f)}dMT
where j represents a maximum temperature summation index, mtj is a maximum temperature parameter useful in modeling the maximum temperature effect on sugar recovery, and MTd,z is a variable for the maximum temperature in zone z on harvesting day d. Hence, there are in total ten parameters mt1, . . . , mt10 in the dynamic residual model that captures the effect of maximum temperature on sugar recovery prediction. A time period other than the last two months and slots other than ten day slots can instead be used in connection with equation (19), and a time period other than the remaining four months (out of the last six months) and slots other than thirty day slots can instead be used in connection with equation (20).
The dynamic delta temperature model that models the effect of the difference between maximum and minimum temperatures on sugar recovery is very similar to the maximum temperature model and is given by the following equation:
{circumflex over (f)}
d
ΔT
={circumflex over (f)}
d
ΔT
+{circumflex over (f)}
d
ΔT
∀d (21)
The first term {circumflex over (f)}dΔT
where k represents a delta temperature summation index, where δtk is a delta temperature parameter useful in modeling the delta temperature effect on sugar recovery, and ΔTd,z is the delta temperature variable for zone z on harvesting day d. Hence, in the dynamic residual model that captures the effect of delta temperature on sugar recovery, there are ten parameters δt1, . . . , δt10 which will be determined during recovery prediction using linear programming optimization techniques operating on the sample set of example data. A time period other than the last two months and slots other than ten day slots can instead be used in connection with equation (22), and a time period other than the remaining four months (out of the last six months) and slots other than thirty day slots can instead be used in connection with equation (23).
The combined dynamic residual model that predicts the effect of weather conditions on sugar recovery comprises, for example, a total forty-one parameters (twenty-one for rainfall and ten each for maximum temperature and delta temperature). These parameters may be determined by plugging the actual data entered at 10 of
The results obtained after using the global model for parameter estimation and recovery predictions are encouraging. The plot of actual recovery versus predicted recovery is shown in
The last step in the modeling of sugar recovery prediction is to combine the “Static Variety Model” (which considers Julian Date and age effects) and the “Dynamic Weather Model” (which is function of rainfall, maximum temperature, and the difference between maximum and minimum temperatures) by modifying Equation (5) to include the effects of weather as given by the following equation:
and by modifying the optimization objective function given by Equation (6) as given by the following equation:
As discussed above, this modeling approach described above may be implemented in accordance with the flow chart of
The independent models generated at the blocks 12 and 14 for the Julian Date and age effects are then integrated at a block 16 in accordance with equations (3)-(5) to produce a combined model that considers variety, Julian Date, and age effects while predicting (estimating) sugar recovery.
The prediction error model is generated at the block 18 in accordance with equations (12)-(23), the rainfall data is input at the block 20, and the temperature data is input at the block 22.
The dynamic weather model so obtained is then integrated with the combined (static variety) model at a block 24 in accordance with equation (24) to produce a final global model for sugar recovery prediction. Also at the block 24, the parameters av, bv, cv, dv, ev, rfi, mtj, and δtk of equation (24) are determined in accordance with equation (25), in accordance with the constraints given by equations (7)-(23), in accordance with the sample planting data input at the block 10, in accordance with the sample rainfall data input at the block 20, and in accordance with the sample temperature data input at the block 22.
These parameters can then be inserted into the global model of equation (24) for use in predicting sugar recovery from future sugarcane crops as shown in
The program corresponding to the flow chart of
Certain modifications of the present invention have been discussed above. Other modifications of the present invention will occur to those practicing in the art of the present invention. For example, the present invention has been described above in connection with sugarcane crops. However, the present invention could be used in connection with other crops.
Accordingly, the description of the present invention is to be construed as illustrative only and is for the purpose of teaching those skilled in the art the best mode of carrying out the invention. The details may be varied substantially without departing from the spirit of the invention, and the exclusive use of all modifications which are within the scope of the appended claims is reserved.