The disclosed embodiments relate generally to a system and method for improving the operability of a print production environment and, more particularly to an improved approach for forecasting print production demand in the print production environment.
Document production environments, such as print shops, convert printing orders, such as print jobs, into finished printed material. A print shop may process print jobs using resources such as printers, cutters, collators and other similar equipment. Typically, resources in print shops are organized such that when a print job arrives from a customer at a particular print shop, the print job can be processed by performing one or more production functions.
In one example of print shop operation, product variety (e.g., the requirements of a given job) can be low, and the associated steps for a significant number of jobs might consist of printing, inserting, sorting and shipping. In another example, product variety (corresponding, for instance, with job size) can be quite high and the equipment used to process these jobs (e.g. continuous feed machines and inserting equipment) can require a high changeover time. Experience working with some very large print shops has revealed that print demand exhibits a tremendous variety of time series behavior.
Forecasting demand for a given large print shop can be useful in, among other things, managing shop resources. However, traditional approaches of forecasting (as found in associated literature) may be insufficient to suitably forecast demand in large print shops with considerable print job variability. For instance, in literature relating to forecasting a preference toward using pooled demand forecast (as opposed to forecasting components individually and summing the forecasts to obtain an aggregate forecast) has been expressed. It has been found, however, that pooled demand forecasting can break down in, among other environments, print production environments when the job related demand exhibits relatively high levels of variability.
In U.S. patent application Ser. No. 11/868,993 to Rai et al. entitled System and Method of Forecasting Print Job Related Demand (filed on Oct. 9, 2007 and published on Apr. 9, 2009 as U.S. Patent Application Publication No. 2009/0094094-A1) demand data is collected for a print production environment and then segmented into a first demand series with at least two demand components and a second demand series with at least one demand component. The demand components are derived with a segmentation technique, which technique can be performed with a database attribute (e.g., an attribute, such as job or form type, client, duplex/simplex (i.e., media “plex”); obtained from a data warehouse, by time slice (e.g., Mondays or firsts of the month), or by statistical thresholding (e.g., demand over and under 30,000 prints). In practice a first demand related forecast is generated with a combination of the at least two demand components and a second demand related forecast may be generated with the at least one demand component. In one approach, the first and second demand related forecasts may be shown or used in an aggregate plot, the aggregate plot being used to obtain forecast demand data for the print production environment.
Forecasting with the disclosed approach of the '993 Patent Application is well suited for its intended purpose. In particular, use of the disclosed approach can clearly lead to significantly improved forecasting (i.e., reasonably low levels of mean absolute percentage error [MAPE]). Nonetheless, it is believed that the approach disclosed and claimed below may, under certain circumstances, result in even lower levels of MAPE.
In one aspect of the disclosed embodiments there is disclosed a print demand forecasting system for use with a print production system in which multiple print jobs are processed over a selected time interval. The print demand forecasting system includes: a data collection tool, said data collection tool collecting print demand data for each print job processed during the selected time interval; mass memory for storing the collected print demand data; a computer implemented service manager for processing the stored print demand data to obtain a first time series component and a second time series component, said computer implemented service corresponding the first time series component with a first forecast model and the second time series component with a second forecast model; and a selector process, operating with said computer implemented service manager to select one of the first forecast model and the second forecast model, the selected one of the first forecast model and the second forecast model being used to obtain forecast data for a selected time outside of the selected time interval.
In another aspect of the disclosed embodiments there is disclosed A print demand forecasting method for use with a print production system in which multiple print jobs are processed over a selected time interval. The print demand forecasting method includes: collecting print demand data for each print job processed during the selected time interval; storing the collected print demand data in memory; processing the stored print demand data to obtain a first time series component and a second time series component; corresponding the first time series component with a first forecast model and the second time series component with a second forecast model; selecting one of the first forecast model and the second forecast model; and using the selected one of the first forecast model and the second forecast model obtain forecast data for a selected time outside of the selected time interval.
Referring to
In one example, the DCT is a programmable subsystem (possibly assuming the form of a suitable application programmable interface) capable of capturing data, including performance or demand related data, from the output device at selected time intervals. It should be appreciated that, consistent with U.S. Pat. No. 7,242,302 to Rai et al., the pertinent portions of which are incorporated herein by reference, the output device could assume a number of forms, such as a handheld device, PDA, or RFID related device. The DCT 18 may communicate with mass memory 20 for short term storage of, among other things, demand related data. Additionally, a wide variety of performance related information from the output device 16, including information relating to job type, client, duplex/simplex, page counts and impression counts, just to name a few, may be stored in mass memory 20.
The data processing center 12 includes a “service manager” 24 communicating with a “data warehouse” 26. In one illustrated embodiment, the service manager comprises a processing platform that is capable of performing the types of forecasting calculations described below. As contemplated, a variety of data from the document production centers 10, including demand data from mass memory 20, is stored in the data warehouse. The data warehouse may also store job performance related data in the form of a database to facilitate a data segmentation approach, as described below. In the illustrated approach of
Referring now to
An approach of the disclosed embodiments will now be applied to the time series of
With the time series segmentation of
Referring to
For the sake of understanding the modeling of S with the multinominal logistic regression, consider a process that takes on four values, {a, b, c, d}, and let the probabilites of the values be pa, pb, pc, and pd, and the sum of the probabilities be 1. The model might then include the following three logistic regression equations (of course, more equations would be formed for additional components:
Where,
α0, β0, γ0=model parameters to be estimated;
αT, βT, γT=model parameters to be estimated; and
X prepresents a co-variate vector (in this case, previous values)
In essence, fitting this model (including solving the above equations) provides a forecaster. It produces an estimator of the probabilities (pa, pb, pc, pd) for a next observation and the best forecast is the one with the highest probability. The above forecaster performs surprisingly well with an error of 2.4%, and has been found to work well as a time series forecaster. Putting the models together operates, in part, as follows:
It has been found that error in modeling of the forecaster for S (e.g., an error of 10%) can produce inadequate forecasts for the process. For the time series of
S: Multinom(X23˜X1+ . . . +X22) (i.e., looks back 22 obs to forecast the next one
Referring specifically to
Referring to
Referring to
Top: ARIMA (5,1,1)×(0,1,0)7;
Middle: ARIMA(0,2,8)×(0,2,1)9;
Bottom: ARIMA(1,2,1)×(1,2,1)6; and
S: Multinom(X12˜X1+ . . . +X11) (i.e., looks back 11 obs to forecast the next one).
The disclosed embodiments demonstrates that if time series can be segmented into individual time series that can be modeled well by a popular method like ARIMA, then the individual models can be forecast and the proper forecast used at the right time. The process that chooses among an interleaved time series is a selector process—a random process that takes on categorical variables. The selector process can be suitably obtained through use of multinomial logistic regression.
The following are some observations regarding the disclosed embodiments:
Based on the above description, the following features of the disclosed embodiments should now be apparent:
The claims, as originally presented and as possibly amended, encompass variations, alternatives, modifications, improvements, equivalents, and substantial equivalents of the embodiments and teachings disclosed herein, including those that are presently unforeseen or unappreciated, and that, for example, may arise from applicants/patentees and others.
It will be appreciated that various of the above-disclosed and other features and functions, or alternatives thereof, may be desirably combined into many other different systems or applications. Also that various presently unforeseen or unanticipated alternatives, modifications, variations or improvements therein may be subsequently made by those skilled in the art which are also intended to be encompassed by the following claims. Unless specifically recited in a claim, steps or components of claims should not be implied or imported from the specification or any other claims as to any particular order, number, position, size, shape, angle, color, or material.
Number | Name | Date | Kind |
---|---|---|---|
5970224 | Salgado et al. | Oct 1999 | A |
6611726 | Crosswhite | Aug 2003 | B1 |
6650433 | Keane et al. | Nov 2003 | B1 |
6805502 | Rai et al. | Oct 2004 | B2 |
6941013 | Drayer | Sep 2005 | B1 |
7002702 | Machida | Feb 2006 | B1 |
7051328 | Rai et al. | May 2006 | B2 |
7061636 | Ryan et al. | Jun 2006 | B2 |
7079266 | Rai et al. | Jul 2006 | B1 |
7125179 | Rai et al. | Oct 2006 | B1 |
7152589 | Ekeroth et al. | Dec 2006 | B2 |
7230744 | Christodoulou et al. | Jun 2007 | B2 |
7242302 | Rai et al. | Jul 2007 | B2 |
7562062 | Ladde et al. | Jul 2009 | B2 |
7584116 | Kakouros et al. | Sep 2009 | B2 |
7716022 | Park et al. | May 2010 | B1 |
7755778 | Rai et al. | Jul 2010 | B2 |
7953681 | Rai | May 2011 | B2 |
20010055123 | Ryan et al. | Dec 2001 | A1 |
20020124756 | Rai et al. | Sep 2002 | A1 |
20020129081 | Rai et al. | Sep 2002 | A1 |
20030149747 | Rai et al. | Aug 2003 | A1 |
20040130745 | Fabel et al. | Jul 2004 | A1 |
20040267485 | Penov et al. | Dec 2004 | A1 |
20060224440 | Rai | Oct 2006 | A1 |
20060226980 | Rai et al. | Oct 2006 | A1 |
20060241900 | Hu et al. | Oct 2006 | A1 |
20070019228 | Rai et al. | Jan 2007 | A1 |
20070078585 | Pomeroy et al. | Apr 2007 | A1 |
20070091355 | Rai | Apr 2007 | A1 |
20070092323 | Lin et al. | Apr 2007 | A1 |
20070124182 | Rai | May 2007 | A1 |
20070177191 | Eschbach et al. | Aug 2007 | A1 |
20070236724 | Rai et al. | Oct 2007 | A1 |
20070247657 | Zhang et al. | Oct 2007 | A1 |
20070247659 | Zhang et al. | Oct 2007 | A1 |
20070279675 | Quach et al. | Dec 2007 | A1 |
20070293981 | Rai | Dec 2007 | A1 |
20080285067 | Rai | Nov 2008 | A1 |
20090021773 | Rai | Jan 2009 | A1 |
20090021774 | Rai | Jan 2009 | A1 |
20090021775 | Rai | Jan 2009 | A1 |
20090157579 | Rai | Jun 2009 | A1 |
20100290079 | Rai et al. | Nov 2010 | A1 |
Entry |
---|
U.S. Appl. No. 11/868,993 to Rai et al. entitled System and Method of Forecasting Print Job Related Demand (filed on Oct. 9, 2007 and published on Apr. 9, 2009 as U.S. Patent Application Publication No. 20090094094). |
Julian J. Faraway, Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models, Chapman & Hall/CRC, 2005. |
Mestha et al., “Control Elements in Production Printing and Publishing Systems: DocuColoriGen3”, Conference on Decision and Control , Dec. 2003, Proceedings of the 42nd IEEE. |
Potts et al., “Scheduling with batching: A review”, European Journal of Operational Research 120 (2000) 228-249. |
Crovella et al., “Task Assignment in a Distributed System: Improving Performance by Unbalancing Load”, ACM SIGMETRICS Performance Evaluation Review, vol. 26, No. 1, ACM, 1998. |
Schroeder et al., “Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness”, Cluster Computer 7.2 (2004): 151-161. |
Number | Date | Country | |
---|---|---|---|
20100030617 A1 | Feb 2010 | US |