Time series data is recorded for a number of business processes in order to identify past performance metrics and to predict future performance. Such time series data has been used to model various machines, such as printing devices, to determine the rate at which such machines use consumables. However, inaccurate forecasting of time series data reduces operational efficiency and can cause unnecessary expense due to excess or shortage of inventory, non-optimal resource usage, premium freight or labor charges, or the like. One challenge is finding a modeling framework that most accurately describes a wide variety of observed behavior among customers or machines and for different time periods for a particular customer or machine.
This disclosure is not limited to the particular systems, devices and methods described, as these may vary. The terminology used in the description is for the purpose of describing the particular versions or embodiments only, and is not intended to limit the scope.
As used in this document, the singular forms “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art. Nothing in this disclosure is to be construed as an admission that the embodiments described in this disclosure are not entitled to antedate such disclosure by virtue of prior invention. As used in this document, the term “comprising” means “including, but not limited to.”
In an embodiment, a system for forecasting consumption of a consumable for a machine may include a processor and a processor-readable non-transitory storage medium in communication with the processor. The processor-readable storage medium may contain one or more programming instructions that, when executed, cause the processor to receive consumption time series data for a consumable for a plurality of machines, where the consumption time series data for each machine comprises, for each of a plurality of time periods, an amount of the consumable consumed by the machine during the time period, and, for at least one machine of the plurality of machines, determine a model consumption forecast for each of a plurality of dynamic linear models, based on the consumption time series data for the consumable for the machine and the dynamic linear model, determine a final consumption forecast based on the plurality of model consumption forecasts, and determine an amount of the consumable for the machine based on the final consumption forecast.
In an embodiment, a method of forecasting consumption of a consumable for a machine may include receiving, by a computing device, consumption time series data for a consumable for a plurality of machines, where the consumption time series data for each machine comprises, for each of a plurality of time periods, an amount of the consumable consumed by the machine during the time period; and, for at least one machine of the plurality of machines, determining, by the computing device for each of a plurality of dynamic linear models, a model consumption forecast for the machine based on the consumption time series data for the consumable for the machine and the dynamic linear model, determining, by the computing device, a final consumption forecast based on the plurality of model consumption forecasts, and providing an amount of the consumable for the machine based on the final consumption forecast.
The following terms shall have, for the purposes of this application, the respective meanings set forth below.
As used herein, the terms “sum,” “product” and similar mathematical terms are construed broadly to include any method or algorithm in which a single datum is derived or calculated from a plurality of input data.
A “computing device” refers to a computer, a processor and/or any other component, device or system that performs one or more operations according to one or more programming instructions. An illustrative computing device is described in reference to
A “consumable” refers to any product or service that is used in the performance of an operation. For example, an amount of paper used by a printing device may be determined by a print volume. Alternately, an amount of toner used by a printing device may be estimated by the print volume or measured by a decrease in the amount of available toner. Similarly, an amount of work performed by an individual may be used to determine the amount of a service that is performed.
A “machine” refers to a device used to perform a task. In a print production environment, a machine may include, without limitation, a printing device.
A “printing device” refers to a machine capable of performing one or more print-related functions. For example, a print device may include a printer, a scanner, a copy machine, a multifunction device, a collator, a binder, a cutter or other similar equipment. A “multifunction device” is a device that is capable of performing two or more distinct functions. For example, a multifunction device may have print and scan capabilities.
A “service” refers to one or more operations. Illustrative services may include, without limitation, printing, copying, binding, delivering materials, procurement, production, and the like.
A “service provider” refers to an entity that performs one or more services for a user. A service provider may generally perform, for example and without limitation, print services, copy services, construction services, delivery services, and/or any other types of services.
The present disclosure teaches methods and systems for forecasting the consumption of a consumable for a large disparate population of customers and/or machines. Historical information (i.e., consumption time series data) may be collected for each customer and/or machine. The historical information may be ordered as a time series with discrete data points identified for particular time periods.
For example, the amount of toner used to print a page may be estimated based on the type of print operation being performed and the machine on which it is performed based on historical data. In a print embodiment, the consumption time series data that is received 105 may include print volume usage based on one or more of print impression meter reads and toner usage. Print impression meter reads may refer to reading a counter associated with a printing device that keeps track of the number of print impressions made by the printing device. Toner usage may refer to determining the actual amount of toner used by a printing device (based on, for example, an amount of toner used or remaining in a toner cartridge or other receptacle). In an embodiment, consumption time series data may be recorded for each type of toner present in a printing device. For example, a color printing device may include 3, 4 or more toner receptacles, for which consumption time series data may be separately stored.
In an embodiment, consumption time series data that is received 105 may include consumption time series data for a consumable for a plurality of machines having a similar machine type. For example, consumption time series data may be received 105 for printing devices of the same make and model. Alternately, consumption time series data may be received 105 for printing devices in the same class of printers, such as desktop printing devices, high-volume printing devices, or the like. Reception 105 of consumption time series data for similar classifications and groupings of machines may also be performed within the scope of this disclosure.
The computing device may determine 110, for each of a plurality of dynamic linear models, a model consumption forecast for at least one machine of the plurality of machines based on the consumption time series data for the consumable for the machine and the dynamic linear model. In other words, each model consumption forecast may correspond to a dynamic linear model and a machine, and a plurality of model consumption forecasts may be determined 110 for each machine. Each dynamic linear model may comprise a Bayesian model having an identified set of parameters. The parameters for each dynamic linear model may correspond to a common set of features amongst the dynamic linear models. However, each model may have different values for the set of parameters. Each state space model may have operational parameters that vary in time based on customer behavior. External perturbations to the system may be included in a model to improve forecasting. In an embodiment, information may be shared across customers in similar subpopulations in order to increase forecasting accuracy.
In an embodiment, one or more of the dynamic linear models used to make the determination of the model consumption forecast may be additive models. Additive models may add together month-to-month (or other period-based) predictions and unusual variations, such as seasonal components to the model. For example, if an increase in production of approximately 1000 units occurs every December, 1000 units may be added to a month-to-month forecast for December based on the seasonal variation.
In an embodiment, one or more of the dynamic linear models used to make the determination of the model consumption forecast may be multiplicative models. Multiplicative models may multiply a month-to-month (or other period-based) prediction by a factor associated with an unusual variation, such as a seasonal component. For example, if an increase in production of approximately 10% occurs in December, the month-to-month prediction for December may be multiplied by approximately 1.10 in order to account for the seasonal variation.
In an embodiment, dynamic linear models having a large range of seasonality (unusual variation) to trend (period-based) weighting may be used to accommodate machines operated by customers that are largely seasonal, customers that are largely trending, and customers that fall somewhere in between. In an embodiment, the weighting factors for the dynamic linear models may range from about 1E-06 to about 1E+05 (i.e., about 0.000001 to about 100,000). Dynamic linear models having different weighting factors may also be used within the scope of this disclosure.
In an embodiment, the dynamic linear models may have a large range in signal to noise ratio used to separate the underlying signal and the natural variability of the customer behavior. In an embodiment, the signal to noise ratio for the dynamic models may range from about 2E−01 to 1.1E+05 (i.e., about 0.2 to about 110,000). Dynamic linear models having alternate signal-to-noise ratios may also be used within the scope of this disclosure.
In an embodiment, a machine's periodic consumable level (such as a print volume) can be considered an observable time series Yt, where the machine data is specified as impressions (meter reds) or consumable usage (percentage of a toner bottle used or remaining). A Bayesian state space model may be used with the assumption that there is an unobservable state time series θt and that Yt is an imprecise measurement of θt. Knowledge may be measured as a probability, and forecasts may be conditional distributions based on prior knowledge. As such, a state space model may include the following:
State time series θtεp: t=0,1,2, . . . , where θt is a Markov chain:
π(θt|θ0,θ1, . . . ,θt−1)=λ(θt|θt−1)
Observable time series Ytεm: t=1,2,3, . . . , where Yt are independent and depend on θt only.
Dynamic linear models (or Gaussian linear state space models) may be applied to the machine data. Yt and θt may be specified by their means and variances, and the transformations from θt→Yt and θt−1→θt may be linear. In an embodiment, different dynamic linear models may combine the trend (periodic or period-based) and seasonal (intermittent or unusual) components with varying relative weights. In an embodiment, each dynamic linear model may combine the trend and seasonal components in one of an additive and a multiplicative mode. In an embodiment, the dynamic linear models may have varying signal to noise ratios. In an embodiment, the dynamic linear models have initial conditions dependent on other machines in the same product family. Based on the above, the following equations result:
Observation Equation: Yt=Ftθt+vt with vt≈Nm(0,Vt)
System Equation: θt=Gtθt−1+wt, with w≈Np(0,Wt)
Prior Distribution: θ0≈Np(m0,C0), where
Yt represents the observations at time t, θt represents the state vectors at time t, Ft is a known matrix transforming state into observation, Gt is a known matrix defining the time evolution of the state vector, vt and wt are independent Gaussian random vectors. For a model in which each time period is a month, 12 time periods may be considered when determining an estimation for a subsequent time period. In such a case, Ft and Gt may be the following:
The observation variance, Vt, is a 1×1 matrix that may be initialized when the first full time period's usage information is available. For an additive model, Vt=Yf. For a multiplicative model,
In an embodiment where a printing device uses meter reads of impressions printed, Yf is the first time period's actual estimated monthly copy volume (EMCV). In an embodiment where a printing device identifies the percentage of a consumable remaining or used, Yf is the first time period's actual usage.
The system variance matrix, Wt, is a 12×12 matrix that may be initialized when the first time period's usage or impression count is available:
For an additive model, W1,1=Model Factor One*Yf and W12,12=Model Factor Three*Yf; and for a multiplicative model,
where W1,1 may correspond to the variance of the seasonal component and W12,12 may correspond to the variance of the trend component for the dynamic linear model.
The system state matrix, mt, is a 1×12 matrix that may be initialized when the first time period's usage or impression count is available:
m
0=[0 0 0 0 0 0 0 0 0 0 0 m12].
For an additive model, m12 is equal to Yfam. For a multiplicative model, m12 is equal to ln(Yfam) (i.e., the natural logarithm of Yfam).). Yfam, may correspond to the family average of machines of a similar type, such as printing devices of the same make and model.
The system covariance matrix, C0, is a 12×12 matrix that may be initialized when the first time period's usage or impression count is available:
For an additive model, C1=Model Factor Two*Yf and C2=107*Dfam, where Dfam, is the default EMCV or supply usage for a particular machine type corresponding to the machine being considered. For a multiplicative model,
C1 corresponds to a seasonal variation and C2 corresponds to a trend or periodic variation.
Exemplary parameters for a set of dynamic linear models are disclosed in Table 1 below.
Referring back to
In an embodiment, the final consumption forecast may be determined 115 by determining a weight for each of the plurality of dynamic linear models. The weight for each dynamic linear model may be a dynamically adjustable number that is determined based on the accuracy of the particular model with respect to the consumption time series data for the machine over a plurality of trailing time periods. In other words, each model may be compared to prior consumption time series data to determine whether the model approximates the consumption time series closely or not. Based on the closeness of the model to the prior consumption time series data, the weight for the model may be dynamically adjusted. Other methods of determining 115 the final consumption forecast may also be performed within the scope of this disclosure.
An amount of the consumable may be provided 120 for the machine based on the final consumption forecast. For example, an order may be placed for the consumable based on the final consumption forecast. In an embodiment, the order may be placed automatically. The order may be filled by providing 120 the consumable to an operator of the machine.
In an embodiment, providing 120 an amount of the consumable may include determining a safety stock value. The safety stock value may be summed with the final consumption forecast in order to determine an amount of the consumable to be provided 120 to the operator of the machine. In an embodiment, the safety stock value may be determined based on a variance in the consumption time series data for the consumable for the machine.
For example, a safety stock value may be determined based on a customer stocking requirement, such as 30 days stock with P=95% confidence. Using the customer stocking requirement, the Bayesian framework allows calculation of the stocking level to be performed. Because forecasts are conditional distributions, a forecast of mean m and variance V may be defined as f(t)=N(m,V). Using this function, a stocking level L implies that the customer will be in-stock with a probability determined by the following equation:
where erf is the error function:
Solving for the stocking level L in terms of P, m, and V yields: L=m+V√{square root over (2)}*erf−1(2P−1). As such, this will allow inventory to be provided where it is most needed. Customers having greater variances may receive more safety stock than customers with smaller variances.
In an embodiment, the dynamic linear models may incorporate trend, periodicity, regression and variance components. In an embodiment, characteristics of members of a similar subpopulation may be considered for each member. A plurality of models may be selected by identifying clusters of customer behavior. The models may be combined into a single aggregate forecast (i.e., the final consumption forecast).
In an embodiment, an expected volume for a new machine may be determined based upon recent behavior for similar machines if such data is available. Moreover, the expected volume may be determined and/or modified based on business expectations. The initial prediction may be modified as additional information becomes available.
A controller 320 interfaces with one or more optional memory devices 325 to the system bus 300. These memory devices 325 may include, for example, an external or internal DVD drive, a CD ROM drive, a hard drive, flash memory, a USB drive or the like. As indicated previously, these various drives and controllers are optional devices.
Program instructions, software or interactive modules for providing the interface and performing any querying or analysis associated with one or more data sets may be stored in the ROM 310 and/or the RAM 315. Optionally, the program instructions may be stored on a tangible computer readable medium such as a compact disk, a digital disk, flash memory, a memory card, a USB drive, an optical disc storage medium, such as a Blu-ray™ disc, and/or other non-transitory storage media.
An optional display interface 330 may permit information from the bus 300 to be displayed on the display 335 in audio, visual, graphic or alphanumeric format. Communication with external devices, such as a print device, may occur using various communication ports 340. An illustrative communication port 340 may be attached to a communications network, such as the Internet or an intranet.
The hardware may also include an interface 345 which allows for receipt of data from input devices such as a keyboard 350 or other input device 355 such as a mouse, a joystick, a touch screen, a remote control, a pointing device, a video input device and/or an audio input device.
Various of the above-disclosed and other features and functions, or alternatives thereof, may be combined into many other different systems or applications. Various presently unforeseen or unanticipated alternatives, modifications, variations or improvements therein may be subsequently made by those skilled in the art, each of which is also intended to be encompassed by the disclosed embodiments.