The present invention relates to methods and systems for forecasting product demand using a causal methodology, based on multiple regression techniques, and in particular to an improved method for forecasting product demand including a data cleansing process.
Accurate demand forecasts are crucial to a retailer's business activities, particularly inventory control and replenishment, and hence significantly contribute to the productivity and profit of retail organizations. Additionally, predicting the impact of promotions and price discounts on product demand is crucial for retail marketing, promotion planning, and replenishment activities.
Aprimo, a division of Teradata Corporation, has developed a suite of analytical applications for the retail business, referred to as Aprimo Demand Chain Management (DCM), which provides retailers with the tools they need for product demand forecasting, planning and replenishment. The Aprimo Demand Chain Management forecasting application assists retailers in accurately forecasting product sales at the store/SKU (Stock Keeping Unit) level to ensure high customer service levels are met, and inventory stock at the store level is optimized and automatically replenished. The Aprimo DCM forecasting application helps retailers anticipate increased demand for products and plan for customer promotions by providing the tools to do effective product forecasting through a responsive supply chain.
In U.S. patent application Ser. Nos. 11/613,404; 11/967,645; 12/982,251; and 12/982,251; and U.S. Pat. No. 7,996,254; Teradata Corporation has presented improvements to the DCM Application Suite for forecasting and modeling product demand during promotional and non-promotional periods. The forecasting methodologies described in these references seek to establish a cause-effect relationship between product demand and factors influencing product demand in a market environment. Such factors may include current product sales rates, product price changes, promotional activities, competitive information, weather conditions, and other factors. A product demand forecast is generated by combining an uplift coefficient determined through regression analysis of weekly historical demand data and the causal factors influencing product demand, with an Average Rate of Sale (ARS) value generated by the DCM application, and a seasonal factor selected for the product.
This novel methodology, referred to as Regression Event Uplift (REU), analyzes the impact of historical promotions on future promotional sales. It uses a methodology that calculates and models the partial role of various causal factors on the demand simultaneously. It is a multiple regression model that analyzes the effect of several causal factors such as price discount, media type, duration of promotion, etc. REU calculates a set of coefficients for each input variable which are used to forecast the future promotional uplifts.
Incorrect or inconsistent data, referred to as noise, in a customers' data can create problems in REU calculations. Incorrect or inconsistent data can lead to false conclusions and misdirected results for regression analysis. It is therefore important to employ a strong cleansing logic in the REU module to prevent anomalies and unexpected data points being fed into the regression analysis.
A new implementation of REU, employing an improved data cleansing methodology, is described below.
In the following description, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration specific embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable one of ordinary skill in the art to practice the invention, and it is to be understood that other embodiments may be utilized and that structural, logical, optical, and electrical changes may be made without departing from the scope of the present invention. The following description is, therefore, not to be taken in a limited sense, and the scope of the present invention is defined by the appended claims.
As stated above, the causal demand forecasting methodology seeks to establish a cause-effect relationship between product demand and factors influencing product demand in a market environment. A product demand forecast is generated by blending the various influencing factors in accordance with corresponding regression coefficients determined through the analysis of historical product demand and factor information. The multivariable regression equation can be expressed as:
LN=base+α1var1+α2var2+ . . . +αnvarn;
where LN represents demand; var1 through varn represent causal variables, such as current product sales rate, product price, weather, promotional activities, and other factors; and α1 through αn represent regression coefficients determined through regression analysis using historical sales, price, promotion, and other causal data.
The Aprimo DCM Application Suite may be implemented within a three-tier computer system architecture as illustrated in
Presentation tier 101 includes a PC or workstation 111 and standard graphical user interface enabling user interaction with the DCM application and displaying DCM output results to the user. Application tier 103 includes an application server 113 hosting the DCM software application 114. Database tier 103 includes a database server containing a database 116 of product price and demand data accessed by DCM application 114.
As illustrated in
Contribution: Contribution module 211 provides an automatic categorization of SKUs, merchandise categories and locations based on their contribution to the success of the business. These rankings are used by the replenishment system to ensure the service levels, replenishment rules and space allocation are constantly favoring those items preferred by the customer.
Seasonal Profile: The Seasonal Profile module, also referred to as the Intelligent Profile (IPC) Clustering module, 212 automatically calculates seasonal selling patterns at all levels of merchandise and location. This module draws on historical sales data to automatically create seasonal models for groups of items with similar seasonal patterns. The model might contain the effects of promotions, markdowns, and items with different seasonal tendencies.
Demand Forecasting: The Demand Forecasting module 213 provides store/SKU level forecasting that responds to unique local customer demand. This module considers both an item's seasonality and its rate of sales (sales trend) to generate an accurate forecast. The module continually compares historical and current demand data and utilizes several methods to determine the best product demand forecast.
Promotions Management: The Promotions Management module 214 automatically calculates the precise additional stock needed to meet demand resulting from promotional activity.
Automated Replenishment: Automated Replenishment module 215 provides the retailer with the ability to manage replenishment both at the distribution center and the store levels. The module provides suggested order quantities based on business policies, service levels, forecast error, risk stock, review times, and lead times.
Time Phased Replenishment: Time Phased Replenishment module 216 Provides a weekly long-range order forecast that can be shared with vendors to facilitate collaborative planning and order execution. Logistical and ordering constraints such as lead times, review times, service-level targets, min/max shelf levels, etc. can be simulated to improve the synchronization of ordering with individual store requirements.
Allocation: The Allocation module 217 uses intelligent forecasting methods to manage pre-allocation, purchase order and distribution center on-hand allocation.
Load Builder: Load Builder module 218 optimizes the inventory deliveries coming from the distribution centers (DCs) and going to the retailer's stores. It enables the retailer to review and optimize planned loads.
Capacity Planning: Capacity Planning module 219 looks at the available throughput of a retailer's supply chain to identify when available capacity will be exceeded.
Referring to
In step 320, the historical demand data for products having seasonal selling patterns is adjusted, i.e., deseasonalized, by dividing the actual historical demand values by their corresponding seasonal factors according to equation 1, dsdemandyr,wk=demandyr,wk/SFwk. The seasonally adjusted demand (dsdemand) is then used as input to the causal framework and the forecasting module of the DCM forecasting application.
In step 330, regression preprocessing is performed to select the set of causal factors that have statistically significant effects on historical product demand, and to prepare the causal factor data 208 for analysis.
In step 340, regression coefficients (α1, α2, α3, . . . αn) are calculated using the deseasonalized demand data and tracked causal factors 308. These regression coefficients are combined in step 350 to generate an uplift coefficient for each product.
In step 360, the uplift coefficient is combined with the DCM Average Rate of Sale (ARS) calculation results provided by the forecasting module of the DCM forecasting application for the product, and the appropriate seasonal factor, to generate the final product demand forecast for the product:
FCSTi=ARSi×SFi×uplifti
Within the Data Transformation module 410, processes are provided for data extraction 412, e.g., extraction of sales data, discounts, media types, promotions, and other information; de-seasonalization 414, i.e., the removal of seasonal affects from historical demand data; and initial demand analysis 416, e.g., stockout replacement analysis, regular demand outlier analysis, partially promoted week analysis, etc.
Model Assignment module 420 includes processes 422 and 424 for assigning model types to each product within a department and location (PDL). Products are categorized according to historical promotional behavior into four different model types:
Model Analysis module 430 includes separate analysis processes for the different model types. HH model processes 440 include dynamic sale pattern analysis 442, outlier analysis 444, media analysis 446, and regression analysis 448. Similarly, LH models processes 450 include outlier analysis 452, media analysis 454, and regression analysis 456; and AGG/GRP model processes 460 include media analysis 462, dynamic sale pattern analysis 464, outlier analysis 466, and regression analysis 468.
As stated earlier, incorrect or inconsistent data, referred to as noise, in a customers' data can create problems in REU calculations. Incorrect or inconsistent data can lead to false conclusions and misdirected results for regression analysis. Incorrect or inconsistent data is often revealed as outliers—data points which are distant from the majority of data points in a dataset—and should be removed from REU calculations.
The identification of outliers in product sales data for products that are frequently promoted can be more difficult, as sales values during product promotions may significantly exceed average sales during non-promotional periods.
Establishing a single set of high and low boundary values to identify regular (non-promotional) sales outliers may erroneously identify promotional sales values as outliers, whereas establishing a single set of high and low boundary values to identify outliers based upon promotional, or the total of promotional and non-promotional sales, may fail to identify regular sales outliers.
A new implementation of the REU algorithm employing an improved data cleansing methodology is illustrated by the flow chart of
The updated REU algorithm includes the following data cleansing steps which enhance the forecast accuracy:
A more detailed explanation of these data cleansing steps is provided below:
Instructions of the various software routines discussed herein, are stored on one or more storage modules in the system shown in
Data and instructions of the various software routines are stored in respective storage modules, which are implemented as one or more machine-readable storage media. The storage media include different forms of memory including semiconductor memory devices such as dynamic or static random access memories (DRAMs or SRAMs), erasable and programmable read-only memories (EPROMs), electrically erasable and programmable read-only memories (EEPROMs) and flash memories; magnetic disks such as fixed, floppy and removable disks; other magnetic media including tape; and optical media such as compact disks (CDs) or digital video disks (DVDs).
The instructions of the software routines are loaded or transported to each device or system in one of many different ways. For example, code segments including instructions stored on floppy disks, CD or DVD media, a hard disk, or transported through a network interface card, modem, or other interface device are loaded into the device or system and executed as corresponding software modules or layers.
The foregoing description of various embodiments of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many alternatives, modifications, and variations will be apparent to those skilled in the art in light of the above teaching.
This application claims priority under 35 U.S.C. §119(e) to the following co-pending and commonly-assigned patent application, which is incorporated herein by reference: Provisional Patent Application Ser. No. 61/783,400, entitled “METHOD AND SYSTEM FOR DATA CLEANSING TO IMPROVE PRODUCT DEMAND FORECASTING,” filed on Mar. 14, 2013, by David Chan and Ghadamali Bagherikaram. This application is related to the following commonly-assigned patents and patent applications, which are incorporated by reference herein: Application Ser. No. 11/613,404, entitled “IMPROVED METHODS AND SYSTEMS FOR FORECASTING PRODUCT DEMAND USING A CAUSAL METHODOLOGY,” filed on Dec. 20, 2006, by Arash Bateni, Edward Kim, Philip Liew, and J. P. Vorsanger; Application Ser. No. 11/967,645, entitled “TECHNIQUES FOR CAUSAL DEMAND FORECASTING,” filed on Dec. 31, 2007, by Arash Bateni, Edward Kim, J. P. Vorsanger, and Rong Zong; Application Ser. No. 12/982,251, entitled “METHODS AND SYSTEMS FOR FORECASTING PRODUCT DEMAND USING PRICE ELASTICITY OF DEMAND WITHIN A CAUSAL METHODOLOGY,” filed on Dec. 30, 2010, by Arash Bateni and Edward Kim; Application Ser. No. 13/691,679, entitled METHODS AND SYSTEMS FOR FORECASTING PRODUCT DEMAND FOR PRODUCTS WITH DYNAMIC SALES PATTERNS,” filed on Nov. 30, 2012, by Arash Bateni and David Chan; and U.S. Pat. No. 7,996,254, entitled “IMPROVED METHODS AND SYSTEMS FOR FORECASTING PRODUCT DEMAND DURING PROMOTIONAL EVENTS USING A CAUSAL METHODOLOGY,” issued on Aug. 9, 2011, by Arash Bateni, Edward Kim, Harmintar Atwal, and J. P. Vorsanger.
Number | Date | Country | |
---|---|---|---|
61783400 | Mar 2013 | US |