The present invention relates to multi-channel retailing competition and, more particularly, to a simulation system and methodology to assist retail entities to make multi-channel business decisions dynamically.
Larger retail companies use multiple channels to sell their merchandise. Multiple channels include physical stores, web sites, catalogs, etc. The industry continues to invest heavily to sustain and renew these channels (e.g., global e-retail market size reached $80B in 2006). But multi-channel retailers are facing the challenges of how to make right strategies (price, service, etc.) to adopt the competing and dynamic multi-channel environment. Multi-channel retailing competition is very complex as:
To make informed decisions, it is necessary for retailers to be able to identify channel influences and customer preferences according to sales and price history of different channels/retailers.
In one aspect there is provided a system, method and computer program product that provides the ability for retailers to devise a current channel strategy (e.g., adaptive price setting) that considers competitors in a dynamic competing environment, and that enables computing a competitive advantage of a channel.
According to one aspect, there is provided a system, method and computer program product to estimate a price for selling a product j in a commerce channel comprising: a) receiving, at a processor device, real market data including sales and price history data of a product j sold by one or more retailers over one or alternate sales channels t; b) generating, by the processor device, a competitive advantage parameter value based on the sales and price history data; and, c) computing, utilizing the competitive advantage parameter value, an optimum price for a particular product to be marketed in one of the one or alternate sales channel.
Further to this aspect, the generating, by the processor device, of a competitive advantage parameter value comprises: receiving at input nodes of configured neural network, a data vector including the prices and customer preferences of a product j at different times, the configured neural network representing a dynamic procedure of competition in retailer market; propagating the data to one or more intermediate nodes of the configured neural network; and, implementing a function, at the intermediate nodes, for calculating a customer preference parameter value Cijt.
In a further aspect, the method includes implementing a back propagation computation of said neural network to obtain gradients for use in calculating said customer preference parameter value Cijt.
Furthermore, the method comprises: calculating a competitive advantage ψijk of customer segment i and product j in channel t according to:
ωijt(k)=max{θ1max{0,pjs−pjt},θ2max{0,Cijt−Cijs}}
where k denotes a time period, pjs (pjt) denotes the price of product “j” in respective channel s(t), cijt(cijs) denotes a calculated customer preference parameter value for respective channel t(s) of customer segment “i” while buying product “j”, and theta “θ” represents a relationship between two products. For example, θ1, θ2 are the weights of price and customer preference, respectively, e.g., for the same θ2, θ1 being large indicates customer valuing price more. θ1 and θ2 are identified from real marketing data, hence not only their ratio, but their values are meaningful.
In one embodiment, the method further comprises: producing, at output nodes of said neural network, a simulated sales volume qijt based on said competitive advantage parameter value ψijk, wherein said sales volume is computed for a channel t in a particular price setting.
Further to this embodiment, the simulated sales volume qijt is computed at output nodes of the configured neural network according to:
q
ij
t(k)=[adjustment factor]ψimt
where ψimt is the competitive advantage parameter value.
The objects, features and advantages of the present invention will become apparent to one skilled in the art, in view of the following detailed description taken in combination with the attached drawings, in which:
1) Data Integration module 11 initializes the Dynamic Competition Model by obtaining (e.g., receiving) and integrating initialization data from multiple sources into a unique platform (such as shown in
2) Parameter Identification module 20 using a Judge Neural Network 100: Apart from the initialization data, there exist key parameters in the Dynamic Competition Model that are first identified. A Judge Neural Network 100 is configured to determine the key parameters in the Dynamic Competition Model. With sales and price history data as the input, the JNN determines key parameters including but not limited to: channel influence and customer preference parameters. Two parameters include θ1, θ2 which are independent but will be identified together. The “competitive advantage” variable will affect the estimated sales quantity through the propagation along the neural network. That is, in one embodiment, the parameters in the JNN are identified using real market data. A Newton Descend method commonly used to train unknown parameters may be used, in one embodiment, to reduce the value of the cost function. Alternative to Newton Descend method, the unknown parameters can be trained using, for example, Gradient Descent method, Quasi-Newton method or Generation Algorithm (see Nocedal, Jorge and Wright, Stephen J. (1999) Numerical Optimization, Springer-Verlag, contents and disclosure of which is incorporated by reference). As will be described in detail herein below, the JNN uses the gradients (e.g., the change rates of sales quantity to factors) calculated from the backward neural network of the JNN, e.g., as shown in
3) Competition Process Simulation 30: After computing these key parameters via the JNN, the configured JNN 100 is used to perform a simulation and produce a final solution such as: an optimization of the price of channels in competition, or, the observing of resulting effects of different price setting and channel coordination strategies.
In one embodiment, the Judge Neural Network is a type of Neural Network and, as shown in the example Node A of the JNN portion 120 shown in
Several functions are defined using the “m” in-nodes as the input to compute the output of this node. Based on proper assumptions (i.e., the assumption should be reasonable and accepted by all customers), these functions include:
max{max{pjs−pjt,0},max{cijt−vijs}}
such as shown in
ψijt(k)=max{θ1max{0,pjs−pjt},θ2max{0,Cijt−Cijs}}
After the prices and customer preferences are set in the input layer, the output 225 of a first middle layer such as shown in Node A is obtained according to the functions defined in the network, which then serve as the input to a further (e.g., a second) middle layer (not shown), and the propagation continues until the output layer is determined. For example, a situation exists that the value of output layer can be known without considering some other value. For example, in the forward propagation (see
In certain portions of the middle layers, a key variable, ψijt(k) 225, where k represents a different time (i.e., different set of training data), is used to represent a calculation of a competitive advantage of customer segment i and product j in channel t, which helps in the computation of variable qijt 250, a variable representing sales volume of product j in channel t from customer segment i. For example, the ψijt(k) variable 225 is the bridge of variable 250 and the variables observed like prices. Especially, in identification, the “help” is prominent since the derivatives are computed from backward neural networks where variable 225 is a key middle node as
In one embodiment, JNN portion 120 shown in
In one embodiment, shown in
After computing these parameters, the configured JNN is used to perform a final simulation and produce a final solution, such as described herein below with respect to
As mentioned, in one embodiment, for the supervised training approach used to train the Judge Neural Network 100, there are four types of input parameters: product price, product cost (cost of manufacturing product or the cost retailers pay), customer preference, and customer population. During each iteration, the sales volume computed by JNN is compared with the real data, and the error is calculated, then the parameters are adjusted, e.g., using Newton method. The gradients are computed from the backward network of JNN 122 of
(e.g., error being the differences between the real sales quantity and the sales quantity computed by JNN) is lower than a pre-set limit. In one embodiment, optimal parameters are the ones getting least error.
where θmj is the product relationship between two products, e.g., products j and m (for example, if someone purchases a TV set, it is possible that he/she will buy a DVD player. Then the θmj parameter between TV set and DVD player would be a large positive value, for example;
is a computed adjustment factor, t is the channel, and k is time (training data). To obtain the value of qijt(k), the parameters β0, β1, and θmj are first identified in JNN with β0, β1 being two factors that affect the shape of a logistic function which is used widely for prediction. It is understood that the logistic function is a very common function and widely used to describe things having upper and lower bounds and is applied to the JNN; however, any other reasonable functions, which are monotone and have upper and lower bounds, are suitable. Cos tm and price pm (as explained herein above) and competitive advantage variable ψijt(k) are the input variables of
Thus, in one aspect, the JNN 100 assists in making multi-channel decision(s) dynamically for an entity, e.g., retailer, company or other business organization, by enabling one to identify channel influence and customer preference according to sales and price history of different channels/retailers; and, to give channel strategy (e.g. adaptive price setting) that considers competitors in a dynamic competing environment. That is, JNN describes a dynamic procedure of competition in a retailer market, i.e., when using JNN to make multi-channel decision(s), JNN will consider the change and reaction of all channels—and not just treat them in static state.
As described above at 30,
In this embodiment, Equation 1) provides an example objective function of a tth channel represented as Gt (pjt,qijt) which incorporates sales volumes, profit, market share, etc., represented as:
That is, there is obtained an objective function Gt for each channel t, where equation 1) is a special case where the objective function is set to be profit that can be computed by multiplying the sales quantity with the difference between price and cost of product t.
In a further aspect, as shown in
Where k is a time point, tlag denotes the time lag for competition (i.e., as a retailer typically cannot get real-time information to make decision and can only change the price based on some prior information) and is a nonnegative integer, if and only if “t” is an e-channel tlag=0 where e-channel represent an internet commerce (where a retailer “r” sell products on-line). Since via the internet, the retailer r will get more real-time information, its time lag is set to zero, showing this advantage of e-channel to traditional ones; and, λt and δtr are parameters describing different behaviors of different retailers. For example, if the retailer will change the price tempestuously, λt would set to be a large value. λt and δtr are the parameters describing the behaviors of retailers. Like λt, if the retailer will change the price tempestuously, δtr, would set to be a large value. The difference between λt and δtr is that λt is trying to increase own sales volume while δtr considers to decrease the sales volume of competitors.
In a further aspect, the method further determines how two channels cooperate when they belong to the same company. Normally, when the sales from one channel increases or drops, sales from the other channels is likely to fluctuate in the same way. Accordingly, the formula of equation 2) is revised to equation 3) as follows:
Thus, in the competition simulation, time is updated from k=1, and the price in the next time k+1 is computed according to equation (3), where the parameters λt, ttag, and δtr are set in advance and other variables, e.g., θ1, θ2 (
is the grads computed by the backward propagation of JNN. The last summation term represents the endeavor for decreasing the profit of the competitors.
In one embodiment, both G and
can be obtained from JNN. After a pre-defined K iteration steps, a price is computed that best fits in the business competition. That is, by using the backward propagation of the JNN, the
can be computed and, then, according to (3), the price at every time point in a dynamic simulation can be obtained.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with a system, apparatus, or device running an instruction.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with a system, apparatus, or device running an instruction. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc. or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may run entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of the present invention are described below with reference to flowchart illustrations (e.g.,
The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which run on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more operable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be run substantially concurrently, or the blocks may sometimes be run in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
While there has been shown and described what is considered to be preferred embodiments of the invention, it will, of course, be understood that various modifications and changes in form or detail could readily be made without departing from the spirit of the invention. It is therefore intended that the scope of the invention not be limited to the exact forms described and illustrated, but should be construed to cover all modifications that may fall within the scope of the appended claims.