The invention relates to methods and apparatuses that allow to compare entities, as for example companies or trademarks, that provide goods or services. Relevant information or data on the companies and their products is retrieved from an information space, as for example the internet, in an automatic fashion.
For example, for marketing reasons a comparison between given companies operating in the same business area is desirable. However, due to the vast and generally unstructured information this is not easy to achieve. Conventionally, marketing analysts have to browse available information sources, for example mailing catalogs, retail stores or also internet web shops. In the marketing business a manual analysis of competitors' profiles and prizing strategies is then necessary. As a result, often questions should be answered like which products X of company A correspond to products Y of the portfolio of company B? It can also be interesting to know which products or product groups of company A significantly deviate from the pricing structure of company B's corresponding products.
Conventionally, the necessary information collection and analysis is performed in a manual fashion. This requires a considerable human effort, as for example in terms of traditional market research collection has to be done through polling customers or pear group members. Also analysis for marketing purposes has been achieved so far in a very low-tech fashion. This requires human operators for aggregating the relevant information and for finding corresponding products of different makes and evaluating the pricing information.
This disclosure presents methods and apparatuses for comparing entities that provide goods or services as products. Specifically, entity identifiers are set for each entity, and a product category is set for defining classes of goods or services provided by said entities. Then, query results are obtained by querying at least one web service, wherein the web service provides predetermined product parameters as characteristic information on the products of the product category as a function of the entity identifier. One can then map, for example, product parameters from said web service to a predetermined data format for said product parameters which enables to further process the aggregated information.
For example, the method can be employed for comparing companies based on available product pricing information. Then, a product category can be selected, said product category comprising a plurality of products. For comparing companies pricing and product feature parameters for products of said selected product category is retrieved. This is for instance done by a price finding robot, said price finding robot providing said pricing and product feature parameters for products as a function of a company name. Then, products having similar product feature parameters and stemming from different companies are detected, and a difference in pricing features of those products is calculated.
This strategy allows to group products that have similar features but derive from different companies. One can then form clusters of products. An analysis with respect to average pricing levels in the groups or clusters is then feasible.
Hence, data formats for pricing and feature information stemming from different web services may also be automatically retrieved and mapped into a common format. The method allows to evaluate systematic deviations in the companies' pricing policies or to detect if companies do not have a matching product portfolio.
This disclosure also describes an apparatus that may be implemented to carry out such a method for comparing entities. In one embodiment an apparatus comprises an input means for setting entity identifiers and at least one product category identifier. The apparatus comprises a processing platform which is communicatively coupled to the internet for querying at least one web service for retrieving predetermined product parameters as characteristic information on the products of the product category. This information is obtained as a function of the entity identifier as a query result. The processing platform may also map said product parameters to a predetermined data format for said product parameters. For example, a similar apparatus can be implemented as a computer. Additionally, a computer program may be implemented based on one of the above aspects of a method for comparing entities to initiate an execution of such a method on a computer.
In the following aspects and embodiments of the invention are described with reference to the figures in the drawings.
In the following embodiments of the method and apparatus for comparing entities according to the present invention are described with reference to the enclosed figures.
An embodiment of the invention, for example, can be employed when two companies are given that operate in the same or similar business. For example, those companies may offer tangible products, as for example digital cameras. However, a comparison can be performed also between companies providing services having, for example, trade names. The following aspects and embodiments take advantage of the information offered in the internet, as for example, through price finding robots as a middle layer for data acquisition. Hence, this framework operates in an automatic manner and is therefore extremely cost efficient. Additionally, state of the art information retrieval and text mining techniques can be included.
Price finding robots provide information on a variety of products together with specific product parameters. As one dedicated product parameter the price or market prize of such a product can be obtained from a price finding robot.
The internet 5 among other elements comprises a plurality of web services and in particular price finding robots 6, 7, 8. Price finding robots search for products and their respective prices, for example, in online shops of the internet 5. For example, the price finding robot 7 browses online shops 11, 12, 13 and provides information on a product including usually the lowest available price. For example,
This abundant information on products and prices available on the internet or World Wide Web is used for estimating and comparing companies providing those products. Through querying the price finding robots the pricing structure, offer and demand regarding certain products or services can be extracted. Such price finder robots are particularly suited to track parameters regarding retail products, as for example digital cameras. However, also other products or web services can be monitored or employed. For example, internet book shops provide pricing and further information on certain authors and the prizes of their books, titles, number of editors and so forth.
The processing platform 2 of the apparatus 1 shown in
In step S1 an entity identifier and a product category for comparison is selected. An entity identifier is, for example, a company name, a make, a trademark name, a brand name, but may be also a person's or party's name. Recalling the example of digital cameras an entity identifier can be, for example, the make “Canon”, “Fuji” or “Panasonic”. The product category, for example, can be defined as “digital cameras”.
Next, in step S2 the processing platform as shown in
For example by selecting only price finder robots from a given country a cross company comparison by means of product pricing can be performed on a fine grained level. This means on a country level, regional or world wide level comparisons are feasible.
For example, product parameter P1 relates to the make “Panasonic” plus the type “Lumix DMC-TZ3 EG”. Product parameter P2 reflects the product category “digital camera”. Product parameters P3-P8 correspond to technical features of the digital camera proposed in the query result WP. For example, P4 states the number of pixels, P5 the resolution, P6 supported data format types, P7 the sensitivity and P8 the diameter of the filter screw thread. Product parameter P9 relates to the available price of the product.
The processing platform 2 is, for example, implemented with an HTML-file wrapper that extracts the textual information from the fields containing the product parameters P1-P9 from the web page WP. Such wrappers can be adapted to specific price finder robots or their proprietary data format in which the products are presented in the query results WP, i.e. the information retrieval from the query results WP is executed through dedicated language wrappers. Language or HTML-wrappers are known in the art.
In
Product parameters or features of products may also vary across price finding robot platforms. For example, the same feature “number of mega pixels” can be labeled as “resolution” or “effective points”. Hence, the mapping step S3 optionally also includes an assignment of the product parameters corresponding to the same semantic feature to the same standard denomination. Such mapping functions can be implemented manually by specifying specific mapping rules. Once, those mapping rules are established the mapping of the specific product features or product parameters from different price finding robots can be done automatically. As a result of the mapping step S3 product data for each model or make is acquired. Hence, data structures containing the structured information on products belonging to different companies or makes are provided.
The product parameters, for example, for entry or data structure C1 can be regarded as a three-dimensional vector. Analogously, all entries in the lists L1, L2 can be represented by three-dimensional vectors. A similarity measure, for example, can be the metric difference between those parameter vectors for each entry C1-CN, F1-FM. As a result, one obtains a mapping of entries in the list L1 to entries in the list L2. One can also contemplate other similarity measures such as cosine similarities or Pearson correlations.
For example, the product C1=“Canon Ixus 37” corresponds best to the product F2=“Magenta Fuji X1”. This is indicated by a mapping arrow M1. Similarly, mappings between C2 and F1 (M2), C3 and Fn (M3) and Cn and F3 (M4) can be established. Certainly, other differing similarity functions or measures can be implemented. As a result, a bipartite match making of products from the two lists L1 and L2 is obtained.
Now, an analysis in step S4 as illustrated in
In an alternative or optional analysis step a portfolio gap analysis can be performed based on the acquired data. For example, if no match can be found as illustrated in
According to another aspect of a method for comparing companies based on their products available on the internet, the analysis can be extended to grouping certain products. For example, products having similar but not like product parameters sets are clustered into product groups. For example, one product group with respect to digital cameras could be lower budget cameras opposed to high end mirror reflex cameras. Then, average prizes for these clusters or products belonging to a cluster are aggregated and compared on a per-cluster rather than on a product basis.
A variety of clustering methods are available, as for example k-means, expectation maximization, density-based clustering etc.
For example, the cluster CL1 refers to a plurality of digital cameras having a low resolution and high noise. This would correspond to cheap consumer low budget cameras. On the other hand, cluster CL3 may refer to high-end digital cameras with a high resolution and low noise levels. Another exemplary cluster is shown as CL2 in the diagram of
Instead of matching single products from different companies or makes, now clusters with average prices can be compared to each other. Clustering allows also an overall portfolio gap analysis with respect to companies or producers. For example, one may detect that a certain make, say “Fuji”, does not offer cameras that can be classified into cluster CL2.
Finally, significant deviations in prices of corresponding products or products of different companies belonging to same clusters can be computed. This leads to a cross company comparison for the products. In particular, the pricing information may be relevant for marketing issues. In a display as shown in
An exemplary overall score could summarize individual scores or differences between pricing policies. For example, as a result of the analysis, one may find that cameras by the make “Canon” appear to be 30% more expensive than comparable products by the brand “Fuji”. Regarding marketing issues this might indicate that Fuji's customers are willing to pay a premium for the higher quality products or a more prestigious brand name.
Hence, the several contemplated aspects and embodiments of the invention provide for an automatic price based comparison of companies, in particular, for business to customer products. The overall framework either implemented as a computer program or apparatus, for example computer, carries out the relevant method and analysis steps for the price based comparison. Additionally, the user may input similarity measures to find corresponding products in an automatic fashion. For example, product or service A of company B is comparable to product or service C of make D. Further, the application allows to compare single products but also to compare product groups or clusters by a way of applying standard clustering techniques on parameter features that are extracted from the product descriptions from web service query results. Additionally, portfolio gap analyses may be carried out. The above mentioned aspects and embodiments may specifically enhance and facilitate the strategies of marketing analysts.