A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The field relates generally to information processing systems, and more particularly to product configuration in information processing systems.
Enterprises offer various types of support with their products including, for example, warranties, technical support services and product installation, deployment and repair services. Large enterprises may have numerous support options, which can be combined in multiple ways with, for example, hardware and software products.
Errors in product configuration and corresponding support offer data are common, and may result in incorrect order fulfillment and a lack of appropriate product support. Under conventional approaches, there are no systems in place to determine product and support offering issues in advance of their occurrence and to adequately handle such problems when they occur.
Embodiments provide a product configuration validation platform in an information processing system.
For example, in one embodiment, a method comprises receiving product selection data, wherein the product selection data characterizes at least one combination of at least two products. In the method, the product selection data is analyzed using one or more machine learning algorithms. The method further comprises predicting based, at least in part, on the analyzing, whether the at least one combination is anomalous. One or more alerts are generated in response to predicting that the at least one combination is anomalous.
Further illustrative embodiments are provided in the form of a non-transitory computer-readable storage medium having embodied therein executable program code that when executed by a processor causes the processor to perform the above steps. Still further illustrative embodiments comprise an apparatus with a processor and a memory configured to perform the above steps.
These and other features and advantages of embodiments described herein will become more apparent from the accompanying drawings and the following detailed description.
Illustrative embodiments will be described herein with reference to exemplary information processing systems and associated computers, servers, storage devices and other processing devices. It is to be appreciated, however, that embodiments are not restricted to use with the particular illustrative system and device configurations shown. Accordingly, the term “information processing system” as used herein is intended to be broadly construed, so as to encompass, for example, processing systems comprising cloud computing and storage systems, as well as other types of processing systems comprising various combinations of physical and virtual processing resources. An information processing system may therefore comprise, for example, at least one data center or other type of cloud-based system that includes one or more clouds hosting tenants that access cloud resources. Such systems are considered examples of what are more generally referred to herein as cloud-based computing environments. Some cloud infrastructures are within the exclusive control and management of a given enterprise, and therefore are considered “private clouds.” The term “enterprise” as used herein is intended to be broadly construed, and may comprise, for example, one or more businesses, one or more corporations or any other one or more entities, groups, or organizations. An “entity” as illustratively used herein may be a person or system. On the other hand, cloud infrastructures that are used by multiple enterprises, and not necessarily controlled or managed by any of the multiple enterprises but rather respectively controlled and managed by third-party cloud providers, are typically considered “public clouds.” Enterprises can choose to host their applications or services on private clouds, public clouds, and/or a combination of private and public clouds (hybrid clouds) with a vast array of computing resources attached to or otherwise a part of the infrastructure. Numerous other types of enterprise computing and storage systems are also encompassed by the term “information processing system” as that term is broadly used herein.
As used herein, “real-time” refers to output within strict time constraints. Real-time output can be understood to be instantaneous or on the order of milliseconds or microseconds. Real-time output can occur when the connections with a network are continuous and a user device receives messages without any significant time delay. Of course, it should be understood that depending on the particular temporal nature of the system in which an embodiment is implemented, other appropriate timescales that provide at least contemporaneous performance and output can be achieved.
As used herein, “natural language” is to be broadly construed to refer to any language that has evolved naturally in humans. Non-limiting examples of natural languages include, for example, English, Spanish, French and Hindi.
As used herein, “natural language processing (NLP)” is to be broadly construed to refer to interactions between computers and human (natural) languages, where computers are able to derive meaning from human or natural language input, and respond to requests and/or commands provided by a human using natural language.
As used herein, “natural language understanding (NLU)” is to be broadly construed to refer to a sub-category of natural language processing in artificial intelligence where natural language input is disassembled and parsed to determine appropriate syntactic and semantic schemes in order to comprehend and use languages. NLU may rely on computational models that draw from linguistics to understand how language works, and comprehend what is being said by a user.
As used herein, “natural language generation (NLG)” is to be broadly construed to refer to a computer process that transforms data into natural language. For example, NLG systems decide how to put concepts into words. NLG can be accomplished by training machine learning models using a corpus of human-written texts.
As used herein, “application programming interface (API)” or “interface” refers to a set of subroutine definitions, protocols, and/or tools for building software. Generally, an API defines communication between software components. APIs permit programmers to write software applications consistent with an operating environment or website. APIs are used to integrate and pass data between applications, and may be implemented on top of other systems.
The customer devices 102 and administrator devices 103 can comprise, for example, Internet of Things (IoT) devices, desktop, laptop or tablet computers, mobile telephones, or other types of processing devices capable of communicating with the product configuration validation platform 110 over the network 104. Such devices are examples of what are more generally referred to herein as “processing devices.” Some of these processing devices are also generally referred to herein as “computers.” The customer devices 102 and administrator devices 103 may also or alternately comprise virtualized computing resources, such as virtual machines (VMs), containers, etc. The customer devices 102 and/or administrator devices 103 in some embodiments comprise respective computers associated with a particular company, organization or other enterprise.
The terms “customer,” “administrator” or “user” herein are intended to be broadly construed so as to encompass numerous arrangements of human, hardware, software or firmware entities, as well as combinations of such entities. Product configuration validation services may be provided for users utilizing one or more machine learning models, although it is to be appreciated that other types of infrastructure arrangements could be used. At least a portion of the available services and functionalities provided by the product configuration validation platform 110 in some embodiments may be provided under Function-as-a-Service (“FaaS”), Containers-as-a-Service (“CaaS”) and/or Platform-as-a-Service (“PaaS”) models, including cloud-based FaaS, CaaS and PaaS environments.
Although not explicitly shown in
In some embodiments, the administrator devices 103 are assumed to be associated with repair technicians, system administrators, information technology (IT) managers, software developers, release management personnel or other authorized personnel configured to access and utilize the product configuration validation platform 110.
The customer devices 102 are respectively associated with one or more customers placing orders for products. As explained in more detail herein, along with a given product (e.g., personal computer, laptop, storage array, etc.), an enterprise may offer support products such as, but not necessarily limited to, warranties and installation, deployment and repair services for the given product. Enterprises may sell these support products in conjunction with a main product (e.g., hardware, appliances, software, etc.).
Large enterprises may, for example, have a catalog of thousands to millions of support offers, which may be combined in numerous ways with other products. With conventional approaches, errors in accumulating product data and data entry for product data can result in differences between what is intended to be sold and what is actually sold to customers. For example, product data may include a significant number of attributes associated with a service product that define the kind of services to be provided to a customer. The attributes may specify, for example, the duration of a service plan, a payment schedule, included parts, materials and/or labor, when service is to be provided (e.g., next business day, within a designated time, etc.), and availability of support (e.g., 24 hours per day, 7 days per week, weekdays, weekends, etc.). With current approaches, the description of a product may not be consistent with the product's configuration and/or attributes. Additionally, with conventional techniques, inconsistent product descriptions may lead to product combinations which are not appropriate. For example, if a particular higher-end product is typically sold with a same business day or better support plan, an order in which the higher-end product is paired with a lower-end support plan may be incorrect.
Current systems lack adequate approaches to find issues with product descriptions and product combinations. The systems are reactive in nature, seeking out issues in response to customer complaints. Moreover, current computing systems are not equipped and do not have adequate computational resources to handle the textual and contextual analysis required to identify such problems, link the problems to a large number (e.g., thousands to millions) of orders, and ultimately correct the issues.
The embodiments advantageously provide techniques to use machine learning to analyze new product and sales (e.g., order) data based on historical product and sales data known to be valid. As a result, the embodiments provide techniques to proactively flag anomalous product data instances and anomalous product combinations prior to impacting customers. As an additional advantage, the embodiments leverage NLP, NLU, NLG and other machine learning algorithms to generate product descriptions from inputted product attributes to improve product description accuracy and eliminate errors stemming from incorrect descriptions.
The product configuration validation platform 110 in the present embodiment is assumed to be accessible to the customer devices 102 and/or administrator devices 103 and vice versa over the network 104. The network 104 is assumed to comprise a portion of a global computer network such as the Internet, although other types of networks can be part of the network 104, including a wide area network (WAN), a local area network (LAN), a satellite network, a telephone or cable network, a cellular network, a wireless network such as a WiFi or WiMAX network, or various portions or combinations of these and other types of networks. The network 104 in some embodiments therefore comprises combinations of multiple different types of networks each comprising processing devices configured to communicate using Internet Protocol (IP) or other related communication protocols.
As a more particular example, some embodiments may utilize one or more high-speed local networks in which associated processing devices communicate with one another utilizing Peripheral Component Interconnect express (PCIe) cards of those devices, and networking protocols such as InfiniBand, Gigabit Ethernet or Fibre Channel. Numerous alternative networking arrangements are possible in a given embodiment, as will be appreciated by those skilled in the art.
Referring to
The product data engine 120 comprises one or more databases including product data for products of an enterprise. Such product data may comprise, but is not necessarily limited to, product stock keeping unit (SKU) identifiers (e.g., numbers), product descriptions and attributes corresponding to respective products. In a non-limiting operational example, referring to the table 201 in
As can be seen in table 201, if applicable, the product attributes (e.g., duration, category and service level) for SKUs 123-1111, 123-3333, 456-4444 and 456-5555 are consistent with the product descriptions and are not anomalous. Referring to SKU 123-2222 in table 201, product data in the product data engine 120 includes a description of a service product that indicates, for example, “2 Years, ProSupport Plus 4 Hour.” However, unlike the other SKUs in table 201, the product description for SKU 123-2222 is not consistent with the corresponding attributes of a “ProSupport” service plan having a 1 year duration, and promising service within the next business day of reporting a problem, and has been labelled as anomalous. As can be understood from table 201, some of the product data in the product data engine 120 may include descriptions of products that are not consistent with the products' configurations and/or attributes. For example, in the case of SKU 123-2222, the product description does not align with any of the 3 attributes (duration, category or service level). As a result, under conventional approaches, a seller or customer may think they are selling or purchasing 2 years of ProSupport Plus, 4-hour protection, but instead 1 year of ProSupport, next business day protection will be fulfilled programmatically. As explained, in more detail herein, the product data from the product data engine 120 is provided to the anomaly prediction engine 140 to identify anomalous instances of product data (e.g., like SKU 123-2222) so that the anomalies can be corrected before an order is fulfilled. As noted herein below, based on historical product description and attributes data from a product data corpus 125, the machine learning layer 141 of the anomaly prediction engine 140 is trained to identify normal and anomalous product descriptions. For example, referring to the table 201, the anomaly prediction layer 142 identifies that the product descriptions for SKUs 123-1111, 123-3333, 456-4444 and 456-5555 are normal, and not anomalous. In contrast, the anomaly prediction layer 142 identifies that the product description for SKU 123-2222 is anomalous and flags it for review.
The product data from the product data engine 120 is provided to the order management engine 150 and is used by the sales layer 151 in connection with recommending and selling product combinations to customers. For example, as noted herein above, along with a given product, an enterprise may offer support products, and sell these support products in conjunction with a main product. However, if the product data from the product data engine 120 includes product descriptions that are inconsistent with the product attributes, this may lead to product combinations which are not appropriate. For example, as noted above, if a particular higher-end product is typically sold with a same business day or better support plan, an order in which the higher-end product is paired with a lower-end support plan may be incorrect. In more detail, referring to the table 202 in
Referring to the product description generation engine 130 in
The product description generation engine 130 generates a product description 135 from corresponding product attributes data 121 using highly sophisticated neural networks and NLG techniques. Illustrative embodiments leverage one or more transformers. As used herein, a “transformer” is to be broadly construed to refer to a deep learning model that differentially weighs the significance of portions of input data. Similar to recurrent neural networks (RNNs), transformers manage sequential input data. However, transformers do not necessarily process the data in order, and utilize mechanisms which provide context for any position in an input sequence. By identifying context, a transformer does not need to process the beginning of a data sequence before the end of the data sequence, which allows for more parallelization than RNNs to reduce training time. Non-limiting examples of transformer-based neural network machine learning models that may be used by the embodiments are a Bidirectional Encoder Representations from Transformers (BERT) model, a GPT model and a GPT2/3 model, which may use context from both directions, and use encoder parts of transformers to learn a representation for each token.
In some embodiments, a pre-trained transformer 132 trained with large amounts of general-purpose text is then trained with custom enterprise corpus data (e.g., historical product description and attributes data 126 from product data corpus 125) for accurate sentence generation. The transformer 132 solves sequence to sequence issues while handling long-range issues. Since the attribute descriptions in natural language are essentially a sequence of words in the body of text, and the output sentence is also a sequence of words, this can be modeled as a many-to-many sequence-to-sequence problem which the transformer 132 solves.
A many-to-many sequence-to-sequence configuration has two core building blocks (encoder and decoder) and one supporting capability called an “attention mechanism.” The encoder-decoder architecture solves sequence-to-sequence problems like text generation or sentence generation tasks where input and output sequences are of different lengths. Considering an input textual message is similar to a time series model where the words come one after another in time and/or space, the transformer 132 comprises a sophisticated encoder-decoder with a multi-head attention mechanism.
The architecture of a transformer 132 in an illustrative embodiment is shown in
Illustrative embodiments utilize a Key2Text (K2T) or an OpenAI™ GPT2 transformer pre-trained with large amounts (e.g., millions of instances) of textual corpus data using, for example, PyTorch® and/or Tensorflow® software. The textual corpus data used for pre-training comprises, for example, community data from sources such as, but not necessarily limited to, Wikipedia®, news sources, social media (e.g., Twitter®) feeds, etc. Considering the data to be analyzed by the product description generation engine 130 will involve a massive amount of enterprise specific product data, the transformer models are customized and re-trained with domain-specific data from the product data corpus 125.
Illustrative embodiments utilize a Key2Text Python® package that leverages pre-trained K2T-base transformers to generate word combinations (e.g., sentences, phrases, etc.) from a set of input words. The package is built on top of a T5 transformer model and simplifies the APIs to use the model.
Another approach in illustrative embodiments is to leverage an advanced OpenAI™ GPT2 transformer, which is trained using a massive amount of general-purpose text and capable of generating multiple word combinations from the same input words. A pre-trained GPT2 model is created from a transformer pipeline and the attribute values are passed in concatenated text to generate word combinations.
In illustrative embodiments, a pre-trained transformer is augmented with training data from corpus data that includes enterprise specific domain data (e.g., historical product description and attributes data 126 from product data corpus 125). This can be achieved by leveraging pre-processing component 133 to load and pre-process (e.g., remove stop words, emojis, etc., perform stemming, lemmatization, etc.) custom training datasets to prepare for tokenization. Tokenizing converts words to numbers, which facilitates building of word vectors for NLP tasks. Tokenization can be done using a HuggingFace function “DistilBertTokenizerFast( )” Once the model is augmented with a domain specific corpus, generated product descriptions will be highly accurate and standardized, eliminating manual entry by humans and resolving the aforementioned problem of discrepancies between product descriptions and product attributes.
When products (e.g., main products and associated service products) are selected during an ordering process by customers via, for example, customer devices 102, the sales layer 151 of the order management engine 150 will invoke a request to the anomaly prediction engine 140 to detect any mismatch in the product selection (e.g., problematic combination of an identified main product with an identified service product). Upon detection of an anomalous combination, the sales layer 151 is notified by the anomaly prediction engine 140 of the anomaly and proactively generates one or more alerts, which may be transmitted to one or more administrators, via, for example, administrator devices 103, to correct the products in the order prior to order fulfillment by the fulfillment layer 152 and/or prior to the provision of services by the services layer 153. Generating an alert may comprise activating a flag requesting review of the order prior to order fulfillment. To detect anomalous product combinations, the training layer 143 of the machine learning layer 141 of the anomaly prediction engine 140 is trained with historical orders data from the order data repository 160 to learn which product combinations are typically used (e.g., considered normal). It is to be understood that although the embodiments are discussed in terms of main products (e.g., hardware, appliances, software, etc.) and associated service products, the embodiments are not limited thereto, and may apply to other types of products and combinations depending on the nature of the enterprise. Additionally, the embodiments are not necessarily limited to combinations where two products have been identified, and may be applied to detect anomalies in combinations where more than two products have been identified.
As noted above, in another aspect, based on input data from the product data engine 120, the anomaly prediction engine 140 identifies as anomalous instances where a product description is not consistent with the corresponding attributes of the product. As noted herein, some of the product data in the product data engine 120 may include descriptions of products that are not consistent with the products' configurations and/or attributes. For example, the addition of new product attributes and description data to the product data engine 120 will invoke a request to the anomaly prediction engine 140 to detect any product descriptions that do not reflect corresponding product attributes. Upon detection of an anomalous combination, the product data engine 120 is notified by the anomaly prediction engine 140 of the anomaly and proactively generates one or more alerts, which may be transmitted to one or more administrators, via, for example, administrator devices 103, to correct the product description. Generating an alert may comprise activating a flag requesting review of the product description. For example, this feature can be used to alert personnel in real-time of inconsistencies with the product attributes responsive to generating (e.g., authoring) product data (e.g., at the time of generating new product descriptions). To detect anomalous product descriptions, the training layer 143 of the machine learning layer 141 of the anomaly prediction engine 140 is trained with historical product description and attributes data from the product data corpus 125 to learn which product description and attribute combinations are typically used (e.g., considered normal).
Referring to the operational flow 700 in
As noted herein above, depending on the nature of the anomaly (e.g., inconsistent product description or problematic product combination), the training layer 143 of the machine learning (ML) layer 141 is trained with historical product description and attributes data 126 and historical orders data 161. The anomaly prediction engine 140 includes a pre-processing component 144, which processes the incoming product description and attributes data 122 and/or order data 155, and the historical product description and attributes data 126 and historical orders data 161 for analysis by the ML layer 141. For example, the pre-processing component 144 removes any unwanted characters, punctuation, and stop words. As can be seen in
The anomaly prediction engine 140 uses an unsupervised learning approach to detect anomalies in product selections (e.g., identified product combinations) and product descriptions. Normal product descriptions with respect to particular attributes are learned from historical product description and attributes data 126, and product selections of normal product combinations are learned the historical orders data 161. For example, the table 800 in
Anomaly detection or outlier detection identifies situations that are not considered normal based on the observation of the properties being considered. For example, according to illustrative embodiments, in normal orders, each transaction will have a specific main product and an attached service product and its attributes. During product mismatch situations, these features vary from the normal orders, and are considered as outliers or anomalies.
The machine learning layer 141 leverages unsupervised learning methodology for outlier detection of product descriptions and product combinations at the time of ordering and product data creation. The unsupervised learning methodology may utilize, for example, shallow or deep learning. In an embodiment, the machine learning layer 141 implements multivariate anomaly detection using an isolation forest algorithm, which does not require labeled training data. The isolation forest algorithm identifies anomalies among the normal observations, by setting up a threshold value in a contamination parameter that can apply for real-time predictions. The isolation forest algorithm has the capacity to scale up to handle extremely large data sizes (e.g., terabytes) and high-dimensional problems with a large number of attributes, some of which may be irrelevant and potential noise. The isolation forest algorithm has relatively low linear time complexity and prevents masking and swamping effects in anomaly detection. A masking effect is where a model predicts normal behavior when the behavior is anomalous. A swamping effect is where a model predicts anomalous behavior when the behavior is normal.
In illustrative embodiments, the machine learning model used by the machine learning layer 141 isolates an anomaly by creating decision trees over random attributes. This random partitioning produces significantly shorter paths since fewer instances of anomalies result in smaller partitions, and distinguishable attribute values are more likely to be separated in early partitioning. As a result, when a group (e.g., forest) of random trees collectively produces shorter path lengths for some particular points, then they are highly likely to be anomalies. A larger number of splits are required to isolate a normal point, while an anomaly can be isolated by a shorter number of splits. For example, referring to the plots 1001 and 1002 in
According to one or more embodiments, when products are selected via the sales layer 151 of an order management engine 150 (e.g., both on-line and offline), selected products and their attributes, as well as customer and region, etc. are input to the trained model of the machine learning layer 141 for prediction. If the model detects that the selected products vary from the typical selections in the orders from the historical orders data 161, that product selection will be flagged as an anomaly or outlier. This is an indication that the product selection is unusual. According to one or more embodiments, in a normal condition, the flag is set to NORMAL and orders are fulfilled. The flag may be set to ANOMALOUS when the machine learning model of the anomaly prediction engine 140 determines that product selections are anomalous. When the flag is set to ANOMALOUS, the order is prevented from being fulfilled and an administrator is notified via, for example, one of the administrator devices 103.
In connection with the operation of the anomaly prediction engine 140,
In illustrative embodiments, the machine learning model (e.g., isolation forest model) is trained using historical data (e.g., historical product description and attributes data 126, and historical orders data 161). If the anomaly prediction layer 142 identifies a given product description or combination as deviating from typical product descriptions or combinations, and/or having anomaly scores exceeding a threshold, the anomaly prediction layer 142 identifies the given product description or combination as anomalous (e.g., anomalous 145-1). If the anomaly prediction layer 142 identifies a given product description or combination as being consistent with typical product descriptions or combinations, and/or having anomaly scores below a threshold, the anomaly prediction layer 142 identifies the given product description or combination as normal (e.g., normal 145-2).
According to one or more embodiments, the product data corpus 125, order data repository 160 and other data corpuses, repositories or databases referred to herein can be configured according to a relational database management system (RDBMS) (e.g., PostgreSQL). In some embodiments, the product data corpus 125, order data repository 160 and other data corpuses, repositories or databases referred to herein are implemented using one or more storage systems or devices associated with the product configuration validation platform 110. In some embodiments, one or more of the storage systems utilized to implement the product data corpus 125, order data repository 160 and other data corpuses, repositories or databases referred to herein comprise a scale-out all-flash content addressable storage array or other type of storage array.
The term “storage system” as used herein is therefore intended to be broadly construed, and should not be viewed as being limited to content addressable storage systems or flash-based storage systems. A given storage system as the term is broadly used herein can comprise, for example, network-attached storage (NAS), storage area networks (SANs), direct-attached storage (DAS) and distributed DAS, as well as combinations of these and other storage types, including software-defined storage.
Other particular types of storage products that can be used in implementing storage systems in illustrative embodiments include all-flash and hybrid flash storage arrays, software-defined storage products, cloud storage products, object-based storage products, and scale-out NAS clusters. Combinations of multiple ones of these and other storage products can also be used in implementing a given storage system in an illustrative embodiment.
Although shown as elements of the product configuration validation platform 110, the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150 and/or order data repository 160 in other embodiments can be implemented at least in part externally to the product configuration validation platform 110, for example, as stand-alone servers, sets of servers or other types of systems coupled to the network 104. For example, the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150 and/or order data repository 160 may be provided as cloud services accessible by the product configuration validation platform 110.
The product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150 and/or order data repository 160 in the
At least portions of the product configuration validation platform 110 and the elements thereof may be implemented at least in part in the form of software that is stored in memory and executed by a processor. The product configuration validation platform 110 and the elements thereof comprise further hardware and software required for running the product configuration validation platform 110, including, but not necessarily limited to, on-premises or cloud-based centralized hardware, graphics processing unit (GPU) hardware, virtualization infrastructure software and hardware, Docker containers, networking software and hardware, and cloud infrastructure software and hardware.
Although the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150, order data repository 160 and other elements of the product configuration validation platform 110 in the present embodiment are shown as part of the product configuration validation platform 110, at least a portion of the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150, order data repository 160 and other elements of the product configuration validation platform 110 in other embodiments may be implemented on one or more other processing platforms that are accessible to the product configuration validation platform 110 over one or more networks. Such elements can each be implemented at least in part within another system element or at least in part utilizing one or more stand-alone elements coupled to the network 104.
It is assumed that the product configuration validation platform 110 in the
The term “processing platform” as used herein is intended to be broadly construed so as to encompass, by way of illustration and without limitation, multiple sets of processing devices and one or more associated storage systems that are configured to communicate over one or more networks.
As a more particular example, the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150, order data repository 160 and other elements of the product configuration validation platform 110, and the elements thereof can each be implemented in the form of one or more LXCs running on one or more VMs. Other arrangements of one or more processing devices of a processing platform can be used to implement the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150, order data repository 160, as well as other elements of the product configuration validation platform 110. Other portions of the system 100 can similarly be implemented using one or more processing devices of at least one processing platform.
Distributed implementations of the system 100 are possible, in which certain elements of the system reside in one data center in a first geographic location while other elements of the system reside in one or more other data centers in one or more other geographic locations that are potentially remote from the first geographic location. Thus, it is possible in some implementations of the system 100 for different portions of the product configuration validation platform 110 to reside in different data centers. Numerous other distributed implementations of the product configuration validation platform 110 are possible.
Accordingly, one or each of the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150, order data repository 160 and other elements of the product configuration validation platform 110 can each be implemented in a distributed manner so as to comprise a plurality of distributed elements implemented on respective ones of a plurality of compute nodes of the product configuration validation platform 110.
It is to be appreciated that these and other features of illustrative embodiments are presented by way of example only, and should not be construed as limiting in any way. Accordingly, different numbers, types and arrangements of system elements such as the product data engine 120, product data corpus 125, product description generation engine 130, anomaly prediction engine 140, order management engine 150, order data repository 160 and other elements of the product configuration validation platform 110, and the portions thereof can be used in other embodiments.
It should be understood that the particular sets of modules and other elements implemented in the system 100 as illustrated in
For example, as indicated previously, in some illustrative embodiments, functionality for the product configuration validation platform can be offered to cloud infrastructure customers or other users as part of FaaS, CaaS and/or PaaS offerings.
The operation of the information processing system 100 will now be described in further detail with reference to the flow diagram of
In step 1702, product selection data characterizing at least one combination of at least two products is received by, for example, an anomaly prediction engine 140. In step 1704, the product selection data is analyzed using one or more machine learning algorithms.
In step 1706, based, at least in part, on the analyzing, a prediction is made whether the at least one combination is anomalous. The predicting comprises identifying the at least one combination as an outlier combination of at least a first product identifier and a second product identifier, wherein the one or more machine learning algorithms utilize an unsupervised learning technique to identify the at least one combination as an outlier combination. The one or more machine learning algorithms comprise an isolation forest algorithm trained with training data comprising historical product selection data. The historical product selection data comprises historical product combinations and one or more product attributes.
In step 1708, one or more alerts are generated in response to predicting that the at least one combination is anomalous. In illustrative embodiments, the product selection data comprises an order for the at least one combination, wherein the generating of the one or more alerts comprises activating a flag requesting review of the order prior to order fulfillment.
In illustrative embodiments, the method further comprises receiving product attributes data for at least one product, and generating, using at least one natural language generation algorithm, a description of the at least one product based, at least in part, on the product attributes data. The at least one natural language generation algorithm utilizes one or more transformers to generate one or more word combinations in the description. The at least one natural language generation algorithm utilizes one or more neural networks in combination with the one or more transformers to generate the one or more word combinations. The at least one natural language generation algorithm is trained with historical product description and attributes data.
In illustrative embodiments, the anomaly prediction engine 140 also receives product description and attributes data, and analyzes the product description and attributes data using the one or more machine learning algorithms. The anomaly prediction engine 140 predicts based, at least in part, on the analyzing of the product description and attributes data, whether one or more product descriptions are anomalous. One or more additional alerts are generated in response to predicting that the one or more product descriptions are anomalous. The one or more machine learning algorithms are trained with historical product description and attributes data.
It is to be appreciated that the
The particular processing operations and other system functionality described in conjunction with the flow diagram of
Functionality such as that described in conjunction with the flow diagram of
Illustrative embodiments of systems with a product configuration validation platform as disclosed herein can provide a number of significant advantages relative to conventional arrangements. For example, the product configuration validation platform uses machine learning to proactively predict anomalies or mismatches of various products selected as part of sales orders. For example, the embodiments utilize an isolation forest algorithm to detect outlier product combinations such as, but not necessarily limited to, product combinations of hardware, appliance or software products with service products. As an additional advantage, the embodiments generate alerts for unusual and/or misconfigured product configurations in real-time (e.g., during the sales process) to prevent fulfillment of unwanted orders.
The embodiments advantageously leverage an unsupervised learning approach and machine learning models to detect anomalies in product descriptions as well as product combinations. Responsive to identifying product descriptions which are inconsistent with corresponding product attributes, the embodiments generate alerts at the time of product data creation to avoid costly errors in the supply chain that may occur from incorrect product descriptions.
The embodiments further provide technical solutions which leverage state of the art NLP, NLU and NLG techniques by further training a pre-trained transformer with enterprise product data. Illustrative embodiments advantageously automatically generate product descriptions from inputted product attributes, thereby eliminating human configuration issues.
Unlike conventional approaches, illustrative embodiments provide technical solutions which formulate programmatically and with a high degree of accuracy, the detection of anomalies in product descriptions and product combinations. By utilizing historical product description and attribute data, and historical orders data corresponding to normal situations, and leveraging a sophisticated machine learning algorithm, anomalous product configurations are identified.
It is to be appreciated that the particular advantages described above and elsewhere herein are associated with particular illustrative embodiments and need not be present in other embodiments. Also, the particular types of information processing system features and functionality as illustrated in the drawings and described above are exemplary only, and numerous other arrangements may be used in other embodiments.
As noted above, at least portions of the information processing system 100 may be implemented using one or more processing platforms. A given such processing platform comprises at least one processing device comprising a processor coupled to a memory. The processor and memory in some embodiments comprise respective processor and memory elements of a virtual machine or container provided using one or more underlying physical machines. The term “processing device” as used herein is intended to be broadly construed so as to encompass a wide variety of different arrangements of physical processors, memories and other device components as well as virtual instances of such components. For example, a “processing device” in some embodiments can comprise or be executed across one or more virtual processors. Processing devices can therefore be physical or virtual and can be executed across one or more physical or virtual processors. It should also be noted that a given virtual device can be mapped to a portion of a physical one.
Some illustrative embodiments of a processing platform that may be used to implement at least a portion of an information processing system comprise cloud infrastructure including virtual machines and/or container sets implemented using a virtualization infrastructure that runs on a physical infrastructure. The cloud infrastructure further comprises sets of applications running on respective ones of the virtual machines and/or container sets.
These and other types of cloud infrastructure can be used to provide what is also referred to herein as a multi-tenant environment. One or more system elements such as the product configuration validation platform 110 or portions thereof are illustratively implemented for use by tenants of such a multi-tenant environment.
As mentioned previously, cloud infrastructure as disclosed herein can include cloud-based systems. Virtual machines provided in such systems can be used to implement at least portions of one or more of a computer system and a product configuration validation platform in illustrative embodiments. These and other cloud-based systems in illustrative embodiments can include object stores.
Illustrative embodiments of processing platforms will now be described in greater detail with reference to
The cloud infrastructure 1800 further comprises sets of applications 1810-1, 1810-2, . . . 1810-L running on respective ones of the VMs/container sets 1802-1, 1802-2, . . . 1802-L under the control of the virtualization infrastructure 1804. The VMs/container sets 1802 may comprise respective VMs, respective sets of one or more containers, or respective sets of one or more containers running in VMs.
In some implementations of the
In other implementations of the
As is apparent from the above, one or more of the processing modules or other components of system 100 may each run on a computer, server, storage device or other processing platform element. A given such element may be viewed as an example of what is more generally referred to herein as a “processing device.” The cloud infrastructure 1800 shown in
The processing platform 1900 in this embodiment comprises a portion of system 100 and includes a plurality of processing devices, denoted 1902-1, 1902-2, 1902-3, . . . 1902-K, which communicate with one another over a network 1904.
The network 1904 may comprise any type of network, including by way of example a global computer network such as the Internet, a WAN, a LAN, a satellite network, a telephone or cable network, a cellular network, a wireless network such as a WiFi or WiMAX network, or various portions or combinations of these and other types of networks.
The processing device 1902-1 in the processing platform 1900 comprises a processor 1910 coupled to a memory 1912. The processor 1910 may comprise a microprocessor, a microcontroller, an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a central processing unit (CPU), a graphical processing unit (GPU), a tensor processing unit (TPU), a video processing unit (VPU) or other type of processing circuitry, as well as portions or combinations of such circuitry elements.
The memory 1912 may comprise random access memory (RAM), read-only memory (ROM), flash memory or other types of memory, in any combination. The memory 1912 and other memories disclosed herein should be viewed as illustrative examples of what are more generally referred to as “processor-readable storage media” storing executable program code of one or more software programs.
Articles of manufacture comprising such processor-readable storage media are considered illustrative embodiments. A given such article of manufacture may comprise, for example, a storage array, a storage disk or an integrated circuit containing RAM, ROM, flash memory or other electronic memory, or any of a wide variety of other types of computer program products. The term “article of manufacture” as used herein should be understood to exclude transitory, propagating signals. Numerous other types of computer program products comprising processor-readable storage media can be used.
Also included in the processing device 1902-1 is network interface circuitry 1914, which is used to interface the processing device with the network 1904 and other system components, and may comprise conventional transceivers.
The other processing devices 1902 of the processing platform 1900 are assumed to be configured in a manner similar to that shown for processing device 1902-1 in the figure.
Again, the particular processing platform 1900 shown in the figure is presented by way of example only, and system 100 may include additional or alternative processing platforms, as well as numerous distinct processing platforms in any combination, with each such platform comprising one or more computers, servers, storage devices or other processing devices.
For example, other processing platforms used to implement illustrative embodiments can comprise converged infrastructure.
It should therefore be understood that in other embodiments different arrangements of additional or alternative elements may be used. At least a subset of these elements may be collectively implemented on a common processing platform, or each such element may be implemented on a separate processing platform.
As indicated previously, components of an information processing system as disclosed herein can be implemented at least in part in the form of one or more software programs stored in memory and executed by a processor of a processing device. For example, at least portions of the functionality of one or more elements of the product configuration validation platform 110 as disclosed herein are illustratively implemented in the form of software running on one or more processing devices.
It should again be emphasized that the above-described embodiments are presented for purposes of illustration only. Many variations and other alternative embodiments may be used. For example, the disclosed techniques are applicable to a wide variety of other types of information processing systems and product configuration validation platforms. Also, the particular configurations of system and device elements and associated processing operations illustratively shown in the drawings can be varied in other embodiments. Moreover, the various assumptions made above in the course of describing the illustrative embodiments should also be viewed as exemplary rather than as requirements or limitations of the disclosure. Numerous other alternative embodiments within the scope of the appended claims will be readily apparent to those skilled in the art.
Number | Name | Date | Kind |
---|---|---|---|
10282237 | Johnson | May 2019 | B1 |
10534928 | Roden | Jan 2020 | B1 |
20220309390 | Arnautov | Sep 2022 | A1 |
20220351132 | Tatituri | Nov 2022 | A1 |
Entry |
---|
Wikipedia, “Isolation Forest,” https://en.wikipedia.org/w/index.php?title=Isolation_forest&oldid=1029032059, Jun. 17, 2021, 7 pages. |
Jeremy H. “4 Microservices Examples: Amazon, Netflix, Uber, and Etsy,” https://blog.dreamfactory.com/microservices-examples/, Jul. 14, 2021, 12 pages. |
J. Lewis et al., “Microservices: A Definition of This New Architectural Term,” https://martinfowler.com/articles/microservices.html, Mar. 25, 2014, 32 pages. |
A. Vaswani et al., “Attention Is All You Need,” arXiv:1706.03762v5, Dec. 6, 2017, 15 pages. |
U.S. Appl. No. 17/502,169, filed in the name of Bijan Kumar Mohanty et al. on Oct. 15, 2021, and entitled “Application Programming Interface Anomaly Detection.” |
U.S. Appl. No. 17/828,357, filed in the name of Bijan Kumar Mohanty et al. on May 31, 2022, and entitled “Microservices Anomaly Detection.” |
Number | Date | Country | |
---|---|---|---|
20240020475 A1 | Jan 2024 | US |