This application is based on and claims priority under 35 U.S.C. § 119 to Indian Provisional Application No. 202141057905, filed on Dec. 13, 2021, and Indian Complete Application No. 202141057905, filed on Nov. 10, 2022, the disclosures of which are incorporated by reference herein in their entireties.
The present disclosure relates to an electronic device, and more specifically, to a method and the electronic device for on-device lifestyle recommendations.
Fashion is an integral part of human life. Access to fashion trends in the real world is available. But, channelizing and analyzing user's fashion and bringing the right content in the right context with respect to fashion trends are not available. A user can connect to current trends of fashion easily only upon knowing his/her own style appropriately. A bridge between one's fashion and fashion trends outside is a missing connection. Thus, it is desired to provide a useful method and/or system for bridging the user's fashion and the fashion trends in the real world.
Segmenting the identified fashion objects from the image includes determining a feature vector of the image using a Convolution Neural Network (CNN) model; determining Region of Interests (ROIs) of the image by providing the feature vector to a Region Proposal Network (RPN); optimizing scales of the ROIs by providing the feature vector and the predicted ROIs to a Feature Pyramid Network (FPN); refining an alignment of the ROIs; and determining the segmented fashion objects including output masks, labels, and coordinates of the identified fashion objects in the ROIs using a plurality of neural network models.
Provided is a method for on-device lifestyle recommendations.
According to an aspect of the disclosure, a method of an electronic device for on-device lifestyle recommendations, includes: receiving a user input; determining a fashion context based on the user input; dynamically clustering fashion objects in at least one image stored in the electronic device based on the fashion context; and displaying a lifestyle recommendation including the clustered fashion objects.
The dynamically clustering the fashion objects in the at least one image may include: identifying the fashion objects in the at least one image by analyzing the at least one image stored using an artificial intelligence (AI) model; generating a fashion knowledge graph including different classes of the identified fashion objects; traversing the fashion context through the fashion knowledge graph; and dynamically clustering the fashion objects in the different classes obtained based on the traversal.
The method may further include updating the fashion knowledge graph based on a user action on the recommendation.
The method may further include updating the fashion knowledge graph based on receiving and analyzing a new image.
The generating the fashion knowledge graph may include: segmenting the identified fashion objects from the image; classifying the segmented fashion objects to different classes; determining personal and social attributes of the segmented fashion objects in the different classes; and generating the fashion knowledge graph including the different classes of the segmented fashion objects, wherein each segmented fashion object in each class is assigned with either a personal tag or a social tag based on the personal and social attributes.
The dynamically clustering the fashion objects in the different classes may include: determining a weightage of a match between the at least one class of segmented fashion objects and the fashion context; and dynamically clustering the segmented fashion objects with the assigned tag in the at least one class based on the weightage.
The segmenting the identified fashion objects from the image may include: determining a feature vector of the image using a Convolution Neural Network (CNN) model; determining Region of Interests (ROIs) of the image by providing the feature vector to a Region Proposal Network; optimizing scales of the ROIs by providing the feature vector and the predicted ROIs to a Feature Pyramid Network (FPN); refining an alignment of the ROIs; and determining the segmented fashion objects including output masks, labels, and coordinates of the identified fashion objects in the ROIs using a plurality of neural network models.
The classifying the segmented fashion objects to the different classes may include: obtaining labels of the segmented fashion objects; and performing one of: based on the labels of the segmented fashion objects being clothes, classifying the segmented fashion objects into a pattern class, a fabric class, and an attire class, and based on the labels of the segmented fashion objects being fashion accessories, classifying the segmented fashion objects into a fashion accessory class.
The determining the personal and social attributes of the segmented fashion objects in the different classes may include: identifying each person in the image by detecting faces of people in the image; determining a relationship of each person with a user of the electronic device; segregating the segmented fashion objects of each person; and determining the personal and social attributes of the segregated fashion objects based on the relationship of each person with the user.
According to an aspect of the disclosure, an electronic device for on-device lifestyle recommendations, includes: a display; a memory storing instructions; a processor configured to execute the instructions to: detect a user input on the electronic device; determine a fashion context based on the user input; dynamically cluster fashion objects in at least one image stored in the electronic device based on the fashion context; and control the display to display a lifestyle recommendation including the clustered fashion objects.
The processor may be further configured to execute the instructions to: identify the fashion objects in the at least one image by analyzing the at least one image stored using an artificial intelligence (AI) model; generate a fashion knowledge graph including different classes of the identified fashion objects; traverse the fashion context through the fashion knowledge graph; and dynamically cluster the fashion objects in the different classes obtained based on the traversal.
The processor may be further configured to execute the instructions to update the fashion knowledge graph based on a user action on the recommendation.
The processor may be further configured to execute the instructions to update the fashion knowledge graph based on receiving and analyzing a new image.
The processor may be further configured to execute the instructions to: segment the identified fashion objects from the image; classify the segmented fashion objects to different classes; determine personal and social attributes of the segmented fashion objects in the different classes; and generate the fashion knowledge graph including the different classes of the segmented fashion objects, wherein each segmented fashion object in each class is assigned with either a personal tag or a social tag based on the personal and social attributes.
The processor may be further configured to execute the instructions to: determine a weightage of a match between the at least one class of segmented fashion objects and the fashion context; and dynamically cluster the segmented fashion objects with the assigned tag in the at least one class based on the weightage.
The processor may be further configured to execute the instructions to: determine a feature vector of the image using a Convolution Neural Network (CNN) model; determine Region of Interests (ROIs) of the image by providing the feature vector to a Region Proposal Network; optimize scales of the ROIs by providing the feature vector and the predicted ROIs to a Feature Pyramid Network (FPN); refine an alignment of the ROIs; and determine the segmented fashion objects including output masks, labels, and coordinates of the identified fashion objects in the ROIs using a plurality of neural network models.
The processor may be further configured to execute the instructions to: obtain labels of the segmented fashion objects; and performing one of: based on the labels of the segmented fashion objects being clothes, classify the segmented fashion objects into a pattern class, a fabric class, and an attire class, and based on the labels of the segmented fashion objects being fashion accessories, classify the segmented fashion objects into a fashion accessory class.
The processor may be further configured to execute the instructions to: identify each person in the image by detecting faces of people in the image; determine a relationship of each person with a user of the electronic device; segregate the segmented fashion objects of each person; and determine the personal and social attributes of the segregated fashion objects based on the relationship of each person with the user.
The above and other aspects, features, and advantages of certain embodiments of the present disclosure will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
Various embodiments provide a method and an electronic device for on-device lifestyle recommendations. The electronic device provides fashion-centric on-device content by performing fashion-centric personnel and social attribution of various images stored in the electronic device by analyzing the images through neural networks and brings fashion information handy to the user and solves the problem of bringing appropriate fashion content as and when needed to the user.
An aspect of various embodiments herein is to segment, extract, store, cluster, and classify personnel attributed fashion components of images stored in the electronic device and bring contextually to a user based on user interaction. The proposed method is handy during message composition, replies to photos, collaborative search, fashion smart selects and share, fashion-centric image search, feedback to e-commerce sites, and unified and outfit-matching backgrounds. The method allows the electronic device which enables the user to store, access and share the fashion content as a whole or selected part of an image or cluster formed from the components of various images
The embodiments herein and the various features and advantageous details thereof are explained more fully with reference to the non-limiting embodiments that are illustrated in the accompanying drawings and detailed in the following description. Descriptions of well-known components and processing techniques are omitted so as to not unnecessarily obscure the embodiments herein. Also, the various embodiments described herein are not necessarily mutually exclusive, as some embodiments can be combined with one or more other embodiments to form new embodiments. The term “or” as used herein, refers to a non-exclusive or, unless otherwise indicated. The examples used herein are intended merely to facilitate an understanding of ways in which the embodiments herein can be practiced and to further enable those skilled in the art to practice the embodiments herein. Accordingly, the examples should not be construed as limiting the scope of the embodiments herein.
As is traditional in the field, embodiments may be described and illustrated in terms of blocks which carry out a described function or functions. These blocks, which may be referred to herein as managers, units, modules, hardware components or the like, are physically implemented by analog and/or digital circuits such as logic gates, integrated circuits, microprocessors, microcontrollers, memory circuits, passive electronic components, active electronic components, optical components, hardwired circuits and the like, and may optionally be driven by firmware. The circuits may, for example, be embodied in one or more semiconductor chips, or on substrate supports such as printed circuit boards and the like. The circuits constituting a block may be implemented by dedicated hardware, or by a processor (e.g., one or more programmed microprocessors and associated circuitry), or by a combination of dedicated hardware to perform some functions of the block and a processor to perform other functions of the block. Each block of the embodiments may be physically separated into two or more interacting and discrete blocks without departing from the scope of the disclosure. Likewise, the blocks of the embodiments may be physically combined into more complex blocks without departing from the scope of the disclosure.
The accompanying drawings are used to help easily understand various technical features and it should be understood that the embodiments presented herein are not limited by the accompanying drawings. As such, the present disclosure should be construed to extend to any alterations, equivalents and substitutes in addition to those which are particularly set out in the accompanying drawings. Although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are generally only used to distinguish one element from another.
Accordingly, the embodiments herein provide a method for on-device lifestyle recommendations. The method includes detecting, by an electronic device, a user input on the electronic device. The method includes determining, by the electronic device, a fashion context based on the user input. The method includes dynamically clustering, by the electronic device, fashion objects in images stored in the electronic device based on the fashion context. The method includes displaying, by the electronic device, a lifestyle recommendation including the clustered fashion objects.
Accordingly, the embodiments herein provide an electronic device for on-device lifestyle recommendations. The electronic device includes a fashion object recommendation engine, a memory, a processor, where the fashion object recommendation engine is coupled to the memory and the processor. The fashion object recommendation engine is configured for detecting a user input on the electronic device. The fashion object recommendation engine is configured for determining a fashion context based on the user input. The fashion object recommendation engine is configured for dynamically clustering fashion objects in images stored in the electronic device based on the fashion context. The fashion object recommendation engine is configured for displaying a lifestyle recommendation including the clustered fashion objects.
Unlike related art methods and systems, the electronic device provides fashion-centric on-device content by performing fashion-centric personnel and social attribution of various images stored in the electronic device by analyzing the images through neural networks and bringing fashion information handy to the user and solving the problem of bringing appropriate a fashion content as and when needed to the user, which improves user experience.
Unlike related art methods and systems, the electronic device segments, extracts, stores, clusters, and classifies personnel attributed fashion components of images stored in the electronic device and brings context to a user. The proposed method is handy during message composition, replies to photos, collaborative search, fashion smart selects and share, fashion-centric image search, feedback to e-commerce sites, and unified and outfit-matching backgrounds. The method allows the electronic device which enables the user to store, access and share the fashion content as a whole or selected part of an image or cluster formed from the components of various images.
Related art solutions provide a way to search fashion content in the web but the proposed method allows the electronic device to bring relevant content available on the electronic device. The proposed method also allows the electronic device to fashion-centric on-device content accessing, and recommend the right contextual content. The proposed method solves the problem by thoroughly analyzing the existing images on the electronic device in terms of various fashion aspects by using Artificial Intelligence (AI) techniques. This analysis is used to build a fashion knowledge graph of the user and the user's close community. The knowledge graph is accessed and provided to the user as and when required by sensing the fashion context of user interactions and presenting the data, building solutions by using data and providing appropriate recommendations.
Referring now to the drawings, and more particularly to
The fashion object recommendation engine 110 detects the user input (e.g. user interactions, selecting a text, media, etc.) on the electronic device 100. Further, the fashion object recommendation engine 110 determines a fashion context based on the user input. Further, the fashion object recommendation engine 110 dynamically clusters fashion objects (e.g. jeans, shirt, cap, ornaments, watch, wallet, etc.) in images stored in the electronic device 100 based on the fashion context. Further, the fashion object recommendation engine 110 displays a lifestyle recommendation (e.g. image, text, etc.) including the clustered fashion objects on the display 150.
In an embodiment, for dynamically clustering the fashion objects in the images, the fashion object recommendation engine 110 identifies the fashion objects in the images by analyzing the images stored using an AI model 206. Further, the fashion object recommendation engine 110 creates a fashion knowledge graph including different classes of the identified fashion objects. Further, the fashion object recommendation engine 110 traverses the fashion context through the fashion knowledge graph. Further, the fashion object recommendation engine 110 dynamically clusters the fashion objects in the different classes obtained upon the traversal.
In an embodiment, the fashion object recommendation engine 110 updates the fashion knowledge graph based on a user action (e.g. choosing, ignoring, etc.) on the recommendation. In another embodiment, the fashion object recommendation engine 110 updates the fashion knowledge graph upon receiving and analyzing a new image.
In an embodiment, for creating the fashion knowledge graph including the different classes of the identified fashion object, the fashion object recommendation engine 110 segments the identified fashion objects from the image. Further, the fashion object recommendation engine 110 classifies the segmented fashion objects into different classes. Further, the fashion object recommendation engine 110 determines the personal and social attributes of the segmented fashion objects in the different classes. Further, the fashion object recommendation engine 110 creates the fashion knowledge graph including the different classes of the segmented fashion objects, where each segmented fashion object in each class is assigned either a personal tag or a social tag based on the personal and social attributes.
In an embodiment, for dynamically clustering the fashion objects in the different classes, the fashion object recommendation engine 110 determines a weightage of a match between the classes of segmented fashion objects and the fashion context. Further, the fashion object recommendation engine 110 dynamically clusters the segmented fashion objects with the assigned tag in the classes based on the weightage.
In an embodiment, for segmenting the identified fashion objects from the image, the fashion object recommendation engine 110 determines a feature vector of the image using a Convolution Neural Network (CNN) model 402. Further, the fashion object recommendation engine 110 determines Region of Interests (ROIs) of the image by providing the feature vector to a Region Proposal Network (RPN). Further, the fashion object recommendation engine 110 optimizes scales of the ROIs by providing the feature vector and the predicted ROIs to a Feature Pyramid Network (FPN). Further, the fashion object recommendation engine 110 refines an alignment of the ROIs. Further, the fashion object recommendation engine 110 determines the segmented fashion objects including output masks, labels, and coordinates of the identified fashion objects in the ROIs using a plurality of neural network models.
In an embodiment, for classifying the segmented fashion objects into the different classes, the fashion object recommendation engine 110 obtains labels of the segmented fashion objects. The fashion object recommendation engine 110 classifies the segmented fashion objects into a pattern class, a fabric class, and an attire class, when the labels of the segmented fashion objects are clothes. The fashion object recommendation engine 110 classifies the segmented fashion objects into a fashion accessory class, when the labels of the segmented fashion objects are fashion accessories.
In an embodiment, for determining the personal and social attributes of the segmented fashion objects in the different classes, the fashion object recommendation engine 110 identifies each person in the image by detecting the faces of people in the image. Further, the fashion object recommendation engine 110 determines the relationship of each person with the user of the electronic device 100. Further, the fashion object recommendation engine 110 segregates the segmented fashion objects of each person. Further, the fashion object recommendation engine 110 determines the personal and social attributes of the segregated fashion objects based on the relationship of each person with the user.
The memory 120 stores the images. The memory 120 includes applications 121 installed in the electronic device 100. Further, the memory 120 includes a CMH database 122, and a lifestyle engine database 113. The memory 120 stores instructions to be executed by the processor 130. The memory 120 may include non-volatile storage elements. Examples of such non-volatile storage elements may include magnetic hard discs, optical discs, floppy discs, flash memories, or forms of electrically programmable memories (EPROM) or electrically erasable and programmable (EEPROM) memories. In addition, the memory 120 may, in some examples, be considered a non-transitory storage medium. The term “non-transitory” may indicate that the storage medium is not embodied in a carrier wave or a propagated signal. However, the term “non-transitory” should not be interpreted that the memory 120 is non-movable. In some examples, the memory 120 can be configured to store larger amounts of information than its storage space. In certain examples, a non-transitory storage medium may store data that can, over time, change (e.g., in Random Access Memory (RAM) or cache). The memory 120 can be an internal storage unit or it can be an external storage unit of the electronic device 100, a cloud storage, or any other type of external storage.
The processor 130 is configured to execute instructions stored in the memory 120. The processor 130 may be a general-purpose processor, such as a Central Processing Unit (CPU), an Application Processor (AP), or the like, a graphics-only processing unit such as a Graphics Processing Unit (GPU), a Visual Processing Unit (VPU) and the like. The processor 130 may include multiple cores to execute the instructions. The communicator 140 is configured for communicating internally between hardware components in the electronic device 100. Further, the communicator 140 is configured to facilitate communication between the electronic device 100 and other devices via one or more networks (e.g. Radio technology). The communicator 140 includes an electronic circuit specific to a standard that enables wired or wireless communication.
Although the
The image scanner 111A periodically scans and gets the images analyzed to build the fashion knowledge graph. The fashion segmentation engine 111B segments the images in the context of the fashion i.e. type and where it is worn. The image metadata extractor 111C extracts metadata of the image (location, time, time zone etc.). The fashion classifier 111D classifies each fashion segment into different classes based on the physical and social features of the attire. The personal attributer 111F identifies the person to which fashion segment and attributes to the person. The social attributer 111E attributes the social relationship of the user and, the user's fashion segments to the other persons whose fashion segments are present. The dynamic fashion cluster creator 111G creates the clusters dynamically based on the user or use case need basis so that all relevant fashion data can be presented to the user readily. The match identifier 111H matches the different fashion segments based on their physical and social properties. The fashion attributes builder 112 collates all the images and fashion attributes built based on different fashion AI models and mash-up happens. The lifestyle engine database 113 contains all fashion data of the user and the user's social contacts are stored here.
The lifestyle engine proxy 114A is an interface through which the lifestyle attributes of the user can be extracted from the lifestyle engine database 113. The dynamic cluster & match identifier proxy 114B is an interface through which the dynamic clusters and the identified matches can be accessed. The lifestyle image knowledge graph builder 114C builds the fashion knowledge graph based on the current fashion context. The application context adapter 114D tunes the fashion knowledge graph to the current use case context and need. The recommendation engine 115A provides the recommendations based on the current context and the fashion knowledge graph that is built based on the fashion attributes. The feedback analyzer 115B analyses the user actions and feedback on the user actions on the recommendations provided so that the fashion knowledge graph can be learnt.
The fashion object recommendation engine 110 attributes segmented fashion accessories and uses them while composing a message. Further, the fashion object recommendation engine 110 provides text-driven image suggestions followed by image-driven text suggestions. The proposed method is socially collaborative, and enables the electronic device 100 for decentralized fashion searching and identifying close aid for authenticated reviews of products. The electronic device 100 performs social personalized and localized fashion searches and fashion-centric smart select in an image or screenshot for easy access to personal fashion accessories information. The proposed method is localized and supports contextual search of fashion and feeds it back to shopping apps for personalized refinement.
A function associated with the AI model 206 may be performed through the non-volatile/volatile memory 120, and the processor 130. One or a plurality of processors 130 controls the processing of the input data in accordance with a predefined operating rule or the AI model 206 stored in the non-volatile/volatile memory 120. The predefined operating rule or the AI model 206 is provided through training or learning. Here, being provided through learning means that, by applying a learning method to a plurality of learning data, the predefined operating rule or the AI model 206 of the desired characteristic is made. The learning may be performed in the electronic device 100 itself in which the AI model 206 according to an embodiment is performed, and/or may be implemented through a separate server/system. The AI model 206 may consist of a plurality of neural network layers. Each layer has a plurality of weight values and performs a layer operation through the calculation of a previous layer and an operation of a plurality of weights. Examples of neural networks include, but are not limited to, convolutional neural network (CNN), deep neural network (DNN), recurrent neural network (RNN), restricted Boltzmann Machine (RBM), deep belief network (DBN), bidirectional recurrent deep neural network (BRDNN), generative adversarial networks (GAN), and deep Q-networks. The learning method is a method for training a predetermined target device (for example, a robot) using a plurality of learning data to cause, allow, or control the target device to make a determination or prediction. Examples of the learning method include, but are not limited to, supervised learning, unsupervised learning, semi-supervised learning, or reinforcement learning.
Although the
Recognizing the fashion aspect in an image and suggesting a background relevant to it.
Accessing and presenting media or text based on users' communication.
Community search of fashion elements can be done in closed groups.
Personalized shopping suggestions are enabled.
Enabling fashion planning for gatherings.
Enabling fashion-centric smart selection feature.
The various actions, acts, blocks, steps, operations, or the like in the flow diagram 300 may be performed in the order presented, in a different order, or simultaneously. Further, in some embodiments, some of the actions, acts, blocks, steps, or the like may be omitted, added, modified, skipped, or the like without departing from the scope of the present disclosure.
In the example scenario, initially, the fashion object recommendation engine 110 detects the context of the fashion in the current conversation. Further, the fashion object recommendation engine 110 identifies the persons and relation of fashion elements. Further, the fashion object recommendation engine 110 generates relevant suggestions. Further, the fashion object recommendation engine 110 provides fashion-centric photo replies shown to the user.
Consider, two friends Kris and Radhe met in the morning and had a conversation. Kris and Radhe wanted to complement each other by typing texts in their own electronic devices 100A, 100B (e.g. smartphone) via the instant chat application as shown in the
At 1105, Radhe starts typing to complement Kris in her electronic device 100B, and the fashion object recommendation engine 110 detects the context of the conversation and displays an image of Kris captured that morning using the electronic device 100B as the recommendation. Radhe selects the image of Kris 1106 from the recommendation. At 1107, the fashion object recommendation engine 110 converges the user-typed text, and the selected image and then suggests a followed text 1108 based on the user-selected image and previous message context.
In the example scenario with reference to the
With reference to the
In the example scenario with reference to the
With reference to the
In the example scenario with reference to the
In the example scenario with reference to the
In the example scenario with reference to the
In the example scenario with reference to the
In the example scenario with reference to the
The embodiments disclosed herein can be implemented using at least one hardware device and performing network management functions to control the elements.
The foregoing description of the specific embodiments will so fully reveal the general nature of the embodiments herein that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without departing from the generic concept, and, therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments. It is to be understood that the phraseology or terminology employed herein is for the purpose of description and not of limitation. Therefore, while the embodiments herein have been described in terms of example embodiments, those skilled in the art will recognize that the embodiments herein can be practiced with modification within the scope of the embodiments as described herein.
Number | Date | Country | Kind |
---|---|---|---|
202141057905 | Dec 2021 | IN | national |