This application is based upon and claims priority to Chinese Patent Application No. 202011262630.8, filed on Nov. 12, 2020, the entire contents of which are incorporated herein by reference.
The present invention relates to the technical field of intelligent services, in particular to an online shopping system based on facial emotional state analysis and a method thereof.
In recent years, technologies such as artificial intelligence and big data have developed rapidly. The combination of virtual economy and real economy driven by Internet and big data has brought revolutionary changes to people's work and lifestyle. Online shopping breaks the traditional shopping mode, and the data-driven recommendation algorithm introduces personalized factors such as commodity attributes and users' browsing transaction records into online shopping, analyzes users' shopping intentions through the recommendation algorithm, and gives the corresponding shopping recommendation lists, thus improving shopping efficiency of the users.
The traditional shopping process can be regarded as a communication process between merchants and customers, and merchants can give corresponding product recommendations by observing the facial emotional changes of customers. However, online shopping is a process of human-computer interaction. Shopping systems often don't analyze the changes in users' facial emotions, and the changes of users' emotions in the process of interaction will have a certain impact on shopping choices and shopping efficiency. However, the influence of users' emotional changes in the shopping environment is seldom taken into consideration for shopping recommendations, and corresponding recommendations cannot be provided according to users' emotions.
The object of the present invention is to provide an online shopping system and method based on facial emotional state analysis. Aiming at the lack of facial emotional analysis in online shopping at present, a facial emotion recognition method based on the combination of macro-expressions and micro-expressions analyzes the change of a user's emotional state in a shopping environment through the facial image, and analyzes the shopping intention of the user according to different emotional states and human-computer interaction contents, thereby improving the shopping experience of the user.
To solve the above technical problems, an embodiment of the present invention provides the following solution:
An online shopping system based on facial emotional state analysis, comprising:
an online shopping module configured to provide an online shopping interactive interface for a user and collect facial image data and interactive behavior data of the user in an online shopping process;
a facial expression recognition module configured to recognize an emotional state of the user according to the collected facial image data in the online shopping process of the user;
a shopping intention analysis module configured to decide a shopping intention of the user according to the recognized emotional state and the interactive behavior data of the user; and
a shopping recommendation adjustment module configured to dynamically adjust a commodity recommendation strategy for the user according to the decided shopping intention of the user.
Preferably, the online shopping module comprises:
a login sub-module configured to allow the user to input a user name and a password to register and log in to the system, and input account information and basic information of the user into a user database for storage;
a display sub-module comprising a three-level structure, namely a homepage, a commodity display page and a commodity detail page, wherein the homepage is a commodity category page; the display sub-module is configured for displaying commodity categories on the homepage for users to select interesting commodity categories to enter the commodity display page, and then for users to select interesting commodities from different commodities displayed on the commodity display page to enter the commodity detail page for browsing within a predetermined time; and
an acquisition sub-module configured to acquire the facial image data and interactive behavior data of the user in the online shopping process.
Preferably, the facial expression recognition module comprises:
a facial macro-expression feature extraction sub-module configured to extract facial macro-expression features according to the collected facial image data;
a facial micro-expression feature extraction sub-module configured to extract facial micro-expression features according to the collected facial image data;
a facial expression locating sub-module configured to locate different types of facial expressions according to the extracted facial macro-expression features and facial micro-expression features;
a facial emotional state recognition sub-module configured to hierarchically fuse different types of facial expression features in an orderly manner, and use a support vector machine to construct a classifier for facial emotional state recognition to classify a current emotional state of the user.
Preferably, the facial macro-expression feature extraction sub-module is specifically configured to:
adopt a public macro-expression dataset for training to obtain a macro-expression image coding model, wherein, the macro-expression image coding model is a bilinear convolutional neural network model;
implement noise reduction, segmentation and normalization pretreatment on the collected facial image data; and
input the pretreated data into the macro-expression image coding model, and extract the facial macro-expression features of the user.
Preferably, the facial micro-expression feature extraction sub-module is specifically configured to:
adopt a public micro-expression dataset for training to obtain a micro-expression image coding model, wherein the micro-expression image coding model is a two-stream difference network model;
implement noise reduction, segmentation and normalization pretreatment on the collected facial image data; and
input the pretreated data into the micro-expression image coding model, and extract the facial micro-expression features of the user.
Preferably, the facial expression locating sub-module is specifically configured to:
extract fine-grained change characteristics of local regions of interest in a face by the bilinear convolutional neural network model, and locate durations of different types of expressions in the process of data acquisition; based on the influence of deep local region extraction features on the classification of facial macro-expressions, micro-expressions, and natural expressions, select efficient local facial features for fusion with overall features to improve classification accuracy, and label each frame of images in a video sequence with emotional features, so as to locate different types of facial expressions in the video sequence.
Preferably, the facial emotion state recognition sub-module is specifically configured to: implement hierarchical fusion in an orderly manner according to different types of facial expression features, extract features that can best represent emotional categories from the facial macro-expression features, facial micro-expression features and fusion features according to a support vector machine classifier, and construct an optimal classification model to recognize a current emotional state of the user.
Preferably, the shopping intention analysis module is specifically configured to:
decide that the user is not interested in shopping content at this time when the emotional state of the user is recognized as a negative state or a negative micro-expression state is hidden with a positive facial expression, the browsing time of the user is short and interaction frequency is high; and
decide that the user is interested in shopping content at this time when the emotional state of the user is identified as a positive state or a positive micro-expression state is hidden by a negative facial expression, the browsing time of the user is long and interaction frequency is low.
Preferably, the shopping recommendation adjustment module is specifically configured to:
recommend similar products to the user when the emotional state of the user is positive, the interaction frequency with the shopping system is low and the browsing time is long; and
adjust the commodity recommendation strategy for the user when the emotional state of the user is negative, the interaction frequency with the shopping system is high and the browsing time is short.
An online shopping method based on facial emotional state analysis, comprising the following steps of:
providing an online shopping interactive interface for a user and collecting facial image data and interactive behavior data of the user in an online shopping process;
recognizing an emotional state of the user according to the collected facial image data in the online shopping process of the user;
deciding a shopping intention of the user according to the recognized emotional state and the interactive behavior data of the user; and
dynamically adjusting a commodity recommendation strategy for the user according to the decided shopping intention of the user.
The technical solution provided by the embodiment of the invention has at least the following beneficial effects:
In the embodiment of the invention, the emotional state of the user in the shopping environment is analyzed by collecting the facial image data of the user in the shopping process, and the shopping intention of the user is predicted according to the analysis of the emotional state of the user and the interaction behavior between the user and the shopping system, and the corresponding shopping recommendation is provided, thereby improving the shopping experience of the user.
In order to explain the technical solution in the embodiments of the present invention more clearly, the drawings used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present invention, and other drawings can be obtained according to these drawings without paying creative work.
In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail with reference to the accompanying drawings.
An embodiment of the present invention first provides an online shopping system based on facial emotional state analysis, as shown in
an online shopping module 101 configured to provide an online shopping interactive interface for a user and collect facial image data and interactive behavior data of the user in an online shopping process;
a facial expression recognition module 102 configured to recognize an emotional state of the user according to the collected facial image data in the online shopping process of the user;
a shopping intention analysis module 103 configured to decide a shopping intention of the user according to the recognized emotional state and the interactive behavior data of the user; and
a shopping recommendation adjustment module 104 configured to dynamically adjust a commodity recommendation strategy for the user according to the decided shopping intention of the user.
The online shopping system provided by the embodiment of the invention analyzes the emotional state of the user in the shopping environment by collecting the facial image data of the user in the shopping process, predicts the shopping intention of the user according to the analysis of the emotional state of the user and the interaction behavior between the user and the shopping system, and gives corresponding shopping recommendations, thereby improving the shopping experience of the user.
otherwise, the recommended content of goods is adjusted until the appropriate goods are selected, and the system operation ends.
Furthermore, the online shopping module 101 includes:
a login sub-module configured to allow the user to input a user name and a password to register and log in to the system, and input account information and basic information of the user into a user database for storage; wherein, after logging in successfully, the user enters the personal space and can see the basic personal information of the user;
a display sub-module comprising a three-level structure, namely a homepage (i.e., a commodity category page), a commodity display page and a commodity detail page, wherein the display sub-module is configured for displaying commodity categories (for example: canvas bag, mobile phone case, umbrella, keyboard, mouse and so on) on the homepage for users to select interesting commodity categories to enter the commodity display page (display different products of the product category selected by the user, such as different types of canvas bags and different types of mobile phone cases, etc.), and then for users to select interesting commodities from different commodities displayed on the commodity display page to enter the commodity detail page for browsing within a predetermined time so that so that the user can make purchase choices from different commodities; and
an acquisition sub-module configured to acquire the facial image data and interactive behavior data of the user in the online shopping process.
Furthermore, the facial expression recognition module 102 includes:
a facial macro-expression feature extraction sub-module configured to extract facial macro-expression features according to the collected facial image data;
a facial micro-expression feature extraction sub-module configured to extract facial micro-expression features according to the collected facial image data;
a facial expression locating sub-module configured to locate different types of facial expressions according to the extracted facial macro-expression features and facial micro-expression features;
a facial emotional state recognition sub-module configured to hierarchically fuse different types of facial expression features in an orderly manner, and use a support vector machine to construct a classifier for facial emotional state recognition to classify a current emotional state of the user.
Furthermore, the facial macro-expression feature extraction sub-module is specifically configured to:
adopt a public macro-expression data set for training to obtain a macro-expression image coding model, wherein, the macro-expression image coding model is a bilinear convolutional neural network model;
implement noise reduction, segmentation and normalization pretreatment on the collected facial image data; and
input the pretreated data into the macro-expression image coding model, and extract the facial macro-expression features of the user.
Furthermore, the facial micro-expression feature extraction sub-module is specifically configured to:
adopt a public micro-expression data set for training to obtain a micro-expression image coding model, wherein the micro-expression image coding model is a two-stream difference network model;
implement noise reduction, segmentation and normalization pretreatment on the collected facial image data; and
input the pretreated data into the micro-expression image coding model, and extract the facial micro-expression features of the user.
A micro-expression is a kind of short-lived and imperceptible facial expression that the user cannot help showing in shopping scenes. The feature extraction model is also trained by a public micro-expression data set. The micro-expressions differ from macro-expressions in that they have a short duration, a low intensity and are only related to local face, so micro-expression features are extracted by a two-stream difference network model without identity information.
Furthermore, the facial expression locating sub-module is specifically configured to:
extract fine-grained change characteristics of local regions of interest in a face by the bilinear convolutional neural network model, and locate durations of different types of expressions in the process of data acquisition; based on the influence of deep local region extraction features on the classification of facial macro-expressions, micro-expressions and calm expressions, select efficient local facial features for fusion with overall features to improve classification accuracy, and label each frame of images in a video sequence with emotional features, so as to locate different types of facial expressions in the video sequence.
Compared with macro-expressions, micro-expressions have a short duration and a low intensity, which makes the difference between images and natural neutral facial expressions not obvious, and the changes between micro-expression facial images and natural facial images are subtle. Therefore, the task of macro-expression and micro-expression location is transformed into fine-grained image classification, and the features of fine-grained image changes are extracted by a bilinear convolutional neural network model to identify micro-expressions and other facial actions. At the same time, considering the features related to local facial regions when facial expressions occur, the overall and local relations of facial expression images are further integrated, and the classification of emotional states is further assisted by extracting fine-grained change features of local regions of interest. By paying attention to the influence of deep local region extraction features on the classification of facial micro-expressions, macro-expressions and calm expressions, efficient local features and overall facial features are selected, the classification accuracy is improved by feature fusion, and the emotional features of each frame of the images in the video sequence are labeled with emotional features, and then different types of facial expressions in the video sequence are located.
Furthermore, the facial emotion state recognition sub-module is specifically configured to: implement hierarchical fusion in an orderly manner according to different types of facial expression features, extract features that can best represent emotional categories from the facial macro-expression features, facial micro-expression features and fusion features according to a support vector machine classifier, and construct an optimal classification model to recognize a current emotional state of the user.
Wherein, the bilinear convolutional neural network model is used to extract the fusion features of the images of the whole and local areas of the face as macro-expression features. In the process of micro-expression feature extraction, the first frame of a facial image collected at the beginning of shopping of the user in a calm state is taken as a reference and input into the self-coding network model to obtain the user's identity feature, and then each subsequent frame of the images is input into the self-coding network model with the same structure to extract the superimposed features of facial micro-expressions and identity information, and then a differential network is used to remove the identity information in the superimposed features and retain the micro-expression features. In order to obtain the optimal emotion classification result, firstly, the macro-expression and micro-expression features of the face are weighted and fused and input into the support vector machine model for the first emotion recognition. Then, the images with inaccurate fusion feature classification are classified by macro-expression and micro-expression features through threshold division. By orderly hierarchical fusion of different types of expression features, the current user's emotional state can be accurately recognized, and is applied to the whole online shopping process to monitor and feedback the user's interest in goods in real time and improve the user's shopping efficiency.
Furthermore, the shopping intention analysis module 103 is specifically configured to:
decide that the user is not interested in shopping content at this time when the emotional state of the user is recognized as a negative state or a negative micro-expression state is hidden with a positive facial expression, the browsing time of the user is short and interaction frequency is high; and
decide that the user is interested in shopping content at this time when the emotional state of the user is identified as a positive state or a positive micro-expression state is hidden by a negative facial expression, the browsing time of the user is long and interaction frequency is low.
Furthermore, the shopping recommendation adjustment module 104 is specifically configured to:
recommend similar products to the user when the emotional state of the user is positive, the interaction frequency with the shopping system is low and the browsing time is long; and adjust the commodity recommendation strategy for the user when the emotional state of the user is negative, the interaction frequency with the shopping system is high and the browsing time is short.
As a specific implementation of the present invention,
The system layer is a network online shopping system. A user registers and logs in to the system by entering a user name and a password. The User logs in to the account and fills in basic data and inputs them into the user database for storage. After logging in successfully, the user can enter the personal space and see the basic personal information of the user.
The data layer is used for collecting facial image data and interactive behavior data of users during online shopping. The facial image data acquisition module is mainly used for acquiring facial macro-expression features and micro-expression features when users shop online, locating the occurrence time of different types of facial expressions, and further judging the emotional state of the current users; online interactive behavior data collection of input devices adopts an online shopping platform to collect behavior data such as user's browsing time of goods and operation frequency of the interactive devices, which is mainly used to analyze the browsing time and interest degree of goods during online shopping.
The feature layer includes facial macro-expression, micro-expression feature extraction and interactive data feature extraction. The macro-expression feature extraction of facial image data adopts a large number of public expression data sets to train the image self-coding model to extract the macro expression features of the facial image; with regard to the short and imperceptible facial micro-expressions that users involuntarily reveal during shopping, the feature extraction model is also trained by using the public micro-expression data sets. The difference from macro-expressions is that micro-expressions have a short duration, a low intensity and are only locally related to the face. Therefore, the micro-expression features are extracted by removing identity information through the two-stream difference network model. With regards to the feature extraction of interactive data, the median, mean, minimum, maximum, range, standard deviation, variance and other statistical features of the interactive data are extracted.
The emotion layer locates the durations of different types of expressions in the data collection process by extracting fine-grained features of local regions of interest in the face, pays attention to the influence of deep local region extraction features on the classification of calm expressions, micro-expressions and other facial states, selects efficient local features and overall facial features through relevant feature selection algorithms, improves the classification accuracy through feature fusion, marks each frame of images in the video sequence with emotional features, and analyzes the start, peak and end points of micro-expressions to realize location of facial micro-expressions in the video sequence. At the same time, different types of facial expression features are hierarchically fused in an orderly manner. The features that best represent the emotional category are extracted from the macro-facial expression features, micro-facial expression features and fusion features by the support vector machine classifier, and the optimal classification model is constructed to recognize the current user's facial emotional state.
The intention layer judges the degree of interest in the current shopping content through the facial expressions, micro-expressions, browsing time and interaction frequency of the user in the shopping process. When the user's expression is of a negative emotion category or hides his negative micro-expression category with positive facial expressions, the user's browsing time is short and the interaction frequency is high, it means that the user is not interested in shopping content. When the user is in a positive emotional state, the browsing time is longer and the interaction frequency is low, it means that the user is interested in shopping content.
The interactive layer is a shopping recommendation adjustment module, which judges whether the system changes the recommended commodity category and commodity content according to the evaluation result of the online shopping system on the user's interest degree. The shopping recommendation adjustment module mainly depends on the system's judgment on recognition of the facial expressions and comprehension of the shopping intention. When the result of facial expression and micro-expression recognition is a negative emotional state, the system determines that the user is in a state of declining degree of interest at this moment, which is not suitable for recommending this kind of product content to the user, and then adjusts the product category recommendation to the user to improve the shopping efficiency of the user.
On the basis of facial image data obtained in the online shopping process, the interactive data with dynamic changes and spatiotemporal changes of facial macro-expressions, micro-expressions and calm expressions are processed for the characteristics of image modes in the interactive perception and cognition of users in shopping, so as to realize robust emotional cognition, accurate intention comprehension and process shopping interaction, finally realize the integration of interactive feedback processes and improve the shopping experience of users.
Accordingly, the embodiment of the present invention also provides an online shopping method based on facial emotional state analysis, as shown in
providing an online shopping interactive interface for a user and collecting facial image data and interactive behavior data of the user in an online shopping process;
recognizing an emotional state of the user according to the collected facial image data in the online shopping process of the user;
deciding a shopping intention of the user according to the recognized emotional state and the interactive behavior data of the user; and
dynamically adjusting a commodity recommendation strategy for the user according to the decided shopping intention of the user.
In the method, the specific contents of each step can be referred to the above-mentioned embodiments, and will not be repeated here.
The online shopping method provided by the embodiment of the present invention analyzes the emotional state of the user in the shopping environment by collecting the facial image data of the user in the shopping process, predicts the shopping intention of the user according to the analysis of the emotional state of the user and the interaction behavior between the user and the shopping system, and gives corresponding shopping recommendations, thereby improving the shopping experience of the user.
The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of the present invention shall be included in the scope of protection of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
202011262630.8 | Nov 2020 | CN | national |