Embodiments of the present disclosure relate to a question recommendation method, a question recommendation device, a question recommendation system, an electronic device, and a non-volatile readable storage medium.
With rapid development of Internet technology, a lot of convenience may be provided to people's life. People can search and browse information of interest through the Internet, and can also conduct online consultation, online diagnosis, or the like through the Internet. The Internet has accumulated a large amount of content information over time, search results on the traditional search engine web page are large in quantity, and there may be much repetitive and irrelevant content, which is difficult for the user to find information of interest, the question that the user wants to consult, or the like in a short time.
At least one embodiment of the present disclosure provides a question recommendation method, and the question recommendation method comprises: obtaining a candidate question set of a user, the candidate question set comprising a plurality of candidate questions; obtaining user behavior data, and obtaining a user interest parameter based on the user behavior data; based on the user interest parameter and the plurality of candidate questions, obtaining at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter; based on basic user information, the plurality of candidate questions, and the at least one similarity feature, sorting the plurality of candidate questions to obtain a question sequence; and based on an order of the question sequence, recommending at least one candidate question in the question sequence to the user.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, based on the basic user information, the plurality of candidate questions, and the at least one similarity feature, sorting the plurality of candidate questions to obtain the question sequence, comprises: by using a ranking model, combining the basic user information, the plurality of candidate questions, and the at least one similarity feature to form an input feature vector of the ranking model, obtaining a score corresponding to each candidate question of the plurality of candidate questions, and sorting the plurality of candidate questions according to a value of the score corresponding to each candidate question of the plurality of candidate questions, so as to obtain the question sequence.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, based on the user interest parameter and the plurality of candidate questions, obtaining at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter, comprises: by using at least one similarity matching model, obtaining the at least one similarity feature between each candidate question and the user interest parameter based on the user interest parameter and the plurality of candidate questions.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, the at least one similarity matching model comprises at least one of a group consisting of a cosine similarity model, a Jaccard similarity model, an edit distance similarity model, a word mover's distance similarity model, and a deep structured semantic similarity model.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, the ranking model comprises a Wide&Deep model.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, obtaining the candidate question set of the user comprises: accessing a data knowledge base, the data knowledge base comprising a plurality of knowledge question sets; obtaining the basic user information, and establishing a user tag set based on the basic user information; and associating the user tag set with the data knowledge base, and obtaining the candidate question set according to the plurality of knowledge question sets.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, the user tag set comprises a multi-level tag set, the multi-level tag set comprises tags of a plurality of levels, and tags of different levels are of different types.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, the question recommendation method is used to recommend a question related to disease, a first-level tag of the multi-level tag set is an age group, a second-level tag of the multi-level tag set is a time period, a third-level tag of the multi-level tag set is a type of disease, and a fourth-level tag of the multi-level tag set is a complication.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, each of the plurality of knowledge question sets comprises: a standard question, a standard answer corresponding to the standard question, and an extended question corresponding to the standard question.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, obtaining the candidate question set of the user further comprises: establishing the data knowledge base.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, establishing the data knowledge base comprises: retrieving a data set from a network, and classifying the data set according to intention to form the plurality of knowledge question sets, so as to establish the data knowledge base.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, the question recommendation method is used to recommend a question related to disease, and the data set is based on at least one of a group consisting of a medical consultation data set between a doctor and a patient, a hotspot question associated with the disease, and a reward question associated with the disease.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, associating the user tag set with the data knowledge base, and obtaining the candidate question set according to the plurality of knowledge question sets, comprises: establishing a mapping relationship between the user tag set and the standard question in the data knowledge base, matching the user tag set with the standard question in the data knowledge base, and combining the knowledge question set corresponding to a matched standard question to form the candidate question set.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, obtaining the candidate question set of the user comprises: retrieving the candidate question set of the user stored beforehand.
For example, in the question recommendation method provided by at least one embodiment of the present disclosure, obtaining the user interest parameter based on the user behavior data comprises: analyzing the user behavior data, and converting a question clicked by the user or a word or sentence which the user is interested in into the user interest parameter.
At least one embodiment of the present disclosure provides a question recommendation device, and the question recommendation device comprises: a set acquisition circuit, configured to obtain a candidate question set of a user, the candidate question set comprising a plurality of candidate questions; a behavior analysis circuit, configured to obtain user behavior data and obtain a user interest parameter based on the user behavior data; a feature generation circuit, configured to obtain at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter based on the user interest parameter and the plurality of candidate questions; a question sorting circuit, configured to sort the plurality of candidate questions to obtain a question sequence based on basic user information, the plurality of candidate questions, and the at least one similarity feature; and a recommendation circuit, configured to recommend at least one candidate question in the question sequence to the user based on an order of the question sequence.
For example, in the question recommendation device provided by at least one embodiment of the present disclosure, the question sorting circuit comprises: a question sorting sub-circuit, configured to, by using a ranking model, combine the basic user information, the plurality of candidate questions, and the at least one similarity feature to form an input feature vector of the ranking model, obtain a score corresponding to each candidate question of the plurality of candidate questions, and sort the plurality of candidate questions according to a value of the score corresponding to each candidate question of the plurality of candidate questions, so as to obtain the question sequence.
For example, in the question recommendation device provided by at least one embodiment of the present disclosure, the set acquisition circuit comprises: a knowledge base access circuit, configured to access a data knowledge base, the data knowledge base comprising a plurality of knowledge question sets; an information acquisition circuit, configured to obtain the basic user information and establish a user tag set based on the basic user information; and a candidate set generation circuit, configured to associate the user tag set with the data knowledge base and obtain the candidate question set according to the plurality of knowledge question sets, the candidate question set comprising the plurality of candidate questions.
For example, in the question recommendation device provided by at least one embodiment of the present disclosure, the set acquisition circuit further comprises: a knowledge base establishing circuit, configured to retrieve a data set from a network and classify the data set according to intention to form the plurality of knowledge question sets, so as to establish the data knowledge base.
For example, in the question recommendation device provided by at least one embodiment of the present disclosure, the candidate set generation circuit comprises: a candidate set generation sub-circuit, configured to establish a mapping relationship between the user tag set and the standard question in the data knowledge base, match the user tag set with the standard question in the data knowledge base, and combine the knowledge question set corresponding to a matched standard question to form the candidate question set.
For example, in the question recommendation device provided by at least one embodiment of the present disclosure, the behavior analysis circuit comprises: a behavior analysis sub-circuit, configured to analyze the user behavior data and convert a question clicked by the user or a word or sentence which the user is interested in into the user interest parameter.
For example, in the question recommendation device provided by at least one embodiment of the present disclosure, the feature generation circuit comprises: a feature generation sub-circuit, configured to, by using at least one similarity matching model, obtain the at least one similarity feature between each candidate question and the user interest parameter based on the user interest parameter and the plurality of candidate questions.
At least one embodiment of the present disclosure provides a question recommendation system, and the question recommendation system comprises a terminal and a question recommendation server. The terminal is configured to send request data to the question recommendation server. The question recommendation server is configured to, in response to the request data, obtain a candidate question set of a user which comprises a plurality of candidate questions, obtain user behavior data, and obtain a user interest parameter based on the user behavior data, based on the user interest parameter and the plurality of candidate questions, obtain at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter, and based on basic user information, the plurality of candidate questions, and the at least one similarity feature, sort the plurality of candidate questions to obtain a question sequence. The terminal is further configured to display first N candidate questions in the question sequence, and N is an integer greater than or equal to 1.
At least one embodiment of the present disclosure provides an electronic device. The electronic device comprises: a processor; and a memory, comprising one or more computer program modules. The one or more computer program modules are stored in the memory and configured to be executed by the processor, and the one or more computer program modules comprise instructions for performing the question recommendation method according to any one of the embodiments of the present disclosure.
At least one embodiment of the present disclosure provides a non-volatile readable storage medium, where computer instructions are stored. The question recommendation method according to any one of the embodiments of the present disclosure is executed in the case where the computer instructions are executed by a processor.
In order to clearly illustrate the technical solutions of the embodiments of the present disclosure, the drawings of the embodiments will be briefly described in the following. It is obvious that the described drawings are only related to some embodiments of the present disclosure and thus are not limitative to the present disclosure.
In order to make objects, technical details and advantages of the embodiments of the present disclosure apparent, the technical solutions of the embodiments will be described in a clearly and fully understandable way in connection with the drawings related to the embodiments of the present disclosure. Apparently, the described embodiments are just a part but not all of the embodiments of the present disclosure. Based on the described embodiments herein, those skilled in the art can obtain other embodiment(s), without any inventive work, which should be within the scope of the present disclosure.
Unless otherwise defined, all the technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which the present disclosure belongs. The terms “first,” “second,” etc., which are used in the description and the claims of the present application for disclosure, are not intended to indicate any sequence, amount or importance, but distinguish various components. Also, the terms such as “a,” “an,” etc., are not intended to limit the amount, but indicate the existence of at least one. The terms “comprise,” “comprising,” “include,” “including,” etc., are intended to specify that the elements or the objects stated before these terms encompass the elements or the objects and equivalents thereof listed after these terms, but do not preclude the other elements or objects. The phrases “connect,” “connected,” “coupled,” etc., are not intended to define a physical connection or mechanical connection, but may include an electrical connection, directly or indirectly. “On,” “under,” “right,” “left” and the like are only used to indicate relative position relationship, and when the position of the object which is described is changed, the relative position relationship may be changed accordingly.
With economic development and improvement of living standards, more and more people suffer from chronic diseases of varying degrees. The chronic disease refers to the general term for diseases which do not constitute infection and have long-term accumulation of disease form damage. Common chronic diseases mainly include cardiovascular and cerebrovascular diseases, cancer, diabetes, chronic respiratory diseases, etc. Cardiovascular and cerebrovascular diseases include hypertension, cerebral stroke, coronary heart disease, etc. Statistics show that one of the causes of chronic diseases is an unhealthy lifestyle. For example, the unhealthy lifestyle includes irrational diet, insufficient exercise, tobacco use, excessive alcohol use, etc. Therefore, for patients with chronic diseases, in addition to medical treatment (for example, treatment with drugs) for patients according to conditions, doctors also need to provide patients with reasonable advice (for example, dietary advice, exercise advice, etc.) on coping with chronic diseases as the auxiliary means of medical treatment, so as to better control and prevent chronic diseases. With development of network technology, smart medical care provides more convenience, for example, especially for health knowledge related to chronic disease prevention, control and self-inspection, which can improve patients' awareness of diseases and conduct more effective prevention and treatment. Therefore, the intelligent question answering system for health education provided for patients with chronic diseases can efficiently and accurately solve doubts and questions of patients. At the same time, under the current domestic shortage of medical resources, the intelligent question answering system for health education can also reduce the burden on medical workers and facilitate long-term health management.
The inventors of the present disclosure have noticed that, currently, the medical question answering system or online consultation relies on the condition described by the patient and the question raised by the patient. Usually, because the information expressed by the patient is not clear enough, the feedback answer provided by the system may be inadequate for patient information or mismatched in sequence. In addition, general medical question answering systems do not use user information to provide targeted health knowledge, which is difficult for providing personalized and diversified service.
At least one embodiment of the present disclosure provides a question recommendation method, a question recommendation device, a question recommendation system, an electronic device, and a non-volatile readable storage medium. The question recommendation method includes: obtaining a candidate question set of a user, the candidate question set including a plurality of candidate questions; obtaining user behavior data, and obtaining a user interest parameter based on the user behavior data; based on the user interest parameter and the plurality of candidate questions, obtaining at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter; based on basic user information, the plurality of candidate questions, and the at least one similarity feature, sorting the plurality of candidate questions to obtain a question sequence; and based on an order of the question sequence, recommending at least one candidate question in the question sequence to the user.
The question recommendation method provided by at least one embodiment of the present disclosure not only effectively avoids the question of inappropriate feedback answers due to unclear expression of the patient, but also can provide personalized question recommendation regarding to individual factors, self-characteristics, and the like (for example, basic user information, click behaviors and browsing behaviors of the user, etc.), so that the users can more pertinently learn the knowledge related to their own health. In at least some other embodiments, the question recommendation method can also make the question recommendation more relevant, personalized, and diversified by using a ranking model, and further pay attention to the final feedback order, thereby achieving the effect of better satisfaction of requirements of the user and effectively improving user experience.
The following is a non-limiting description of the question recommendation method provided by at least one embodiment of the present disclosure through several examples or embodiments. As described below, different features in these specific examples or embodiments can be combined with each other to obtain new examples or embodiments in case of no conflict, and these new examples or embodiments also fall within the protection scope of the present disclosure.
A question recommendation method 10 provided by at least one embodiment of the present disclosure may be applied to scenarios such as medical intelligent question answering, online diagnosis, health consultation, etc., and for example, can be applied to an intelligent question answering system of chronic disease health knowledge. For example, in an embodiment, as illustrated in
Step S100: obtaining a candidate question set of a user, the candidate question set including a plurality of candidate questions.
Step S140: obtaining user behavior data, and obtaining a user interest parameter based on the user behavior data.
Step S150: based on the user interest parameter and the plurality of candidate questions, obtaining at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter.
Step S160: based on basic user information, the plurality of candidate questions, and the at least one similarity feature, sorting the plurality of candidate questions to obtain a question sequence.
Step S170: based on an order of the question sequence, recommending at least one candidate question in the question sequence to the user.
For example, in an embodiment, as illustrated in
The question recommendation method 10 provided by the embodiments of the present disclosure will be described in detail below with reference to
The question recommendation method 10 provided by the present disclosure will be described in detail below with reference to the accompanying drawings through examples.
In step S110, a data knowledge base is accessed, and the data knowledge base includes a plurality of knowledge question sets. For example, in the case where the question recommendation method 10 is applied to an intelligent question answering system for chronic disease health knowledge, the data knowledge base can be a data knowledge base associated with chronic diseases (for example, diabetes, hypertension, etc.), and includes complete basic health knowledge related to chronic diseases. The data knowledge base includes a plurality of knowledge question sets. For example, the basic health knowledge in the data knowledge base is classified according to intention (such as diet, exercise, medication, examination, complications, surgery, treatment, symptoms, etc.), and standard questions and corresponding extended questions and standard answers are sorted (for example, manually sorted) under each intention to form one knowledge question set, thus forming the data knowledge base.
For example, in at least one embodiment of the present disclosure, each of the plurality of knowledge question sets may include the standard question, the standard answer corresponding to the standard question, and the extended question corresponding to the standard question. For example, in an example, the user is a diabetic patient. Under the intention of “diet,” the sorted standard question may include “What foods should the diabetic patient eat?”, and the extended question corresponding to the standard question may include, for example, “What foods are good for the diabetic patient?”, “What foods are suitable for the diabetic patient?”, “What foods should the diabetic patient eat to lower the level of blood glucose?”, etc. In this case, the knowledge question set under the intention of above-mentioned “diet” may include the aforementioned standard question and extended questions, and corresponding standard answers.
It should be noted that the standard question and extended question listed in the embodiments of the present disclosure are only illustrative, and can be adjusted and updated according to application scenarios, medical practices, etc. In addition, similarly, the standard answer can also be adjusted and updated according to application scenarios, medical practices, etc., and can further be adjusted and updated according to the adopted language (for example, Chinese, English, etc.), which is not specifically limited in the embodiments of the present disclosure.
For example, in the embodiments of the present disclosure, the data knowledge base may be already established, and the established data knowledge base may be pre-stored in the local terminal or in the server, or may be established such as by the server when implementing the question recommendation method 10, and the data knowledge base may also be read from other devices. The embodiments of the present disclosure are not specifically limited in this aspect, which can be set according to actual needs. Detailed description of establishing the data knowledge base will be described in the following.
Step S120: obtaining the basic user information, and establishing a user tag set based on the basic user information.
For example, in at least one embodiment of the present disclosure, the basic user information may include, for example, the age, gender, height, weight, waist circumference, lifestyle habits, and the like of the user. For example, in an example, if the user is a diabetic patient, the basic user information also includes the type of diabetes, confirmed chronic complications, existing symptoms, and the like. For example, in another example, if the user is not a diabetic patient, the basic user information also includes the medical history of the user, fasting blood glucose level, two-hour postprandial blood glucose level, or the like. For example, in still another example, if the user is a hypertensive patient, the basic user information also includes the diastolic blood pressure, systolic blood pressure, type of hypertension comorbidity, symptoms, and the like of the user. The embodiments of the present disclosure do not specifically limit the content included in the basic user information, which can be set according to actual needs.
For example, in an example, the basic user information may be obtained from the online real-time information of the user, for example, from an online health record platform, an information management system (for example, a laboratory information management system) of a medical institution (for example, a hospital or a medical examination institution), an electronic device for physical examination report, and the like under permission.
For example, in an example, in the case where the user is a registered user of a certain software platform (for example, a health management platform), the user voluntarily fills in and saves the basic information (for example, the name, gender, age, height, weight, waist circumference, lifestyle habits, etc.) while registering, and thus the basic information of the user can be obtained directly from the database (for example, back stage) associated with the specific platform. For example, in another example, in the case where the user is not a registered user of the specific platform (e.g., the health management platform) or the user has not completed or saved the basic information in the specific platform, the basic information of the user may be collected by a third-party platform (for example, an information management system of a hospital or a medical examination institution, etc.) or related electronic devices (for example, a smart bracelet, a smart watch, etc.). The embodiments of the present disclosure are not specifically limited in this aspect, which can be adjusted according to actual conditions. For example, in an example, when a user uses a certain webpage or a certain platform for the first time, the basic information of the user may also be obtained according to the personal information entered by the user in the text box. The embodiments of the present disclosure are not specifically limited in this aspect, which can be adjusted according to actual conditions.
For example, the health management platform may interface with at least one (for example, multiple) medical institution, and obtain basic information and examination results of patients participating in physical examination at the medical institution. For example, the health management platform may also obtain basic user information from smart terminals (for example, smart measuring instruments, smart bracelets, smart watches, smart clothes, etc.) as well as at least one physical sign data (for example, the pulse, body temperature, heart rate, respiration, brain electricity, electrocardiogram, blood pressure, blood glucose, myoelectricity, etc.) of the patient detected by the sensor in the smart terminal. For example, the health management platform may periodically (for example, daily) obtain the examination results of patients participating in physical examination at these medical institutions from multiple medical institutions, and store the examination results in a database (or a memory) associated with the health management platform in advance. It should be noted that the embodiments of the present disclosure do not specifically limit the source of the basic user information, and the source can be set according to actual needs. For example, in some other examples, the basic user information can also be filled in by the user online. For example, a corresponding web page may be provided, the user fills in self information in the web page, the web page sends the information filled by the user to the server, and the server organizes the information to obtain the basic user information.
For example, in an example, after the basic user information is obtained according to the above content, a tag set is established based on the obtained basic user information.
For example, the tag of the user may include gender, name, age, and the like of the user. For example, in an example, the tag also includes the name of the disease, such as type I diabetes, etc. For example, in another example, the tag also includes the name of the complication, such as diabetic foot, etc.
The process of establishing the tag set based on the obtained basic user information is described below in conjunction with
It should be noted that the specific tags listed in the description of the foregoing embodiments of the present disclosure are only illustrative and not limitative.
For example, in at least one embodiment of the present disclosure, the user tag set may include a multi-level tag set, the multi-level tag set includes tags of a plurality of levels, and tags of different levels are of different types.
For example, in at least one embodiment of the present disclosure, nearly two hundred basic rules can be established according to different age groups, time periods, types of disease, presence or absence of symptoms, presence or absence of complications, etc., which may spread to different types of diabetes, combination of multiple types of symptoms, and combination of multiple types of complications. Thus, the multi-level tag set can cover the basic information and interest orientation of the user.
For example, in an example, the first-level tag of the multi-level tag set is an age group, the second-level tag of the multi-level tag set is a time period, the third-level tag of the multi-level tag set is a type of disease, and the fourth-level tag of the multi-level tag set is a complication.
It should be noted that the embodiments of the present disclosure do not limit the specific rules of the user tag set, which can be set according to actual needs.
Step S130: associating the user tag set with the data knowledge base, and obtaining the candidate question set according to the plurality of knowledge question sets, the candidate question set including the plurality of candidate questions.
For example, in at least one embodiment of the present disclosure, the user tag set established in step S120 can be matched with the data knowledge base accessed in step S110, and the candidate question set can be obtained according to rule association.
For example, in at least one embodiment of the present disclosure, in step S130, associating the user tag set with the data knowledge base and obtaining the candidate question set according to the plurality of knowledge question sets includes: first establishing a mapping relationship between the user tag set and the standard question in the data knowledge base, matching the user tag set with the standard question in the data knowledge base, and then combining the knowledge question set corresponding to the matched standard question to form the candidate question set.
As illustrated in
For different age groups, different corresponding rule schemes are provided combining the second-level tag (that is, different time periods of the day) and the third-level tag (e.g., diabetes types).
For example, there is no limitation on the order of the specific implementation of tags of respective levels. For example, the age group, time period, and type of diabetes are a single choice, which may be more convenient to classify first, and the type of complications and the like may be with multiple choice, which is more complicated and determined later.
For example, in at least one embodiment of the present disclosure, for children with type 1 and type 2 diabetes, during the breakfast period (for example, 6:00-8:00), questions related to diet and nutrition advice, notes for blood glucose 2 hours after breakfast, and the like may be consulted. For example, in an example, for children with type 1 and type 2 diabetes, during the breakfast period (for example, 6:00-8:00), the corresponding tag includes “type 1 and type 2 diabetes,” “child,” and “6:00-8:00,” and the corresponding candidate questions may include: “What should children with diabetes eat for breakfast?”, “What are the precautions for children with type 1 and type 2 diabetes after breakfast?”, “What equipment does the diabetic patient use to measure the blood glucose level,” etc. For children with type 1 and type 2 diabetes, during the lunch period (for example, 11:00-15:00), questions related to diet and nutrition advice, exercise advice, precautions, and the like may be consulted. For example, in an example, for children with type 1 and type 2 diabetes, during the lunch period (for example, 11:00-15:00), the corresponding tag includes “type 1 and type 2 diabetes,” “child,” and “11:00-15:00,” and the corresponding candidate questions may include: “What should children with diabetes eat for lunch?”, “What are the precautions for children with type 1 and type 2 diabetes after lunch?”, “What exercises are suitable for children with type 1 and type 2 diabetes after lunch?”, etc. For children with type 1 and type 2 diabetes, during the dinner period (for example, 17:00-20:00), questions related to diet and nutrition advice, exercise advice, precautions, medication-related notice, and the like may be consulted. For example, in an example, for children with type 1 and type 2 diabetes, during the dinner period (for example, 17:00-20:00), the corresponding tag includes “type 1 and type 2 diabetes,” “child,” and “17:00-20:00,” and the corresponding candidate questions may include: “What should children with type 1 and type 2 diabetes eat for dinner?”, “What are the precautions for diabetic patients after dinner?”, “What exercises are suitable for children with diabetes after dinner?”, “How do children with type 1 and type 2 diabetes take medicine at night?”, etc. It should be noted that the recommendation questions listed in the embodiments of the present disclosure are only exemplary and not limitative.
For example, in at least one embodiment of the present disclosure, for adults with type 1 and type 2 diabetes, during the breakfast period (for example, 6:00-8:00), questions related to diet and nutrition advice, notes for blood glucose 2 hours after breakfast, and the like may be consulted. For example, in an example, for adults with type 1 and type 2 diabetes, during the breakfast period (for example, 6:00-8:00), the corresponding tag includes “type 1 and type 2 diabetes,” “adult,” and “6:00-8:00,” and the corresponding candidate questions may include: “What should adults with diabetes eat for breakfast?”, “What are the precautions for adults with type 1 and type 2 diabetes after breakfast?”, “Can adult diabetic patients drink milk in the morning?”, etc. During the lunch period (for example, 11:00-15:00), questions related to diet and nutrition advice, exercise advice, precautions, and the like may be consulted. For example, in an example, for adults with type 1 and type 2 diabetes, during the lunch period (for example, 11:00-15:00), the corresponding tag includes “type 1 and type 2 diabetes,” “adult,” and “11:00-15:00,” and the corresponding candidate questions may include: “What should adults with type 1 and type 2 diabetes eat for lunch?”, “What are the precautions for adults with diabetes after lunch?”, “What exercises are suitable for adults with type 1 and type 2 diabetes after lunch?”, “How do adults with type 1 and type 2 diabetes take medicine at noon?”, etc.
During the dinner period (for example, 17:00-20:00), questions related to diet and nutrition advice, exercise advice, precautions, notes for blood glucose 2 hours after dinner, medication-related notice, and the like may be consulted. For example, in an example, for adults with type 1 and type 2 diabetes, during the dinner period (for example, 17:00-20:00), the corresponding tag includes “type 1 and type 2 diabetes,” “adult,” and “17:00-20:00,” and the corresponding candidate questions may include: “What should adults with type 1 and type 2 diabetes eat for dinner?”, “What are the precautions for adults with type 1 and type 2 diabetes after dinner?”, “What exercises are suitable for adults with type 1 and type 2 diabetes after dinner?”, “How do adults with type 1 and type 2 diabetes take medicine at night?”, etc. It should be noted that the recommendation questions listed in the embodiments of the present disclosure are only exemplary and not limitative.
For example, in at least one embodiment of the present disclosure, for elderly diabetic patients and gestational diabetic patients, during the breakfast period (for example, 6:00-8:00), questions related to diet and nutrition advice, notes for blood glucose 2 hours after breakfast, and the like may be consulted. For example, in an example, for elderly diabetic patients and gestational diabetic patients, during the breakfast period (for example, 6:00-8:00), the corresponding tag includes “diabetes,” “gestational diabetes,” “elder,” and “6:00-8:00,” and the corresponding candidate questions may include: “What should elderly diabetic patients and gestational diabetic patients eat for breakfast?”, “What are the precautions for elderly diabetic patients and gestational diabetic patients after breakfast?”, “How do elderly diabetic patients and gestational diabetic patients take medicine in the morning?”, etc. During the lunch period (for example, 11:00-15:00), questions related to diet and nutrition advice, notes for blood glucose 2 hours after lunch, exercise advice, precautions, and the like may be consulted. For example, in an example, for elderly diabetic patients and gestational diabetic patients, during the lunch period (for example, 11:00-15:00), the corresponding tag includes “diabetes,” “gestational diabetes,” “elder,” and “11:00-15:00,” and the corresponding candidate questions may include: “What should elderly diabetic patients and gestational diabetic patients eat for lunch?”, “What are the precautions for elderly diabetic patients and gestational diabetic patients after lunch?”, “What exercises are suitable for elderly diabetic patients and gestational diabetic patients after lunch?”, “How do elderly diabetic patients and gestational diabetic patients take medicine at noon?”, etc. During the dinner period (for example, 17:00-20:00), questions related to diet and nutrition advice, exercise advice, precautions, notes for blood glucose 2 hours after dinner, medication-related notice, and the like may be consulted. For example, in an example, for elderly diabetic patients and gestational diabetic patients, during the dinner period (for example, 17:00-20:00), the corresponding tag includes “diabetes,” “gestational diabetes,” “elder,” and “17:00-20:00,” and the corresponding candidate questions may include: “What should elderly diabetic patients and gestational diabetic patients eat for dinner?”, “What are the precautions for elderly diabetic patients and gestational diabetic patients after dinner?”, “What exercises are suitable for elderly diabetic patients and gestational diabetic patients after dinner?”, “How do elderly diabetic patients and gestational diabetic patients take medicine at night?”, etc. It should be noted that the recommendation questions listed in the embodiments of the present disclosure are only exemplary and not limitative.
For example, in at least one embodiment of the present disclosure, for pre-diabetes patients, during the breakfast period (for example, 6:00-8:00), questions related to diet and nutrition advice, notes for blood glucose 2 hours after breakfast, and the like may be consulted. For example, in an example, for pre-diabetes patients, during the breakfast period (for example, 6:00-8:00), the corresponding candidate questions may include: “What should pre-diabetes patients eat for breakfast?”, “What are the precautions for pre-diabetes patients after breakfast?”, “How do pre-diabetes patients take medicine in the morning?”, etc. During the lunch period (for example, 11:00-15:00), questions related to diet and nutrition advice, exercise advice, precautions, and the like may be consulted. For example, in an example, for pre-diabetes patients, during the lunch period (for example, 11:00-15:00), the corresponding candidate questions may include: “What should pre-diabetes patients eat for lunch?”, “What are the precautions for pre-diabetes patients after lunch?”, “How do pre-diabetes patients take medicine at noon?”, etc. During the dinner period (for example, 17:00-20:00), questions related to diet and nutrition advice, exercise advice, precautions, and the like may be consulted. For example, in an example, for pre-diabetes patients, during the dinner period (for example, 17:00-20:00), the corresponding candidate questions may include: “What should pre-diabetes patients eat for dinner?”, “What are the precautions for pre-diabetes patients after dinner?”, “What exercises are suitable for pre-diabetes patients after dinner?”, “How do pre-diabetes patients take medicine at night?”, etc. It should be noted that the recommendation questions listed in the embodiments of the present disclosure are only exemplary and not limitative.
For example, in at least one embodiment of the present disclosure, after the first-level tag (for example, the age group), the second-level tag (for example, the time period of a day), and the third-level tag (for example, the type of disease) are determined, the complication may serve as the fourth-level tag for rule association. According to clinical data, about 10 years after the onset of diabetes, 30% to 40% of patients may have at least one complication, such as cardiovascular disease, kidney disease, retinopathy, neuropathy, lower extremity vascular disease, diabetic foot, etc.
For example, in some exemplary rule schemes, if there is no confirmed complication, questions related to one or more complications are selected and recommended according to the existing symptoms of the user, and then these related questions are sent to the candidate question set corresponding to the user, that is, to form the plurality of candidate questions corresponding to the user. For example, in the case of no confirmed complication, the existing symptoms of the user are combined with clinical data so as to recommend questions related to one or more complications. For example, in an example, the high blood pressure, pain in the precordial area, palpitation, chest tightness, and the like are symptoms of cardiovascular disease complications. For example, when the user already has symptoms of the high blood pressure and pain in the precordial area, the user tag set may include the tag “cardiovascular disease,” which can recommend some questions related to cardiovascular disease complications, for example, “What are the symptoms of cardiovascular disease complications?”, “Why do you feel palpitation and chest tightness?”, “What should you do if you have palpitation and chest tightness?”, etc. The embodiments of the present disclosure are not specifically limited in this aspect. For example, in another example, foamy urine, difficulty in urination, lower extremity edema, eyelid edema, and the like are symptoms of complications of diabetic nephropathy. For example, in another example, blurred vision, decreased vision, vision blurring, and the like are complications of retinopathy. For example, in another example, slurred speech, decreased memory, persistent numbness and tingling of hands and feet, swelling, and the like are neuropathic complications. For example, in another example, intermittent numbness and weakness of the lower limbs, claudication, night pain in the lower limbs, and the like are complications of lower limb vascular disease. For example, in another example, long-term leg ulcers, thickened and swelled fingertips or tiptoes, big toe bone protruding, and the like are symptoms of diabetic foot complications.
For example, if a complication has been diagnosed, related questions can be recommended based on the complication. For example, in an example, if the user tag includes “cardiovascular disease,” the matched questions in the data knowledge base may include: “If the diabetic patient has cardiovascular disease complications, what treatment should be used?”, “What are the symptoms of cardiovascular disease complications of the diabetic patient?”, etc. The embodiments of the present disclosure are not specifically limited in this aspect.
It should be noted that the classification of various diseases and the classification of various symptoms in complications described in the embodiments of the present disclosure are only to describe how to establish the mapping relationship between the user tag set and the data knowledge base, that is, only to illustrate the above specific rule schemes for purpose of description. The classification and symptom analysis of specific diseases may be adjusted and set based on a large amount of clinical data, professional experience judgments, etc. The embodiments of the present disclosure are not specifically limited in this aspect.
With reference to
For example, in at least one embodiment of the present disclosure, the user is a hypertensive patient. The rule scheme for hypertensive patients is different from the rule scheme for diabetic patients. For example, in an example, the rule scheme of a hypertensive patient can be directly determined as pre-hypertension, mild hypertension, moderate hypertension, and severe hypertension according to values of diastolic blood pressure and systolic blood pressure, so as to execute different rule schemes. For example, in the rule scheme for hypertensive patients, for the first-level tag (i.e., the age group), a stage is specifically set for the elderly over 80 years old, which is classified as critical elderly hypertension, and blood pressure must be monitored at any time. In addition, because diabetes is a common complication of hypertension, it is also necessary to simultaneously monitor blood pressure and blood glucose at a specific time in the rule scheme for hypertensive patients.
It should be noted that the rule scheme provided by the embodiments of the present disclosure is only illustrative, and the embodiments of the present disclosure are not limited in the specific rule scheme, which may be set according to actual needs.
For example, in the embodiments of the present disclosure, after the mapping relationship between the user tag set and the standard question in the data knowledge base is established, the user tag set is matched with the standard question in the data knowledge base by using methods such as entity recognition, keyword matching, deep learning, etc. Then the knowledge question set corresponding to the matched standard question is retrieved from the data knowledge base, so as to form the candidate question set. In the above, the entity recognition refers to the recognition of entities with specific meanings in the text, for example, a name of a person, a place, time, etc. For example, the keyword matching method includes broad matching, exact matching, phrase matching, negative matching, and the like. For example, in an example, if the text content of the tag is “diabetes,” it can be matched to the recommendation question containing the word “diabetes” in the data knowledge base, for example, “What is the recipe for diabetic patients?”, “What exercise is suitable for diabetic patients?”, “What are the symptoms of diabetic patients?”, or the like, and the embodiments of the present disclosure are not specifically limited in this aspect.
For example, in an example, an elderly person over 60 years old with type 1 diabetes and complications usually wants to get the corresponding diet and advice on blood glucose monitoring, and also wants to know the questions related to complications at 12 p.m. Through the foregoing steps, a corresponding user tag set is established based on the basic information of the user (i.e., including tags: “type 1 diabetes,” “complication,” “over 60 years old” and “12 p.m.”). The user tag set is associated with the data knowledge base, a candidate question set is obtained from multiple knowledge question sets included in the data knowledge base, and the candidate question set includes multiple candidate questions. For example, according to the user tag set, a plurality of related candidate questions can be matched from the data knowledge base by methods such as entity recognition, keyword matching, deep learning, and the like, so to form the candidate question set. For example, the plurality of candidate questions corresponding to the user in the above example may include: “What are the recipes for three meals a day for diabetic patients?”, “What exercise is suitable for elderly diabetic patients?”, “How often do diabetic patients monitor blood glucose every day?”, “What are the symptoms of patients with type 1 diabetes?”, and “How to treat diabetic complications and peripheral neuropathy?”.
Therefore, by establishing the mapping relationship between the user tag set and the standard question in the data knowledge base, convenient conditions are provided for quickly retrieving the corresponding knowledge question set (including the standard question, the extended question, and the standard answer) from the data knowledge base through the user tag set while establishing rule association in the subsequent process.
Step S140: obtaining user behavior data, and obtaining a user interest parameter based on the user behavior data.
For example, in at least one embodiment of the present disclosure, user behavior data (for example, user behavior logs) can be acquired from software on a client terminal or a Web server, or user behavior data can also be collected by customization, which is not limited in the embodiments of the present disclosure. For example, the user behavior data can include all behavior data while the user visits a website, such as visiting, browsing, clicking, etc. That is, the user behavior data can feed back the specific behavior of the user, for example, which link or which page the user clicks on, which search term does the user adopt, etc. For example, in at least one embodiment of the present disclosure, the user interest parameter can be obtained by analyzing the user behavior data described above. For example, in an example, the feedback behavior of the user may include the explicit feedback behavior and the implicit feedback behavior. For example, the explicit feedback behavior includes the explicit feedback of the user on the answer, such as clearly choosing whether the answer is helpful. As illustrated in FIG. 3B, in an example, on a certain software platform (for example, a health management platform), when a user raises a certain question, for example, “How often do patients with hypertension monitor blood pressure every day?”, the platform will consult the user: “Does the answer help you?” after providing the answer to the question. According to the behavior of clicking on “YES” or “NO” of the user, the feedback of the user on the answer can be clearly known, which can reflect the interest and concern of the user. The implicit feedback behavior refers to the fact that cannot directly reflect the preference of the user, but reflects the preference of the user in an indirect way, for example, through the frequency of click and browse of the user within a certain period of time. For example, in an example, the health knowledge read by the user can be summarized by the maximal marginal relevance (MMR) algorithm. The sentence in the document can be extracted according to the importance to form the summary by the MMR algorithm, the high-frequency words in the abstract can be obtained by the term frequency-inverse document frequency (TF-IDF) method, and these high-frequency words (also called keywords) are also important features reflecting the interest and concern of the user.
For example, in an example, if a user uses a specific software platform (for example, a health management platform) in a certain device for the first time, user behavior data can be obtained by analyzing the log of the application stored in the device. The log may be a log stored after startup and operation of the device this time, or may be a log stored after startup and operation of the device last time. The embodiments of the present disclosure are not specifically limited in this aspect, which can be adjusted according to actual conditions.
For example, in an example, the question clicked by the user, the high-frequency word browsed by the user, the keyword, and the like, which reflect the interest and concern of the user, are converted into the user interest parameter. For example, the user interest parameter may be a numerical vector to reflect the interest and concern of the user. For example, word embedding is performed on the question clicked by the user or the word and sentence that the user is interested in, and the embedding vector of the user-interest word and sentence is generated, which forms the aforementioned “user interest parameter.” The word embedding can be understood as a mapping relationship, which can map or embed a word in a text space into another numerical vector space by a certain method. In other words, the word embedding can express the vocabulary and complete sentence in vector form.
Step S150: based on the user interest parameter and the plurality of candidate questions, obtaining at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter.
For example, in at least one embodiment of the present disclosure, based on the user interest parameter and the plurality of candidate questions, obtaining at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter may include: by using at least one similarity matching model, obtaining the at least one similarity feature between each candidate question and the user interest parameter based on the user interest parameter and the plurality of candidate questions.
For example, in at least one embodiment of the present disclosure, the at least one similarity matching model includes at least one of a group consisting of a cosine similarity model, a Jaccard similarity model, an edit distance (Levenshtein) similarity model, a word mover's distance (WMD) similarity model, and a deep structured semantic (DSSM) similarity model.
For example, in an example, a user interest parameter A (for example, a numerical vector) is provided, any candidate question (for example, a standard question or an extended question) in the candidate set is subjected to word embedding to generate the embedding vector B of the candidate question, and the vector B is also a numerical vector. The word embedding can be understood as a mapping relationship, which can map or embed a word in a text space into another numerical vector space by a certain method. In other words, the word embedding can express the vocabulary and complete sentence in vector form. The user interest parameter A and a certain candidate question B are input into the above-mentioned similarity models, and through each of the similarity models, each similarity model outputs the similarity features between the numerical vectors A and B. The larger the value of the similarity feature, the more similar the word and sentence corresponding to the vector A and the word and sentence corresponding to the vector B. The embodiments of the present disclosure do not limit the number of similarity matching models. For example, in an example, if five similarity matching models are used, there may be five similarity features between the vectors A and B. For example, in an example, if three similarity matching models are used, the vectors A and B may have three similarity features.
In the following, several similarity matching models mentioned above are briefly introduced.
(1) The cosine similarity: the cosine similarity uses the cosine value of the vector angle as a measure of the difference between two individuals. The closer the cosine value is to 1, the more similar the two vectors A and B are. The following formula is usually used to calculate the cosine similarity (also known as the cosine distance):
In addition, the similarity features output by the cosine similarity model are continuous.
(2) The Jaccard distance: the Jaccard distance is used to measure the degree of discrimination between two sets according to the proportion of different elements to all elements in two sets. It is expressed by the following formula, and J(A, B) is the Jaccard similarity coefficient.
In addition, the similarity features output by the Jaccard similarity model are continuous.
(3) The edit distance, also known as the Levenshtein distance, refers to the minimum number of operations required to convert string A into string B by using character operations. The permitted character operations include modifying a character, inserting a character, and deleting a character. Generally speaking, the smaller the edit distance of two strings, the more similar the two strings are. If two strings are identical, the edit distance of the two strings is 0. In addition, the similarity features output by the edit distance similarity model are continuous.
(4) The word mover's distance (WMD) refers to considering the similarity between two documents as a whole, and measuring the semantic similarity of documents by searching the pair of the minimum distance between all words in the two documents. The similarity features output by the WMD similarity model are continuous.
(5) The DSSM model: the DSSM model is a deep structured semantic model, which maps the two matched to low-dimensional space, and the correlation question is transformed into the distance of low-dimensional space vectors. The model can not only be used to predict the semantic similarity of two sentences, but also can obtain the low-dimensional semantic vector expression of the sentences. In addition, the similarity features output by the DSSM model are discrete.
For example, in at least one embodiment of the present disclosure, the above five similarity matching models can be used for the user interest parameter and candidate questions at the same time, and then for each candidate question of the candidate questions, five similarity features corresponding to the five similarity matching models and regarding to the user interest parameters can be obtained.
It should be noted that the similarity matching model used in the embodiments of the present disclosure may not be limited to the similarity matching model described above, and other similarity matching models can also be used, as long as the same or similar technical effects can be achieved, that is, the similarity between two vectors can be calculated, which is not specifically limited in the embodiments of the present disclosure. In addition, the embodiments of the present disclosure do not limit the number of similarity matching models used, which can be set according to actual needs.
Step S160: based on basic user information, the plurality of candidate questions, and the at least one similarity feature, sorting the plurality of candidate questions to obtain a question sequence.
For example, in at least one implementation of the present disclosure, based on basic user information, the plurality of candidate questions, and the at least one similarity feature, sorting the plurality of candidate questions to obtain a question sequence may include: by using a ranking model, combining the basic user information, the plurality of candidate questions, and the at least one similarity feature to form an input feature vector of the ranking model, obtaining a score corresponding to each candidate question of the plurality of candidate questions, and sorting the plurality of candidate questions according to a value of the score (for example, in order of the score from high to low) corresponding to each candidate question of the plurality of candidate questions, so as to obtain the question sequence.
For example, in at least one embodiment of the present disclosure, the basic information of a certain user includes: gender is male (for example, the corresponding discrete feature is “0”), and suffering from diabetes (for example, the corresponding discrete feature is “1”). The embedding vector corresponding to one of the candidate questions is [0.3, 0.5, 0.6]. The similarity feature between the candidate question and the user interest parameter includes: cosine similarity of 0.85, Jaccard distance of 0.91, edit distance of 3, WMD of 1.17, and DS SM of 2. In this case, the input feature vector that composes them into the ranking model is a vector [0, 1, 0.3, 0.5, 0.6, 0.85, 0.91, 3, 1.17, 2]. It should be noted that the feature data provided in this embodiment is only exemplary, the specific value of the feature data may be set according to experimental results or actual conditions, and the embodiments of the present disclosure are not specifically limited in this aspect.
Here, the basic user information, the plurality of candidate questions, and the at least one similarity feature form the input feature vector of the ranking model, so that the individual factors of the user (for example, basic user information, user behavior data, etc.) are fully considered for personalized question recommendation, which allows the user to more specifically learn knowledge related to self-health.
For example, in at least one embodiment of the present disclosure, the ranking model used is the classic Wide&Deep model proposed by Google. In the above example, the vector [0, 1, 0.3, 0.5, 0.6, 0.85, 0.91, 3, 1.17, 2] is input to the Wide&Deep model as the input feature vector. The core idea of the model is to combine the memory ability of the linear model with the generalization ability of the deep neural network model, and reflect the integration of relevance and diversity in the recommended scene. This model is used to score the candidate questions in the question candidate set, and the corresponding candidate questions are sorted in the order of the scores from high to low, so as to obtain the question sequence. Then, according to actual needs, the first N questions in the question sequence are shown to the user, where N is an integer greater than or equal to 1.
For example, in at least one embodiment of the present disclosure, the score of a candidate question can be expressed as a conditional probability p(ylx). For example, y represents a tag corresponding to a certain user behavior. For example, if the user clicks on the candidate question, then y=1; and if the user does not click on the candidate question, then y=0. Among them, x represents the input feature vector. The input feature vector x includes the discrete features of the basic user information described above, the continuous embedding vector of the candidate question, and at least one similarity feature between the candidate question and the user interest parameter (the similarity feature may be of continuous or discrete type). For example, the probability value P(y=1|x) output according to y=1 is used as the final score of the candidate question, so that the score (i.e., the probability value) of each candidate question in the candidate question set can be output through the ranking model (i.e., the Wide&Deep model).
The principle of the Wide&Deep model is briefly introduced below in conjunction with
As illustrated in
For the Wide part, the Wide model is a generalized linear model (for example, logistic regression) with the following formula:
y=w
T
x+b
For example, x represents the feature vector [x1, x2, x3 . . . ], w represents the parameter vector [w1, w2, w3 . . . ], b is the bias term, and y is the output tag (label). The output is a probability value between 0 and 1 by the sigmoid function. The input features used in the Wide part are discrete features, for example, discrete features of basic user information and discrete similarity features.
For the Deep part, the Deep model is a feed-forward neural network. Generally, the input of the deep neural network model is continuous dense features. The sparse high-dimensional features need to be subjected to embedding (dimensionality reduction) so as to be converted to low-dimensional dense features, which are used as the input of the first hidden layer and trained and updated in the reverse direction according to the final error (loss). The activation function f of the hidden layer usually adopts the ReLU function that prevents the gradient from disappearing. Therefore, the input features used in the Deep part are continuous features, such as the embedding vector of the candidate question and the continuous similarity feature.
During model training, the gradient is calculated according to the final error and back-propagated to the Wide part and Deep part to continuously update the parameters of the model, so as to obtain the final model. It should be noted that training the Wide model and the Deep model at the same time does not mean model fusion, but the weighted sum of the results of the two models servers as the final prediction result:
P(Y=1|x)=σ(wwideT[x,Ø(x)]+wdeepTa(lη)+b)
In the above Sigmoid function, Wwide is the weight of the Wide part, x is the original feature vector, ϕ(x) is the cross feature, for example, the new feature combined after one-hot encoding, Wdeep is the weight output by the last activation layer of the neural network of the Deep part, l represents the hidden layer, ƒ frepresents the activation function, a is the input feature, and b is the bias term.
Model training uses joint training. Compared with the individual training of a single model in integrated learning, the model is only fused in the final prediction stage, while model fusion is performed in the training stage of joint training. The training error is fed back to both the Wide model and the Deep model to update the weight. Therefore, the Wide model focuses on the cross product of discrete features, and non-linear transformation is performed on the original feature to generate the memory of feature interaction. The Deep model focuses on generalization, and the deep neural network uses low-dimensional dense features and requires only a small amount of feature engineering to better generalize the feature combination not appeared in the training sample, thereby improving the generalization ability of the model. After the model is trained, the model is deployed to the question recommendation scenario.
It should be noted that more information about the Wide&Deep model may be referred to other related references, and the above description of the present disclosure is only a schematic introduction.
For example, in an example of the present disclosure, in the question recommendation method 10, the ranking model (i.e., the Wide&Deep model) is adopted to compose the input feature vector by basic user information, the plurality of candidate questions, and at least one similarity feature. Each of the candidate questions included in the candidate question set returns a score. According to the value of the score, the candidate questions are sorted to obtain the question sequence. For example, in an example, the scores are sorted from high to low. For example, in another example, the scores are sorted from low to high. It should be noted that the scores can also be sorted in other orders, and the embodiments of the present disclosure are not specifically limited in this aspect.
In step S170, based on an order of the question sequence, at least one candidate question in the question sequence is recommended to the user. For example, in an example, the question sequence is sorted according to the order of the scores from high to low, and the first N candidate questions in the question sequence are recommended to the user, where N is an integer greater than or equal to 1. For example, the first 5 (or other numerical values) candidate questions in the question sequence are selected as the final recommendation questions and recommended to the user.
For example, in an embodiment, when the user is inputting the question that the user wants to consult into the input text box, the platform can match the related question in the data knowledge base according to the text information input by the user through, for example, entity recognition, keyword matching, or other suitable methods. These questions are displayed at the top of the input text box in the user interface. For example, as illustrated in
It should be noted that some steps of step S110 to step S170 may be selectively executed according to actual conditions, which are not specifically limited in the embodiments of the present disclosure.
For example, in an embodiment, when the same user uses a certain platform multiple times, if the basic information has not changed, step S110 to step S130 can be omitted, and the pre-stored candidate question set of the user can be directly retrieved. For example, in an example, when a user uses a certain platform (for example, a health management platform) for the first time, the platform has generated a corresponding candidate question set for the user, which is saved in the database associated with the platform. When the user uses the platform for the second time, step S110 to step S130 may not be repeated to obtain the candidate question set of the user, but the candidate question set of the user can be obtained by retrieving the candidate question set previously stored in the associated database. The embodiments of the present disclosure are not specifically limited in this aspect, which may be adjusted according to actual conditions.
The question recommendation method 10 provided by at least one embodiment of the present disclosure not only effectively avoids the question of inappropriate feedback answers due to unclear expression of the patient, but also provides personalized question recommendation regarding to individual factors of the user (for example, basic user information, user behavior data, etc.), so that the user can more pertinently learn the knowledge related to health conditions. Moreover, the ranking model (i.e., the Wide&Deep model) provides fusion of various features, such as basic user information, the output of each similarity matching model, and the like, so as to make the question recommendation relevant, personalized, and diversified, and simultaneously pay attention to the final feedback sequence, thereby achieving a more user-friendly effect.
Referring to
Compared with the question recommendation method 10 illustrated in
For example, in an example, a web crawler can be used to grab the data collection containing a large amount of data from the network such as the Internet. For example, the web crawler, also known as the web spider, is a program or script that automatically crawls information on the World Wide Web according to certain rules.
For example, in an embodiment of the present disclosure, when the above-mentioned question recommendation method 50 is applied to a medical intelligent question answering scenario, the question recommendation method 50 can be used to recommend the questions related to a certain disease, and the data set can be based on at least one of a group consisting of the diagnosis data set between the doctor and the patient, the hot issue related to the disease, and the reward question. For example, in an example, the reward question may be a question that requires payment for consultation on certain websites (for example, www.xywy.com, www.39.net, etc.), and the embodiments of the present disclosure are not specifically limited in this aspect.
For example, the TF-IDF method may be used to extract the high-frequency keyword from the data set, such as the “symptoms,” “treatment,” “blood glucose,” “diet,” “medication,” “examination,” “insulin,” “diabetic foot,” etc. According to the high-frequency keyword, a large amount of data from the data set can be classified based on intention, which can facilitate the construction of the data knowledge base. For example, the deep learning algorithm Text-CNN is used to classify the data set based on intention, and standard questions and corresponding extended questions and standard answers are sorted (for example, manually sorted) under each intention, thereby establishing the complete data knowledge base containing basic health knowledge related to chronic diseases, such as diabetes, hypertension, and the like. For example, in an example, based on the extracted high-frequency keyword, multiple types of intentions can be manually determined. For example, the type of intention can be manually determined according to the degree of attention of people, the frequency of occurrence of keywords, etc., and the embodiments of the present disclosure are not specifically limited in this aspect. For example, in an example, the following intention categories are manually determined: diet, exercise, medication, examination, complications, surgery, treatment, symptoms, etc., and the embodiments of the present disclosure are not specifically limited in this aspect. Then, corresponding data (for example, the questions matching this type of intention) can be manually organized under each category of intention and used as training data for model training by using the deep learning algorithm Text-CNN. Then a large amount of data from the data set may be input into the trained model, and the model outputs the intention category corresponding to each data, so as to realize the intention classification of a large amount of data. In order to improve the accuracy of intention classification, standard questions and corresponding extended questions and standard answers can be manually filtered, sorted and supplemented under each intention. For example, when the question recommendation method 50 is executed for the first time, step S510 is executed to establish the data knowledge base, and the data knowledge base is stored in a server, a memory, or a database. When the question recommendation method 50 is executed again later, step S510 can be omitted, and the data knowledge base can be directly accessed, thereby improving the processing efficiency. For example, step S510 can be executed once in a while, or the data knowledge base can be updated in other suitable ways, so that the update and optimization of the data knowledge base can be realized, and the candidate questions obtained in the subsequent steps can be more relevant to user needs and more relevant to the cognition level of current society.
The technical effects achieved by the question recommendation method 50 illustrated in
At least one embodiment of the present disclosure further provides a question recommendation device.
For example, in an example, the set acquisition circuit 600 is configured to obtain a candidate question set of a user, and the candidate question set includes a plurality of candidate questions. For example, the set acquisition circuit 600 includes a knowledge base access circuit 610, an information acquisition circuit 620, and a candidate set generation circuit 630. For example, the knowledge base access circuit 610 is configured to access a data knowledge base including a plurality of knowledge question sets. For example, the information acquisition circuit 620 is configured to obtain the basic user information and establish a user tag set based on the basic user information. For example, the candidate set generation circuit 630 is configured to associate the user tag set with the data knowledge base and obtain the candidate question set according to the plurality of knowledge question sets, and the candidate question set includes a plurality of candidate questions. For example, the behavior analysis circuit 640 is configured to obtain user behavior data and obtain a user interest parameter based on the user behavior data. For example, the feature generation circuit 650 is configured to obtain at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter based on the user interest parameter and the plurality of candidate questions. For example, the question sorting circuit 660 is configured to sort the plurality of candidate questions to obtain a question sequence based on basic user information, the plurality of candidate questions, and the at least one similarity feature. For example, the recommendation circuit 670 is configured to recommend at least one candidate question in the question sequence to the user based on an order of the question sequence.
For example, the specific operations, which the knowledge base access circuit 610, the information acquisition circuit 620, the candidate set generation circuit 630, the behavior analysis circuit 640, the feature generation circuit 650, the question sorting circuit 660, and the recommendation circuit 670 are configured to perform, may be with reference to the related descriptions of the question recommendation methods 10 and 50 provided by at least one embodiment of the present disclosure described above, and details are not described herein again.
For example, in at least one embodiment of the present disclosure, the question sorting circuit 660 in the question recommendation device 60 includes a question sorting sub-circuit 661. The question sorting sub-circuit 661 is configured to, by using a ranking model, combine the basic user information, the plurality of candidate questions, and the at least one similarity feature to form an input feature vector of the ranking model, obtain a score corresponding to each candidate question of the plurality of candidate questions, and sort the plurality of candidate questions according to a value of the score corresponding to each candidate question of the plurality of candidate questions, so as to obtain the question sequence.
For example, the specific operations, which the question sorting sub-circuit 661 is configured to perform, may be with reference to the related descriptions of the question recommendation methods 10 and 50 provided by at least one embodiment of the present disclosure described above, and details are not described herein again.
For example, in at least one embodiment of the present disclosure, the question recommendation device 60 further includes a knowledge base establishing circuit 601. The knowledge base establishing circuit 601 is configured to retrieve a data set from network and classify the data set according to intention to form the plurality of knowledge question sets, so as to establish the data knowledge base.
For example, the specific operations, which the knowledge base establishment circuit 601 is configured to perform, may be with reference to the related descriptions of the question recommendation method 50 provided by at least one embodiment of the present disclosure described above, and details are not described herein again.
For example, in at least one embodiment of the present disclosure, the candidate set generation circuit 630 in the question recommendation device 60 includes a candidate set generation sub-circuit 631. The candidate set generation sub-circuit 631 is configured to establish a mapping relationship between the user tag set and the standard question in the data knowledge base, match the user tag set with the standard question in the data knowledge base, and combine the knowledge question set corresponding to the matched standard question to form the candidate question set.
For example, the specific operations, which the candidate set generation sub-circuit 631 is configured to perform, may be with reference to the related descriptions of the question recommendation methods 10 and 50 provided by at least one embodiment of the present disclosure described above, and details are not described herein again.
For example, in at least one embodiment of the present disclosure, the behavior analysis circuit 640 in the question recommendation device 60 includes a behavior analysis sub-circuit 641. The behavior analysis sub-circuit 641 is configured to analyze the user behavior data and convert a question clicked by the user or a word or sentence which the user is interested in into the user interest parameter.
For example, the specific operations, which the behavior analysis sub-circuit 641 is configured to perform, may be with reference to the related descriptions of the question recommendation methods 10 and 50 provided by at least one embodiment of the present disclosure described above, and details are not described herein again.
For example, in at least one embodiment of the present disclosure, the feature generation circuit 650 in the question recommendation device 60 includes a feature generation sub-circuit 651. The feature generation sub-circuit 651 is configured to, by using at least one similarity matching model, obtain the at least one similarity feature between each candidate question and the user interest parameter based on the user interest parameter and the plurality of candidate questions.
For example, the specific operations, which the feature generation sub-circuit 651 is configured to perform, may be with reference to the related descriptions of the question recommendation methods 10 and 50 provided by at least one embodiment of the present disclosure described above, and details are not described herein again.
It should be noted that the set acquisition circuit 600, the knowledge base access circuit 610, the information acquisition circuit 620, the candidate set generation circuit 630, the behavior analysis circuit 640, the feature generation circuit 650, the question sorting circuit 660, the recommendation circuit 670, the feature generation sub-circuit 651, the behavior analysis sub-circuit 641, the candidate set generation sub-circuit 631, the question sorting sub-circuit 661, and the knowledge base establishing circuit 601 in the embodiments of the present disclosure can be implemented as hardware such as a processor, a controller, or the like, software that can implement related functions, or a combination of hardware and software. The embodiments of the present disclosure do not limit the specific implementation manners.
It should also be noted that, in the embodiments of the present disclosure, the question recommendation device 60 may further include more circuits, and is not limited to the set acquisition circuit 600, the knowledge base access circuit 610, the information acquisition circuit 620, the candidate set generation circuit 630, the behavior analysis circuit 640, the feature generation circuit 650, the question sorting circuit 660, the recommendation circuit 670, the feature generation sub-circuit 651, the behavior analysis sub-circuit 641, the candidate set generation sub-circuit 631, the question sorting sub-circuit 661, and the knowledge base establishing circuit 601 described above, which can be determined according to actual needs, and the embodiments of the present disclosure are not limited in this aspect.
It should be understood that the question recommendation device 60 provided by the embodiments of the present disclosure can implement the aforementioned question recommendation methods 10 and 50, and can also achieve similar technical effects to those of the aforementioned question recommendation methods 10 and 50, and details are not described herein again.
At least one embodiment of the present disclosure further provides a question recommendation system.
For example, the above-mentioned operations, which the question recommendation server 720 is configured to perform, may be with reference to the question recommendation methods 10 and 50 provided by at least one embodiment of the present disclosure, and details are not described herein again.
For example, in an example, the terminal 710 included in the question recommendation system 70 may be implemented as a client terminal (such as a mobile phone, a computer, etc.), and the question recommendation server 720 may be implemented as a server terminal (such as a server).
For example, in an example, as illustrated in
The question recommendation system 70 provided by at least one embodiment of the present disclosure can implement the question recommendation methods 10 and 50 provided by the foregoing embodiments, and can also achieve similar technical effects as those of the question recommendation methods 10 and 50 provided by the foregoing embodiments, and details are not described herein again.
At least one embodiment of the present disclosure further provides an electronic device.
For example, the memory 820 and the processor 810 may be provided on the server terminal (or the cloud), for example, set in the aforementioned question recommendation server 720, so as to execute one or more steps in the question recommendation method described in
For example, the processor 810 may be a central processing unit (CPU), a digital signal processor (DSP), or other forms of processing units with data processing capabilities and/or program executing capabilities, such as a field programmable gate array (FPGA), etc. For example, the central processing unit (CPU) may be an X86 or ARM architecture, etc. The processor 810 may be a general-purpose processor or a special-purpose processor, and may control other components in the electronic device 80 to perform desired functions.
For example, the memory 820 may include any combination of one or more computer program products, and the computer program product may include various forms of computer-readable storage media, such as a volatile memory and/or a non-volatile memory. For example, the volatile memory may include a random access memory (RAM) and/or a cache memory. For example, the non-volatile memory may include a read-only memory (ROM), a hard disk, an erasable programmable read-only memory (EPROM), a portable compact-disk read-only memory (CD-ROM), a USB memory, a flash memory, etc. One or more computer program modules 821 may be stored in the computer-readable storage medium, and the processor 810 may run the one or more computer program modules 821 to implement various functions of the electronic device 80. The computer-readable storage medium may also store various application programs and various data, such as various data used and/or generated by the application programs. The specific functions and technical effects of the electronic device 80 may be with reference to the above descriptions of the question recommendation method, and details are not described herein again.
As illustrated in
Generally, the following devices can be connected to the I/O interface 950: the input device 960 including such as a touch screen, a touch pad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, or the like; the output device 970 including such as a liquid crystal display (LCD), a speaker, a vibration device, or the like; the storage device 980 including such as a magnetic tape, a hard disk, or the like; and the communication device 990. The communication device 990 may allow the display terminal 900 to perform wireless or wired communication with other electronic devices to exchange data. Although
At least one embodiment of the present disclosure further provides a non-volatile readable storage medium.
For example, the storage medium may be any combination of one or more computer-readable storage media. For example, one computer-readable storage medium contains computer-readable program codes for obtaining the candidate question set of the user, another computer-readable storage medium contains computer-readable program codes for obtaining user behavior data and obtaining the user interest parameter based on the user behavior data, still another computer-readable storage medium contains computer-readable program codes for obtaining at least one similarity feature between each candidate question of the plurality of candidate questions and the user interest parameter based on the user interest parameter and the plurality of candidate questions, and further still another computer-readable storage medium contains computer-readable program codes for sorting the plurality of candidate questions to obtain a question sequence based on basic user information, the plurality of candidate questions, and the at least one similarity feature. Certainly, each of the above-mentioned program codes can also be stored in the same computer-readable medium, which is not limited in the embodiments of the present disclosure. For example, when the program codes are read by a computer, the computer can execute the program codes stored in the computer storage medium and execute such as the question recommendation method provided by any one of the embodiments of the present disclosure.
For example, the storage medium may include a memory card of a smart phone, a storage component of a tablet computer, a hard disk of a personal computer, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM), a portable compact-disk read-only memory (CD-ROM), a flash memory, or any combination of the foregoing storage media, and may also be other suitable storage media. For example, the readable storage medium may also be the memory 820 in
It should be noted that the storage medium 100 can be applied to the question recommendation server 720, and the technician can make a selection according to specific scenarios, which is not limited herein.
For example, the user terminal 310 may be a computer 310-1 or a portable terminal 310-2 illustrated in
For example, the network 320 may be a single network, or a combination of at least two different networks. For example, the network 320 may include, but is not limited to, one or a combination of a local area network, a wide area network, a public network, a private network, the Internet, a mobile communication network, and the like.
For example, the server 330 may be a single server or a server group, and each server in the server group is connected through a wired network or a wireless network. For example, the wired network may use the twisted pair, coaxial cable, optical fiber transmission, or the like for communication, and the wireless network may use the 3G/4G/5G mobile communication network, Bluetooth, Zigbee, WiFi, or the like for communication. The present disclosure does not limit the types and functions of the network here. The server group can be centralized, such as a data center, or can also be distributed. The server may be local or remote. For example, the server 330 may be a general-purpose server or a dedicated server, and may be a virtual server or a cloud server.
For example, the database 340 may be used to store various data used, generated, and output from the work of the user terminal 310 and the server 330. The database 340 may be connected or communicated with the server 330 or a part of the server 330 via the network 320 or directly connected or communicated with the server 330, or may be connected or communicated with the server 330 through a combination of the above two methods. In some embodiments, the database 340 may be a stand-alone device. In other embodiments, the database 340 may also be integrated in at least one of the user terminal 310 and the server 340. For example, the database 340 may be set on the user terminal 310 or on the server 340. For another example, the database 340 may also be distributed, a part of the database 340 is set on the user terminal 310, and the other part of the database 340 is set on the server 340.
For example, in an example, firstly, the user terminal 310 (for example, the user's mobile phone) may send the request data to the server 330 via the network 320 or other technologies (for example, Bluetooth communication, infrared communication, etc.). Next, the server 330 obtains the candidate question set of the user in response to the request data, and the candidate question set includes candidate questions. Then, the server 330 obtains user behavior data, and obtains the user interest parameter based on the user behavior data. For example, the user behavior data is transmitted from the user terminal 310 to the server 330 via the network 320. Then, the server 330 obtains at least one similarity feature between each candidate question of the candidate questions and the user interest parameter based on the user interest parameter and the candidate questions. Then, the server 330 sorts the candidate questions based on the basic user information, the candidate questions, and the at least one similarity feature, so as to obtain the question sequence, and then sends the first N candidate questions in the question sequence to the user terminal 310 via the network 320 or other technologies (for example, Bluetooth communication, infrared communication, etc.). Finally, the user terminal 310 receives and displays the first N candidate questions from the server 330.
In the present disclosure, the term “plurality” refers to two or more than two unless specifically defined otherwise.
Those skilled in the art may easily think of other embodiments of the present disclosure after considering the specification and practicing the disclosure disclosed herein. The present disclosure is intended to cover any variations, uses, or adaptive changes of the present disclosure. These variations, uses, or adaptive changes follow the general principles of the present disclosure and include common knowledge or conventional technical means in the technical field that are not disclosed in the present disclosure. The description and the embodiments are exemplary only, and the scope and spirit of the present disclosure are pointed out by the following claims.
It should be understood that the present disclosure is not limited to the precise structure that has been described above and illustrated in the drawings, and various modifications and changes can be made without departing from the scope. The protection scope of the present disclosure should be based on the protection scope of the claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2020/093390 | 5/29/2020 | WO | 00 |