This application claims priority under 35 U.S.C. §119 to Korean Patent Application No. 10-2014-0002410, filed on Jan. 8, 2014, the disclosure of which is incorporated herein by reference in its entirety.
The present invention relates to an opinion analyzing system and method that analyze an opinion change based on an elapse of time and provide an analysis result.
As the number of users of social media such as Blog, Twitter, Facebook, etc. increases rapidly, texts expressing an opinion (expression) about a specific event, product, and policy are exponentially increasing.
Such an opinion is an important factor for a strategic plan of a company, which plans an event or a product, or an institute that establishes a policy.
In particular, examples of opinions which are changed with time includes a case, which establishes a marketing strategy by checking a change in a company image, and a case which analyzes public opinion about a policy to establish an appropriate counterstrategy.
Moreover, positive or negative images of candidates are shown in social media during an election period, and are analyzed and used as a news article.
An analysis of a large amount of opinion is proposed as an alternative of the existing survey of public opinion which is made for a small-scale population sample, and thus is high in reliability. Therefore, the analysis is expected to be applied to various fields.
However, although an opinion analyzing method is needed, a related art opinion analyzing method cannot analyze a cause by which various opinions are changed with time, and moreover cannot provide a basis of an analysis result.
Accordingly, the present invention provides an opinion analyzing system and method based on an elapse of time, which analyze a change progress of opinion, and provide a basis of an opinion analysis result by using a text and an image, thereby intuitively recognizing opinion information, analyzed from a large number of texts, and an opinion change based on an elapse of time.
In one embodiment, an opinion analyzing server based on an elapse of time includes: a communication unit configured to receive an opinion analysis request signal, including an opinion analysis target keyword and analysis target period information, from a user terminal; a collection unit configured to collect text information including the opinion analysis target keyword from a database according to the opinion analysis request signal which is received through the communication unit; and an opinion analyzing unit configured to analyze the text information collected by the collection unit, and provide opinion information according to the analysis result.
The opinion analyzing unit may classify, by predetermined category, the text information collected by the collection unit, and digitize an amount of the text information by category to analyze the text information.
The opinion analyzing unit may count number of matches between words included in the text information collected by the collection unit and predetermined core words by category, and classify the text information by category according to the count result.
The opinion analyzing server may further include a summary text generating unit configured to generate a summary text according to the text information analysis result of the opinion analyzing unit.
The summary text generating unit may cluster the text information collected by the collection unit by using a clustering algorithm, and generate the summary text by using the clustered text information.
The collection unit may collect the text information from web data which is described in a natural language, and collect image information associated with the opinion analysis target keyword.
The opinion analyzing server may further include a summary image generating unit configured to generate a summary image by using the image information collected by the collection unit according to the text information analysis result of the opinion analyzing unit.
In another embodiment, an opinion analyzing method based on an elapse of time includes: setting a keyword and search period of an opinion analysis target; analyzing text information of an opinion based on the set keyword and search period; and providing, as a graph and a summary text, opinion information about the analysis result of the text information.
The analyzing may include: collecting text information which is generated during the set search period and includes the set keyword; classifying the collected text information by predetermined category; and digitizing an amount of the classified text information by category to analyze the text information.
The analyzing may include: counting number of matches between words included in the collected text information and predetermined words by category; and when the number of matches is equal to or greater than predetermined number of times, classifying the text information as a category corresponding to the predetermined words.
The providing of opinion information may include generating a graph of an amount of the text information by category based on time by using the analysis result of the text information.
The opinion analyzing method may further include collecting image information associated with the keyword, and generating and providing a summary image according to the analysis result of the text information.
The providing of opinion information may include clustering the collected text information by using a clustering algorithm, and selecting and providing representative text information from among pieces of the clustered text information, or listing words, which are included in the clustered text information a predetermined number of times or more, to provide a summary text.
In yet another embodiment, an opinion analyzing system based on an elapse of time includes: a user terminal configured to transmit an opinion analysis request signal including an opinion analysis target keyword and analysis target period information; and an opinion analyzing server configured to, when the opinion analysis request signal is received from the user terminal, collect text information including the opinion analysis target keyword from a database according to the opinion analysis request signal, and analyze the collected text information to generate a summary text of an opinion.
The opinion analyzing server may classify the collected text information by predetermined category, and digitize an amount of the text information by category to analyze the text information.
The opinion analyzing server may compare predetermined words by category and words included in the collected text information to count number of matches therebetween, and when the number of matches is equal to or greater than predetermined number of times, classify the text information as a corresponding category.
The opinion analyzing server may generate a graph of an amount of the text information by category based on time according to the analysis result of the text information.
The opinion analyzing server may select representative text information from among pieces of the text information according to the analysis result of the text information, or list words, which are included in the text information a predetermined number of times or more, to generate the summary text.
The opinion analyzing server may collect image information associated with the opinion analysis target keyword from the database, and generate a summary image by using the collected image information.
Other features and aspects will be apparent from the following detailed description, the drawings, and the claims.
Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.
Referring to
The user terminal 100 transmits an opinion analysis request signal, including an opinion analysis target keyword and analysis target period information, to the opinion analyzing server 200 over the communication network. In this case, the user terminal 100 receives a menu or key manipulation of a user to set the opinion analysis target keyword and a period by using a web browser or a dedicated application, transmits the opinion analysis request signal to the opinion analyzing server 200, and receives an opinion analysis result corresponding to the opinion analysis request signal from the opinion analyzing server 200.
When the opinion analysis request signal is received from the user terminal 100 over the communication network, the opinion analyzing server 200 collects a text information including keyword from the database 300, and analyzes the collected text information to generate a summary text.
Moreover, the opinion analyzing server 200 analyzes opinion by setting the opinion analysis target keyword and the period. The opinion analyzing server 200 generates a summary text or summary image of the opinion analysis result, and transmits the generation result to the user terminal 100 over the communication network. At this time, the opinion analyzing server 200 collects a text or an image from the database 300.
Referring to
The input unit 120 receives a web browser or dedicated application execution signal, keywords, and a period setting signal from the user.
The control unit 130 activates the web browser or the dedicated application signal according to the web browser or dedicated application execution signal input through the input unit 120, and transmits the opinion analysis request signal to the opinion analyzing server 200 through the communication unit 110.
The display unit 140 displays the opinion analysis result, received from the opinion analyzing server 200 over the communication network, according to a control of the control unit 130.
The control unit 130 displays the opinion analysis result, received from the opinion analyzing server 200 over the communication network, in the display unit 140, and stores the opinion analysis result in the storage unit 150.
Referring to
The communication unit 210 interoperates with the user terminal 100 to receive the opinion analysis request signal including the opinion analysis target keyword and the analysis target period information from the user terminal, and supplies the opinion analysis request signal to the collection unit 220.
The collection unit 220 collects a document including keywords, text information, and an image from the database 300 of
The opinion analyzing unit 230 performs an opinion analysis by using the document or text information and image which are supplied from the collection unit 220.
The summary text generating unit 240 and the summary image generating unit 250 respectively generate a summary text and a summary image, which are associated with opinion, according to an analysis result of the opinion analyzing unit 230, and transmit the summary text and the summary image to the user terminal 100 through the communication unit 210. Here, text information which is an opinion (various kinds of sensitivities or views) analysis target is collected from all web data, which is described in a natural language, such as news, Blog Twitter, Facebook, or the like.
The opinion analyzing server 200 according to an embodiment of the present invention compares
predetermined core words by category and words included in the text information collected by the collection unit 220, and when the number of matches between the core words and the words included in the text information is equal to or greater than the predetermined number of times, the opinion analyzing server 200 classifies the text information as a corresponding category, and digitizes an amount of the text information by category to analyze the text information.
However, in the above-described embodiment, for understanding of those skilled in the art, an example of the opinion analysis of the opinion analyzing server 200 according to an embodiment of the present invention has been described above, but the opinion analysis according to an embodiment of the present invention is not limited to an embodiment.
An opinion may be classified as two categories, namely, a positive category and a negative category, and may also be classified as categories which are more subdivided than satisfaction, easiness, fear, and disappointment.
The opinion analyzing server 200 digitizes the amount of the text information by category to analyze the text information. For example, the opinion analyzing server 200 may digitize an opinion, which is classified as a category “satisfaction” of the text information including predetermined keywords, in units of a predetermined period (for example, one hour, or a day).
The opinion analyzing server 200 according to an embodiment of the present invention may classify an opinion for text information, such as a Blog post, a Facebook state message, and a Twitter tweet, by using an opinion classifier which is learned through machine learning, and then count the number of text information corresponding to an arbitrary time range.
However, the opinion analysis of the opinion analyzing server 200 according to an embodiment of the present invention is not limited to the above-described embodiment. For example, an opinion may be digitized (for example, 25%, or 40%) by using a rate at which each opinion category occupies whole opinion within an arbitrary time range, without digitizing an absolute amount of each opinion category.
Moreover, the opinion analyzing server 200 generates a graph of the amount of text information by category based on time according to an analysis result of the text information.
The summary text generating unit 240 of the opinion analyzing server 200 selects representative text information from among pieces of collected text information according to the analysis result of the text information, or lists words, which are included in the text information a predetermined number of times or more, to generate a summary text.
In another embodiment, the summary text generating unit 240 clusters text information collected by the collection unit 220 by using a clustering algorithm, and generates the summary text by using the clustered text information.
In this case, the summary text generating unit 240 may change the text information to a word vector or a morpheme vector to apply the clustering algorithm, or calculate an edit distance, such as a Levenshtein distance, from all text information to perform clustering.
In order to apply the clustering algorithm, k-means clustering or k-nearest neighbors clustering may be used as the predetermined number of clusters, and the number of clusters may be dynamically determined through hierarchical clustering.
When the opinion analyzing server 200 selects the representative text information from among the pieces of collected text information, the opinion analyzing server 200 selects text information, which is closest to a centroid of a cluster, as the representative text information of the cluster to generate the summary text.
The summary image generating unit 240 of the opinion analyzing unit 200 generates the summary image by using keywords-related image information which is collected from the database 300 by the collection unit 220.
The user terminal 100 receives an analysis result from the opinion analyzing server 200, and as illustrated in
In the graph of
The graphs of
Referring to
Referring to
Referring to
Referring to
The collection unit 220 of the opinion analyzing server 200 collects text information (for example, a tweet of Twitter) from the database 300, and the opinion analyzing unit 230 analyzes the text information, generates a graph of an opinion change progress, and generates and provides summary texts by category (easiness, sorrow, impression, fear, disappoint, and opposition).
That is, according to
A user, which checks an analysis result by using the display unit 140 of the user terminal 100, can check summary texts of categories “easiness” and “sorrow” to know the reason that an opinion for the categories is dominant. It can be seen that there were a lot of opinions in which human death being small is easy, and there were a lot of opinions in which the accident victims cause sorrow.
According to
According to
According to
The opinion analysis result including the graph and summary text of
Referring to
In step S300, the opinion analyzing method analyzes text information of an opinion including the keyword according to the keyword and search period which are set in steps S100 and S200.
In step S400, the opinion analyzing method provides a graph and a summary text as the analysis result of the text information of the opinion which is obtained in step S300.
Here, step S300 collects the text information which is generated during the search period which is set in step S200 and includes the keyword which is set in step S100.
Step S300 classifies the collected text information by predetermined category, and digitizes an amount of the text information by category to analyze the text information.
In another embodiment, step S300 compares a match between words included in the collected text information and predetermined words by category, counts the number of matches, and when the number of matches is equal to or greater than the predetermined number of times, classifies the text information as a category corresponding to the predetermined words.
However, the above-described embodiment of step S300 is merely for understanding of those skilled in the art, and is not limited to the above-described example. As another example of the text information analyzing method in step S300, the opinion analyzing method according to an embodiment of the present invention may classify an opinion for text information, such as a Blog post, a Facebook state message, and a Twitter tweet, by using an opinion classifier which is learned through machine learning, and then count the number of text information corresponding to an arbitrary time range.
In step S400, the opinion analyzing method generates a graph of an amount of text information by category based on time by using the analysis result of the text information which is obtained in step S300. In this case, the opinion analyzing method generates a graph of a change progress of single opinion based on time as shown in
The opinion analyzing method based on an elapse of time according to an embodiment of the present invention further includes an operation that collects keyword-related image information from a database, and generates and provides a summary image according to the analysis result of step S300.
Moreover, step S400 clusters the collected text information by using the clustering algorithm, and selects and provides representative text information from among pieces of the clustered text information, or lists words, which are included in the clustered text information a predetermined number of times or more, to provide a summary text.
Here, step S400 has been described as only an example for understanding of those skilled in the art, and is not limited to the above-described example.
Referring to
The process clusters the opinion-related document, which is extracted in step S311, by using the clustering algorithm in step S312.
In step S313, the process filters a small or irrelevant document from a plurality of clusters which are obtained in step S312. In step S314, the process selects a representative document of each of the plurality of clusters. In step S315, the process supplies the selected representative document to a user. Here, step S314 selects a document, which is closest to a centroid of a cluster, as the representative document.
Referring to
In step S322, the process clusters a news article title, which is extracted in step S321, by using the clustering algorithm.
Subsequently, in step S323, the process filters out a news article title irrelevant to the clustered news article title which is obtained by using the clustering algorithm.
In step S324, the process selects a representative news article tile of each of a plurality of clusters. In step S325, the process provides the selected representative news article title.
As described above, the opinion analyzing system and method based on an elapse of time analyze and provide a change progress of opinion depending on input keywords and a predetermined period, thereby intuitively determining an opinion change in each of images of an event, a brand, and a person which are recognized by the public.
Moreover, according to the embodiments of the present invention, users' satisfaction for a specific product or service can be easily determined, and it is possible to trace a change process of public opinion for a specific policy. Also, the opinion analyzing system and method may be applied to a marketing or crisis counterstrategy.
Moreover, according to the embodiments of the present invention, opinion is analyzed, and change progress information of the analyzed opinion is provided by using a text or an image, thereby easily determining bases of various opinions.
A number of exemplary embodiments have been described above. Nevertheless, it will be understood that various modifications may be made. For example, suitable results may be achieved if the described techniques are performed in a different order and/or if components in a described system, architecture, device, or circuit are combined in a different manner and/or replaced or supplemented by other components or their equivalents. Accordingly, other implementations are within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2014-0002410 | Jan 2014 | KR | national |