This application claims the conventional priority based on Japanese patent application serial No. 2005-266410, filed on Sep. 14, 2005, the disclosures of which are incorporated herein by reference.
1. Field of the Invention
The present invention generally relates to a data display technique, and particularly to an apparatus and method for displaying data, and a data display program for displaying the data corresponding to the ordered item (for example, year). More particularly, the invention displays the transition in the number of cases released of the document data for every keyword contained in the released document data.
2. Description of the Related Art
Each research institute for university or enterprise publishes useful researches every year in the annual meeting or treatises.
The annual meeting and the treatises have a temporary peak in the fourth and sixth years, and temporarily fall in the sixth and eighth years respectively. Just two years after the year when the peak and the fall occur in the annual meeting, the peak and the fall occur in the treatises. This may be caused because the treatises take a lot of time to perform the contribution, reading and printing, whereby the research is published in the treatises later than the annual meeting even when the research is made in the same period.
The data (annual publication data) of the number of cases released in each year for each research institute can be displayed in a table format, employing a technique for displaying the inputted data in the table format as described in a non-patent document (refer to Operation handbook, Standard Excel, Bible for all functions, 2003, Yoshinori Murata, Gijutsu Hyoron-Sha, 2004.2.1 published).
Conventionally, there is provided a technique for manually sorting the annual publication data of the document containing a certain keyword (for example, research institute name, research field name) in the order of keywords in which the number of cases released is larger, for example.
However, with the conventional technique, it is difficult to automatically display the transition in the number of cases released for every keyword to grasp it at a glance. For example, if the research institute is adopted as the keyword, it is not possible to automatically display visually the transition in the number of cases released for each research institute to grasp at a glance a trend in the number of cases released, such as a tendency in which the number of cases released recently increases or decreases in the research institute.
Also, for example, if the research field is adopted as the keyword, it is not possible with the conventional technique to display the number of cases released recently tending to increase or decrease in the research field to grasp it at a glance.
It is an object of the present invention to provide an apparatus for displaying data for automatically displaying the data (for example, numerical data) corresponding to an ordered item (for example, annual) related to a description data (for example, document) containing a data item (for example, keyword). It is another object of the present invention to provide a method for displaying data for automatically displaying the data (for example, numerical data) corresponding to an ordered item (for example, annual) related to a description data (for example, document) containing a data item (for example, keyword). It is still another object of the present invention to provide data display program for displaying data for automatically displaying the data (for example, numerical data) corresponding to an ordered item (for example, annual) related to a description data (for example, document) containing a data item (for example, keyword). More particularly, it is an object of the invention to automatically display a transition in the number of cases released for every keyword in a format where a trend in the number of cases released of the document for every keyword can be grasped at a glance.
The apparatus for displaying data of the present invention displays data corresponding to an ordered item. The apparatus comprises a data input unit inputting the data corresponding to the ordered item related to a description data containing a data item, a reference value calculation unit calculating, for each of the data item, a reference value that is referenced in sorting the inputted data related to the description data containing each of the data item, based on the inputted data related to the description data containing each of the data item, a data sort unit sorting the data related to the description data containing each of the data item in the ascending or descending order of the calculated reference values, and a data display unit creating a display data based on the sorted data and displaying the created display data on a screen.
Preferably, the apparatus for displaying data of the present invention displays numerical data corresponding to an ordered item. The apparatus comprises a data input unit inputting the numerical data corresponding to the ordered item related to a description data containing a data item, a reference value calculation unit calculating, for each of the data item, a reference value that is referenced in sorting the numerical data, based on the input numerical data, a data sort unit sorting the numerical data related to the description data containing each of the data item in the ascending or descending order of the calculated reference values, and a data display unit creating a display data based on the sorted numerical data and displaying the created display data on a screen.
Preferably, in the apparatus of the present invention, the reference value calculation unit calculates the reference value, based on the order value of the ordered item.
Preferably, in the apparatus of the present invention, the reference value calculation unit calculates the reference value in which the mean value, the mode value and the median of the order values of the ordered items are averaged.
Preferably, in the apparatus of the present invention, the reference value calculation unit calculates, as the reference value, the mean value of the order values of the ordered items.
Preferably, in the apparatus of the present invention, the reference value calculation unit calculates, as the reference value, the mode value of the order values of the ordered items.
Preferably, in the apparatus of the present invention, the reference value calculation unit calculates, as the reference value, the median of the order values of the ordered items.
Preferably, in the apparatus of the present invention, the data display unit displays the data in which the reference value calculated for each of the data item is plotted as a part of the display data on the screen.
Preferably, in the apparatus of the present invention, the data display unit further displays the display data on the screen in a format where the reference value calculated for each of the data item is written down with the display data.
Preferably, in the apparatus of the present invention, the data input unit comprises a data accumulation unit accumulating an inputted bibliography data, a data item extraction unit extracting a data item from the accumulated bibliography data, and a data creation unit creating the data corresponding to the ordered item related to the description data containing each of the data item, based on the extracted data item and the bibliography data, and the reference value calculation unit calculates, for each of the data item, the reference value that is referenced in sorting the data related to the description data containing each of the data item, based on the created data related to the description data containing each of the data item.
The method for displaying data of the present invention displays data corresponding to an ordered item. The method comprises inputting the data corresponding to the ordered item related to the description data containing a data item, calculating, for each of the data item, a reference value that is referenced in sorting the inputted data related to the description data containing each of the data item, based on the inputted data related to the description data containing each of the data item, sorting the data related to the description data containing each of the data item in the ascending or descending order of the calculated reference values, creating a display data based on the sorted data, and displaying the created display data on a screen.
The program for displaying data of the present invention displays data corresponding to an ordered item. The program causes a computer to execute inputting the data corresponding to the ordered item related to a description data containing a data item, calculating, for each of the data item, a reference value that is referenced in sorting the inputted data related to the description data containing each of the data item, based on the inputted data related to the description data containing each of the data item, sorting the data related to the description data containing each of the data item in the ascending or descending order of the calculated reference values, creating a display data based on the sorted data, and displaying the created display data on a screen.
The apparatus, method, and program of the present invention automatically sort the data (for example, numerical data) corresponding to the ordered item (for example, year) related to the description data (for example, document) containing each keyword based on the reference value and display the sorted data on the screen.
More specifically, the apparatus, method, and program of the present invention automatically sort the data (annual publication data) of the number of cases released in each year for the document containing individual keywords based on the reference value and display the transition in the number of cases released for a plurality of keywords after sorting. Therefore, it is possible to grasp a trend in the number of cases released for each keyword at a glance.
The data input unit 11 inputs a data (for example, numerical data) corresponding to an ordered item related to a description data (for example, document) containing a data item (for example, keyword). The data input unit 11 inputs, for example, an annual publication data as shown in
Also, the data input unit 11 inputs, for example, the data as shown in
In the embodiment of the present invention, the data input unit 11 may input a bibliography data, create a data (for example, data of the number of cases released in each year for the document containing individual keywords (annual publication data)) corresponding to the ordered item related to the description data (example, document) containing a data item (for example, keyword), based on the input bibliography data, and output the created data.
The reference value calculation unit 12 calculates, for each of the data item, the reference value that is referenced in sorting the output data based on the data corresponding to the ordered item related to the description data (for example, document) containing each of the data item, which is outputted from the data input unit 11. A calculation example of the reference value will be described later.
The data sort unit 13 sorts the data corresponding to the ordered item related to the description data (for example, document) containing each of the data item, based on the reference value calculated by the reference value calculation unit 12. The data sort unit 13 sorts the data (for example, annual publication data) corresponding to the ordered item for each of the data item in the ascending or descending order of the reference values.
The data display unit 14 displays the display data by creating the data (display data) to be displayed based on the data sorted by the data sort unit 13.
The data input unit 11 may include data accumulation unit 111, data item extraction unit 112, and data creation unit 113, for example, as shown in
In this embodiment of the invention, the data input unit 11 may not include the data item extraction unit 112 and the data creation unit 113, but may output the data (for example, annual publication data) corresponding to the ordered item related to the description data (for example, document) containing a data item (for example, keyword), which is inputted into the data accumulation unit 111.
The data display unit 14 includes display data creation unit 141 and display unit 142. The display data creation unit 141 creates a display data to be displayed, based on the data (for example, annual publication data) corresponding to the ordered item for each of the data items sorted by the data sort unit 13. The display unit 142 displays the display data created by the display data creation unit 141 on the screen.
First of all, an annual publication data is inputted into the data input unit 11 (step S1). For example, the annual publication data for each keyword (each research institute in
Next, the reference value calculation unit 12 calculates a reference value for each keyword, based on the inputted annual publication data (step S2). For example, the mean value, the mode value and the median for the years of publication are obtained, and the mean value of them is calculated as the reference value. Herein, the mean value for the years of publication is the total value of the year of publication multiplied by the number of cases released, divided by the total number of cases released. For example, for A university in
Also, the mode value for the years of publication is the value of year in which the number of cases released is largest. For example, for A university, the mode value is the value “6” of the year (sixth year) in which the number of cases released is largest, “10”. Also, the median for the years of publication is the value of year to which the middle data belongs in the data of the number of cases released. For example, for A university, the median is the value “6” of the year (sixth year) to which the ninth data as the middle data belongs in the data of the number of cases released from 1 to 17.
Accordingly, the reference value for A university is calculated as (5.29+6+6)/3=5.76.
Of course, in the embodiment of the present invention, a calculation method for the reference value is not limited to the above described method, but the calculated mean value, mode value or median for the years of publication may be directly employed as the reference value, or the mean value, mode value and median for the years of publication may be appropriately combined to calculate the reference value based on a predetermined calculation method.
In the embodiment of the present invention, the data corresponding to the ordered item (for example, year) inputted into the data input unit 11 is not limited to the numerical data, but the inputted data may be a language representation, for example. That is, when the data as shown in
Of course, in the embodiment of the present invention, the reference value may be calculated by converting each language representation inputted as the data corresponding to the ordered item (for example, year) into the predetermined numerical value associated with each language representation, and employing the numerical value after conversion.
Next, the data sort unit 13 sorts the annual publication data for each keyword, based on the calculated reference value (step S3). For example, the data sort unit 13 sorts the annual publication data for each keyword in the ascending order of the reference values. The data sort unit 13 may sort the annual publication data for each keyword in the descending order, based on the calculated reference value.
Next, the data display unit 14 creates a display data based on the sorted annual publication data and displays the created display data on a screen (step S4). The data display unit 14 converts the data of the number of cases released in each year for each research institute into contour line data, and displays a screen where the data of the number of cases released in the treatises in each year for each research institute is represented with the contour line, as shown in
In a screen display example as shown in
As seen from
In the embodiment of the present invention, at step S4, the data display unit 14 may convert the data of the number of cases released in each year for each keyword into contour line data, create the kinked line data in which the reference value is plotted for each keyword, and display the contour line data of the number of cases released in each year for each keyword and the kinked line data of the reference value for each keyword as indicated by the bold kinked line on the same screen, as shown in
Also, in the embodiment of the invention, at step S4, the data display unit 14 may convert the data of the number of cases released in each year for each keyword into the kinked line data, and display a screen in which data of the number of cases released in each year for each keyword is represented as the kinked line, as shown in
Also, in the embodiment of the present invention, at step S4, the data display unit 14 may display the data of the number of cases released in each year for each keyword as a bubble chart on the screen, as shown in
In the following, various screen display examples of display data will be described below.
The screen display example as shown in
However, as will be apparent from the median for the years of publication for each research institute, a lot of research institutes have the same score. Accordingly, the research institutes having the same score may be arranged in any order. Hence, to avoid the same score, the median for the years of publication is not simply employed, but the annual publication data may be rearranged and displayed in terms of the reference value calculated based on the mode value, the mean value and the median, as in the screen display example of
First of all, the bibliography data is inputted into the data input unit 11 (step S11). For example, the bibliography data as shown in
To extract the keyword, a well-known morpheme analysis tool, a Chasen, is employed. Employing the Chasen, a Japanese sentence is decomposed, and the part of speech for each word is estimated. A division example of a Japanese sentence using Chasen will be described.
For example, if (Gakko he iku)” is inputted, the following results are obtained.
(gakko)
(gakko)
(gakko) Noun in general
(he)
(he)
(he) particle—case particle in general
(iku)
(iku)
(iku) verb—independent
(godan kagyo) long consonant fundamental form
EOS
The sentence is decomposed with one word in each line, and the information of reading and part of speech is appended to each word.
As a part of speech tagging system for English, Brill (Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging, Computational Linguistics, Vol. 21, No. 4, p. 543-565, 1995) is famous, in which the part of speech of each word in the English sentence can be estimated.
In the embodiment of the invention, each morpheme obtained by decomposing the title by Chasen is extracted as the keyword.
For example, each morpheme obtained by decomposing the tile by Chasen, in which the number of morpheme appearing in the bibliography data is greater than or equal to a threshold, may be extracted as the keyword in the embodiment of the invention.
Next, the data creation unit 113 creates and outputs the annual publication data based on the extracted keyword and bibliography data (step S13). At step S13, the data of the number of cases released in each year for the document containing the keyword extracted at step S12 in the title is created as the annual publication data for the keyword, for example.
Next, the reference value calculation unit 12 calculates the reference value for each keyword, based on the annual publication data (step S14). And the data sort unit 13 sorts the annual publication data for each keyword, based on the calculated reference value (step S15).
And the data display unit 14 creates a display data based on the sorted annual publication data and displays the created display data on a screen (step S16).
As shown in
The publication of “dialog”, “morpheme”, “probability”, “dictionary” and “statistics” in the annual meeting was active in the earlier years, as shown in
In the embodiment of the invention, the annual publication data may be created, employing the bibliography data containing the predetermined keyword. As an example, the research trend for translation is minutely investigated, and the invention is applied, employing only the data containing the translation as the title of the bibliography data published in the annual meeting. Its results are shown in
Also seeing
Moreover, the present invention may be practiced as a program read and executed by a computer. The program implementing the present invention may be stored in an appropriate recording medium such as a portable memory, a semiconductor memory or a hard disk readable by the computer, provided in the recording medium recording the program, or distributed via a communication interface across a network.
Number | Date | Country | Kind |
---|---|---|---|
2005-266410 | Sep 2005 | JP | national |