Field of Invention
The present invention relates to a method and a system for assigning a published subject to one of a plurality of pre-defined fields of knowledge, in particular for assigning a technical or product specification, respectively, to a technical field, using a plurality of pre-defined buzzwords to describe the subject. It also refers to a method and system for finding a published subject in one of a plurality of pre-defined fields of knowledge. The method and system of the present invention can be utilized in particular for technical and patent searches but is not limited to this utilization.
Description of Prior Art
When users search for certain information in the internet, they usually enter one or more buzzwords and receive a one-dimensional list of proposals by internet search engines or e-commerce vendors, whereas the hits are linearly structured from the best match on the top to the least match at the bottom. One-dimensional lists match to computer programs, whereas the human eye prefers to work two dimensionally. With regard to two-dimensionally displayed information mind maps and tag clouds are common. However, these two dimensional presentations of buzzwords lack priority-related structures as of the kind of linear structured lists.
Besides, users that consume information there are more and more users that contribute information in the internet (contributors). The online encyclopedia Wikipedia is one of the most famous examples. The contributors typically work for free. One can imagine that much more users could be motivated to contribute information, if they were paid. So-called crowdsourcing services try to leverage that potential. A crowdsourcing service may consist in preprocessing publications for enterprises by assigning them to topics, furtheron called buzzwords. The crowd that provide the service via the internet, furtheron called experts, need to be provided with tools that enable them to easily find and select buzzwords that apply for an analyzed publication.
One example of such an analysis of publications is the analysis of patent publications found within patent monitoring: Based on patent literature, being found within a monitored IPC class, today the company's patent specialists manually decide whether the publication is interesting for the company and sometimes further on assigns it to company internal buzzwords. The outsourcing of such manual, time consuming tasks often fails, because the search criteria that indicate whether patent publication is interesting for the company are regarded to be company specific. This sweeps off scale effects that might make outsourcing to external Experts profitable. If a way of using publications' assigned buzzwords across companies could be found, the work of assigning patent publications to buzzwords could be outsourced and thus organized in a much more efficient way.
Therefore, solutions to solve the above problems are required. The present invention provides such solution.
It is an object of the present invention, to provide a method and system for supporting and efficiently managing search tasks, in particular in distributed knowledge and crowdsourcing environments.
It is a further object of the invention, to provide a basic configuration for providing an up-to-date knowledge classification scheme and visualizing the same both to contributors and users. More specifically, it is an object to provide a platform which facilitates dynamic updating of such schemes on the one hand and efficient browsing of the knowledge basis on the other.
The present invention solves the set task with a 2-dimensional (or 3-dimensional) buzzword map that arranges buzzwords on the map depending on the frequency of combined appearance with other buzzwords on the map, measured in certain contexts.
According to an aspect of the present invention, a method for assigning a published subject to a field of knowledge comprises that each of the buzzwords is assigned to a 2- or 3-dimensional element which is arranged at a defined location in a 2- or 3-dimensional map, wherein the respective positions of the elements or positional relations of the elements to each other reflect a relation between the contents of the respective buzzwords, and that the plurality of the elements associated to the pre-defined plurality of buzzwords is displayed on a display screen as a 2- or 3-dimensional image, that each of the elements has a pre-defined extension in each of the dimensions of the map, in particular being shaped as an ellipse, circle, rectangle, or square in a 2-dimensional map or as a ellipsoid, sphere, brick, or cube in a 3-dimensional map, and is adapted to be browsed for browsing within the n-dimensional map.
According to another aspect of the invention, a method for finding a published subject in a certain field of knowledge comprises, further to the features of the method mentioned above, that in the image of the map displayed on the display screen a single element or a sub-area or sub-space, respectively, containing plural elements is selected, and that subject or those subjects are displayed in a window on the display screen or on a separate display, to which the selected element or elements is/are associated.
According to a further aspect of the invention, a system for assigning a published subject to a field of knowledge or for finding a subject within a field of knowledge comprises a first database, wherein a set of fields of knowledge is stored; a second database, wherein a set of buzzwords is stored, each assigned to an element with a predetermined location in a 2- or 3-dimensional map, wherein positional relations of elements to each other reflect a contextual relation between the contents of the respective buzzwords; a third database storing a plurality of published subjects, wherein at least one buzzword is assigned to each of the subjects; a search entity for assigning a published subject loaded from the third database to a field of knowledge or for finding a published subject, based on a positional relation of at least one element to at least one other element in the 2- or 3-dimensional map, and the corresponding buzzword, in a field of knowledge; at least one display entity for displaying an image of the 2- or 3-dimensional map with the elements which are assigned to buzzwords, and at least one input entity or browser for providing inputs into the system, wherein the browser is adapted for browsing within the map displayed on the display.
In an embodiment of the invention, a list of all assigned published subjects is established, wherein the respective position of the most relevant buzzword in the 2- or 3-dimensional map or its positional relations to other relevant buzzwords are associated to the subject.
In a further embodiment, buzzwords of different levels of abstraction are used to describe the subject, wherein the respective level is marked-up in the associated element in the 2- or 3-dimensional map, in particular as a pre-defined color of the element or frame structure of a 2-dimensional element or shell structure of a 3-dimensional element.
In an embodiment of the invention, the extensions of the elements are correlated to the frequency of the appearance of the corresponding buzzword in the course of assigning a plurality of published subjects to the pre-defined fields of knowledge and building the 2- or 3-dimensional map.
In a still further embodiment of the invention, arranging the elements in the map starts with an initial set of buzzwords assigned to elements with pre-defined locations in the map, and the map is dynamically updated with each executed assignment of buzzwords to a subject, by adjusting the respective position or positional relations of the elements which are associated to the newly assigned buzzwords. In this regard, the dynamical updating is based on at least one of: a co-existence of buzzwords, a relation strength indicator indicating the strength of a relation between several buzzwords in a newly classified subject, and a confidence indicator indicating the level of confidence of an assignment of a buzzword to a newly classified subject.
In a further refined embodiment, the dynamical updating includes introducing new buzzwords into the assigning procedure and corresponding new elements are being introduced in the 2- or 3-dimensional map, wherein the positional relations of the new element to at least two existing elements are defined on the basis of a linguistic relation of the new buzzword to at least two existing buzzwords.
In an embodiment of the inventive system, the search entity comprises at least one of a search engine or human being.
In another embodiment of the system, the system comprises a browser for browsing within the map displayed on the display.
In a still further embodiment of the inventive system, the publication data base is implemented on a system server or as a freely accessible data base, and the search entities are adapted to access the system server database or public database, respectively.
In a still further embodiment of the invention, the system comprises a processing entity for dynamically updating an initial set of buzzwords assigned to elements with pre-defined locations in the map, the processing entity being connected to a plurality of data input entities which are adapted for specifying buzzwords or respective elements in the n-dimensional map.
In an embodiment of the inventive system the so far mentioned set of buzzwords is understood as set of primary buzzwords that is related to sets of secondary buzzwords. By choosing a primary buzzword displayed on a 2- or 3-dimensional map, one set or plural sets of secondary buzzwords displayed in other windows are automatically arranged, depending on the frequency of the assignment of the secondary buzzword to the chosen primary buzzword.
In a further embodiment of the inventive system, additionally to the chosen buzzword an observation bandwidth around the primary buzzword is being set, in order to trigger the display of associated secondary buzzwords in a separate window.
At least in certain embodiments, the present invention has at least one of the following effects/advantages:
The inventive buzzword map arranges buzzwords such, that buzzwords which usually occur together are visualized close to each other. The person that searches for certain buzzwords thus finds them with a higher probability in the neighborhood of already found ones.
The inventive map eases the outsourcing of the analysis of publications, e.g. to a crowdsourcing service. Considering the enormous amount of well-educated experts globally that are online with their smartphones, tablets and laptops and ready to casually earn some money by solving generic tasks as assigning publications to buzzwords, information may be globally structured in a new dimension.
The inventive map enables as well the reverse step of searching for publications matching to buzzwords or buzzword combinations
The inventive Buzzword map may be also applied for listing up products that are probably of interest for the visitor of an e-commerce website.
And more generally the Buzzword map can be applied for displaying any kind of search result.
The buzzword A on the highest detail level, that by its meaning covers all buzzwords B-D on the map, is positioned in the center of the map. Buzzword B, among all buzzwords one level below buzzword A with the highest total frequency among the ‘relatives’ of A, is positioned vertically above buzzword A. B has two ‘satellites’ or ‘daughters’ B1, B2 with very low total frequency. Buzzword C, likewise one level below buzzword A with the second highest total frequency, is positioned vertically below buzzword A. Buzzword D, two levels below buzzword A with the third highest total frequency, is positioned to the left of buzzword A. Buzzword E, one level above buzzword A with almost the same total frequency as A, is positioned to the right of buzzword A at the largest distance to A. In this exemplary display configuration, the elements corresponding to the buzzwords are shown as circles or concentrical ring structures, respectively, wherein the number of rings corresponds to the level of abstraction of the respective buzzword, and the extension (diameter) of the elements corresponds to a predetermined relevance of the buzzword. This relevance is determined independently of the formation of the initial map but will be changed in the course of a subsequent dynamical updating of a map, see further below.
In the exemplary embodiments of
Whereas in the above-mentioned figures all tags are of the same size and shown in black-and-white, in a practical implementation the sizes and/or colors of the tags can be different, depending on the relevance or frequency of appearance, respectively, of the underlying buzzwords.
Furthermore, the figure shows that meanwhile from buzzword A ‘relatives’ have been derived, at different hierarchical levels, in the figure designated with numerals A1, A2, A3, and A11, A12, A13. Likewise, buzzword C has now ‘daughters’ C1, C2, and C3.
In the figure, the positional relation between buzzwords A and D is explained in more detail by indicating the relevant vector FAD and the distance dAD are indicated, as well as the vectors FDB between the elements D and B, FDC between the elements D and C, and FDE between the elements D and E. The distance between elements A and D is dependent on the frequency of joint appearance of D and B and can be dependent on the total appearance of buzzword D, whereas the direction component of the vector FAD depends on the positional relations of element D with respect to elements B, C, and E and can, in the simplest case, be derived from a vector addition of the respective vectors FDB, FDC, and FDE.
What also can be derived from a comparison of
The above-referenced frequency of appearance of a buzzword can be understood as the number of times
a) the buzzword has been assigned to publications by offices, experts and/or regarded as relevant by a particular customer (individual point of view) or customers (overall point of view) or
b) the buzzword has been viewed, commented or purchased by a consumer (individual point of view) or consumers (overall point of view).
According to a further aspect, the direction between buzzwords A and D (wherein D can be considered as a ‘daughter’ of A) depends on the relative frequency of common appearance of D with each of the neighbor elements (buzzwords) B, C, and E. Depending on the context, for the exemplary relation between D and B the frequency hDB can mean
c) the number of publications to which D and B have been both assigned by offices, experts and/or regarded as relevant by a customer (individual point of view) or customers (overall point of view) or
d) the number of consumers (overall point of view), which/or the number of times a particular consumer (individual point of view) have shown interest in both D and B, divided by the total frequency HB of the neighbor buzzword B.
In a further embodiment of the invention the frequencies h and H may, depending on the context, be weighted by the level of importance, e.g. low, medium, high, that experts or customers assign to buzzwords, and the level of trust in the expert's ability to judge (context 1) or the degree of similarity of consumer profiles (context 2).
A display unit 107 is provided for displaying the 2- or 3-dimensional map with the elements assigned to the buzzwords, and a keyboard or touchpad function 109 serves for providing inputs into the system by a user (expert or customer), in particular for designating buzzwords to a publication or for selecting one or more elements in the map, to find the publication which has been classified by using the referenced buzzword(s). A search engine 111 is provided for assigning a publication in the third database 105 to a field of knowledge in the first database 101 or for finding the publication belonging to a certain element upon the user's input. A processing unit 113 is provided for dynamically updating an initial set of buzzwords and corresponding elements in the map displayed on a display 107 with new information which is input by the user on the keyboard or touchpad 109. The operation of the above-referenced system components is in line with the method described further above and will, therefore, not be repeated here.
The user may search for publications which are assigned to certain descriptor elements. The result may be displayed as colored areas on the IPC class map (primary buzzword map).
While an embodiment of the present invention is illustrated and described, various modifications and improvements can be made by persons skilled in this art. The embodiment of the present invention is therefore described in an illustrative but not restrictive sense. It is intended that the present invention may not be limited to the particular forms as illustrated, and that all modifications which maintain the spirit and realm of the present invention are within the scope as defined in the appended claims.