Information retrieval comprises the searching for information in documents, searching for documents themselves, searching for metadata which describe documents, as well as searching within data bases. Data bases may comprise stand-alone databases or hyper-text network databases such as the World Wide Web.
A search engine has an information retrieval system designed to help to find information stored in a computer system such as the World Wide Web or inside a proprietary network or within a personal computer of a user. Such a search engine allows a user to seek for content meeting specific criteria which are formed usually by keyphrases. A search engine retrieves a list of items that match those input criteria. This list is often sorted according to some measure of relevance. A search engine can be formed by a web search engine which searches for information on the public World Wide Web. Other kinds of search engines can be formed by enterprise search engines which search in intranets of companies. Well-known search engines are for instance “google” or “MSN-search” of Microsoft.
Conventional search engines use a search index which takes into account links from one document to other documents to assign a relevance of a document for a key phrase. A disadvantage of conventional search engines is that they do not integrate the users into the search. In most search engines the user has no influence as to the search process which is performed automatically by the search engine. There is no pro-active feedback by the user to influence the quantity and/or the quality of the search results. Accordingly, it often happens that search results even when listed according to a relevance do not meet the expectation of the user, i.e. the listed document are not the desired documents or documents of poor quality.
Another disadvantage of conventional search engines is that the ranking of the listed documents can be manipulated by so-called spam-pages, i.e. documents which are linked with other documents to influence the ranking performed by the search engine.
Accordingly, it is an object of the present invention to provide a search engine which supplies to the users with more relevant documents.
The invention provides a search engine comprising search indices for entities, wherein the reputation of a tag, the so called tag reputation, which classifies an entity, is updated by said searching machine depending on a rating input by said user and depending on a user reputation of said user.
In an embodiment of the search engine according to the present invention the entity is a network (data) entity of a data network, e.g. a web page.
In an embodiment of the search engine according to the present invention a network is formed by a set of interconnected computers, ranging from local area networks (LANs) to wide area networks (WANs).
In a further embodiment of the search engine according to the present invention the data network is formed by the internet.
In an embodiment of the search engine according to the present invention said entity is formed by a web page.
In an embodiment of the search engine according to the present invention the entity is formed by a web service.
In an embodiment of the search engine according to the present invention each entity comprises a uniform resource identifier.
In an embodiment of the search engine according to the present invention the search engine is formed by a server of a service provider connected to a network.
In an embodiment of the search engine according to the present invention the search engine comprises a memory for storing a search index list comprising for a plurality of tags corresponding uniform resource identifiers of network entities classified by the respective tags.
In an embodiment of the search engine according to the present invention the tag is formed by a key phrase or a meta tag.
In an embodiment of the search engine according to the present invention the search engine comprises a memory for storing a user reputation index list comprising for a plurality of user IDs of users corresponding user reputations of the respective users, the so called user reputations.
In an embodiment of the search engine according to the present invention the search engine comprises a memory for storing a tag reputation index list comprising for a plurality of tags corresponding (tag) reputations of the respective tags.
In an embodiment of the search engine according to the present invention the search engine is connected via network to user clients.
In an embodiment of the search engine according to the present invention a search index update message is generated by an authorised user client and transmitted via a network-interface of said network to said search engine.
In an embodiment of the search engine according to the present invention the search index update message comprises a user ID of said user,
a rating for an entity and a uniform resource identifier of the entity.
In an embodiment of the search engine according to the present invention the search index update message further comprises
at least one tag for classifying the entity,
an optional comment of the user with respect to another comment input by another user with respect to that entity or input by another user with respect to the entity and a comment reference for the respective comment.
In an embodiment of the search engine according to the present invention the search engine comprises a memory for storing for each entity a corresponding comment index list which comprises a plurality of user IDs of users and comments input by said users for the respective entities.
In an embodiment of the search engine according to the present invention the search engine after receiving a search index update message from a client performs an update of said search index list by inserting tags of said search index update message into said search index list along with a uniform resource identifier of said search index update message.
In an embodiment of the search engine according to the present invention an update of the user reputation of a first user having a comment for an entity stored in said comment index list of said entity is performed after reception of a search index update message from a second user by said search engine depending on a user reputation of said second user and depending on the rating given by said second user in said search index update message, wherein the updated user reputation of said first user is stored by said search engine in a user reputation index list.
In an embodiment of the search engine according to the present invention a rating is given by said second user with respect to comments of said first user for said entity.
In an embodiment of the search engine according to the present invention the search engine performs an update of a tag reputation of all tags within a received search index update message depending on the user reputation of the respective user and stores the updated tag reputations in the tag reputation index list of the respective entity.
In an embodiment of the search engine according to the present invention the search engine performs an update of said comment index list of said entity by storing the comments of the last search index update message of said user with respect to said entity.
The invention further provides a method for indexing entities,
wherein a tag reputation of a tag which classifies an entity is updated depending on a user reputation of a user and depending on a rating input by said user.
The invention further provides a system for indexing entities,
wherein a tag reputation of a tag which classifies an entity is updated depending on a user reputation of a user and depending on a rating input by said user.
The invention further provides a client for providing a search index update message for a search engine connected to said client via a network,
wherein said search index update message comprises:
a user ID of a user,
a rating input by said user for an entity,
a uniform resource identifier of said entity,
at least one tag for classifying said entity,
and an optional comment of a user with respect to said entity or with respect to a comment of another user for said entity, wherein a tag reputation of said tag is updated depending on a user reputation of said user and depending on said rating input by said user.
As can be seen from
In the embodiment shown in
The search index list SIL comprises for a plurality of tags t corresponding uniform resource identifiers (URI) for network entities classified by the respective tags t.
The user reputation index list URIL as stored in the memory 6-2 of the database 6 comprises for a plurality of user IDs of users corresponding user reputations UR of the respective users.
The tag reputation index list TRIL as stored in the memory 6-3 of the database 6 comprises for a plurality of tags t corresponding tag reputations TR.
A comment index list CIL as stored in the memory 6-4 comprises a plurality of user IDs of users and comments C input by that users for the respective entities.
Each user, such as user A, user B in
After that another user, i.e. user B performs a normal search by supplying a tag t which classifies an entity to the search engine 2. The search engine 2 performs a search and provides user B with a sorted list of URIs for the corresponding tag respecting a tag ratings TRs of the respective entities such as shown for example in
In a further step user B can select a uniform resource identifier URI of an entity to get access to this entity E. The entity E can be any kind of document such as a web page or a webservice. A web page can be provided, for instance, by a server 5 as shown in
After starting the search operation the user can decide in step S1 whether he wishes to perform an indexing operation or a normal search operation. When the user decides to perform a normal search he inputs tags t in a step S2 and transmits the tags t in a further step S3 to the search engine 2. The tags t can be formed by key words or keyphrases classifying the respective entity. If the user decides to perform an indexing update he performs a user log-in in step S4 to authenticate himself for the search engine 2.
In a further step S5 the user selects an entity such as “eclipse.org” and can view in a further step S6 all comments given by other users with respect to the entity. Possible comments might be, for instance: “eclipse.org is a good tool”
In a further step S7 the user can input further tags or key phrases for the respective entity such as “C++”.
Furthermore, the user can input in a further step S8 his own comment C for the entity, for example a comment for the entity “eclipse.org”, but also a comment to the comments given by the other users. In a further step S9 a search index update message SIUM is generated automatically by the client computer 4 and transmitted in step S10 to the search engine 2.
In a further possible embodiment the search index update message SIUM further comprises at least one tag t for classifying the entity, a comment C of a user with respect to another (user) comment C′ input by another user or directly with respect to the entity. The rating R of the search index update message SIUM indicates, for example the quality of the referenced entity. The tags t are, for instance keywords or keyphrases to classify the referenced entity. The classification of the entities is built up dynamically. The user comment C and the comment reference is optional.
After a call of the search engine 2 by a user via the network 3 in a step 1 the user informs the search engine 2 in step S2 whether he wishes to perform an indexing update or whether he wishes a normal search operation.
In case of a normal search operation the search engine 2 waits for tags t input by the user in step S3. Then, the search engine 2 performs a normal search operation on basis of the input tags t in step S4. The normal search operation is shown in more detail in
In a first step S4-1 the search engine 2 gets tags ti received from the searching user.
Then the search engine looks up all uniform resource identifiers stored in the search index list SIL for the input tags ti in step S4-2.
As can be seen from
In a further step S7 the search engine 2 updates the user reputation index list URIL as shown in
In a further step S9, the search engine 2 performs an update of the tag reputation index list TRIL as shown in
In the example shown in
As can be seen from
User D inputs a search index update message wherein the rating given for the entity is negative (R=−). Accordingly, the reputation of the entry given by the user A and the user reputation UR of user A himself is diminished. Furthermore, a new tag “VI” is stored as a characterising feature of the entity “eclipse.org” as long as the user reputation UR of user D is positive. In a further embodiment of the method according to the present invention the user has not only the possibility to give comments C with respect to a network entity such as eclipse.org but also to give a comment C for a comment given by another user. For example, by clicking a reply button the user has the possibility to give a comment of his own to a comment of another user. An update of a user reputation UR of a first user which has given a comment C for an entity as stored in the comment index list CIL of said entity can be performed after reception of a search index update message SIUM of a second user depending on the user reputation UR of said second user and depending on the rating given by said second user in the search index update message SIUM. For example, if a second user having high user reputation UR, comments negatively on a comment of a first user the user reputation UR of the first user will be diminished significantly. In the same manner other users having a high user reputation commenting positively on a comment given by a first user can increase the user reputation UR of the first user.
In the method and system according to the present invention a quality of the search is increased significantly because each user has the opportunity to influence the tag reputation TR of a tag t which classifies a network entity. This tag reputation TR depends on the ratings R given by the users and depending on the user reputation UR of these users. Ratings of users with high user reputations UR have a strong impact on the tag reputation TR of a tag t. The reputation of a user i.e. is user reputation UR depends on the comments C he gives for the network entities. Furthermore, the user reputation UR of the user depends also on the comments C given by the other users with respect to the network entity or with respect to his comments. An authorised user known to the search engine 2 has the possibility to influence the index lists, thus increasing the quality of the search and increasing the search speed. These users have the ability to assign dynamically tags for specific uniform resource identifiers. The users further have the possibility to input comments C for referenced entities so that the users are motivated to read comments of other users. Furthermore, the method and system according to present invention allows to identify useless entries, i.e. SPAM and to eliminate such entries.
| Number | Date | Country | Kind |
|---|---|---|---|
| EP07009524 | May 2007 | EP | regional |