Method for adjusting concept-based keyword functions, and search engine employing the same

Information

  • Patent Application
  • 20070174319
  • Publication Number
    20070174319
  • Date Filed
    June 09, 2006
    18 years ago
  • Date Published
    July 26, 2007
    16 years ago
Abstract
A search engine applying concept-based keyword functions involves the application of a keyword function that was generated by computing a keyword and that corresponds to and that represents concepts of the keyword. Contents of the keyword function can be adjusted through constant training with clicking actions of a user. A search conducted in the search engine based on the keyword function can locate information related to synonyms of the keyword, words related to the keyword, etc., thereby permitting a comprehensive and accurate web page data search.
Description

BRIEF DESCRIPTION OF THE DRAWINGS

Other features and advantages of the present invention will become apparent in the following detailed description of the preferred embodiments with reference to the accompanying drawings, of which:



FIG. 1 is a block diagram of the first preferred embodiment of a search engine according to the present invention;



FIG. 2 is a flowchart to illustrate a method for generating a keyword function in the first preferred embodiment;



FIG. 3 is a schematic diagram to illustrate the method of generating the keyword function;



FIG. 4 is a flowchart to illustrate the operation of the search engine;



FIG. 5 is a block diagram of the second preferred embodiment of a search engine according to the present invention; and



FIG. 6 is a flowchart to illustrate the operation of the second preferred embodiment of the search engine according to the present invention.


Claims
  • 1. A method for adjusting a concept-based keyword function which is adapted to train using a plurality of web pages in a web page database, comprising: (A) providing a keyword function, the keyword function including a plurality of keyword parameters and correlation values corresponding thereto;(B) receiving web page clicking information associated with clicking one of the web pages in the web page database; and(C) through a data training scheme and according to a correlation between the clicked web page and the keyword function, automatically determining and classifying the clicked web page and adjusting the keyword function to obtain a new keyword function.
  • 2. The method according to claim 1, wherein step (C) is realized using a data processing technique selected from the group consisting of neural network, Naïve Bayes, and SVM.
  • 3. The method according to claim 1, wherein step (C) includes the following sub-steps: (C1) selecting from the clicked web page learning results that are extremely stable and learning results that are extremely unstable as re-training data, and regarding the rest as test data;(C2) combining the re-training data and the clicked web page to define a new training model, and combining the test data and the clicked web page to define new test data;(C3) finding a correlation between the keyword function and the new test data and a correlation between the new training model and the new test data to obtain a two-dimensional model; and(C4) with the correlation between the keyword function and the new test data serving as a weight of the keyword function, and with the correlation between the new training model and the new test data serving as a weight of the new training model, combining the weights to obtain a new keyword function representing new learning results.
  • 4. A search engine applying a concept-based keyword function, which is adapted to conduct a search using an adjustable keyword function, said search engine comprising: a web page database including a plurality of web pages, each of the web pages being represented by a vector function including a plurality of keyword parameters and keyword weights corresponding thereto;a search module for searching said web page database for a plurality of web pages correlated to the keyword function;a keyword function generating system including an adjusting module which provides the keyword function, the keyword function including a plurality of keyword parameters and correlation values corresponding thereto, which receives web page information associated with clicking a web page in said web page database, and which, through a data training scheme and according to a correlation between the clicked web page and the keyword function, automatically determines and classifies the clicked web page and adjusts the keyword function; anda keyword function database for storing the keyword function.
  • 5. The search engine according to claim 4, wherein determination and classification of the clicked web page and adjustment of the keyword function are performed by said keyword function generating system using a data processing technique selected from the group consisting of neural network, Naïve Bayes, and SVM.
  • 6. The search engine according to claim 4, wherein said keyword function generating system performs data processing, which includes: selecting from the clicked web page learning results that are extremely stable and learning results that are extremely unstable as re-training data, and regarding the rest as test data; combining the re-training data and the clicked web page to define a new training model, and combining the test data and the clicked web page to define new test data; finding a correlation between the keyword function and the new test data and a correlation between the new training model and the new test data to obtain a two-dimensional model; and, with the correlation between the keyword function and the new test data serving as a weight of the keyword function, and with the correlation between the new training model and the new test data serving as a weight of the new training model, combining the weights to obtain a new keyword function representing new learning results.
Priority Claims (1)
Number Date Country Kind
095103005 Jan 2006 TW national