Claims
- 1. A knowledge container, including:
an indication of an object; and at least one tag, wherein each tag associates the object to a knowledge map representation of a discrete perspective of a domain of knowledge.
- 2. The knowledge container of claim 1, wherein the object is one of content and resources.
- 3. The knowledge container of claim 1, further including administrative meta-data, comprised of structured information about the object.
- 4. The knowledge container of claim 1, wherein the indication of the object is the object itself.
- 5. The knowledge container of claim 1, wherein the indication of the object is a pointer to the object.
- 6. The knowledge container of claim 4, wherein the knowledge container includes:
marked content that is a textual representation of the object; selective demarcation of regions of the textual representation of the object; and a plurality of indicators of the nature of the content.
- 7. The knowledge container of claim 1, wherein each tag includes a weight indication representing a strength of association of the knowledge container to a particular node.
- 8. The knowledge container of claim 3, wherein the administrative metadata contains a description of the method used to assign the knowledge container to a particular node, including:
SME designation; autocontextualization; source mapping based on where the knowledge container came from; and dialog response.
- 9. The knowledge container of claim 1, wherein said at least one tag is associated with nodes from a single taxonomy.
- 10. The knowledge container of claim 1, wherein said at least one tag is associated with nodes from a plurality of taxonomies.
- 11. The knowledge container of claim 1, wherein the object indicates a person's interests, information needs, and entitlements.
- 12. The knowledge container of claim 11, wherein the tags for the knowledge container include a weight representing:
a strength of the person's interest or information need; relevancy to a question; and expertise of a provider.
- 13. The knowledge container of claim 12, wherein the tags for the knowledge container associate the knowledge container with various portions of the knowledge map.
- 14. The knowledge container of claim 11, wherein the person's entitlements are represented as tags to nodes of an entitlement taxonomy.
- 15. The knowledge container of claim 1, wherein the knowledge container is represented by a markup language such that it is displayable using template-based automated processing.
- 16. A method of processing at least one tag to generate a summary of a knowledge container, comprising the steps of:
generating a natural language template based on at least one tag stored inside the knowledge container; and merging content from the knowledge container and the tagged concept-nodes into the template.
- 17. An autocontextualization method to automatically associate a knowledge container with a knowledge map having a plurality of taxonomies representative of selected discrete perspectives of a knowledge domain, each taxonomy having nodes corresponding to a conceptual area within the discrete perspective that the taxonomy represents, the autocontextualization method comprising:
using a feature recognizer to determine features of the knowledge container; employing a classification system to classify the knowledge container based on the determined features; generating a preliminary list of nodes to which the knowledge container may be associated; and determining a weight indicating a strength of association therewith.
- 18. The autocontextualization method of claim 17, further including the steps of:
truncating nodes from the preliminary list based on the strength of association indicated by the weights; and generating an indication that the remaining nodes are associated with the knowledge container.
- 19. The autocontextualization method of claim 17, further including:
following the classifying step, adjusting the weights determined by the classification system by applying an inference engine based on a set of rules regarding relationships between the nodes.
- 20. The autocontextualization method of claim 17, further including:
following the classifying step, adjusting the preliminary list of nodes generated by the classification system by applying an inference engine based on a set of rules regarding relationships between nodes.
- 21. The autocontextulization method of claim 17, wherein the feature recognizer recognizes as features at least some of:
dates; times; numbers; monetary amounts; people's names; organization names; product names; company names; technical terminology; noun phrases; verb phrases; and syntactic relationships.
- 22. The autocontextualization method of claim 17, wherein the step of generating a preliminary list of nodes further includes the step of identifying the features within the content most relied upon by the classifier in making the classification.
- 23. An organization of a contiguous entity of knowledge, comprising:
a plurality of knowledge containers, each knowledge container having an indication of a constituent portion of the entity of knowledge, each constituent portion of the entity relating to a different topic; and at least one tag associated with said knowledge container, wherein the tag represents an association of a constituent portion of the knowledge container to a concept node.
- 24. The organization of claim 23, further comprising at least one link associated with a first knowledge container, wherein said at least one link associates said first knowledge container to at least a second knowledge container.
- 25. The organization of claim 23, wherein each of the knowledge containers are subordinate knowledge containers, further including:
a master knowledge container that includes an indication of the entire entity of knowledge, wherein each of the subordinate knowledge containers include a link to the master knowledge container.
- 26. A method of processing a query to identify a particular knowledge container, associated with a knowledge map, that is relevant to the query, wherein the knowledge map includes at least one taxonomy representing a discrete perspective of a knowledge domain, wherein the at least one taxonomy is organized into a group of nodes, the nodes representing conceptual areas within the discrete perspective, and wherein the nodes have an indication of knowledge, including the particular content associated therewith, said method comprising the steps of:
(a) processing the query to identify nodes of the taxonomies within the knowledge map that are potentially relevant conceptual areas; (b) identifying knowledge map regions surrounding at least one of the identified nodes; (c) performing a content-based retrieval over the knowledge containers associated with the nodes in each identified region, to retrieve an ordered list of potentially relevant knowledge containers, where each retrieved knowledge container is assigned a numerical relevance score representing a quality of association between the retrieved knowledge container and the query; (d) combining the ordered lists for the identified regions into a single re-ordered list, based on calculating the quality of associations between the knowledge container in the list, the knowledge map, and the query; and (e) returning as a result the re-ordered list of the retrieved knowledge containers.
- 27. The method of claim 26, further including the step of returning the potentially relevant nodes and knowledge map regions.
- 28. The method of claim 26, wherein the content based retrieval step operates upon one content-based search engine index for all knowledge containers associated with nodes of the knowledge map.
- 29. The method of claim 26, in which the content-based retrieval step operates on at least one distinct content-based search engine index per region, where each index indexes or points to a subset of the knowledge containers associated with nodes of the knowledge map.
- 30. The method of claim 29, wherein for each concept node in at least some of the taxonomies, the knowledge containers whose content is associated with those nodes are indexed by a distinct index.
- 31. The method of claim 29, wherein in the subset of knowledge containers in each index have similarity of vocabulary.
- 32. The method of claim 31, wherein the subsets of knowledge containers in each index are formed by steps of:
aggregating the content indicated by knowledge containers associated with each node into a single block of content; grouping the blocks together based on vocabulary occurring within the blocks, using a text clustering system; and grouping those knowledge containers whose content comprises the forming the knowledge containers from which the blocks in a group originate into a distinct subset.
- 33. The method of claim 26, wherein the content-based retrieval step is performed over a group of indexes for each knowledge map region, wherein the group of indexes for a particular region is based on indexes for nodes in that knowledge-map.
- 34. The method of claim 26, wherein the query processing step further includes the step of augmenting the set of identified nodes with additional nodes as input to the query process.
- 35. The method of claim 29, wherein the content-based retrieval step further includes:
performing an additional search over an index for all knowledge containers associated with concept nodes in the knowledge map.
- 36. The method of claim 26, wherein the list combining step includes the following steps:
modifying the numeric relevance scores; and combining the ordered lists into the single reordered list based on the modified relevance scores; wherein the numeric relevance score for a knowledge container in a particular knowledge map region is modified at least partially based on a quality measure for that knowledge map region.
- 37. The method of claim 36, wherein the quality measure for a particular knowledge-map region is derived from a quality measure for each of the potentially relevant concept nodes around which the knowledge-map region surrounds.
- 38. The method of claim 37, wherein the quality measure for a potentially relevant concept node is based on the weight value determined in the query process step when identifying a node for a potentially relevant conceptual area.
- 39. The method of claim 37, wherein the quality measure for a node for a potentially relevant conceptual area is based on a weight for that node determined in the query process step.
- 40. The method of claim 36, wherein the numeric relevance score for a particular knowledge container is adjusted based on a quality measure for that knowledge container.
- 41. The method of claim 36, wherein the quality measure for a particular knowledge container is based on weights of association of the knowledge container with nodes of the taxonomies.
- 42. The method of claim 36, wherein the quality measure for a particular knowledge container is based at least in part by how many knowledge map regions with which the knowledge container has associated nodes.
- 43. The methods of claim 36, wherein the quality measure for a particular knowledge container is dependent on a taxonomic distance between the nodes in the knowledge map with which the knowledge container is associated and nodes in the knowledge map with which the query is associated.
- 44. The method of claim 36, wherein the query is a present query, and wherein the quality measure for a particular knowledge container is based at least in part on a previously-determined overall quality score for the knowledge container based on from users presented with the knowledge container in response to previous queries.
- 45. The method of claim 26, wherein the query includes taxonomic restrictions limiting the areas of the knowledge map from which a knowledge container is returned in response to the query.
- 46. The method of claim 45, wherein the taxonomic restrictions include:
a) a restriction that all knowledge containers returned must be associated with nodes in a particular one or more of the taxonomies; b) a restriction that all knowledge containers returned must be associated with particular nodes; c) a restrictions that all knowledge containers returned must be associated with nodes either at or taxonomically under a particular node or nodes; and d) a boolean combination of the restrictions a), b) and c).
- 47. The method of claim 46, where said taxonomic restrictions further include a restriction that all knowledge containers returned must be tagged to concept-nodes either at or within a particular taxonomic distance of a particular concept-node or nodes.
- 48. The method of claim 46, where said taxonomic restrictions further include:
a) a restriction that all knowledge containers returned may not be associated with nodes in a particular one or more of the taxonomies; b) a restriction that all knowledge containers returned may not be associated with particular nodes; c) a restrictions that all knowledge containers returned may not be associated with nodes either at or taxonomically under a particular node or nodes; and d) a boolean combination of the restrictions a), b) and c).
- 49. The method of claim 26, further including a step of processing administrative meta-data constraints to limit the knowledge containers included in the result, the administrative meta-data constraints including at least one of:
names of authors of the knowledge containers; date ranges for creation date of the knowledge containers; date ranges for last modified date of the knowledge containers; date ranges for expiration date of the knowledge containers; words or phrases which must be present in the title of the knowledge containers; name of publication or source in which the knowledge containers originally appeared; and name of customers for which the knowledge containers were originally prepared.
- 50. The method of claim 47, further including the step of constructing the taxonomic restrictions.
- 51. The method of claim 47, wherein said constructing step is further comprised of the step of manually interacting with a graphical display of the knowledge map to indicate desired taxonomic restrictions.
- 52. The method of claim 47, wherein the interfacing step includes the step of receiving a textual query from the user.
- 53. The method of claim 47, wherein indications of knowledge experts are associated with nodes for which the conceptual areas represented by the nodes are with the experts' area of expertise, and
wherein information about the experts may be included as part of the result of processing the query.
- 54. The method of claim 26, further including the following steps:
receiving input from a user as to the suitability of particular portions of the returned result; modifying the query in response to the input; and repeating steps (a)-(e), using the modified query.
- 55. The method of claim 26, further comprising the step of generating clarifying questions based on the nodes for potentially relevant knowledge containers, wherein the input is provided at least partially in response to answers from a user to the clarifying questions.
- 56. The retrieval method of claim 26, further comprising the step of generating suggested additional terms for the query based on the nodes for potentially relevant knowledge containers, wherein the query is modified in response to a user choosing from the additional terms.
- 57. The retrieval method of claim 26, further comprising the steps of:
generating parameterized questions from which a user can interactively construct a taxonomic restriction to limit the areas of the knowledge map or construct a query from which result knowledge content is returned in response to the query, said parameterized questions including:
a boolean taxonomy-restriction expression, where the concept nodes in the expression are replaced with variables; text of a previously composed question comprised of a plurality of text selection-list boxes for each variable within the boolean taxonomy-restriction expression, wherein each selection-list box holds lists of names or descriptions of concept-nodes that are potential values for the variable; said lists being assembled using the names or descriptions of concept-nodes returned by the retrieval mechanism in the previous step of the dialog, possibly augmented with other nearby concept-nodes from the same taxonomies; said selection-list boxes optionally having pre-selected as the default choice for the user the specific concept-nodes returned by the retrieval mechanism in the previous step of the dialog, such that when a user selects concept-nodes for each selection-list box within the parameterized question, the boolean taxonomy-restriction expression is instantiated by replacing each of its variables with the corresponding selection-list box selection, and the resulting taxonomic restriction is added to the user s query for the subsequent step of the dialog.
- 58. The method of claim 26, wherein the knowledge container includes other intellectual content or an indication of a person who has knowledge contact is associated.
- 59. The retrieval method of claim 26, wherein:
some of the content associated with the nodes of the knowledge map include an indication of a user and the user's interests; and at least some of the steps of the retrieval process account for the user's interests.
- 60. The retrieval method of claim 59, wherein the steps that account for the user's interests include the list combining step.
- 61. The retrieval method of claim 60, wherein the numerical relevance scores are modified based on a correlation between the user s interests and the nodes with which the retrieved knowledge container is associated.
- 62. The retrieval method of claim 34, wherein the method is initiated from a user application, and wherein information about the user application is provided in the form of concept nodes added to the query.
- 63. The retrieval method of claim 51, wherein the process is initiated from a user application, and wherein information about the user application is provided as the taxonomic restrictions.
- 64. The knowledge retrieval process of claim 26, wherein the process is initiated from a user application, and wherein the list combining step operates based on information about the user application.
- 65. The knowledge retrieval process of claim 26, wherein the list combining step operates at least in part based on an identification of nodes of the knowledge map by a user.
- 66. A method of identifying a knowledge container associated with a knowledge map, wherein the knowledge map includes at least one taxonomy representing a discrete perspective of a knowledge domain, wherein the at least one taxonomy is organized into a group of nodes, the nodes representing conceptual areas within the discrete perspective, and wherein the nodes have an indication of knowledge, including the particular content associated therewith, said method comprising:
processing information about a user to identify nodes in the taxonomy that represent conceptual areas previously indicated to be of interest to a user; identifying knowledge map regions surrounding the at least one of the identified nodes; and performing a content-based retrieval over the knowledge containers associated with the nodes in each identified region, to retrieve an ordered list of potentially relevant knowledge containers, where each retrieved knowledge container is assigned a numerical relevance score representing a quality of association between the retrieved knowledge container and the customer information.
- 67. The knowledge container of claim 11, wherein the indication of the user's interests and information needs includes a query for use by a retrieval method to retrieve objects mapped to the knowledge map.
- 68. The method of claim 66, wherein the information about the customer is processed automatically with any action by the user, and wherein at least one portion of the knowledge container of the re-ordered list is displayed to the user.
- 69. A method for constructing a knowledge map from a corpus of knowledge containers, said method comprising:
identifying a set of root nodes for proposed discrete taxonomies to represent facets of the domain of knowledge; extracting terms and features from the corpus of knowledge containers; assigning the terms amongst the proposed discrete taxonomies; constructing each taxonomy from the terms ascribed to that taxonomy and the corpus of knowledge containers; and testing and refining each constructed taxonomy using a text classification system.
- 70. The method of claim 69, wherein the constructing step includes,
clustering the set of terms ascribed to each taxonomy into multiple groups based on the usage and collocation in the text of the corpus; organizing the term clusters into a hierarchical taxonomy based on statistical correlations among the term clusters, each term cluster becoming a concept-node in the hierarchical taxonomy; assigning each concept-node within the generated taxonomy an appropriate name; and manually reviewing and altering the generated taxonomy.
- 71. The method of claim 70, wherein the clustering step utilizes:
correlations between terms in each cluster; and the number of terms in each cluster; wherein a sequence of clustering steps are carried out with each subsequent step attempting to discover additional clusters and using the highest level clusters as higher level concept-nodes.
- 72. The method of claim 70, wherein the name assigning step includes assigning concept-node names to each term cluster based on human manual review by a human expert within the knowledge domain.
- 73. The method of claim 72, wherein the manual review step includes at least one of:
adding concept-nodes to the generated taxonomy; removing concept-nodes from the generated taxonomy; moving concept-nodes within the generated taxonomy to different locations; and adding or removing terms from the term clusters associated with concept-nodes.
- 74. The method of claim 69, wherein the testing and refining step includes using a trainable text classification system, to perform:
for a subset of knowledge container in the corpus, creating a training set by manually identifying the concept-nodes from the newly generated taxonomy that correspond to topics that appear within the content of the knowledge container; for a subset of knowledge containers in the corpus, creating a test set by manually identifying the concept-nodes from the newly generated taxonomy which correspond to topics that appear within the content of the knowledge container; training the text classification system by using the content of the knowledge containers identified for each concept-node in the training set as example data for the concept node; generating a test on training set report indicating how well the trained text classification system's classification of the knowledge containers in the training set for each concept-node within the taxonomy matches the manually identified classification; based on the test on training set report, refining the taxonomy by at least one of:
adding concept-nodes to the generated taxonomy; removing concept-nodes from the generated taxonomy; moving concept-nodes within the generated taxonomy to different locations; and adding or removing knowledge containers from the manually-identified training set of knowledge containers corresponding to each concept-node; generating a test on test set report indicating how well the trained text classification system's classification of the knowledge containers in the test set for each concept-node within the taxonomy matches the manually identified classification; and based on the test on test set report, refining the taxonomy by one or more of:
adding concept-nodes to the generated taxonomy; removing concept-nodes from the generated taxonomy; moving concept-nodes within the generated taxonomy to different locations; and adding or removing knowledge containers from the manually-identified training set of knowledge containers corresponding to each concept-node.
- 75. A method for constructing a knowledge map from a corpus of knowledge comprising a plurality of knowledge containers, said method comprising the steps of:
(a) identifying a plurality of taxonomies that represent major subject areas found in the plurality of knowledge containers; (b) distributing each of the plurality of knowledge containers into at least one of the plurality of taxonomies; (c) for each taxonomy, identifying a set of concept-nodes that represent major themes found in the plurality of taxonomy knowledge containers; (d) for each concept-node:
distributing the plurality of knowledge containers into each of the plurality of concept-nodes; and designating the concept-node as a taxonomy for the portion of the knowledge map to be constructed; (e) repeating steps (c) and (d) for each taxonomy until the knowledge map cannot be further expanded; and (f) testing and refining each concept-node of said knowledge map using a text classification system.
- 76. The method of claim 75, wherein the step of testing and refining each concept-node includes at least one of the following steps:
adding concept-nodes to the generated taxonomy; removing concept-nodes from the generated taxonomy; and moving concept-nodes within the generated taxonomy to different locations.
- 77. The method of claim 75, wherein the testing and refining step includes using a trainable text classification system, to perform:
for a subset of knowledge containers in the corpus, creating a training set by manually identifying the concept-nodes from the newly generated taxonomy that correspond to topics appearing within the content of the knowledge container; for a subset of knowledge containers in the corpus, creating a test set by manually identifying the concept-nodes from the newly generated taxonomy that correspond to topics appearing within the content of the knowledge container; training the text classification system by using the content of the knowledge containers identified for each concept-node in the training set as example data for the concept node; generating a test on training set report that indicates how well the trained text classification system's classification of the knowledge containers in the training set for each concept-node within the taxonomy matches the manually identified classification.
- 78. The method of claim 77, further comprising the following steps of:
refining the taxonomy by at least one of:
adding concept-nodes to the generated taxonomy; removing concept-nodes from the generated taxonomy; moving concept-nodes within the generated taxonomy to different locations; and adding or removing knowledge containers from the manually-identified training set of knowledge containers corresponding to each concept-node; and generating a test on test set report indicating how well the trained text classification system's classification of the knowledge containers in the test set for each concept-node within the taxonomy matches the manually identified classification.
- 79. The method of claim 78, further comprising the step of refining the taxonomy by at least one of:
adding concept-nodes to the generated taxonomy; removing concept-nodes from the generated taxonomy; moving concept-nodes within the generated taxonomy to different locations; and adding or removing knowledge containers from the manually-identified training set of knowledge containers corresponding to each concept-node.
RELATED APPLICATIONS
[0001] This application is a divisional of U.S. application Ser. No. 09/594,083 filed Jun. 15, 2000, which claims priority under 35 U.S.C. 119(e) from U.S. Provisional Application No. 60/139,509, filed Jun. 15, 1999, which applications are incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60139509 |
Jun 1999 |
US |
Divisions (1)
|
Number |
Date |
Country |
Parent |
09594083 |
Jun 2000 |
US |
Child |
10610994 |
Jul 2003 |
US |