1. Field of the Invention
The invention is generally directed to the management of information (knowledge).
2. Related Art
Many systems exist for organizing documents. Such systems include file management applications (such as Windows Explorer) and document management systems.
Systems also exist for analyzing documents to some extent. Such systems include, for example, various Internet search engines for identifying documents that satisfy some supplied search criteria.
There are many more examples of existing systems for organizing documents, and systems for analyzing documents.
However, systems that effectively and efficiently manage knowledge are rare. Thus, there is a need for such systems. More particularly, there is a need for systems that enable users to organize, process, and otherwise manipulate information contained in any form (text, graphics, multimedia, applications, images, sound, etc.), and that provide diverse and flexible functionality so that users may construct work flows and processes according to their particular needs.
Briefly stated, the invention is directed to a system, method, and computer program product for managing knowledge. The knowledge that is being managed comprises documents of interest to users. Such documents may be in any form, such as but not limited to text, images, graphics, audio, video, multimedia, computer programs/applications, etc., and combinations thereof.
More particularly, the invention is directed to a computer implemented method of enabling a user to organize and analyze information in electronic form. The method operates by searching a first set of documents to thereby generate a second set of documents. The invention automatically creates a first group comprising the second set of documents. The invention analyzes the first group according to one or more analytical functions to thereby generate a third set of documents. The invention automatically creates a second group comprising the third set of documents. The invention enables selective iteration of any of these operational steps.
The invention also generates objects corresponding to process components of this work flow. The user can re-execute the work flow (process) by traversing the objects, or create a new process by modifying the objects.
Further features and advantages of the invention, as well as the structure and operation of various embodiments of the invention, are described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers generally indicate identical, functionally similar, and/or structurally similar elements. Generally, the drawing in which an element first appears is indicated by the leftmost digit(s) in the corresponding reference number.
The present invention will be described with reference to the accompanying drawings, wherein:
1. Overview of the Invention
The invention is directed to a system, method, and computer program product for managing knowledge. The knowledge that is being managed comprises documents of interest to users. Such documents may be in any form, such as but not limited to text, images, graphics, audio, video, multimedia, applications, etc., and combinations thereof.
The invention provides a number of functions for processing and manipulating knowledge. Typically, the functions that are performed are selected by users. More particularly, users select the functions that are to be performed, and the sequence in which those functions are performed. This is referred to as “work flow” or “process.”
According to embodiment of the invention, work flows can be saved. In other words, the invention supports persistent work flows or processes. The invention achieves persistent processes through the use of objects. These objects can be manipulated and re-used to achieve a number of advantages.
The invention shall now be described in greater detail.
2. Description of the Knowledge Management System (KMS)
The system 102 includes a plurality of databases 116 that store documents, such as patent related information and other information (unrelated to patents).
A knowledge management system (KMS) 114 accesses and processes the information in the databases 116. In particular, the KMS 114 includes modules that are capable of semi-automatically and automatically accessing and processing the information in the databases 116 in an document-centric and/or group-oriented manner. Such processing includes, but is not limited to, reporting, analyzing, and planning.
In an embodiment, the KMS 114 is implemented at least in part using an Intellectual Property Asset Manager (IPAM), which is described in U.S. Pat. No. 5,991,751, incorporated herein by reference in its entirety.
In an embodiment, the system 102 includes (but is not limited to) two types of clients, network clients 106 and web clients 104. These clients 104, 106 interact with the KMS 114 to access and process the information in the databases 116.
For example, the clients 104, 106 may request that the KMS 114 retrieve certain information, or automatically analyze certain information. The KMS 114 performs the requested tasks, and sends the results to the requesting clients 104, 106. The clients 104, 106 present these results to their respective operators, for example, and enable the operators to process the results.
Clients 104, 106 may also perform additional processing of data, such as creating a visualization of the data obtained from the KMS 114.
2.1. Example Computer Implementation
In an embodiment of the present invention, the components of the present invention shown in
The computer 502 includes one or more processors (also called central processing units, or CPUs), such as a processor 506. The processor 506 is connected to a communication bus 504. The computer 502 also includes a main or primary memory 508, preferably random access memory (RAM). The primary memory 508 has stored therein control logic 510 (computer software), and data 512.
The computer 502 also includes one or more secondary storage devices 514. The secondary storage devices 514 include, for example, a hard disk drive 516 and/or a removable storage device or drive 518. The removable storage drive 518 represents a floppy disk drive, a magnetic tape drive, a compact disk drive, an optical storage device, tape backup, ZIP drive, a slot, a PCMCIA device, etc.
The removable storage drive 518 interacts with a removable storage unit 520. As will be appreciated, the removable storage unit 520 includes a computer usable or readable storage medium having stored therein computer software (control logic) and/or data. The removable storage drive 518 reads from and/or writes to the removable storage unit 520 in a well known manner.
Removable storage unit 520, also called a program storage device or a computer program product, represents a floppy disk, magnetic tape, compact disk, optical storage disk, ZIP disk, memory card, PCMCIA card, or any other computer data storage device. Program storage devices or computer program products also include any device in which computer programs can be stored, such as hard drives.
In an embodiment, the present invention is directed to computer program products or program storage devices having software that enables the computer 502 to perform any combination of the functions described herein.
Computer programs (also called computer control logic) are stored in main memory 508 and/or the secondary storage devices 514. Such computer programs, when executed, enable the computer 502 to perform the functions of the present invention as discussed herein. In particular, the computer programs, when executed, enable the processor 506 to perform the functions of the present invention. Accordingly, such computer programs represent controllers of the computer 502.
The computer 502 also includes a display unit 522, such as a computer monitor, and one or more input devices 524, such as a keyboard, a mouse, other pointing devices (such as a light pen and trackball), etc.
The computer 502 further includes a communication or network interface 526. The network interface 526 enables the computer 502 to communicate over communication networks, such as networks 108 and 112, which in embodiments use the well known HTTP communication protocol.
The computer 502 can receive signals 527 from any medium via interface 526. Such signals 527 may include data and/or software. Such signals 527 represent another computer program product embodiment, and the invention is directed to such embodiment.
The components of the invention (shown in
2.2. Databases
2.2.1. Document Databases
The document databases 312 include electronic representations of documents of interest to the customer. The document databases 312 represent the customer's repository of documents, and are thus also called the customer's document repository. (The “repository” could alternatively represent all documents represented in the databases 116, whether represented in the document databases 312 or the bibliographic databases 302.)
For example, the patent database 314 includes electronic representations of U.S. and foreign patents of interest to the customer. The patent database 314 preferably has stored therein an image file and a text file for each patent represented in the patent database 314, where the image file and the text file are representations of the patent. Details of an embodiment of the image file and the text file are described in U.S. Pat. Nos. 5,623,681 and 5,623,679, which are both incorporated herein by reference in their entireties.
The document databases 312 also include other documents of interest to the customer.
The documents in the document databases 312 may be text, images, graphics, audio, video, multimedia, applications, etc.
2.2.2. Document Bibliographic Databases
The document bibliographic databases 302 store information about documents (as opposed to the documents themselves). More particularly, the document bibliographic databases 302 store bibliographic information about documents.
For example, the patent bibliographic databases 304 store bibliographic data about U.S. and non-U.S. patents. Such patent bibliographic data includes, but is not limited to, the information on the front page of patents, such as: the patent number, the issue date, the inventors, the title, the assignee, the serial number, the filing date, the U.S. and international classifications, the fields of search, the references cited, the primary examiner, the assistant examiner, the attorney, the agent, the law firm, priority information, related application information, the number of claims, the number of drawing pages, the patent term, the expiration date, etc. The patent bibliographic databases 304 can also include one or more user defined fields that can store large amounts of data.
2.2.3. Notes Database
The present invention supports annotation of the documents in the document databases 312. More particularly, the present invention allows users to create and link annotations (also called notes) to the documents (or portions thereof) in the document databases 312. Such annotations can include text, graphics, images, video, audio, and/or any other information representation that can be stored in electronic form.
The annotations, linkage information (i.e., information that specifies the link between a note and a portion of a document), and information related to the annotations and/or the linkage information (such as the position of the linked portion in the document, the date of creation, the creator, the date of modification, a note title and/or subject, access rights, etc.) are stored in the notes databases 340. Embodiments of the notes databases 340 are described in U.S. Pat. Nos. 5,623,679 and 5,623,681, incorporated by reference herein.
2.2.4. Groups Databases
Information on groups is stored in the group databases 321. Generally, a group is a data structure that includes any number of objects that typically follow a common theme or characteristic (although this is not a mandatory requirement of groups). Groups are said to be document-centric.
There are two classes of groups: predefined groups (also called system defined groups) and user-defined groups (also called arbitrary groups).
The invention also supports other types of groups. For example, the invention supports temporary groups. A temporary group is automatically created by the invention in the course of processing a command. One application of temporary groups involves search operations. Specifically, when conducting a search for documents, a new temporary group is created, and the search results are stored in the temporary group. Groups may be created through the processing of other commands or actions. The invention permits operators to save temporary groups. For example, the invention allows operators to convert temporary groups to predefined groups or user-defined groups.
Groups are further described in U.S. Pat. No. 5,991,751, which is herein incorporated by reference in its entirety.
2.3. Example Block Diagram of the Knowledge Management System
The KMS 114 has a number of modules (collectively called the KMS modules).
A number of the modules interact with the databases 116. A SQL server 226 (such as the Microsoft SQL Server) and/or other well known database servers 228 interact with the databases 116. The KMS modules interact with these servers 226 and 228 and the databases 116 via a database interface module 220, which may represent, for example, an ODBC (object database connectivity) layer.
The Network transport layer or interface 201 is used to receive command request objects from the client 104, 106.
The KMS 114 is a highly secure business decision system. In an embodiment, the specific operations in each command object are checked against the security information maintained about each user in the system. This is logically done through a comprehensive security layer or module 202.
The document storage and retrieval module 208 provides access to documents and information in the databases 116.
The Searching subsystem or module 210 provides for document and information searching. The search layer 210 also encapsulates the specific search engine(s) 224 used in the implementation of the system, which can and will vary based on available search technologies.
The Groups layer or grouping module 212 is responsible for managing groups.
The Notes layer or module 214 is responsible for managing forms of annotations.
The Analysis layer or analysis modules 216 perform analysis in support of specific requests made by various modules in the system.
Finally, the server administration layer or module 218 provides services to manage the configuration of the KMS 114, such as adding or changing the security permissions associated with a specific user.
The modules described above are further described in U.S. Pat. No. 5,991,751, incorporated herein by reference in its entirety.
2.4. Analysis Modules
Example analysis modules 216 are shown in
It should be understood that the invention is adapted and intended to include a wide and varied range of analysis modules 216. The analysis modules 216 shown in
Analysis modules 216 are further described in U.S. Pat. No. 5,991,751, which is herein incorporated by reference in its entirety.
3. Knowledge Management—Example Work Flow Embodiments
As its name indicates, the knowledge management system (KMS) 114 is useful for managing knowledge. The knowledge that is being managed comprises documents of interest to users. Such documents may be in any form, such as but not limited to text, images, graphics, audio, video, multimedia, computer programs/applications, etc., and combinations thereof.
The KMS 114 provides a number of functions for processing and manipulating knowledge. Typically, the functions that are performed are selected by users. More particularly, users select the functions that are to be performed, and the sequence in which those functions are performed. This is referred to as “work flow.”
In step 604 of
The invention supports a number of searching types. These may be based, for example, on morphological processing 606 (involving the understanding of components of words), lexical processing 608 (matching whole words during searches), syntactical processing 610 (taking into consideration the role of words), semantic processing 612 (the meaning of words in sentences), discourse processing 614 (understanding parts of documents), and/or pragmatic processing 616 (using external knowledge). Based on such processing, the invention supports full text search modules 704, boolean search modules 706, clustering modules 708, as well as other search modules (see
The searches of step 604 result in one or more lists of documents. In an embodiment, the lists correspond to groups 618. Accordingly, the searches of step 604 automatically generate groups 618. This is generally true of the invention. That is, processing of documents results in groups, where the groups include documents that were identified, or that resulted, from such processing.
In an embodiment, the groups 618 are temporary. In such embodiments, it is possible to save the groups 618 (step 620). It is also possible to annotate the groups 618, and the documents in the groups 618 (step 622).
The user may wish to conduct further searching within the groups 618. Such further searching is represented by step 624, and this results in new groups 626. Again, such groups 626 may be saved (step 628) and/or annotated (step 630).
In step 632, the user elects to analyze the groups 626 and the documents contained therein. The invention supports a number of analysis modules, such as mapping modules 634, citation or plot lineage modules 636, and reporting modules 638. See also
The citation modules 636 includes a patent citation module 802 (see U.S. Pat. No. 5,991,751), an author citation module 804, and a case law citation module 806.
The author citation module 804 operates to identify documents related to an author. The operation of the author citation module 804 according to an embodiment is shown in
The author citation module 804 as described above is focused on an author. It can be modified to be focused on a particular article (called the based article). For example, references on which the base article is based, and references that cite the base article, could be displayed in step 910.
The case law citation module 806 operates to display cases on which a given case is based, and cases that cite the given case. In this manner, it is similar to the author citation module 804, and its processing is similar to that shown in
Referring again to
The processing performed thus far may have been intended to narrow a large database of documents to a smaller set that is pertinent to certain criteria of interest to the user. This involved various searching (steps 604 and 624) and analysis (step 632). The resulting groups 640 may include documents that the user feels are very pertinent to his criteria of interest. Thus, the user is ready at this point to analyze in detail each document in the groups 640. This is represented by step 642. Such analysis may involve a manual study of the documents, and/or further analytics (such as that performed in step 632).
Groups and documents can be annotated 3024. Also, documents can be individually analyzed 3020.
The work flows described herein can be saved. In other words, the invention supports persistent work flows or processes. The invention achieves persistent processes through the use of objects. These objects can be manipulated, as indicated by step 3022 in
4. Achieving Persistent Processes Through Use of Objects
As mentioned above, the invention enables users to save work flows, which are also referred to as processes. In other words, the invention allows users to make processes persistent.
The invention uses objects to achieve this function. The invention creates objects for components of a process. For example, with regard to the example process of
A number of advantages are achieved by making a process persistent. Some of the advantages (and features of the invention) are shown in
On advantage is a process can be easily fully or partially re-executed by traversing one or more of its objects (i.e., by invoking the objects). This is represented by step 1106.
Another advantage is it is easy to create new processes by modifying old processes. This is represented by steps 1108-1114. In step 1108, a user copies objects corresponding to an old process to create new objects. In step 110, the user modifies the copied objects to create a new process. In step 1112, the user executes the new process by traversing its objects. The user can then compare the new process to the old process by comparing its respective objects (see step 1114).
The invention enables users to organize and manage objects. For example, a user can save objects, delete objects, view objects, modify objects, organize objects in folders, etc. Such operation is represented by step 1116.
The invention supports definitions (or templates) for various objects. The invention supports operation objects (such as query, boolean, import, patent family, corporate family, list exploder objects) and application objects (web reporting, citation tree, patent family tree, corporate family tree, visualization, clustering objects), as well as other objects. As a process is being executed, objects corresponding to the process's components are created (instantiated) based on the applicable object definitions.
Such operation is depicted in the flowchart 2902 of
Many objects have an input and an output. The input represents one or more other objects, which are called upstream objects 2810 (see the example work flow diagram 2802 in
Thus, the features of the invention just described enable users to create, store, manage, manipulate, and share their work - not just results, but the operations and processes for arriving at those results. In addition, these features enable automation of processes to greatly reduce or eliminate the manual effort required to update previously generated results.
Each object type has attributes that help users document and re-use their work. Users can create, manage, and access objects from Object Manager, a tree navigation and management tool similar to Windows File Manager.
With regard to queries (searches), by defining upstream objects (such as a List or database) that will be queried, the Trigger (such as a change to a List or an update to a database source) that could cause the Query to auto-execute, and the downstream object (the output List), a user could set up a number of Queries to feed Lists of common topics (such as competitor activity) that could be shared with perhaps many users. The other types of objects will have similar properties.
Using the defined objects as building blocks connected together in strings, complete processes can be set up. Some simple examples would be:
Database->Query->List->Report
[List A, List B]->Boolean->List C->ThemeScape
One powerful feature of this architecture is that these strings of objects can be defined as objects themselves. Called Process Stream Objects, these objects can be created, stored, and auto- or manually-executed. They can also be copied, modified, and moved, greatly simplifying the task of setting up new processes. A goal in developing these features is to provide the building blocks to enable users to construct, store, utilize, and manage whatever Process Streams they might envision.
Object definitions are described in the following sections. These definitions are provided for illustrative purposes. The invention supports other objects, and the definitions for these other objects will be apparent to persons skilled in the relevant arts based on the teachings contained herein.
4.1.1. Common Object Features
In an embodiment, the following features apply to all objects.
4.1.1.1. Generic Attributes
Object Type
Name
Path
Object ID
Object Version Number
Object Description
Owner
Permissions
Create Date
Modification Date
Object Modification History
Annotations
Related Objects
Upstream Objects
Downstream Objects
Object Triggers
4.1.1.2. Generic Object Views
Permissions View
Connections View (see, for example,
Trigger View (see, for example,
Attribute View (see, for example,
Users can change the Description in the Attribute View.
Modification History View (see, for example,
Annotation View (see, for example,
4.1.2. Boolean Operation Objects
The Boolean Operation Object allows users to perform boolean operations on Lists to generate List output. Possible boolean operations are (in the following, the upstream objects are List A and List B, and the downstream object is List C):
AND
OR
NOT
See also, for example,
4.1.3. Corporate Family Operating Objects
The Corporate Family Operation allows users to define operations based on corporate entity information. For example, this object can sort through a list of patents to identify those that are assigned to a particular corporate entity.
4.1.4. Export Objects
The Export Operation Object allows tracking of exported Lists and their formats. This object corresponds to an export operation where a particular object, such as a list object for example, is exported to a destination. The export operation includes modifying the object according to the specifications of the destination (such as the format of the destination). The user would be prompted to generate or choose an Export Operation Object when beginning the operation. Examples of Export Operation data transactions are exporting to Lotus Notes or to loosely coupled analytic software.
Export specific attributes include the following:
Destination name/provider
Export Format Definition
Export List description
4.1.5. Folder Objects
Folders represent storage locations for other objects, including other Folders. They can be nested in a hierarchy, similar to folders or directories in Windows and DOS. Although in IPAM Folders are implemented in a database rather than in the Windows file system, this is transparent to the user. Folders may be annotated.
Content View is a Folder-Specific Object View. The Content View shows the contents of the Folder, which can include subfolders and Data Objects of any type. Contents are listed in alphabetical order by name, with subfolders listed first, followed by Data Objects. In addition to the names of the contents, the listing can optionally include the Object Description. The Content View is the default view. See, for example,
4.1.6. Import Objects
The Import Operation Object allows tracking of the source of list imports, as well as reformatting input data into Lists. In cases where the Import Operation is driven by the external source, the external software could generate an Import Operation Object. An example of a data transaction generating an Import Operation externally would be an export from SciFinder to IPAM.
In cases where the Import Operation is driven from within IPAM, the user would generate an Import Operation Object. An example of an IPAM-driven data transaction could be an import to IPAM.
Import specific attributes include the following:
Source name/provider
Import Format Definition
Import List description
4.1.7. List Exploder Operation Objects
The List Exploder Operation allows users to easily break an input List into a number of output Lists based on one of several criteria: Publication Year, Inventor, IPC, subject matter, etc.
4.1.8. List Objects
A List Object represents a list of documents, implemented as a list of document GUIDs. Display of the List would result in any additional document fields specified for the List View to be fetched from the Patent or Document database. This object corresponds to the concept of “Group” (described elsewhere herein).
Related Objects
Operation Objects
Application Objects
Permitted Downstream Objects (multiple objects)
Operation Objects
Application Objects
List-Specific Attributes
Document Table
Content Change Type
List Display Parameters
List Display Options View
List Content Views:
Short List View (see, for example,
Full List View (see, for example,
Abstract List View
Shoebox List View (see, for example,
4.1.9. Query Objects
Query operations may be executed against any supported database/search engine. Depending on the database/search engine, queries may be structured in searchscreen format and/or native query strings. For example, an initial Query Object implementation for a Dialog search could provide a screen for entering the Dialog search command string; a later implementation could provide a search form that automatically generates Dialog command strings. See, for example,
Query specific attributes include the following:
Database name/provider
Query string
4.1.10. Patent Family Dedupe
The Patent Family Dedupe operation allows users to dedupe a List of patent documents so as to keep only one member of a patent family in the result List. For example, if a list includes a U.S. patent and its PCT, Japanese, and European counterparts, this operation allows the user to delete all copies of this patent except for one.
Users will select the Family Definition that they wish to use (Inpadoc Family, Derwent Family, Identical Priority Family, or Priority-in-Common Family). Users also select Document Retention Priority, which is the priority order for keeping documents so that the retained doc from each family contains the maximum amount of useful information. An example priority order might be to retain the patents according to the following preference: 1) WPI record, 2) US grant, 3) EP-B publication in English, 4) PCT publication in English, etc.
This object has the following object specific views:
Document Retention Priority View
The Document Retention Priority View allows users to select the priority order for keeping documents so that the retained doc from each family contains the appropriate amount of useful information in the preferred language. For example, if the Output List is to be reviewed by managers who prefer only to read abstracts, then the priority order would be chosen to provide the highest quality abstract, without regard to availability of fulltext. However, if the reviewer is a scientist or engineer, the priority order might reflect the requirement for fulltext. An example priority order might be: 1) WPI record, 2) U.S. grant, 3) EP-B publication in English, 4) PCT publication in English, etc. See, for example,
Family Definition View
The Family Definition View allows the user to select their preferred Family Definition. Possible definitions include the standard definitions: Inpadoc Family or Derwent Family; or user-defined families based on either Identical Priority or Priority-in-Common. For user-defined families, the user must select the specific document-to-document relationships that the user considers to be a Family Relationship. See, for example,
4.1.11. Patent Family Expand Object
The Patent Family Expand operation allows users to expand a List of patent documents so that for each patent family represented in the Input List, all family members are included in the Result List. For example, if a List includes a particular U.S. patent, this operation causes all counterpart patent applications and patents to be inserted into the List.
Membership in a family is determined by the user's choice of Family Definition (Inpadoc Family, Derwent Family, Identical Priority Family, or Priority-in-Common Family). The user may also choose which types of documents to include, i.e., Pseudo-Docs, Primary Patent Docs, and/or Secondary Patent Docs.
This object has an object-specific view called the Document Inclusion View. The Document Inclusion View allows the user to select the Input List, Result List, Document Types to include, and the preferred Family Definition. Possible Family Definitions include the standard definitions (Inpadoc Family or Derwent Family), or custom Family Definitions (based on either Identical Priority or Priority-in-Common). For custom Family Definitions, the user must select the specific document-to-document relationships that the user considers to be a Family Relationship. See, for example,
4.2. Work Flow (Process) Example
In step 1006, a query is conducted on a database 1004. A query object is created for the query.
In step 1008, List A results from the query, and a list object corresponding to List A is created.
In step 1010, duplicate patents are removed from List A. A patent family dedupe object is created corresponding to this operation.
In step 1012, List C results from the operation of step 1010, and a list object corresponding to List C is created.
In step 1016, documents are imported from some source 1014. This creates an import object.
In step 1018, List B results from the operation of step 1016, and a list object corresponding to List B is created.
In step 1020, a boolean operation is performed on List C and List B. This results in a boolean operation object.
In step 1022, List D results from the operation of step 1020, and a list object corresponding to List D is created.
In step 1024, a corporate family analysis is performed to identify patents contained in List D that are assigned to a particular corporate entity. This results in a corporate family operating object.
In step 1026, List E results from the operation of step 1026, and a list object corresponding to List E is created.
Referring now to
As a result of step 1028, List F (step 1030) and List G (step 1032) are created. This results in list objects corresponding to Lists F and G.
In step 1034, some report is performed on List G. This results in an application object being created.
In step 1036, List H results from the operation of step 1034, and a list object corresponding to List H is created.
In step 1038, a mapping operation is performed on the List H. This results in an application object being created.
In step 1040, List I results from the operation of step 1038, and a list object corresponding to List I is created.
In step 1042, List I is exported to some destination 1044. This results in an export object being created.
The work flow of
5. Object Manager and Database
Objects are stored in an object database. The invention also provides an object manager for manipulating objects.
The Object Manager is a tool for visualizing the Folder hierarchy within the Object Database. See, for example,
When the Object Manager is active and a folder is selected, the Work Pane displays the folder contents. When the Object Manager is active and an object is selected, the Work Pane displays that object's Default View.
6. Example Modules of the Invention
6.1. Clustering Tools
Referring to
Generally, some embodiments of the clustering modules operate by parsing the documents to extract particular information. The information may be extracted from unstructured data contained in the document (such as the specification of a patent), and/or from structured data contained in the document (such as meta data, i.e., titles, authors, abstracts, etc.). The documents are then grouped according to this information. The invention supports clustering modules that operate differently than just described.
However, the list 3108 could include hundreds or thousands (or more) of documents, all relating in some way to CDMA. The user might only be interested in a particular CDMA topic, such as GSM.
In the example of
An example of a clustering module that operates as just described is Vivisimo of Pittsburgh, Pa., although the invention is not limited to this example.
6.2. Relevancy Visualization Tool
The invention supports other modules for processing and organizing documents. For example, the invention supports a relevancy visualization (RV) module that extracts key terms from documents (from unstructured and/or structured portions of the documents), assigns them to meaningful categories (a taxonomy), and establishes their inter-relationships. As a result, the RV module generates a highly structured body of information that users can slice as needed. Users can generate patterns related to the documents in a variety of visual forms such as maps, tables and graphs.
More particularly, the RV module analyzes a group of documents to identify how those documents relate to various key terms, and/or relate to each other with regard to (or relative to) those key terms. For example, suppose the key term is “assignee.” In this patent related example, the RV module would analyze a group of patents to determine who their assignees are. The RV performs this task by performing a key word search of the documents, for example.
In embodiments, the RV module operates according to a rule book 3406. The rule book comprises a set of instructions describing specific linguistic patterns relevant to a particular vertical market or horizontal discipline, such as patents. The rulebook tells the module what concepts and relationships to retrieve from the text documents.
In an embodiment, the rule book 3406 includes rules to analyze patents. The rule book may include instructions to analyze patents according to the following key terms: assignee and inventor. When processing a group of patents (as shown in the example of
An example of this module is ClearForest of New York, N.Y., although the invention is not limited to this example.
6.3. Patent Family Expand Features
This section describes features of the invention relating to processing patent families. A patent family is a collection of patent documents that are related to one another in some way. For example, a patent family may include a U.S. provisional application, a corresponding U.S. non-provisional application, a corresponding PCT application, and a Japanese patent that was filed from the PCT application.
Requirements
This section describes requirements needed to support the patent family features according to an embodiment of the invention.
Family Relationship Types
Unstructured Family Relationships
Structured Family Relationships
Family Definitions
Standard Family Definitions
Custom Family Definitions
Document Types
Family Report
Family Tree
Patent Family Expand
Patent Family Dedupe
While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example only, and not limitation. Thus, the breadth and scope of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.
This is a continuation-in-part application of pending U.S. non-provisional application Ser. No. 08/921,369 filed Aug. 29, 1997 titled “Using Hyperbolic Trees to Visualize Data Generated by Patent-Centric and Group-Oriented Data Processing,” which is a continuation-in-part application of U.S. Pat. No. 5,991,751 titled “System, Method, and Computer Program Product for Patent-Centric and Group-Oriented Data Processing,” both of which are herein incorporated by reference in their entireties.
Number | Name | Date | Kind |
---|---|---|---|
4205780 | Burns et al. | Jun 1980 | A |
4270182 | Asija | May 1981 | A |
4486857 | Heckel | Dec 1984 | A |
4533910 | Sukonick et al. | Aug 1985 | A |
4555775 | Pike | Nov 1985 | A |
4613946 | Forman | Sep 1986 | A |
4622545 | Atkinson | Nov 1986 | A |
4686590 | Kunii et al. | Aug 1987 | A |
4716476 | Okada et al. | Dec 1987 | A |
4719523 | Kutaragi | Jan 1988 | A |
RE32632 | Atkinson | Mar 1988 | E |
4736308 | Heckel | Apr 1988 | A |
4748618 | Brown et al. | May 1988 | A |
4752889 | Rappaport et al. | Jun 1988 | A |
4772882 | Mical | Sep 1988 | A |
4785408 | Britton et al. | Nov 1988 | A |
4788538 | Klein et al. | Nov 1988 | A |
4812834 | Wells | Mar 1989 | A |
4847604 | Doyle | Jul 1989 | A |
4868733 | Fujisawa et al. | Sep 1989 | A |
4873623 | Lane et al. | Oct 1989 | A |
4884223 | Ingle et al. | Nov 1989 | A |
4899136 | Beard et al. | Feb 1990 | A |
4914732 | Henderson et al. | Apr 1990 | A |
4931783 | Atkinson | Jun 1990 | A |
4935865 | Rowe et al. | Jun 1990 | A |
4939507 | Beard et al. | Jul 1990 | A |
4959769 | Cooper et al. | Sep 1990 | A |
4977455 | Young | Dec 1990 | A |
4985863 | Fujisawa et al. | Jan 1991 | A |
4991087 | Burkowski et al. | Feb 1991 | A |
5008853 | Bly et al. | Apr 1991 | A |
5021989 | Fujisawa et al. | Jun 1991 | A |
5029013 | Hiratsuka et al. | Jul 1991 | A |
5062060 | Kolnick | Oct 1991 | A |
5072412 | Henderson, Jr. et al. | Dec 1991 | A |
5120944 | Kern et al. | Jun 1992 | A |
5142674 | Barker et al. | Aug 1992 | A |
5148154 | MacKay et al. | Sep 1992 | A |
5155806 | Hoeber et al. | Oct 1992 | A |
5157768 | Hoeber et al. | Oct 1992 | A |
5163104 | Ghosh et al. | Nov 1992 | A |
5179643 | Homma et al. | Jan 1993 | A |
5183404 | Aldous et al. | Feb 1993 | A |
5206830 | Isobe et al. | Apr 1993 | A |
5222160 | Sakai et al. | Jun 1993 | A |
5228123 | Heckel | Jul 1993 | A |
5237158 | Kern et al. | Aug 1993 | A |
5241671 | Reed et al. | Aug 1993 | A |
5251294 | Abelow | Oct 1993 | A |
5253362 | Nolan et al. | Oct 1993 | A |
5276616 | Kuga et al. | Jan 1994 | A |
5283894 | Deran | Feb 1994 | A |
5319745 | Vinsonneau et al. | Jun 1994 | A |
5327235 | Richards | Jul 1994 | A |
5334030 | Brilliott | Aug 1994 | A |
5349170 | Kern | Sep 1994 | A |
5353059 | Lawlor et al. | Oct 1994 | A |
5359428 | Kubota et al. | Oct 1994 | A |
5359508 | Rossides | Oct 1994 | A |
5381175 | Sudo et al. | Jan 1995 | A |
5392428 | Robins | Feb 1995 | A |
5402336 | Spiegelhoff et al. | Mar 1995 | A |
5404295 | Katz et al. | Apr 1995 | A |
5404514 | Kageneck et al. | Apr 1995 | A |
5428778 | Brookes | Jun 1995 | A |
5430681 | Sugawara et al. | Jul 1995 | A |
5432897 | Tatsumi et al. | Jul 1995 | A |
5434962 | Kyojima et al. | Jul 1995 | A |
5440481 | Kostoff et al. | Aug 1995 | A |
5442778 | Pedersen et al. | Aug 1995 | A |
5444615 | Bennett et al. | Aug 1995 | A |
5444779 | Daniele | Aug 1995 | A |
5452018 | Capitant et al. | Sep 1995 | A |
5481666 | Nguyen et al. | Jan 1996 | A |
5511186 | Carhart et al. | Apr 1996 | A |
5519857 | Kato et al. | May 1996 | A |
5530520 | Clearwater | Jun 1996 | A |
5537526 | Anderson et al. | Jul 1996 | A |
5540597 | Budman et al. | Jul 1996 | A |
5544302 | Nguyen | Aug 1996 | A |
5544352 | Egger | Aug 1996 | A |
5550976 | Henderson et al. | Aug 1996 | A |
5551055 | Matheny et al. | Aug 1996 | A |
5553216 | Yoshioka et al. | Sep 1996 | A |
5557722 | DeRose et al. | Sep 1996 | A |
5557785 | Lacquit et al. | Sep 1996 | A |
5559942 | Gough et al. | Sep 1996 | A |
5568639 | Wilcox et al. | Oct 1996 | A |
5576954 | Driscoll | Nov 1996 | A |
5581686 | Koppolu et al. | Dec 1996 | A |
5583982 | Matheny et al. | Dec 1996 | A |
5584035 | Duggan et al. | Dec 1996 | A |
5592607 | Weber et al. | Jan 1997 | A |
5592608 | Weber et al. | Jan 1997 | A |
5594837 | Noyes | Jan 1997 | A |
5596700 | Darnell et al. | Jan 1997 | A |
5604901 | Kelley et al. | Feb 1997 | A |
5615112 | Sheng et al. | Mar 1997 | A |
5615328 | Hadderman et al. | Mar 1997 | A |
5615362 | Jensen et al. | Mar 1997 | A |
5619632 | Lamping et al. | Apr 1997 | A |
5623679 | Rivette et al. | Apr 1997 | A |
5623681 | Rivette et al. | Apr 1997 | A |
5628003 | Fujisawa et al. | May 1997 | A |
5630125 | Zellweger | May 1997 | A |
5632031 | Velissaropoulos et al. | May 1997 | A |
5634012 | Stefik et al. | May 1997 | A |
5634051 | Thomson | May 1997 | A |
5638519 | Haluska | Jun 1997 | A |
5642502 | Driscoll | Jun 1997 | A |
5692176 | Holt et al. | Nov 1997 | A |
5696963 | Ahn | Dec 1997 | A |
5721910 | Unger et al. | Feb 1998 | A |
5732216 | Logan et al. | Mar 1998 | A |
5748956 | Lafer et al. | May 1998 | A |
5754840 | Rivette et al. | May 1998 | A |
5761497 | Holt et al. | Jun 1998 | A |
5765152 | Erickson | Jun 1998 | A |
5774833 | Newman | Jun 1998 | A |
5787424 | Hill et al. | Jul 1998 | A |
5794257 | Liu et al. | Aug 1998 | A |
5799325 | Rivette et al. | Aug 1998 | A |
5806079 | Rivette et al. | Sep 1998 | A |
5808615 | Hill et al. | Sep 1998 | A |
5809318 | Rivette et al. | Sep 1998 | A |
5826252 | Wolters, Jr. et al. | Oct 1998 | A |
5832476 | Tada et al. | Nov 1998 | A |
5848409 | Ahn | Dec 1998 | A |
5870770 | Wolfe | Feb 1999 | A |
5875431 | Heckman et al. | Feb 1999 | A |
5892900 | Ginter et al. | Apr 1999 | A |
5918236 | Wical | Jun 1999 | A |
5924090 | Krellenstein | Jul 1999 | A |
5933841 | Schumacher et al. | Aug 1999 | A |
5950214 | Rivette et al. | Sep 1999 | A |
5963941 | Hirakawa | Oct 1999 | A |
5990897 | Hanratty | Nov 1999 | A |
5991751 | Rivette et al. | Nov 1999 | A |
5999907 | Donner | Dec 1999 | A |
6003033 | Amano et al. | Dec 1999 | A |
6006257 | Slezak | Dec 1999 | A |
6041323 | Kubota | Mar 2000 | A |
6067528 | Breed et al. | May 2000 | A |
6078913 | Aoki et al. | Jun 2000 | A |
6151595 | Pirolli et al. | Nov 2000 | A |
6169995 | Yoshimura et al. | Jan 2001 | B1 |
6175824 | Breitzman et al. | Jan 2001 | B1 |
6279014 | Schilit et al. | Aug 2001 | B1 |
6282545 | Coats | Aug 2001 | B1 |
6339767 | Rivette et al. | Jan 2002 | B1 |
6389434 | Rivette et al. | May 2002 | B1 |
6393406 | Eder | May 2002 | B1 |
6460034 | Wical | Oct 2002 | B1 |
6499026 | Rivette et al. | Dec 2002 | B1 |
6556992 | Barney et al. | Apr 2003 | B1 |
6581039 | Marple et al. | Jun 2003 | B2 |
6963920 | Hohmann et al. | Nov 2005 | B1 |
7437471 | Hohmann et al. | Oct 2008 | B2 |
7523126 | Rivette et al. | Apr 2009 | B2 |
20020035499 | Germeraad et al. | Mar 2002 | A1 |
20020055924 | Liming | May 2002 | A1 |
20020077835 | Hagelin | Jun 2002 | A1 |
20020082778 | Barnett et al. | Jun 2002 | A1 |
20030204514 | Owens et al. | Oct 2003 | A1 |
20070078886 | Rivette et al. | Apr 2007 | A1 |
20070136116 | Germeraad et al. | Jun 2007 | A1 |
20070208669 | Rivette et al. | Sep 2007 | A1 |
Number | Date | Country |
---|---|---|
0239884 | Oct 1987 | EP |
0 694 829 | Jan 1996 | EP |
5-135109 | Jun 1993 | JP |
6-231141 | Aug 1994 | JP |
8-221435 | Aug 1996 | JP |
WO 9325974 | Dec 1993 | WO |
WO 9414122 | Jun 1994 | WO |
WO 9500896 | Jan 1995 | WO |
WO 9514280 | May 1995 | WO |
WO 9816890 | Apr 1998 | WO |
WO9834179 | Aug 1998 | WO |
WO 9844430 | Oct 1998 | WO |
WO 9855945 | Dec 1998 | WO |
WO 9962005 | Dec 1999 | WO |
WO 0011575 | Mar 2000 | WO |
WO 0052618 | Sep 2000 | WO |
WO 0060495 | Oct 2000 | WO |
WO 0173657 | Oct 2001 | WO |
Number | Date | Country | |
---|---|---|---|
20020007373 A1 | Jan 2002 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 08921369 | Aug 1997 | US |
Child | 09848437 | US | |
Parent | 08867392 | Jun 1997 | US |
Child | 08921369 | US |