Modern data processing systems, such as general purpose computer systems, allow the users of such systems to create a variety of different types of data files. For example, a typical user of a data processing system may create text files with a word processing program such as Microsoft Word or may create an image file with an image processing program such as Adobe's PhotoShop. Numerous other types of files are capable of being created or modified, edited, and otherwise used by one or more users for a typical data processing system. The large number of the different types of files that can be created or modified can present a challenge to a typical user who is seeking to find a particular file which has been created.
Modern data processing systems often include a file management system which allows a user to place files in various directories or subdirectories (e.g. folders) and allows a user to give the file a name. Further, these file management systems often allow a user to find a file by searching for the file's name, or the date of creation, or the date of modification, or the type of file. An example of such a file management system is the Finder program which operates on Macintosh computers from Apple Computer, Inc. of Cupertino, Calif. Another example of a file management system program is the Windows Explorer program which operates on the Windows operating system from Microsoft Corporation of Redmond, Wash. Both the Finder program and the Windows Explorer program include a find command which allows a user to search for files by various criteria including a file name or a date of creation or a date of modification or the type of file. However, this search capability searches through information which is the same for each file, regardless of the type of file. Thus, for example, the searchable data for a Microsoft Word file is the same as the searchable data for an Adobe PhotoShop file, and this data typically includes the file name, the type of file, the date of creation, the date of last modification, the size of the file and certain other parameters which may be maintained for the file by the file management system.
Certain presently existing application programs allow a user to maintain data about a particular file. This data about a particular file may be considered metadata because it is data about other data. This metadata for a particular file may include information about the author of a file, a summary of the document, and various other types of information. A program such as Microsoft Word may automatically create some of this data when a user creates a file and the user may add additional data or edit the data by selecting the “property sheet” from a menu selection in Microsoft Word. The property sheets in Microsoft Word allow a user to create metadata for a particular file or document. However, in existing systems, a user is not able to search for metadata across a variety of different applications using one search request from the user. Furthermore, existing systems can perform one search for data files, but this search does not also include searching through metadata for those files.
Certain existing systems can receive a multiple word search input and automatically add Boolean AND operators between the words of the search input to create a search input which is used to perform a search. These certain existing systems cannot, however, determine whether elements within the multiple word search input are within a same category of data.
Methods for managing data in a data processing system and systems for managing data are described herein. These methods and systems typically receive a first input and convert it into a second input for use in performing a search.
According to one aspect of the inventions, an exemplary method includes receiving an input which represents an initial search query, determining whether elements within the initial search query are within the same category of data, and deriving a machine-interpreted search query in response to the determining. One specific implementation of this method involves grouping elements within the same category of data with a Boolean OR operator, and wherein the category of data is one of a date parameter for a file, a file type parameter, or a person parameter, such as an author or sender of a document. Alternative implementations may group some or all of the elements with other Boolean operators (e.g. AND, NOT, XOR, etc.).
According to another aspect of the inventions, an exemplary method includes receiving a user input which specifies an initial search query, interpreting the initial search query to create a machine interpreted search query, and performing a search using the machine-interpreted search query before advising the user that the search used the machine-interpreted search query which is different than the initial search query.
According to another aspect of the inventions, an exemplary method includes receiving an input which represents a search query which includes at least one word, and performing, in response to the input, a prefix matching search through metadata from a plurality of files created by at least one software application wherein the type of information in metadata for files of a first type differs from the type of information in metadata of a second type. In one specific implementation of this method, at least one of the characters dot (“.”) and dash (“-”) are matched as part of the prefix matching search. In other words, characters which are normally treated as separators between words (and hence not part of a word) are treated as part of a word, such as the beginning of a word, and are considered for matching as part of a prefix matching search.
Other aspects of the present inventions include various data processing systems which perform these methods and machine readable media which perform various methods described herein.
The present invention is illustrated by way of example and not limitation in the figures of the accompanying drawings in which like references indicate similar elements.
The subject invention will be described with reference to numerous details set forth below, and the accompanying drawings will illustrate the invention. The following description and drawings are illustrative of the invention and are not to be construed as limiting the invention. Numerous specific details are described to provide a thorough understanding of the present invention. However, in certain instances, well known or conventional details are not described in order to not unnecessarily obscure the present invention in detail.
The present description includes material protected by copyrights, such as illustrations of graphical user interface images. The owners of the copyrights, including the assignee of the present invention, hereby reserve their rights, including copyright, in these materials. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office file or records, but otherwise reserves all copyrights whatsoever. Copyright Apple Computer, Inc. 2004.
As shown in
It will be apparent from this description that aspects of the present invention may be embodied, at least in part, in software. That is, the techniques may be carried out in a computer system or other data processing system in response to its processor, such as a microprocessor, executing sequences of instructions contained in a memory, such as ROM 107, RAM 105, mass storage 106 or a remote storage device. In various embodiments, hardwired circuitry may be used in combination with software instructions to implement the present invention. Thus, the techniques are not limited to any specific combination of hardware circuitry and software nor to any particular source for the instructions executed by the data processing system. In addition, throughout this description, various functions and operations are described as being performed by or caused by software code to simplify description. However, those skilled in the art will recognize what is meant by such expressions is that the functions result from execution of the code by a processor, such as the microprocessor 103.
Capturing and Use of Metadata Across a Variety of Application Programs
The method of
The method of
One particular field which may be useful in the various metadata formats would be a field which includes an identifier of a plug in or other software element which may be used to capture metadata from a data file and/or export metadata back to the creator application.
Various different software architectures may be used to implement the functions and operations described herein. The following discussion provides one example of such an architecture, but it will be understood that alternative architectures may also be employed to achieve the same or similar results. The software architecture shown in
The software architecture 400 also includes a file system directory 417 for the metadata. This file system directory keeps track of the relationship between the data files and their metadata and keeps track of the location of the metadata object (e.g. a metadata file which corresponds to the data file from which it was extracted) created by each importer. In one exemplary embodiment, the metadata database is maintained as a flat file format as described below, and the file system directory 417 maintains this flat file format. One advantage of a flat file format is that the data is laid out on a storage device as a string of data without references between fields from one metadata file (corresponding to a particular data file) to another metadata file (corresponding to another data file). This arrangement of data will often result in faster retrieval of information from the metadata database 415.
The software architecture 400 of
The method of
It will be appreciated that the notification, if done through the OS kernel, is a global, system wide notification process such that changes to any file will cause a notification to be sent to the metadata processing software. It will also be appreciated that in alternative embodiments, each application program may itself generate the necessary metadata and provide the metadata directly to a metadata database without the requirement of a notification from an operating system kernel or from the intervention of importers, such as the importers 413. Alternatively, rather than using OS kernel notifications, an embodiment may use software calls from each application to a metadata processing software which receives these calls and then imports the metadata from each file in response to the call.
As noted above, the metadata database 415 may be stored in a flat file format in order to improve the speed of retrieval of information in most circumstances. The flat file format may be considered to be a non-B tree, non-hash tree format in which data is not attempted to be organized but is rather stored as a stream of data. Each metadata object or metadata file will itself contain fields, such as the fields shown in the examples of
A flexible query language may be used to search the metadata database in the same way that such query languages are used to search other databases. The data within each metadata file may be packed or even compressed if desirable. As noted above, each metadata file, in certain embodiments, will include a persistent identifier which uniquely identifies its corresponding data file. This identifier remains the same even if the name of the file is changed or the file is modified. This allows for the persistent association between the particular data file and its metadata.
User Interface Aspects
Various different examples of user interfaces for inputting search parameters and for displaying search results are provided herein. It will be understood that some features from certain embodiments may be mixed with other embodiments such that hybrid embodiments may result from these combinations. It will be appreciated that certain features may be removed from each of these embodiments and still provide adequate functionality in many instances.
The combination of text entry region 709 and the search parameter menu bar allow a user to specify a search query or search parameters. Each of the configurable pull down menus presents a user with a list of options to select from when the user activates the pull down menu. As shown in
It will also be appreciated that the various options in the pull down menus may depend upon the fields within a particular type of metadata file. For example, the selection of “images” to be searched may cause the various fields present in the metadata for an image type file to appear in one or more pull down menus, allowing the user to search within one or more of those fields for that particular type of file. Other fields which do not apply to “images” types of files may not appear in these menus in order reduce the complexity of the menus and to prevent user confusion.
Another feature of the present invention is shown in
The window 1001 includes an additional feature which may be very useful while analyzing a search result. A user may select individual files from within the display region 1005 and associate them together as one collection. Each file may be individually marked using a specific command (e.g. pressing the right button on a mouse and selecting a command from a menu which appears on the screen, which command may be “add selection to current group”) or similar such commands. By individually selecting such files or by selecting a group of files at once, the user may associate this group of files into a selected group or a “marked” group and this association may be used to perform a common action on all of the files in the group (e.g. print each file or view each file in a viewer window or move each file to a new or existing folder, etc.). A representation of this marked group appears as a folder in the user-configurable portion 1003A. An example of such a folder is the folder 1020 shown in the user-configurable portion 1003A. By selecting this folder (e.g. by positioning a cursor over the folder 1020 and pressing and releasing a mouse button or by pressing another button) the user, as a result of this selection, will cause the display within the display region 1005 of the files which have been grouped together or marked. Alternatively, a separate window may appear showing only the items which have been marked or grouped. This association or grouping may be merely temporary or it may be made permanent by retaining a list of all the files which have been grouped and by keeping a folder 1020 or other representations of the grouping within the user-configurable side bar, such as the side bar 1003A. Certain embodiments may allow multiple, different groupings to exist at the same time, and each of these groupings or associations may be merely temporary (e.g. they exist only while the search results window is displayed), or they may be made permanent by retaining a list of all the files which have been grouped within each separate group. It will be appreciated that the files within each group may have been created from different applications. As noted above, one of the groupings may be selected and then a user may select a command which performs a common action (e.g. print or view or move or delete) on all of the files within the selected group.
The window 1201 shown in
A column 1211 of window 1201 allows a user to select various search parameters by selecting one of the options which in turn causes the display of a submenu that corresponds to the selected option. In the case of
The window 1301 shown in
The search results user interface shown in
It will be appreciated that this method may employ various alternatives. For example, a window may appear after the command option 2232 or 2233 has been selected, and this window asks for a name for the new folder. This window may display a default name (e.g. “new folder”) in case the user does not enter a new name. Alternatively, the system may merely give the new folder or new storage facility a default path name. Also, the system may merely create the new folder and move or copy the items into the new folder without showing the new window as shown in
Certain embodiments may be implemented using an application program interface, such as the interaction between two or more computer programs. In this instance, one program (e.g. a first program) may provide a user interface by receiving an initial user input and providing the initial user input to another program (e.g. a second program such as a “plug in”) which automatically interprets the initial search input and produces a machine interpreted search input. This machine interpreted search input is then used to perform a search (which may be performed by the second program or by yet another program such as a third program). The results of the search are then conveyed to, for example, the first program which displays the search results to the user. The first program, or another program, may optionally display a control to allow machine interpretation to be turned on or off.
Examples of certain types of machine interpretation of the search queries are described below. In general, the data processing system attempts to interpret the user's intent, and, in general, for certain embodiments, the data processing system avoids attempting to perform sophisticated natural language interpretation. This is typically done by detecting keywords which are recognized to specify a type of information, such as a date parameter, or a type of file or document parameter. An algorithm detects these key words and transforms the query using certain Boolean operators. One approach attempts to break down a search query which would, without machine interpretation, produce an empty set, such as the query “today yesterday”. There can be nothing which happened on both today and yesterday, so this query, if logically ANDed, will return an empty set. Clearly this is not what the user wanted. In this instance, by detecting these disjoint predicates and using a logical OR to join them, instead of a Boolean AND, the user's intent is achieved. Disjoint predicates may also be separated into groups. Two such groups include types of documents and date parameters. Other groups may include authors, senders of e-mails, etc. One example of a general algorithm is: A) gather all key words from the query into groups by type (e.g. dates, file types); B) within each group, map the keywords to their predicates, and OR the predicates together, forming a predicate for the category; C) AND logically the predicate for each category together; D) take the remaining (non-keyword) search terms, map each to a predicate matching against any metadata of files in the metadata database; E) join the predicates from operation D) with ANDs; F) form a predicate on file contents with all of the non-keywords, and OR that with the predicate from operation E); G) AND, through a Boolean operator, the predicates formed in operation F) and operation C) together to get the final query.
An example of a complex search query will now be used to show an example of the method of
In certain embodiments, the parameter in the metadata may define how the parameter is combined logically (e.g. which Boolean operator is used) with other parameters or with the same parameter. For example, the kind parameter in metadata for a file specifies the type of file, and this parameter may be defined to use a Boolean OR operator to combine words which are categorized as being within the kind parameter, such as “movies” or “jpeg” or “message.” Thus, the parameter within a definition of the type of information in metadata for a file type may specify a rule (such as, use a Boolean OR to combine words) which indicates how to interpret a user's search query. The rule may also specify the words which are considered to mach the parameter's data type; for example, the kind parameter may include a rule that specifies that the words “email” or “message” are to be interpreted as requesting a search of email documents, and the rule may further specify that the words “jpeg” or “picture” or “image” or “photoshop” or “psd” or “tiff” are all to be interpreted as requesting a search of a variety of different image files.
Another aspect of the inventions is shown in
This table shows that prefix matching will cause a non-match (e.g., note how “paris” does not find “comparison”). This table also shows how the dot character may serve as a character which defines the beginning of the word, and hence, the input “.test” does not match “testnow”.
In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will be evident that various modifications may be made thereto without departing from the broader spirit and scope of the invention as set forth in the following claims. The specification and drawings are, accordingly, to be regarded in an illustrative sense rather than a restrictive sense.
This application is a continuation-in-part of U.S. patent application Ser. No. 10/877,584, filed on Jun. 25, 2004 now U.S. Pat. No. 7,730,012. This application also claims priority to U.S. Provisional Patent Application No. 60/643,087 filed on Jan. 7, 2005, which provisional application is incorporated herein by reference in its entirety; this application claims the benefit of the provisional's filing date under 35 U.S.C. §119(e). This present application hereby claims the benefit of these earlier filing dates under 35 U.S.C. §120.
Number | Name | Date | Kind |
---|---|---|---|
4270182 | Asija | May 1981 | A |
4704703 | Fenwick | Nov 1987 | A |
4736308 | Heckel | Apr 1988 | A |
4939507 | Beard et al. | Jul 1990 | A |
4985863 | Fujisawa et al. | Jan 1991 | A |
5008853 | Bly et al. | Apr 1991 | A |
5072412 | Henderson, Jr. et al. | Dec 1991 | A |
5220625 | Hatakeyama et al. | Jun 1993 | A |
5228123 | Heckel | Jul 1993 | A |
5241671 | Reed et al. | Aug 1993 | A |
5319745 | Vinsonneau et al. | Jun 1994 | A |
5355497 | Cohen-Levy | Oct 1994 | A |
5392428 | Robins | Feb 1995 | A |
5504852 | Thompson-Rohrlich | Apr 1996 | A |
5544360 | Lewak et al. | Aug 1996 | A |
5557793 | Koerber | Sep 1996 | A |
5592608 | Weber et al. | Jan 1997 | A |
5623681 | Rivette et al. | Apr 1997 | A |
5644657 | Capps et al. | Jul 1997 | A |
5659735 | Parrish et al. | Aug 1997 | A |
5710844 | Capps et al. | Jan 1998 | A |
5761678 | Bendert et al. | Jun 1998 | A |
5828376 | Solimene et al. | Oct 1998 | A |
5832500 | Burrows | Nov 1998 | A |
5845301 | Rivette et al. | Dec 1998 | A |
5890147 | Peltonen et al. | Mar 1999 | A |
5966710 | Burrows | Oct 1999 | A |
6012053 | Pant et al. | Jan 2000 | A |
6021409 | Burrows | Feb 2000 | A |
6055543 | Christensen et al. | Apr 2000 | A |
6067541 | Raju et al. | May 2000 | A |
6078916 | Culliss | Jun 2000 | A |
6115717 | Mehrota et al. | Sep 2000 | A |
6119118 | Kain, III et al. | Sep 2000 | A |
6185574 | Howard et al. | Feb 2001 | B1 |
6353823 | Kumar | Mar 2002 | B1 |
6363386 | Soderberg et al. | Mar 2002 | B1 |
6370562 | Page et al. | Apr 2002 | B2 |
6374260 | Hoffert et al. | Apr 2002 | B1 |
6389412 | Light | May 2002 | B1 |
6401097 | McCotter et al. | Jun 2002 | B1 |
6405190 | Conklin | Jun 2002 | B1 |
6408301 | Patton et al. | Jun 2002 | B1 |
6434548 | Emens et al. | Aug 2002 | B1 |
6466237 | Miyao et al. | Oct 2002 | B1 |
6473794 | Guheen et al. | Oct 2002 | B1 |
6480835 | Light | Nov 2002 | B1 |
6532481 | Fassett | Mar 2003 | B1 |
6564225 | Brogliatti et al. | May 2003 | B1 |
6567805 | Johnson et al. | May 2003 | B1 |
6613101 | Mander et al. | Sep 2003 | B2 |
6643640 | Getchius et al. | Nov 2003 | B1 |
6665657 | Dibachi | Dec 2003 | B1 |
6704739 | Craft et al. | Mar 2004 | B2 |
6847959 | Arrouye et al. | Jan 2005 | B1 |
6877002 | Prince | Apr 2005 | B2 |
20010054042 | Watkins et al. | Dec 2001 | A1 |
20020040442 | Ishidera | Apr 2002 | A1 |
20020049738 | Epstein | Apr 2002 | A1 |
20020169771 | Melmon et al. | Nov 2002 | A1 |
20020184195 | Qian | Dec 2002 | A1 |
20020184196 | Lehmeier et al. | Dec 2002 | A1 |
20030018622 | Chau | Jan 2003 | A1 |
20030084087 | Berry | May 2003 | A1 |
20030088547 | Hammond | May 2003 | A1 |
20030088567 | Rosenfelt et al. | May 2003 | A1 |
20030088573 | Stickler | May 2003 | A1 |
20030100999 | Markowitz | May 2003 | A1 |
20030117907 | Kang | Jun 2003 | A1 |
20030122873 | Dieberger et al. | Jul 2003 | A1 |
20030122874 | Dieberger et al. | Jul 2003 | A1 |
20030135828 | Dockter et al. | Jul 2003 | A1 |
20030135840 | Szabo et al. | Jul 2003 | A1 |
20030140035 | Burrows | Jul 2003 | A1 |
20030144990 | Benelisha et al. | Jul 2003 | A1 |
20030158855 | Farnham et al. | Aug 2003 | A1 |
20030196094 | Hillis et al. | Oct 2003 | A1 |
20030200218 | Tijare et al. | Oct 2003 | A1 |
20030200234 | Koppich et al. | Oct 2003 | A1 |
20040177319 | Horn | Sep 2004 | A1 |
20040244039 | Sugahara et al. | Dec 2004 | A1 |
20050004911 | Goldberg et al. | Jan 2005 | A1 |
20060004739 | Anthony et al. | Jan 2006 | A1 |
20060031216 | Semple et al. | Feb 2006 | A1 |
Number | Date | Country |
---|---|---|
WO 0146870 | Jun 2001 | WO |
WO 03060774 | Jul 2003 | WO |
WO 03090056 | Oct 2003 | WO |
Entry |
---|
U.S. Appl. No. 10/407,853, filed Apr. 4, 2003, titled “Method and Apparatus for Tagging and Locating Audio Data”, by inventor Nikhil Bhatt,51 pages (specification and drawings). |
U.S. Appl. No. 10/461,642, filed Jun. 13, 2003, titled “Domain Specific Search Engine”, by inventor Nikhil Bhatt, 72 pages (specification and drawings). |
U.S. Appl. No. 10/873,661, filed Jun. 21, 2004, titled “Methods and Apparatuses for Operating a Data Processing System,” by inventors Bas Ording and Donald Lindsay, 91 pages (specification and drawings). |
Baeza-Yates R.A., et al., “New Approaches to Information Management: Attribute-Centric Data Systems”, String Processing and Information Retrieval, 2000. Spire 2000. Proceedings. Seventh International Symposium on Sep. 27-29, 2000, Piscataway, NJ, USA, IEEE, Sep. 27, 2000, pp. 17-27, XP010517584. |
Bowman, C.M., “A File for Information Management”, Proceedings of the ISMM International Conference. Intelligent Information Management Systems. Proceedings of ISMM Symposium, Jun. 1, 1994, pp. 66-71, XP002224131. |
Caban, Pipo, et al., “Improved Downloading Capabilities for Internet Browsers and File Management Capabilities for Windows Explorer to Minimize User Intervention”, Research Disclosure, Mason Publications, Hampshire, GB, vol. 473, No. 23, Sep. 2003, 3 pages. |
Giampaolo, Dominic, “Practical File System Design with the Be File System”, 1999 Cover Page and Table of Contents (p. 1-7), Chapters 4&5 (pp. 45-98) and Index (pp. 225-247), Morgan Kaufmann Publishers, San Francisco, CA. |
Grimes, Richard, “Revolutionary File Storage System Lets Users Search and Manage Files Based on Content”, MSDN Magazine, Jan. 2004, ( 6 pages). |
Welch, Kevin P., “Find Files Under Presentation Manager and Windows With a Handy Utility”, Microsoft Systems Journal, vol. 4, No. 5, Sep. 1989, pp. 61-68. |
Number | Date | Country | |
---|---|---|---|
20050289106 A1 | Dec 2005 | US |
Number | Date | Country | |
---|---|---|---|
60643087 | Jan 2005 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10877584 | Jun 2004 | US |
Child | 11112165 | US |