The present invention relates generally to the automatic filing of data. More particularly, this invention relates to automatically filing data based on one or more criteria.
A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever. The following notice applies to the software and data as described below and in the drawings hereto: Copyright © 2004, Apple Computer, Inc., All Rights Reserved.
As the use of computers and the Internet have proliferated, so too has the use of electronic mail (email). Many businesses, advertisers, organizations, and individuals use email as a prominent or even a primary means of communication. Since email is so easy and inexpensive to send compared to other means of communications such as letters, faxes, and telephone calls, the result is a deluge of email, both solicited and unsolicited, received by email users. One method to deal with all this email is for the user to manually select and file the email in destinations, such as boxes, files, databases, folders, subfolders, directories, subdirectories, or other containers.
Although the problem of received email is particularly acute, users also have a need to sort and file all types of data, whether this data is email attachments, faxes, telephone calls, downloaded programs and data, audio, video, scanned images, photographs, instant messages, files, databases, records, blocks of text or any other type of data.
Because users receive so much data, they often experience difficulty in determining the appropriate destination and in remembering which destinations are available. For example, it is common for users to have multiple folders that store very similar data (e.g. folders named “photos,” “myphotos,” and “pictures”). When a new piece of data is received, the users cannot remember that they already have an appropriate folder in which to store the data, so they create a new, superfluous folder.
What is needed is a technique to help users file their data in an appropriate destination.
A method, apparatus, and signal-bearing medium are provided that files data in a destination or assigns metadata indicating a destination for the data based on a criteria. In various embodiments, the data may be email, email attachments, faxes, telephone messages, downloaded data or programs, audio, video, scanned images, photographs, instant messages, files, databases, records, blocks of text, or other data. In an embodiment, a training mode and an automatic mode are provided. During the training mode, a user is presented with data and a recommended destination, and the user provides feedback that is used to train the one or more criteria. During an automatic mode, data is transferred to the destination.
In the following detailed description of exemplary embodiments of the invention, reference is made to the accompanying drawings (where like numbers represent like elements), which form a part hereof, and in which is shown by way of illustration specific exemplary embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention, but other embodiments may be utilized and logical, mechanical, electrical, and other changes may be made without departing from the scope of the present invention. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the present invention is defined only by the appended claims.
In the following description, numerous specific details are set forth to provide a thorough understanding of the invention. However, it is understood that the invention may be practiced without these specific details. In other instances, well-known circuits, structures and techniques have not been shown in detail in order not to obscure the invention.
Parts of the description may be presented in terms of operations performed through the execution of programming instructions. As well understood by those skilled in the art, these operations may take the form of electrical, magnetic, or optical signals capable of being stored, transferred, combined, and otherwise manipulated through, for example, electrical components.
As shown in the user interface 100 for an inbox 101, during a training mode, the mail system has analyzed the contents 102 of email 104 and determined that a recommended destination for the email 104 is box2106. The mail system determines the recommended destination based on one or more criteria, such as filters as further described below with reference to
By selecting the “file it and train” button 110, the user gives positive feedback by confirming that the mail system's recommended destination is correct and requests that the mail system transfer, move, or file the email 104 in box2106. The mail system uses this positive confirmation of its recommended destination to train one or more criteria, such as a filter that is used to make future destination recommendations. By selecting the “file it and don't train” button 111, the user requests that the mail system transfer, move, or file the email 104 in box2106, but the criteria are not to be trained. By selecting the “don't file it and train” button 112, the user gives negative feedback and rejects the mail system's recommendation, which trains the filter that the recommendation was not correct. By selecting the “don't file it and don't train” button 113, the user rejects the mail system's recommendation, and the email is not transferred, but the filter is not trained.
Although the confirmation is shown in
The user may also request that any mail be transferred to a destination even in the absence of a recommendation via “file it” button 114. In an embodiment, the “file it” button 114 may also serve for the user to request that the mail system file the selected email in the destination in which an email was most recently filed.
Although the user interface 100 shown in
In another embodiment, when the user drags or otherwise moves the email 104, for example via a pointer 116 and a pointing device, the mail system represents the mail 104 as an icon 118 and presents a recommendation of a destination in the form of indication 120. Although the indication 120 is shown in
When the user releases, drops, or stops dragging the icon 118, the mail system automatically transfers, moves, or files the email 104 in the recommended destination 106. In another embodiment, the mail system only files the email 104 if the user has moved the icon 118 past a specified distance, point, or threshold. For example, in an embodiment, the mail system only automatically files the email in the recommended destination if the drag of the icon has departed the mail system window 122. In another embodiment, the mail system files the email 104 in the recommended destination 106 when the user issues a command, which in an embodiment may be a keyboard key or combination of keys, a pointing-device button, a voice command, or any other appropriate type of command.
Although
In another embodiment the list view is not expanded, but instead the path name of the recommended child is presented. In another embodiment, a folder is sprung open to show subfolders. In yet another embodiment, only destinations visible to the user are recommended, and the user decides which child within the visible destination should be the final destination. In still another embodiment, one or more recommended destinations or metadata attributes are displayed in a pull down message located on or near the message.
In an embodiment, the mail system analyzes all available destinations in order to make a recommendation. In another embodiment, the mail system analyzes a subset of all available destinations where the subset is selected based on the recency (most recent use) of use of the destinations, the frequency of use of the destinations, a combination of recency and frequency, or another other appropriate technique for choosing a subset of the possible destinations. In still another embodiment, the subset of all available destinations or metadata attributes is manually configured by a user.
In one embodiment the system recommends a destination based on metadata about the file itself, or based on actual content in the file, or based on a combination of both. Metadata about the file includes any information about the file. Examples of metadata about the file include, but are not limited to date, time, sender, file type and so on. The actual content of the file can be used to recommend a destination using a variety of techniques for analyzing content of data items. One example technique is latent symantic analysis as described in application Ser. No. 09/881,986, filed Jun. 14, 2001. However, any other suitable techniques may be used to recommend a destination as well.
The mail system also provides an “undo” button 170 to allow the user to request that the mail system undo the most recent transfer of email to a destination and return the email to its original inbox 101.
The mail system provides the menu item “automatic” 188 to allow the user to request that the data filing system be set in automatic mode. While in automatic mode, the mail system automatically categorizes mail to destinations and transfers the mail to those destinations. The automatic mode is further described below with reference to
The mail system provides the menu item “reset training” 190 to allow the user to request that the data filing system be reset to the factory settings. The “reset training” option 190 causes the data filing system to undo all of the previous training. In an embodiment, the training data may be deleted, but in another embodiment, the training data may be kept but ignored.
Embodiments of the invention are not limited to combination of filing options presented in
Some embodiments learn how and where to file items for each user of a system that is used by multiple users. In one such embodiment, a system is configured for two users and recommends storage locations based on the teachings of only that user. Some other embodiments use the teachings of all users on a system. Yet some further embodiments where the present invention is implemented for some or all the users of a network, the teachings of all users on the network are used. In another embodiment, only the teachings of a group of users on the network are used.
In still other embodiments, user interfaces for filing data such as the user interfaces shown in
Control then continues to block 210 where the data is analyzed and a recommended destination for the data is determined. The data may be analyzed based on one or more criteria by comparing the data to existing destinations and determining the destination that is the closest to the data using latent semantic analysis, as described in application Ser. No. 09/881,986, filed Jun. 14, 2001. In other embodiments a vector space or any other appropriate technique for determining the destination of the data may be used.
Control then continues to block 215 where a determination is made whether the system is currently in automatic mode. If the determination at block 215 is true, control continues to block 220 where the data is presented either visually, aurally, or via any other appropriate mechanism. Control then continues to block 225 where the data is transferred, moved, or filed to the destination determined in block 210. In another embodiment, block 220 is optional and control continues directly from block 215 to block 225. In another embodiment, control continues from block 220 to block 225 only after the user has taken an action, such as indicating that the user is done reading, viewing, or listening to the data. In another embodiment, the data is not transferred to the destination until a period of time has elapsed. Control then returns from block 225 to block 205 as previously described above.
If the determination at block 215 is false, then control continues to block 230 where the data is processed in a training mode, as further described below with respect to
Control then continues to block 305 where a selection of the data is detected as previously described above with reference to
If the feedback is negative, so that the recommended destination is not confirmed at block 315, then control continues to block 330 where the criteria or filing filters are optionally trained based on the negative feedback. The criteria or filing filters may be trained or customized as described in application Ser. No. 09/881,986, filed Jun. 14, 2001. Control then continues to block 399 where the function returns.
If the determination at block 415 is false, then control continues to block 430 where the data is processed in a training mode, as further described below with respect to
If the feedback is negative, so that the recommended metadata is not confirmed at block 515, then control continues to block 530 where the criteria or filing filters are optionally trained based on the negative feedback. The criteria or filing filters may be trained or customized as described in application Ser. No. 09/881,986, filed Jun. 14, 2001. Control then continues to block 599 where the function returns.
One method training that is described in application Ser. No. 09/881,986, filed Jun. 14, 2001 is latent semantic analysis (LSA). Latent semantic analysis is a method that automatically uncovers the salient semantic relationships between words and documents in a given corpus. Discrete words are mapped onto a continuous semantic vector space, in which clustering techniques may be applied. Although this example is described by reference to email messages, latent semantic analysis can be applied to train filters for any types of data. This method for filtering data comprises determining a first semantic anchor corresponding with a first group of email messages, for example, legitimate email messages and a second semantic anchor corresponding with a second group of email messages, for example, unsolicited email messages. Determining a vector corresponding with an incoming email message, comparing the vector with at least one of the first semantic anchor and the second semantic anchor to obtain at least one comparison value, and filtering messages based on the comparison value.
An embodiment of the invention may utilize a distributed computing environment, in which program modules may be physically located in different local and remote memory storage devices. Execution of the program modules may occur locally in a stand-alone manner or remotely in a client/server manner. Examples of such distributed computing environments include local area networks, enterprise-wide networks, and the Internet.
Illustrated are a server 601 connected to a computer 602 via a network 610. Although one server 601, one computer 602, and one network 610 are shown, in other embodiments any number or combination of them may be present. Although the server 601 and the network 610 are shown, in another embodiment they may not be present.
The computer 602 may include a processor 630, a storage device 635, an input device 637, and an output device 640, all connected via a bus 680.
The processor 630 may represent a central processing unit of any type of architecture, such as a CISC (Complex Instruction Set Computing), RISC (Reduced Instruction Set Computing), VLIW (Very Long Instruction Word), or a hybrid architecture, although any appropriate processor may be used. The processor 630 may execute instructions and may include that portion of the computer 602 that controls the operation of the entire computer. Although not depicted in
Although the computer 602 is shown to contain only a single processor 630 and a single bus 680, the present invention applies equally to computers that may have multiple processors and to computers that may have multiple buses with some or all performing different functions in different ways.
The storage device 635 represents one or more mechanisms for storing data. For example, the storage device 635 may include read only memory (ROM), random access memory (RAM), magnetic disk storage media, optical storage media, flash memory devices, and/or other machine-readable media. In other embodiments, any appropriate type of storage device may be used. Although only one storage device 635 is shown, multiple storage devices and multiple types of storage devices may be present. Further, although the computer 602 is drawn to contain the storage device 635, it may be distributed across other computers, for example on server 601.
The storage device 635 includes a controller 698, which in an embodiment may include instructions capable of being executed on the processor 630 to carry out the functions of the present invention, as previously described above with reference to
The storage device 635 also contains one or more criteria 699, such as filing filters, which in an embodiment may be a database of destinations and associated data that uses latent semantic content analysis to correlate the data to the database, as described in application Ser. No. 09/881,986, filed Jun. 14, 2001. The filing filter may be customized or trained as described in application Ser. No. 09/881,986, filed Jun. 14, 2001. In other embodiments, a vector space or any other appropriate technique may be used. Of course, the storage device 635 may also contain additional software and/or data (not shown).
The input device 637 may be a keyboard, pointing device, mouse, trackball, touchpad, touchscreen, keypad, microphone, voice recognition device, or any other appropriate mechanism for the user to input data to the computer 602. Although only one input device 637 is shown, in another embodiment any number and type of input devices may be present.
The output device 640 is that part of the computer 602 that communicates output to the user. The output device 640 may be a cathode-ray tube (CRT) based video display well known in the art of computer hardware. But, in other embodiments the output device 640 may be replaced with a liquid crystal display (LCD) based or gas, plasma-based, flat-panel display. In another embodiment, the output device 640 may be a speaker. In still other embodiments, any appropriate output device suitable for presenting data may be used. Although only one output device 640 is shown, in other embodiments, any number of output devices of different types or of the same type may be present.
The bus 680 may represent one or more busses, e.g., PCI, ISA (Industry Standard Architecture), X-Bus, EISA (Extended Industry Standard Architecture), or any other appropriate bus and/or bridge (also called a bus controller).
The computer 602 may be implemented using any suitable hardware and/or software, such as a personal computer or other electronic computing device. Portable computers, laptop or notebook computers, PDAs (Personal Digital Assistants), two-way alphanumeric pagers, keypads, portable telephones, appliances with a computing unit, pocket computers, and mainframe computers are examples of other possible configurations of the computer 602. The hardware and software depicted in
The network 610 may be any suitable network and may support any appropriate protocol suitable for communication between the server 601 and the computer 602. In an embodiment, the network 610 may support wireless communications. In another embodiment, the network 610 may support hard-wired communications, such as a telephone line or cable. In another embodiment, the network 610 may support the Ethernet IEEE (Institute of Electrical and Electronics Engineers) 802.3x specification. In another embodiment, the network 610 may be the Internet and may support IP (Internet Protocol). In another embodiment, the network 610 may be a local area network (LAN) or a wide area network (WAN). In another embodiment, the network 610 may be a hotspot service provider network. In another embodiment, the network 610 may be an intranet. In another embodiment, the network 610 may be a GPRS (General Packet Radio Service) network. In another embodiment, the network 610 may be any appropriate cellular data network or cell-based radio network technology. In another embodiment, the network 610 may be an IEEE 802.11B wireless network. In still another embodiment, the network 610 may be any suitable network or combination of networks. Although one network 610 is shown, in other embodiments any number of networks (of the same or different types) may be present.
As was described in detail above, aspects of an embodiment pertain to specific apparatus and method elements implementable on a computer or other electronic device. In another embodiment, the invention may be implemented as a program product for use with an electronic device. The programs defining the functions of this embodiment may be delivered to an electronic device via a variety of signal-bearing media, which include, but are not limited to:
(1) information permanently stored on a non-rewriteable storage medium, e.g., a read-only memory device attached to or within an electronic device, such as a CD-ROM readable by a CD-ROM drive;
(2) alterable information stored on a rewriteable storage medium, e.g., a hard disk drive or diskette; or
(3) information conveyed to an electronic device by a communications medium, such as through a computer or a telephone network, including wireless communications.
Such signal-bearing media, when carrying machine-readable instructions that direct the functions of the present invention, represent embodiments of the present invention.
It is emphasized that the Abstract is provided to comply with 37 C.F.R. §1.72(b) requiring an Abstract that will allow the reader to quickly ascertain the nature and gist of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims.
In the foregoing Detailed Description, various features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments of the invention require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate preferred embodiment.
It will be readily understood to those skilled in the art that various other changes in the details, material, and arrangements of the parts and method stages which have been described and illustrated in order to explain the nature of this invention may be made without departing from the principles and scope of the invention as expressed in the subjoined claims.
This application is a continuation-in-part of and claims priority to application Ser. No. 09/881,986, filed Jun. 14, 2001, now U.S. Pat. No. 7,076,527 which is hereby incorporated by reference. This application also claims priority as a continuation-in-part to application Ser. No. 10/213,922, filed Aug. 6, 2002, now abandoned which is hereby incorporated by reference. In addition, this application claims priority as a continuation-in-part to application Ser. No. 10/430,694, filed May 5, 2003 now U.S. Pat. No. 7,640,305, which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
5375244 | McNair | Dec 1994 | A |
5377354 | Scannell et al. | Dec 1994 | A |
5613108 | Morikawa | Mar 1997 | A |
5751287 | Hahn et al. | May 1998 | A |
5877963 | Leung et al. | Mar 1999 | A |
5884033 | Duvall et al. | Mar 1999 | A |
5999932 | Paul | Dec 1999 | A |
6023723 | McCormick et al. | Feb 2000 | A |
6101515 | Wical et al. | Aug 2000 | A |
6161130 | Horvitz et al. | Dec 2000 | A |
6243724 | Mander et al. | Jun 2001 | B1 |
6249805 | Fleming, III | Jun 2001 | B1 |
6421709 | McCormick et al. | Jul 2002 | B1 |
6474008 | Hofman et al. | Nov 2002 | B1 |
6556983 | Altschuler et al. | Apr 2003 | B1 |
6578025 | Pollack et al. | Jun 2003 | B1 |
6592627 | Agrawal et al. | Jul 2003 | B1 |
6654787 | Aronson et al. | Nov 2003 | B1 |
6732149 | Kephart | May 2004 | B1 |
6807566 | Bates et al. | Oct 2004 | B1 |
6816885 | Raghunandan | Nov 2004 | B1 |
6901398 | Horvitz et al. | May 2005 | B1 |
6941466 | Mastrianni | Sep 2005 | B2 |
7058684 | Ueda | Jun 2006 | B1 |
7149778 | Patel et al. | Dec 2006 | B1 |
7158986 | Oliver et al. | Jan 2007 | B1 |
7219148 | Rounthwaite et al. | May 2007 | B2 |
7219302 | O'Shaughnessy et al. | May 2007 | B1 |
7249162 | Rounthwaite et al. | Jul 2007 | B2 |
7284033 | Jhanji | Oct 2007 | B2 |
7299261 | Oliver et al. | Nov 2007 | B1 |
7634736 | Meek | Dec 2009 | B2 |
7640305 | Arthur et al. | Dec 2009 | B1 |
20020016735 | Runge et al. | Feb 2002 | A1 |
20020107925 | Goldschneider et al. | Aug 2002 | A1 |
20020188689 | Michael | Dec 2002 | A1 |
20030046421 | Horvitz et al. | Mar 2003 | A1 |
20030187937 | Yao et al. | Oct 2003 | A1 |
20040025037 | Hair | Feb 2004 | A1 |
20040133434 | Szlam et al. | Jul 2004 | A1 |
20050015451 | Sheldon et al. | Jan 2005 | A1 |
20060026236 | Scian et al. | Feb 2006 | A1 |
20070208856 | Rounthwaite et al. | Sep 2007 | A1 |
Number | Date | Country |
---|---|---|
747846 | Dec 1996 | EP |
Number | Date | Country | |
---|---|---|---|
Parent | 10430694 | May 2003 | US |
Child | 10836472 | US | |
Parent | 10213922 | Aug 2002 | US |
Child | 10430694 | US | |
Parent | 09881986 | Jun 2001 | US |
Child | 10213922 | US |