Data tags are becoming a standard convention for data from feeds. They appear on web sites, etc., and much of the tagging of data from such sites is actually being performed by users of the sites who end up “donating” the information while signing up for the tagging services. However, different sets from different sources (i.e., different web sites) may have different agreements or conventions about tag naming. When a person receives data feeds from a variety of sources, the different data sets may have a disparate variety of tags that do not identify the data types consistently.
What is clearly needed is a system and method that can help automatically build a knowledge base for enriching feeds coming from different sources and that have tags of different conventions, by deducting which tags go into various categories of knowledge.
As it is possible to associate tags, it may also be useful and interesting to know how the items to which the tags are attached are related to one another.
What is clearly further needed is a system and method for deducing the relationship between objects, based on the relationships between their tags.
In one embodiment, a knowledge base is automatically built for enriching feeds coming from different sources and that have tags of different conventions, by deducting which tags go into various categories of knowledge. In one embodiment, method for a relationship between objects is determined based on the relationships between their tags.
In the following detailed description of embodiments of the invention, reference is made to the accompanying drawings in which like references indicate similar elements, and in which is shown by way of illustration specific embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention, and it is to be understood that other embodiments may be utilized and that logical, mechanical, electrical, functional, and other changes may be made without departing from the scope of the present invention. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the present invention is defined only by the appended claims.
In some cases, it can be also useful for enriching data sources that don't have existing tags. For instance, tags from one data source could be applied to the same object from another data source. For example, if an object, for example “Yankees” (as in the sports team) is received from a data source which is rich in tag, those tags maybe used to enrich “Yankees” from another source which has no tags. Enriching one dataset with tags of another one allows merging with or grafting on the tag taxonomies, for example, etc.
At least some embodiments, and the different structure and functional elements described herein, can be implemented using hardware, firmware, programs of instruction, or combinations of hardware, firmware, and programs of instructions.
In general, routines executed to implement the embodiments can be implemented as part of an operating system or a specific application, component, program, object, module or sequence of instructions referred to as “computer programs.” The computer programs typically comprise one or more instructions set at various times in various memory and storage devices in a computer, and that, when read and executed by one or more processors in a computer, cause the computer to perform operations to execute elements involving the various aspects.
While some embodiments have been described in the context of fully functioning computers and computer systems, those skilled in the art will appreciate that various embodiments are capable of being distributed as a program product in a variety of forms and are capable of being applied regardless of the particular type of machine or computer-readable media used to actually effect the distribution.
Examples of computer-readable media include but are not limited to recordable and non-recordable type media such as volatile and non-volatile memory devices, read only memory (ROM), random access memory (RAM), flash memory devices, floppy and other removable disks, magnetic disk storage media, optical storage media (e.g., Compact Disk Read-Only Memory (CD ROMS), Digital Versatile Disks, (DVDs), etc.), among others. The instructions can be embodied in digital and analog communication links for electrical, optical, acoustical or other forms of propagated signals, such as carrier waves, infrared signals, digital signals, etc.
A machine readable medium can be used to store software and data which when executed by a data processing system causes the system to perform various methods. The executable software and data can be stored in various places including for example ROM, volatile RAM, non-volatile memory and/or cache. Portions of this software and/or data can be stored in any one of these storage devices.
In general, a machine readable medium includes any mechanism that provides (i.e., stores and/or transmits) information in a form accessible by a machine (e.g., a computer, network device, personal digital assistant, manufacturing tool, any device with a set of one or more processors, etc.).
Some aspects can be embodied, at least in part, in software. That is, the techniques can be carried out in a computer system or other data processing system in response to its processor, such as a microprocessor, executing sequences of instructions contained in a memory, such as ROM, volatile RAM, non-volatile memory, cache, magnetic and optical disks, or a remote storage device. Further, the instructions can be downloaded into a computing device over a data network in a form of compiled and linked version.
Alternatively, the logic to perform the processes as discussed above could be implemented in additional computer and/or machine readable media, such as discrete hardware components as large-scale integrated circuits (LSI's), application-specific integrated circuits (ASIC's), or firmware such as electrically erasable programmable read-only memory (EEPROM's).
In various embodiments, hardwired circuitry can be used in combination with software instructions to implement the embodiments. Thus, the techniques are not limited to any specific combination of hardware circuitry and software nor to any particular source for the instructions executed by the data processing system.
In this description, various functions and operations are described as being performed by or caused by software code to simplify description. However, those skilled in the art will recognize what is meant by such expressions is that the functions result from execution of the code by a processor, such as a microprocessor.
Although some of the drawings illustrate a number of operations in a particular order, operations which are not order dependent can be reordered and other operations can be combined or broken out. While some reordering or other groupings are specifically mentioned, others will be apparent to those of ordinary skill in the art and so do not present an exhaustive list of alternatives. Moreover, it should be recognized that the stages could be implemented in hardware, firmware, software or any combination thereof.
In the foregoing specification, the disclosure has been described with reference to specific exemplary embodiments thereof. It will be evident that various modifications can be made thereto without departing from the broader spirit and scope of the invention as set forth in the following claims. The specification and drawings are, accordingly, to be regarded in an illustrative sense rather than a restrictive sense.
Number | Name | Date | Kind |
---|---|---|---|
5608900 | Dockter et al. | Mar 1997 | A |
5963913 | Henneuse et al. | Oct 1999 | A |
5963965 | Vogel | Oct 1999 | A |
6041311 | Chislenko et al. | Mar 2000 | A |
6055543 | Christensen et al. | Apr 2000 | A |
6199099 | Gershman et al. | Mar 2001 | B1 |
6263340 | Green | Jul 2001 | B1 |
6449632 | David et al. | Sep 2002 | B1 |
6510434 | Anderson et al. | Jan 2003 | B1 |
7035825 | Sturtevant et al. | Apr 2006 | B1 |
7152058 | Shotton et al. | Dec 2006 | B2 |
7293019 | Dumais et al. | Nov 2007 | B2 |
7299193 | Cragun et al. | Nov 2007 | B2 |
20010043210 | Gilbert et al. | Nov 2001 | A1 |
20020059339 | McCormick et al. | May 2002 | A1 |
20020078045 | Dutta | Jun 2002 | A1 |
20020116541 | Parker et al. | Aug 2002 | A1 |
20060123017 | Berg et al. | Jun 2006 | A1 |
20060167704 | Nicholls et al. | Jul 2006 | A1 |
20060173985 | Moore | Aug 2006 | A1 |
20060224969 | Marston | Oct 2006 | A1 |
20060253567 | Selin et al. | Nov 2006 | A1 |
20060288073 | Bansod et al. | Dec 2006 | A1 |
20070067725 | Cahill et al. | Mar 2007 | A1 |
20070073845 | Reisman | Mar 2007 | A1 |
20070078832 | Ott et al. | Apr 2007 | A1 |
20070233736 | Xiong et al. | Oct 2007 | A1 |
20080005134 | Welch | Jan 2008 | A1 |
20080005148 | Welch | Jan 2008 | A1 |
20080040288 | Mortimore | Feb 2008 | A1 |
20080091828 | Mortimore | Apr 2008 | A1 |
20080092160 | Mortimore | Apr 2008 | A1 |
Number | Date | Country | |
---|---|---|---|
20080005134 A1 | Jan 2008 | US |