Nearly everyone reads the news. Most readers obtain their news from major news publisher websites, such as USA TODAY, CNN, ABC, BBC, and FOX NEWS. However, in today's 24-hour news cycle, news sources chase the latest headlines. News publishers, in other words, focus on breaking news and nearly ignore historic details.
The features, aspects, and advantages of the exemplary embodiments are understood when the following Detailed Description is read with reference to the accompanying drawings, wherein:
The exemplary embodiments will now be described more fully hereinafter with reference to the accompanying drawings. The exemplary embodiments may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. These embodiments are provided so that this disclosure will be thorough and complete and will fully convey the exemplary embodiments to those of ordinary skill in the art. Moreover, all statements herein reciting embodiments, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future (i.e., any elements developed that perform the same function, regardless of structure).
Thus, for example, it will be appreciated by those of ordinary skill in the art that the diagrams, schematics, illustrations, and the like represent conceptual views or processes illustrating the exemplary embodiments. The functions of the various elements shown in the figures may be provided through the use of dedicated hardware as well as hardware capable of executing associated software. Those of ordinary skill in the art further understand that the exemplary hardware, software, processes, methods, and/or operating systems described herein are for illustrative purposes and, thus, are not intended to be limited to any particular named manufacturer.
As used herein, the singular forms “a,” “an,” and “the” are intended to include the plural forms as well, unless expressly stated otherwise. It will be further understood that the terms “includes,” “comprises,” “including,” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being “connected” or “coupled” to another element, it can be directly connected or coupled to the other element or intervening elements may be present. Furthermore, “connected” or “coupled” as used herein may include wirelessly connected or coupled. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
It will also be understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another. For example, a first device could be termed a second device, and, similarly, a second device could be termed a first device without departing from the teachings of the disclosure.
Here, though, exemplary embodiments may historically arrange search results. As the server 22 generates the listing 40 of search results, the server 20 may historically arrange the search results. That is, the server 22 may arrange the listing 40 of search results in an historical arrangement 46. When the user's smartphone 26 processes the listing 40 of search results, the search results are displayed in the historical arrangement 46. Exemplary embodiments, for example, may chronologically arrange the listing 40 of search results. A chronological arrangement allows the reading user to quickly delve into historical articles and details for a much quicker historical context. However, the historical arrangement 46 may arrange the listing 40 of search results according to sequential position, scholarly contribution, intellectual advancement, or any other criterion, as later paragraphs will explain.
Exemplary embodiments are thus an intellectual catch up mechanism. When the user queries for any subject matter, exemplary embodiments may present the historical arrangement 46 of the search results. The search results are thus displayed for historical background, allowing the user to probe backwards in the news cycle for past articles, blogs, websites, or other entries. While
Now that exemplary embodiments have been simply described,
The database 28 of content is thus a corpus of news collected over time. At first the database 28 of content may start small with only a few weeks or months of articles. Over time, though, as more and more data is downloaded, the database 28 of content grows. Eventually the database 28 of content becomes a comprehensive repository of new and historical articles. As
As
The database 28 of content quickly grows. As each single article may be compared to every other article in the database 28 of content, the size of the database 28 of content grows exponentially with the number of articles in the database. Exemplary embodiments may thus use distributed computing to spread the computation across multiple server machines. For example, a computational technique may use a map reduce approach whereby the computation is distributed to a number of other computers (e.g., 20), and the individual results are received and aggregated into a final result. This distributed computation may be performed using a central processing unit (CPU) of each respective computer. As another example, one or more graphic processing units (or GPUs), on a single or on the multiple computers, may be tasked with some or all of the computations. This GPU-approach works well for a finished product because of the small inputs (number of distinct articles), large number of computations to do on that data set (all pairs comparisons) and the small number of outputs (mutually exclusive grouping of articles).
Exemplary embodiments may include crowd-sourced comparisons. Once the features of an individual article are determined, exemplary embodiments may gather some or all other similar articles accessible from the Internet or other source. The classifier 118 may thus be trained with reference to crowd-sourcing data or inputs. Exemplary embodiments may thus use the distributed computing infrastructure to accomplish the similarity comparison in near-real time. A current implementation of the classifier 118 determines about twenty (20) different features for each article, using grammatical and/or non-grammatical combinations. For example, the classifier 118 may inspect the text 116 for noun head phrases and/or verbs. Moreover, the classifier 118 may inspect the text 116 for any non-grammatical combinations, such as a bag of words approach where all words are treated equally. Exemplary embodiments may use a statistical distribution of the values of the features themselves over the entire dataset as part of the criteria for the features, rather than just the values of the features compared to a threshold. Many existing approaches simply use a threshold value for determining what a cutoff value for a particular feature should be. Instead, exemplary embodiments may assign values to the features in statistical terms. For example, rather than simply using a term frequency count, weighted based on its uniqueness of the corpus, exemplary embodiment may further weight this feature based on how many standard deviations it is away from the mean value. By including these derivative features, the classifier 118 is more robust, thus generating varying levels of similarity as well as changes to the nature of the dataset.
Crowd sourcing is also scalable. Conventional machine learning classification systems tend to use either statistical analysis or manual annotation to mark ground truth. However, these conventional schemes only work with large amounts of data. Moreover, other conventional schemes use manual annotation by domain experts. To ensure consistency, the number of experts is typically kept small, but with the obvious scalability and expense issues. Here, though, exemplary embodiments are scalable, both in terms of the number of inputs that can be accommodated (e.g., pairs of articles) but also the levels of output to be mapped (e.g., different levels of the similarity 120). As a simple example, suppose there are five different scores or votes of the similarity 120. A vote of “0” or “1” implies two articles are “not related,” while votes of “2” through “4” may imply varying levels of “related.” A vote of “5” would mean the two articles have the same topic, thus meaning a strong relation. In actual practice, though, there may be many different levels of the similarity 120, thus allowing users to map a large number of articles to perhaps even thousands of varying levels of the similarity 120, depending on how many votes that particular comparison received.
This disclosure now augments the explanation with reference to
The timeline 60 of events may be further configurable. For example, if the number of articles shown is less than the total number of related articles in the database 28 of content, a metric can be used to determine which subset of articles are shown. One metric may sequentially add articles that are classified as “less similar” or even “least similar” to the current group of shown articles. This metric allows construction of a comprehensive set of articles that are both different from one another, but still pertinent to the original article. Another metric may display only a subset of articles that pertain to the user. The metric, in other words, may display links to related articles 132 not yet selected by the user for reading, and/or articles that have been published since the last time the user read about the same event subject matter. Regardless, by selecting any website link the smartphone 26 queries for and retrieves the full text of the article.
The historical arrangement 46 may have different criteria. This disclosure above explains a chronological arrangement, which will perhaps be best understood by most readers. However, exemplary embodiments may include many other measures of historical arrangement. For example, the listing 40 of search results may be historically arranged according to scholarly contribution and/or intellectual advancement. Many endeavors may be viewed as a series of advances, especially in science and medicine. Some efforts may yield more insight and advancement that other efforts. Indeed, some efforts may prove fruitless or even a setback. Exemplary embodiments may thus arrange the listing 40 of search results according to intellectual progress, perhaps presenting a hierarchical march from outlier vision to current implementation. Exemplary embodiments are thus very helpful for users in the science, medicine, legal, and financial professions where scholarly, intellectual advancements are studied and reviewed.
The listing 40 of search results may also have a sequential component. Some subject matter may be viewed as a sequence of developments, starting with some initial act or event. Indeed, many social events may be traced to a local spark or issue that grows and spreads in influence. Exemplary embodiments may thus arrange the listing 40 of search results solely or at least partially based on sequential steps from an initial event. Exemplary embodiments are thus very helpful for users in the social sciences, engineering, manufacturing, and legal professions where procedures and processes are studied.
Exemplary embodiments are also applicable to advertising efforts. Most readers understand that advertisements accompany Internet content. Indeed, the listing 40 of search results may include sponsored advertisements that are related to search keywords. However, exemplary embodiments may also include the historical arrangement 46 of sponsored advertisements. As many advertisers submit bids for placements of advertisements in the listing 40 of search results, over time the advertisements may change as advertiser-bidders come and go. When exemplary embodiments historically arrange the listing 40 of search results, the entries may also include current and/or historical advertisements and website links associated with the same search term or keyword. The advertising may be historically arranged, thus allowing the user to monitor changes in advertising schemes and the competitive bidding as time passes.
Exemplary embodiments are also applicable to archival scanning of library materials. As this disclosure intimates, any subject matter may be viewed, perhaps with hindsight, to discern important or consequential advances. History, science, and law are just some subject matter that may be reconstructed to generate a sequence or timeline of events. For example, as GOOGLE® and others continually scan library archives, papers and words may be annotated and analyzed for the historical arrangement 46. The database 28 of content may include entries that reflect the historical arrangement 46 of archival materials.
Exemplary embodiments thus present many features. As the database 28 of content may store any data on any subject, users may thus retrieve and display historical arrangements of any keyword subject matter, not just the latest headlines. Indeed, the database 28 of content may be tailored for specific subject matter, such as the medical, legal, and engineering professions above explained. Exemplary embodiments thus also include “tracking” an event of interest. As the database 28 of content adds a new entry for some subject matter, notifications may be sent to the user's smartphone 26. For example, the user may wish to be notified when new articles about some topic are published. Website links to these articles may be sent to the network address or IP address of the smartphone, thus allowing quick retrieval. Icons or other graphical features may differentiate previously read articles from new and/or unread articles.
Exemplary embodiments may include similarity features. Some users may only wish to receive links to highly similar subject matter articles. Other users, though, may be receptive to articles that stray or cross-classifications in subject matter. Exemplary embodiments may thus be configured for different values or measures of the similarity 140, such as graphical controls from “highly similar” events, to “less similar,” and perhaps even “dissimilar.” Indeed, given the very large corpus of entries in the database 28 of content, entries may even be included for obscure, off-topic, or “weird” subjects. As the database 28 of content contains entries for articles organized by the similarity 140, exemplary embodiments may also identify “orphan” news articles that are completely unrelated to any other news events. Links to these orphans may be highlighted for the user's enjoyment or presented in a different application entirely.
Exemplary embodiments are socially integrated. The user may share any historical arrangement 46 with others, such as the network addresses of their social friends and contacts. A sharing feature, for example, generates a link to a web app version. Moreover, the historical arrangement 46 may be posted or shared using social media. One aspect of the news that may be relevant is what famous personalities think of the news (e.g., TWITTER feeds). Social “tweets” and other postings may be presented alongside the historical arrangement 46 to give additional context about the event. Along the same vein, social networks may also incorporate opinions posted by friends and family.
Exemplary embodiments include still more configuration parameters. The user may personalized her categories of interest, thus excluding articles having no interest to her. The user, of course, may specify categories or topics of interest, thus tailoring the types of articles she sees for consumption. Exemplary embodiments may also track the user's selections, dwell/read time, and other behavioral metrics to predict or recommend articles and categories.
Exemplary embodiments are applicable to any computing and software platform. Exemplary embodiments, for example, have been developed for the APPLE IOS environment, but a exemplary embodiments may be applied to any mobile OS, wearable device, standalone desktop/web application or as a plug-in into an existing web application or website.
Exemplary embodiments may be physically embodied on or in a processor-readable device or storage medium. For example, exemplary embodiments may include CD-ROM, DVD, tape, cassette, floppy disk, optical disk, memory card, memory drive, and large-capacity disks.
While the exemplary embodiments have been described with respect to various features, aspects, and embodiments, those skilled and unskilled in the art will recognize the exemplary embodiments are not so limited. Other variations, modifications, and alternative embodiments may be made without departing from the spirit and scope of the exemplary embodiments.
This application claims the benefit of U.S. Provisional Application 62/105,973 filed Jan. 21, 2015.
Number | Date | Country | |
---|---|---|---|
62105973 | Jan 2015 | US |