Various embodiments described below relate generally to the electronic transfer of data over a wide area network, and more particularly but not exclusively to an automated system for data acquisition using a Web portal.
The Internet is an extremely efficient mechanism for disseminating information or documents to very many users, even around the world. Indeed, today most sophisticated software developers integrate the Internet into their software products in some fashion. In fact, many software developers today make available over the Internet some mechanism for upgrading or enhancing their software products. One example of such a mechanism is used today where templates or other standardized documents are made available for download by users of a particular developer's suite of productivity software. A user of the productivity software may desire a particular form or template that was not included with the productivity software as purchased. The software developer may make available additional templates or forms at a location on the Internet that is accessible through the productivity suite. In one example, supplemental calendar templates or special budgeting forms may be made available online.
Users of the productivity suite appreciate the additional content that is made available by the software developer. This type of follow-on service helps to make the users appreciate their investment in the software, thus providing a competitive advantage to the developer. In many cases, the content being provided is easily generated by the developer. For example, the maker of an anti-virus program is likely the best candidate to create new virus definitions. However, in some cases the developer is not the best source of the content. Take for example the case of a software developer that offers an office productivity solution. The developer may desire to offer supplemental business templates to enhance the software users' efficiency. However, the developer of the productivity suite, being a programmer by trade, is likely not the best source for generating the business templates. In those cases, the developer may partner with third-party providers, who may be individuals, that are experts in particular subject areas to provide specialized content. These arrangements leverage specialized knowledge by allowing each party to focus on their particular area of expertise, developing software or creating high-quality content. Often these partnerships create symbiotic relationships by exposing content providers to consumers that would otherwise be unaware of their services, and by allowing software developers to provide high-quality content at little or no charge.
Although mostly beneficial, these arrangements do suffer from at least one problem: the current state of technology renders the process of accumulating content from several disparate sources particularly onerous and labor-intensive. In other words, a software developer that chooses to partner with content providers must engage actual humans to interface with humans at each potential content provider. The actual exchange of content is handled using ad-hoc mechanisms for data exchange. In other words, e-mail messaging may be the communication means of choice with one content provider, while another may choose to deliver data on a digital medium, such as a Compact Disk. Having to handle each relationship in a different manner renders the process inefficient. Moreover, the delivery of content is routinely coupled with a licensing agreement between the parties. These licenses typically change often, so it can sometimes be difficult to determine exactly what terms are attached to a particular submission because it can sometimes be difficult to determine exactly when the submission occurred. In addition, the process relies on the diligence of a human to simultaneously couple the delivery of the content with the execution of a license, and humans are given to err.
A mechanism that avoids the labor-intensive process of manually accumulating content for inclusion in a software developers online distribution service has eluded those skilled in the art, until now.
The present invention is directed at techniques and mechanisms that implement an automated process for content acquisition using a Web portal. Briefly stated, the process allows a content provider to visit a location on a wide area network and interact with one or more resources to upload a document. Within the process, several tasks are automated that would otherwise be performed by humans. For example, the document is automatically coupled with any legal terms corresponding to the submission and archived at the time of the submission. In this way, the uncertainty of which terms apply to the submission is avoided, and descriptive metadata about the submission is automatically collected from the content provider at the time of submission. In addition, the content is automatically connected to an editorially managed content review and publishing system.
Non-limiting and non-exhaustive embodiments are described with reference to the following figures, wherein like reference numerals refer to like parts throughout the various views unless otherwise specified.
The following description is directed at an automated content acquisition system. Generally stated, a Web portal is provided through which content providers may submit documents to an entity (e.g. a software developer), and the submission is archived together with a then-current version of the applicable terms of submission. Specific implementations of this general concept will now be described.
The client computing system 102 includes an installed software application 104 that is manufactured and marketed by the software developer. In one example, the software application 104 is a productivity suite that includes a word processing program, a spreadsheet program, or the like. In addition, the software application 104 is configured to retrieve enhanced content from a content server 180 maintained by the software developer. Examples of the enhanced content include new templates or forms, or the like.
The content provider 108 is a computing system under the control of an entity or individual that provides content to the software developer for inclusion in the content server 180. Throughout this document, the term “content provider” may be used interchangeably to indicate either the content provider computing system 108 or the entity or individual operating the content provider computer system 108. This is to simplify this discussion, and no importance or relevance is intended or should be inferred by the interchangeable usage of the term.
In this example, the content provider 108 provides to the software developer a document 111 that represents enhanced content. The content provider typically has particular expertise in some area, and provides enhanced content to the software developer in that area. In one particular example, the content provider may have particular expertise in the legal industry, and may provide to the software developer legal forms or templates. Similarly, the content provider may have particular expertise in the real estate industry, and may provide to the software developer real estate forms, or the like. In this example, the document 111 represents the enhanced content that the content provider 108 provides to the software developer. Although referred to here as a “document,” the content being provided may take many different forms, such as clipart, textual help information, and the like. For the purpose of this application, the term “document” means any digital file or compilation of data that includes content that may be served from a Web site.
The developer site 150 is a location available over the Internet 112 to other computing systems to access certain information made available by the software developer. For instance, the software developer makes available enhanced content for the software application 104 from the content server 180. As mentioned above, the content may take the form of templates or other forms that may be used by the software application 104 but which were not installed with the software application 104 originally. The content server 180 in this example may be accessible by general users of the Internet 112, such as the client computing system 102. In addition, a user of the software application 104 on the client computing system 102 may access the content on the content server 180 through the ordinary use of the software application 104.
The developer site 150 also includes a web portal 152 that several content providers, such as content provider 108, may use to submit documents to the software developer for inclusion on the content server 180. Unlike the content server 180, the web portal 152 is accessible over the Internet 112 by select users, perhaps using an authentication or login mechanism. The web portal 152 may be used to make available development information about the software application to the content providers. For instance, the software developer may make available a software development kit (“SDK”) for download from the web portal 152, and a frequently asked questions area that answers many development-related questions. In addition, the web portal 152 may provide an upload mechanism that allows the content provider 108 to upload content to the developer site 150.
The developer site 150 also includes a backend server 156 which is a computing system that is not directly accessible over the Internet 112. The backend server 156 includes a process manager 158 that is operative to control the automated process of acquiring content. The backend server 156 is coupled to both the web portal 152 and the content server 180 using conventional networking mechanisms. The backend server 156 includes archival storage 159 permanently for storing information that is not intended for public access over the Internet 112.
Although generally introduced here, each of the components of the developer site 150 will be discussed in greater detail later in conjunction with
A submission component 210 is available to the content providers to upload content to the web portal 152. The submission component 210 may be one or more controls or other applets embedded on a markup language page available at the web portal 152. Activating the submission component 210 may present to the content provider 108 certain contractual terms 214 to which the content provider must agree in conjunction with submitting the content. The contractual terms 214 are available for viewing by the content provider in conjunction with or prior to allowing the content provider to a document 111 to the web portal 152.
The web portal 152 may also present the content provider with a metadata form 212 for the purpose of gathering certain metadata about the content provider's submission. For example, the metadata form 212 may be used to prompt the content provider for the content provider's name, various information about the document 111, keywords that may be used in a word search for the document, and the like. One particular example of the metadata form 212 is illustrated in
Activating a link or other button on the submission component 210 causes the document 111, the information gathered using the metadata form 212, and the contractual terms 214 to be submitted to the process manager 158. In response, the process manager 158 archives a copy of the document 111 as well as at least the contractual terms 214 into a permanent archival storage 159. In this way, there can be no future confusion about which particular terms attached to the submission of the document 111 or when the submission formally occurred. Advantageously, the automatic archiving of the contractual terms 214 with the document 111 contemporaneously with the submission of the document 111 overcomes the need for a human to remember to collect and store the then-current terms in connection with the particular submission.
More specifically, the process manager 158 initiates the publishing process by notifying a product management component 320 of the submission. The product management component 320 is responsible for programmatically scheduling particular “tasks” to be performed in conjunction with the development of the software application. The product management component 320 includes a work order database 325 in which are created entries that equate to particular tasks that need to be performed.
It will be appreciated that the creation of a task within the work order database 325 causes a notification to be generated for a particular individual responsible for handling tasks of that type. In other words, creating a task in the work order database 325 operates as a notification to a human being to evaluate the document 111 that has been submitted. The product management component 320 may use the metadata gathered through the automated acquisition process to help identify the particular individual to assign to the task. The product management component 320 maintains resource information 327 that describes the capabilities of individuals associated with the software developer, as well as other information, that may be used to match individuals to documents that are submitted.
At this point, human intervention 330 could be employed to evaluate the document 111 for inclusion in the content server 180. In other words, prior to allowing the document 111 to be exposed by the content server 180, it may be beneficial to involve a person with particular expertise to evaluate the appropriateness of the document. It should be noted that the use of human intervention 330 at this point (i.e., during the publishing process) differs from the use of human intervention to facilitate the submission of the document 111 itself (i.e., during the acquisition process). For instance, at this juncture, it can be assumed that the document 111 that has been submitted is archived in conjunction with any contractual terms that have attached to its submission. In addition, a standard set of metadata has already been collected for the document 111, and that metadata can be used to help identify the individual best suited to perform the human intervention process 330. The human intervention 330 may include such events as reformatting the document 111, minor or major edits to the document 111, a subjective evaluation of the quality of the docket 111, and the like.
At the conclusion of the human intervention 330, the document 111 may be passed to a content manager 350 that is responsible for making any final publishing modifications to the document 111, and for publishing the document 111 to the content server 180 for access by users of the Internet 112. At this point, the document 111 may be downloaded by the general public, and more specifically, by users of the software application 104.
The submission form 401 also includes a document information section 407 where information is collected that describes the document or submission itself. For example, the document information may identify the particular application to which the document pertains (e.g., a word processor, spreadsheet program, database program, contact management program, or the like), the language of the document, keywords that may be used to describe the content of the document, and perhaps some miscellaneous comments that the submitter wishes to be included or considered along with the submission.
An attached file portion 409 is provided to allow the user to navigate to a location on the user's local computer or network and identify the particular location of the document being submitted. The attached file portion 409 once the submission form 401 is complete, the user may perform the submission by activating a submit button 411.
At block 503, the content provider is presented with contractual terms that apply to the submission of the document. Typically, the content provider may be required to read and accept the terms of a license agreement, or the like, prior to proceeding with the submission of the document. For example, the contractual terms may address ownership of the document and the right to modify and distribute the document, and the like.
At block 505, the content provider is prompted for metadata about the submission. The metadata generally falls into two categories: information about the content provider, and information about the submission. The information about the content provider may include the content provider's name, the name of the individual providing the content, contact information for the content provider, and the like. The information about the submission may include the identity of the software application to which the submission pertains, the language (if applicable) of the submission, and the like.
At block 507, the software developer receives the submission. It is envisioned that the content provider provides the document using ordinary file or content transfer mechanisms.
At block 509 the document is archived in conjunction with the contractual terms that apply to the submission. As mentioned above, automatically archiving the then-current contractual terms together with a copy of the submission not only simplifies the amount of work that a human must perform, it also avoids later disputes over which particular version of the terms apply, if the terms later change.
At block 511, an evaluation of the submission is performed and any appropriate modifications to the document are performed as part of a publishing process. The modifications may include formatting of the document, or otherwise putting the submission inappropriate format. The publishing process may be performed, at least partially, by a human.
At block 512, the appropriately formatted or modified submission is published to the content server where it is made available for download over the Internet.
Although the above processes are illustrated and described sequentially, in other embodiments, the operations described in the blocks may be performed in different orders, multiple times, and/or in parallel.
The various embodiments described above may be implemented in general computing systems adapted as either servers or clients. An example computer environment suitable for use in implementation of the invention is described below in conjunction with
Illustrative Computing Environment
Computing device 600 may have additional features or functionality. For example, computing device 600 may also include additional data storage devices (removable and/or non-removable) such as, for example, magnetic disks, optical disks, or tape. Such additional storage is illustrated in
Computing device 600 may also contain communication connections 616 that allow the device to communicate with other computing devices 618, such as over a network. Communication connections 616 is one example of communication media. Communication media may typically be embodied by computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and includes any information delivery media. The term “modulated data signal” means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared and other wireless media. The term computer readable media as used herein includes both storage media and communication media.
While example embodiments and applications have been illustrated and described, it is to be understood that the invention is not limited to the precise configuration and resources described above. Various modifications, changes, and variations apparent to those skilled in the art may be made in the arrangement, operation, and details of the methods and systems of the present invention disclosed herein without departing from the scope of the claimed invention.