To facilitate an understanding of the concepts underlying the present invention, they are described hereinafter with reference to their implementation in the context of accessing information contained in XBRL-formatted documents. It will be appreciated, however, that this implementation is but one example of the practical applications of the invention. More generally, the invention is applicable to the retrieval of information that is presented in a format containing metadata that identifies each element of information. In particular, the invention is applicable to collections of XML-formatted documents, as well as each of the specific implementations of XML, such as XBRL. The following discussion should therefore be viewed as illustrative, without limiting the scope of the invention.
XBRL is comprised of two fundamental components, namely an instance document 16, which contains business and financial facts, and a collection of Taxomomies, which define metadata about these facts. Each business fact 18 comprises a single value. In addition to facts, an instance document might contain contexts, which define the entity to which the fact applies, the period of time to which it pertains, and/or whether the fact is actual, projected, budgeted, etc. The instance document might also contain units that define the unit of measurement for the numeric facts that are presented within the document, as well as footnotes providing additional information about the fact, and references to Taxonomies.
The Taxonomies comprise a collection of XML Schema documents 20 and XLink linkbase documents 22. A schema defines facts by means of elements 24. For example, an element might indicate what type of data a fact contains, e.g., monetary, numeric, textual, etc.
A linkbase is a collection of links. A link contains locators, that provide arbitrary labels for elements, and arcs 26, which indicate that an element links to another element, by referencing the labels defined by the locators.
A more detailed view of the dynamic processor is illustrated in
The dynamic processor contains a number of pre-fabricated algorithms that are executed by an algorithm manager 28. Each algorithm is designed to retrieve information in response to a particular type of request. In essence, each algorithm implements a particular type of search strategy. For example, one algorithm can function to retrieve all items from a collection of documents, e.g., all data relating to a particular company. Another algorithm can function to retrieve the metadata associated with a particular fact.
The algorithms perform multi-step processes to first examine the metadata to obtain information about the semantics and structure of the instance documents, and then retrieve the appropriate metadata and data items from the XBRL documents that are responsive to the request. An illustrative example of the process performed by the algorithms is set forth hereinafter in the context of a request to provide the balance sheet of a designated entity.
1. In response to the request, the algorithm which corresponds to that type of request sends a query, for example using an XQuery language component 30, to a presentation linkbase in the Taxonomies, to locate presentation links that correspond to the sections of a balance sheet. It should be noted that, due to the extensible nature of XBRL, the Taxonomies that are applicable to a given filing could comprise multiple sets of Taxonomy documents. There could be a standard Taxonomy that is associated with the entity to which filings are presented. For instance, the SEC might establish a standard Taxonomy containing presentation links for balance sheet data. The documents for this standard Taxonomy might be stored in a known location within the repository. In addition, the entity submitting a filing could include custom Taxonomy documents with the instance documents that it submits. The custom Taxonomy constitutes an extension of the standard Taxonomy established by the SEC. In operation, the algorithm first goes to the standard Taxonomy to locate the appropriate presentation links.
2. Once the presentation links have been located, the algorithm then identifies concepts that are referenced by the presentation links, e.g. assets, current assets, non-current assets, etc.
3. Using these concepts and entities, and any other qualifiers such as specific date or date range, the algorithm employs an XML document retriever 32 to locate corresponding items in the instance documents.
4. As a result of these steps, the algorithm discovers instance documents that contain the relevant data. In some cases, these documents may point to links in custom Taxonomies. In such a situation, these custom links are merged with the standard links, to obtain additional concepts.
5. Using the concepts, presentation links and preferred label attributes contained in the presentation links, the algorithm locates labels for the data in a label linkbase.
6. The algorithm returns the labels, presentation structure and data, e.g. numbers, to the API, to be formatted and presented to the user.
As an alternative to using XQuery, the dynamic processor can employ a different technology such as SAX (Simple API for XML) or XML Pull Parsing, or a combination of such technologies, to retrieve information from the XBRL instance documents and Taxonomy documents.
The dynamic processor preferably includes a cache 33 for storing information that has been retrieved and returned via the API. This cached data can be used to reduce the time needed to respond to subsequent requests that seek some, or all, of the information that was returned in response to a previous request, and thereby eliminate duplicate processing. When a request is received, the algorithm manager 28 first checks the cache, to determine if a valid response to the request is present. If so, the response is retrieved from the cache, and immediately provided to the API in response to the request.
Examples of responses that might be displayed to a user are illustrated in
In addition to retrieving data items that are contained in the instance documents and providing them in a view such as those shown in
The dynamic processor can be implemented within different software environments. In one implementation, the dynamic processor can reside as a stand alone desktop application, which communicates with one or more repositories of XBRL documents that are accessible via a desktop computer, for example through a network. In another implementation, the dynamic processor can be implemented as a client-server program. For instance, the components illustrated in
In addition to the processing of XBRL documents to retrieve data that is responsive to a request, the technology that underlies the invention can also be employed to generate forms that can be used to create XBRL documents. An example of an architecture for a dynamic form generator is illustrated in
A form is generated on the basis of a particular taxonomy that is designated by the user. In generating a form, no assumptions are made about the structure of the taxonomy, other than the fact that it conforms to an XML-based specification, e.g. XBRL. Once the user has designated a particular taxonomy 36, and a name for the form, a dynamic form generator 38 within the dynamic processor examines the schema in the taxonomy, using suitable algorithms, to obtain labels that are relevant to the form to be generated. The form 40 is generated with data entry fields 42 that correspond to each label that was obtained from the taxonomy. In addition, the form is provided with XML tags 44 that are associated with each input field, as described by the taxonomy 36.
Once the form is generated, it is resident as a live form, e.g. an XForm, on a network, such as the Internet. This form can then be accessed by a form-enabled application 46, via which a user can enter input data into each field 42, e.g. financial and business data in the case of an XBRL form. The completed form can then be submitted as a new XML instance document 48, and stored at a location designated by the user.
Thus it can be seen that the present invention provides dynamic evaluation of XML documents in response to a request, notwithstanding the diverse amount of metadata that can result with an extensible language. This is accomplished by analyzing the metadata to learn about the structure and semantics that are employed for any given set of XML documents. As a result, the need to pre-parse documents to derive data from them is avoided. Furthermore, forms for creating XML documents can be automatically generated without requiring manual input to designate fields or tags, or to publish the forms.
It will be appreciated by those of ordinary skill in the art that the invention described herein can be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The disclosed implementations are considered in all respects to be illustrative, and not restrictive. The scope of the invention as indicated by the appended claims, rather than the foregoing description, and all changes that come within the meaning and range of equivalents thereof are intended to be embraced therein.
Number | Date | Country | |
---|---|---|---|
60824062 | Aug 2006 | US |