The present invention relates to a system and method of providing a platform for enabling drill-down analysis of tabular data.
Today, large volumes of data are aggregated in the process of executing various business functions. Much of this data is tabular data created by numerous applications/sources such as spreadsheets, webpages, and databases. In many instances, a user may desire to run an analysis report on the tabular data to illustrate specific details on some or all of the tabular data. However, it becomes quite challenging to a user to analyze data in a top-down, exploratory manner in any great detail.
Embodiments of a system and method are disclosed of providing a platform for enabling drill-down analysis of tabular data.
In accordance with an embodiment of the present disclosure, a method disclosed of providing a platform for enabling drill-down analysis of tabular data, the method executing on one or more processors the steps of: (a) receiving a first request to perform a first analysis on a data set of the tabular data; and (b) generating a first link between the data set and the first analysis, the link configured to be activated by a user to cause the first analysis to be performed on the data set, wherein the first link includes a first parameter associated with first drill down criteria to cause an analysis of a plurality of analyses to be performed on one or more data items of the data set that satisfy the first drill down criteria.
In accordance with another embodiment of the present disclosure, a method is disclosed of providing a platform for enabling drill-down analysis of tabular data, method executing on one or more processors the steps of: (a) receiving a first request to perform a first analysis on a data set of the tabular data; (b) generating a first link between the data set and the first analysis, the link being configured to be activated by a user to cause the first analysis to be performed on the data set, wherein the link includes a first parameter associated with first drill down criteria to cause the analysis of a plurality of analyses to be performed on one or more items of the data set that satisfy the first drill down criteria; and (c) performing, upon activation of the first link by a user, the analysis of the plurality of analyses on the one or more data items that satisfies the first drill down criteria.
In accordance with yet another embodiment of the present disclosure, a system is disclosed for providing a platform for enabling drill-down analysis of tabular data from a data source, the system comprising: (a) a data store to store the tabular data from the data source; and (b) one or more servers coupled to the data store and programmed to execute one or more modules, the modules comprising: (1) an analysis request event handler for generating a user request to perform a first analysis on a data set of the tabular data, wherein the request includes the data set to be analyzed, a link identifying the first analysis to be performed on the data set and a parameter associated with drill down criteria to be applied to one or more data items of the data set prior to the analysis; and (2) an analysis engine for (a) performing the a second analysis on the one or more data items that satisfies the drill down criteria and (b) outputting an analysis report of aggregated data based on the second analysis.
In yet another embodiment of the present disclosure, a method is disclosed of providing a platform for enabling drill-down analysis of tabular data, the method executing on one or more processors the steps of: (a) receiving a request to perform an analysis on a data set of the tabular data; and (b) generating a link between the data set and the analysis, the link configured to be activated by a user to cause the analysis to be performed on the data set, wherein the first link includes a first parameter associated with first drill down criteria to act as a filter on the data set for subsequent analysis.
Embodiments of the present disclosure are described herein with reference to the drawing figures.
Each example client 100, 106, 108, 110 includes a personal computer and a monitor. However, clients 100, 106, 108, 110 may be smartphones, cellular telephones, tablets, PDAs, or other devices equipped with industry standard (e.g., HTML, HTTP etc.) browsers or any other application having wired (e.g., Ethernet) or wireless access (e.g., cellular, Bluetooth, IEEE 802.11b etc.) via networking (e.g., TCP/IP) to nearby and/or remote computers, peripherals, and appliances, etc. TCP/IP (transfer control protocol/Internet protocol) is the most common means of communication today between clients or between clients and central system 102 or other systems (i.e., one or more servers), each client having an internal TCP/IP/hardware protocol stack, where the “hardware” portion of the protocol stack could be Ethernet, Token Ring, Bluetooth, IEEE 802.IIb, or whatever software protocol is needed to facilitate the transfer of IP packets over a local area network.
As described in more detail below, central system 102 includes one or more servers. The one or more servers may include a web server. Each server includes several internal components (e.g., processor, memory, drives, etc.), databases, software modules and applications (e.g., browser) as known to those skilled in the art. The server is an example of a general-purpose computer as shown in
As shown in
In brief, platform 200 is a framework that enables a user to create an analysis report(s) of aggregated figures (i.e., aggregated data) displayed in a spreadsheet, wherein each aggregated figure within the report may then be acted upon (such as mouse-clicked) to create a subsequent analysis report (a “drill-down” analysis report) whose input is limited to that subset of data that corresponds to such acted-upon figure. (An aggregated figure or figures as used herein is an aggregation of one or more items of information/data (e.g., a row or more) of tabular data obtained from a spreadsheet or other data source described above. The figure may be a number, letter or other reference or symbol in a cell or part of a cell or one or more rows.) Platform 200 enables nesting of drill-down analyses that is not limited by the size or complexity of the drill-down criteria or level of nesting. Platform 200 also enables separation and independent (1) analysis functionality and (2) drill-down criteria so that the same set of instructions can support the analysis of any permutation without having to prepare any analysis in advance, of a user-defined dataset, analysis report and/or drill-down criteria. Platform 200 comprises one or more software applications, modules, computer instruction set and/or data from databases or other sources (as a data store) including analysis engine 200-1, analysis request event handler 200-2 analysis response handler 200-3, drill-down criteria database (DDPD) 200-4 and tabular data 200-5 (from a source such as spreadsheets). These modules/data of platform 200 may be implemented in one or more servers located locally or remotely across network 112. The modules/data are described in detail below.
Analysis engine 200-1 is the software application or module (or set of computer instructions) that receives requests for analysis (with or without drill-down), and in response, provides the analysis report (results). This module receives at least three inputs including 1) a particular analysis (i.e., particular aggregation report, such as “Quarterly Revenue by State”) requested by a user to run, 2) the raw data (or reference thereto) upon which the analysis is to be run, and 3) drill-down criteria, if any. Analysis engine 200-1 will extract a subset of raw data that meets/satisfies the drill-down criteria (or extract all of the raw data, if no drill-down input is present) as required to run the analysis (as described below) and output an analysis report that includes aggregated data and, for each drill-down enabled figure, a related drill-down criteria that may be used for further subsequent drill-down analysis requests. (The drill down criteria acts as a filter of the tabular data for subsequent analysis.) Aggregated data refers to an output of any level of aggregation, including a level of aggregation that may be so granular as to aggregate each input data item into its own unique group.
Analysis request event handler 200-2 is a software module (or set of computer instructions) that is executed when a user selects data to be analyzed (aggregated). The data may or may be subject to drill-down analysis (i.e., analysis on aggregated data generated by a prior analysis or more). Analysis request event handler 200-2 specifically creates the analysis request and sends it to analysis engine 200-1. Prior to creating and sending the request, if desired, analysis request event handler 200-2 may present a menu to a user to select which particular analysis report to generate. However, selection may be made in other ways known to those skilled in the art. To create the request, the event handler 200-2 will draw information from 1) a link such as a hyperlink, if any, associated with the triggered event (e.g., click on a link), and 2) other information associated with the cell, if any, associated with the triggered event. This information will be used to create the request that is passed on to analysis engine 200-1.
In this context, the hyperlink information is a property of a cell (or its contents) in a spreadsheet. Upon a mouse click, key press or other user-initiated action taken on such cell, the activated hyperlink (URL) triggers or calls on an analysis request event handler 200-2 by way of a hyperlink event handler (part of application as known to those skilled in the art, e.g., Worksheet_FollowHyperlink used in Excel). The hyperlink event handler is a code that can access hyperlink or other related information (i.e., data to be analyzed, drill down criteria and/or reference to either or both) that the analysis response handler had associated with a figure that triggered the request. (More specifically, the information could be contained in the hyperlink text itself, or could be contained in other properties (such as a cell's comment property) of the cell to which the hyperlink belongs. Drill-down criteria is an expression or formula that may be stored in DDPD 200-4 (in which case each unique drill-down criteria will have a unique drill-down reference code or reference number). When applied to any underlying data item within a provided input data set, the expression or formula (which evaluates to either true or false) determines whether such underlying data item shall be included in the subset of total raw input data on which the requested analysis will be performed. A drill-down parameter may be the expression or formula or a reference to the expression or formula as known to those skilled in the art.
Analysis response handler 200-3 is a software module (or set of computer instructions or script) that receives and processes the response from analysis engine 200-1 and interacts with the spreadsheet program to set up the interface, display the analysis report (results) to a user and establish the hyperlinks for each displayed drill-down-enabled figure.
Drill-down parameter database (DDPD) 200-4 is used when the native drill-down criteria representation, for each cell desired to be drill-down enabled, exceeds any limit that the spreadsheet application may impose on the size of (amount of) hyperlink information that may be associated with any one hyperlink in the spreadsheet. For example, some or all versions of Microsoft Excel limit the size of any cell's hyperlink text to a preset number of characters. In this case, the drill-down criteria is stored in DDPD 200-4, the related reference code (or reference number) is passed from analysis engine 200-1 to analysis response handler 200-3 and in turn the reference code is incorporated into the hyperlink that is passed to the spreadsheet application. For any subsequent request, analysis engine 200-1, upon receiving a drill-down criteria reference code, de-references the code into the actual drill-down criteria using a lookup table known to those skilled in the art.
Tabular data 200-5 are data obtained from a data source such as a spreadsheet, database or other source as known to those skilled in the art. The data is typically obtained in the form of rows and columns (including as may be implicit in structured formats such as XML or JSON), such as from a spreadsheet, text, HTML or CSV file, or a database table.
Analysis engine 200-1, analysis request event handler 200-2 and analysis response handler 200-3 may each be a) a separate application, module or object or linked library, residing on the same server as or on different servers from the server on which the spreadsheet program is operating and with which the user is directly interacting, and/or b) a spreadsheet application extension (such as a “plug-in” for Microsoft Excel or OpenOffice, or a “Google API Script” for Google Sheets).
The operation is briefly described as follows. When a user requests a drill-down-enabled analysis (report), analysis engine 200-1 provides output which includes, for each drill-down enabled figure, information that can be used to determine the underlying data from which such figure was derived. This information is essentially the criteria applied to the underlying data set in order to determine the data that was used to calculate such figure's value (i.e., the value of the aggregated result that a user may now drilled down into).
For example, if a drill-down-enabled output figure (analyzed or aggregated data derived from one or more items of data) represents the revenue in the month of October from all stores located in Texas, the drill-down criteria will indicate that the underlying data set for that figure relates only to the month of October and the state of Texas. If the data's month and state information are referred to as “transactionMonth” and “transactionState” and if the embodiment is implemented to use syntax similar to that used for WWW queries, then the drill-down criteria might be embodied as “transactionMonth=10&transactionState=TX.” Furthermore, if the analysis in which such figure existed was itself the result of a drill-down analysis report. (For example, where “transactionCategory=5”), then the drill-down criteria used to generate such report must be included in such figure's drill-down criteria. (For example, the entire drill-down criteria for such figure might be “transactionMonth=10&transactionState=TX&category=5”).
The drill-down criteria corresponding to each figure in the analysis engine 200-1 output, or information from which such criteria can be derived, is assumed to be present in the analysis engine 200-1 output. Analysis response handler 200-3 then interacts with the spreadsheet program to present the resulting report to the user, and for each drill-down enabled figure, analysis response handler 200-3 builds a hyperlink that it then associates, through the spreadsheet program, with that figure or its containing cell. (For example, in Microsoft Excel, a figure of 1,000 with drill down criteria of transactionMonth=10&transactionState=TX&category=5 might be implemented in a cell by setting the cell's “Value” property to 1,000 and setting the cell's “Hyperlink” property to http://drilldown?transactionMonth=10&transactionState=TX&category=5).
Using a built-in event handler such as Worksheet_FollowHyperlink or Worksheet_KeyPress in Microsoft Excel, analysis request event handler 200-2 is invoked upon the event of any user action designated to trigger drill-down (such as, but not limited to, a mouse click or a key stroke), and handler 200-2 fetches the drill-down criteria from the active cell's hyperlink. In the event that DDPD 200-4 is needed (because of a spreadsheet program's limitations is hyperlink size), the drill-down parameter information that is provided by the analysis engine will be represented by an identification parameter or code (such as a reference number) that is stored in DDPD 200-4 and associated with drill-down criteria (via a look-up table) also stored in DDPD 200-4.
Execution proceeds to step 304 wherein it is determined if there are any raw data items that are part of the analysis request. Raw data items may not be present at this stage. For example, the data received may include column headers (e.g., “Name, Date, Balance”) with or without any rows of real data. Continuing the example, analysis request event handler 200-2 might receive data that consists of a single column with three rows including “Property,” “State,” “NY” and “CA,” but analysis request event handler 200-2 might not be configured to know whether the data consists of a header “Property” followed by three lines of body data (i.e., raw body data), a header of “Property State CA NY” followed by no body data, or some other combination of header rows and body data. Analysis request event handler 200-2 may merely pass data to analysis engine 200-1. If body data (i.e., raw data) is not present for subsequent processing at step 304, then execution proceeds to step 306 as described in more detail below.
If raw data items are present, execution proceeds to step 308 wherein analysis engine 200-1 extracts the next subject (unprocessed) data item from the raw data and then proceeds to decision step 310 wherein it is determined if a drill-down parameter is present in the analysis request information (such as the URL). If a drill-down parameter is present, it will be processed as described below with respect to step 312. If no drill-down parameter is present, then execution moves directly to step 314 wherein data will be analyzed by analysis engine 200-1.
Now, all extracted data is present at step 312, but if a drill-down parameter is present in the request, then only the data satisfying the drill-down criteria defined by the drill-down parameter, as evaluated at step 312, is passed on to analysis engine 200-1 for analysis at step 314. That is, for each data item to be analyzed (e.g. each row in a table of input data), if the drill-down criteria as applied to such item evaluates to true at decision step 312, or if no drill-down criteria was present in the request at decision step 302 (and queried at decision step 310), then execution moves to step 314 wherein such data item is analyzed and incorporated into the result as requested (That is, the data items are analyzed and consolidated (i.e., reduced) at step 314.) These data items are already in whatever standardized format that may be required by the analysis engine, as known to those skilled in the art.
Then execution returns to step 304 wherein it is determined if there are any more raw data items remaining for processing under the request. If yes, execution repeats again starting with step 308. If not, execution proceeds to step 306 wherein an analysis report is sent to analysis response handler 200-3 once the analysis engine completes the analysis. (At step 306, there may be some final formatting or processing of the report as known to those skilled in the art.) This analysis report (drill-down or non-drill-down) sent includes a series of analysis outputs, each of which includes one or more of the following (or information that can be used to derive the following): a) a value that resulted from the application of a function on zero or more of the input data items (for example, a sum, weighted average, rating, color code, or other numeric or non-numeric figure), b) a location on a two-dimensional display (or information used to derive a row and column), which may include row, column, or both (if location is not included, c) drill-down parameter specific to, but not necessarily unique to, this output, d) a handle or reference to the data source that was submitted in the request from which this output was created, and e) other information such as formatting instructions. This information could be sent in Extendible Markup Language (XML), Hypertext Markup Language (HTML), text, or other open or proprietary format, and enable analysis response handler 200-3 to populate specific cells with values from the analysis output, along with the data source handle (i.e., a reference or pointer) or reference and any related drill-down information in the cell's hyperlink (which in turn enables the user to click (to activate) to request a drill-down analysis (cause the performance of) on the subset of data that contributed to such cell's value). For example, in a drill-down analysis report (e.g. “Revenue in the U.S.”) applied to 50 data items, where an output value corresponds to a subset of only 10 out of those 50 data items (e.g. “Revenue in Nevada”), the hyperlink could enable the user to request, at the click of a mouse, a drill-down analysis as applied only to those 10 data items provided in the original analysis report. When clicked, analysis request event handler 200-2 will proceed as described above. (Note that the hyperlink/URL is one form of link used, but those skilled in the art know that a JSON file or any other link may be used in the embodiments disclosed herein.)
Execution then proceeds to step 316 wherein analysis response handler 200-3 presents the analysis report. In particular, handler 200-3 will read the outputs and transport hyperlink and data to the appropriate cells for display. In addition, it may perform other tasks such as applying formats. As indicated above, in the event that the drill-down parameter size could result in hyperlinks whose size exceeds the spreadsheet program hyperlink size allowance, then the drill-down criteria will be stored by the analysis engine in DDPD 102-4 (or other database) and the analysis output will include the related reference code or number in lieu of the actual hyperlink criteria.
At this point, the user will then have the option to execute further drill-down analysis at decision step 318. That is, with decision step 318, it is determined if a user requests a subsequent analysis. If yes, execution returns to step 300. The user may perform as many drill-down analyses (i.e., nested or un-nested aggregation analyses) as he/she desires on any prior analysis. If no addition request is received, then execution ends.
There are several alternative embodiments of this disclosure. These are described below.
In a first alternative embodiment, a user may open a spreadsheet that includes data within one or more cells. The user would then select specific data with desired cell(s) to analyze (run an analysis) by initially copying his or her data into a predefined file or template. The file or template may have certain analysis request event handlers and other code already embedded within it for processing and transmission. The data would then be sent to analysis engine 200-1 for analysis and an analysis report would be returned as described above for review.
In another embodiment, the input data set is not necessarily sent with each request. Instead of sending the input data, the analysis request may include a reference or handle to the input data that is to be used, and the analysis engine then derives the input data using such reference. Such reference may be generated by the analysis engine itself and returned in analysis output, in which case the analysis response handler may embed it in the displayed output's drill-down information. In yet another embodiment of this disclosure, instead of sending the raw data from the analysis request handler to the analysis engine, a reference to the data is sent (such as a file location or a database query definition), or, a coded reference to such reference is sent (e.g. instead the file name or database query definition is replaced with an alphanumeric code) and the data reference is obtained by looking up the related information in a lookup table. In any of these circumstances, platform 200 could, furthermore, store the data source information (whether the data itself, or a reference thereto) and the drill-down criteria information in DDPD 200-4 that is associated with a single reference code (or number). In this respect, a short reference numeral or symbol within the hyperlink will refer to or be associated with the combination of both the raw data (or reference thereto) and drill-down criteria. In doing so, the data (information) required to be stored in each drill-down enabled cell of a spreadsheet may be substantially reduced, and the data in the DDPD 200-4 may be substantially increased. (A lookup table will continue to be used to retrieve the desired analysis (e.g., formula) stored in DDPD 200-4 associated with the desired analysis.) Furthermore, the data source information can also be shortened via the same method employed by DDPD 200-4 (e.g., a database) that can convert the “long” information into a short code, such as a number, or visa versa).
In yet another embodiment of the disclosure, platform 200 employs a different analysis request event handler (than the specific one described above). For example, instead of using Worksheet_FollowHyperlink, which is triggered when the user clicks on a link in a cell and populates the cell's hyperlink value to enable the trigger, platform 200 may employ an event handler that would trap, for example, key presses or mouse clicks, and checks if the selected cell is drill-down enabled and proceeds to process the drill-down request.
The above alternative embodiments are not mutually exclusive, and can be combined in any manner.
It is to be understood that the disclosure teaches examples of the illustrative embodiments and that many variations of the invention can easily be devised by those skilled in the art after reading this disclosure and that the scope of the present invention is to be determined by the claim(s) below.
This application claims priority to U.S. provisional application No. 62/134,261, filed Mar. 17, 2015, entitled System and Method of Providing a Platform For Enabling Drill-down Analysis of Tabular Data, which is incorporated by reference herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US16/22498 | 3/15/2016 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62134261 | Mar 2015 | US |