Providing full data provenance visualization for versioned datasets

Information

  • Patent Grant
  • 9996595
  • Patent Number
    9,996,595
  • Date Filed
    Monday, August 3, 2015
    9 years ago
  • Date Issued
    Tuesday, June 12, 2018
    6 years ago
Abstract
Systems and methods for providing full data provenance visualization for versioned datasets. A method includes receiving selection of a versioned dataset that is within a data pipeline system. The method also includes determining the full data provenance of the selected versioned dataset. The full data provenance may comprise a set of versioned datasets. The method further includes providing for display of a visualization of the full data provenance of the selected versioned dataset. The visualization comprises a graph. The graph comprises a compound node for the selected versioned dataset and for each versioned dataset in the set of versioned datasets. The graph further comprises edges connecting the compounds nodes. Each edge represents a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge.
Description
CROSS-REFERENCE TO RELATED APPLICATION

This application is related to U.S. patent application Ser. No. 14/533,433, entitled “HISTORY PRESERVING DATA PIPELINE SYSTEM AND METHOD,” and filed Nov. 5, 2014, the entire contents of which is hereby incorporated by reference as if fully set forth herein.


TECHNICAL FIELD

The subject innovations relate to graphical user interfaces for computer systems and, in particular, relates to providing full data provenance visualization for versioned datasets.


BACKGROUND

Computers are very powerful tools for processing data. A computerized data pipeline is a useful mechanism for processing large amounts of data. A typical data pipeline is an ad-hoc collection of computer software scripts and programs for processing data extracted from “data sources” and for providing the processed data to “data sinks”. As an example, a data pipeline for a large insurance company that has recently acquired a number of smaller insurance companies might extract policy and claim data from the individual database systems of the smaller insurance companies, transform and validate the insurance data in some way, and provide validated and transformed data to various analytical platforms for assessing risk management, compliance with regulations, fraud, etc.


Between the data sources and the data sinks, a data pipeline system is typically provided as a software platform to automate the movement and transformation of data from the data sources to the data sinks. In essence, the data pipeline system shields the data sinks from having to interface with the data sources or even being configured to process data in the particular formats provided by the data sources. Typically, data from the data sources received by the data sinks is processed by the data pipeline system in some way. For example, a data sink may receive data from the data pipeline system that is a combination (e.g., a join) of data of from multiple data sources, all without the data sink being configured to process the individual constituent data formats.


One purpose of a data pipeline system is to execute data transformation steps on data obtained from data sources to provide the data in format expected by the data sinks. A data transformation step may be defined as a set of computer commands or instructions (e.g., a database query) which, when executed by the data pipeline system, transforms one or more input datasets to produce one or more output or “target” datasets. Data that passes through the data pipeline system may undergo multiple data transformation steps. Such a step can have dependencies on the step or steps that precede it. One example of a computer system for carrying out data transformation steps in a data pipeline is the well-known MapReduce system. See, e.g., Dean, Jeffrey, et al., “MapReduce: Simplified Data Processing on Large Clusters”, Google, Inc., 2004. Another more recent example of a computer system for carrying out data transformation steps in a data pipeline is the Spark system. See, e.g., Zaharia, et al., “Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing”, 9th USENIX Symposium on Networked Systems Design and Implementation, 2012.


An important issue for users of data pipeline systems is provenance. In the context of data pipeline systems, provenance is metadata that describes the origins and history of datasets in their life cycles. Such metadata (sometimes also called “lineage”) is important for many data pipeline tasks. In particular, provenance is important to users to help them judge whether a given dataset produced by the data pipeline system is trustworthy.


Given the increasing amount of data collected by businesses and other organizations, processing data of all sorts through data pipeline systems can only be expected to increase. This trend is coupled with a need for users to be able to visualize the provenance of datasets produced by data pipeline systems.


The approaches described in this section are approaches that could be pursued, but not necessarily approaches that have been previously conceived or pursued. Therefore, unless otherwise indicated, it should not be assumed that any of the approaches described in this section qualify as prior art merely by virtue of their inclusion in this section.


SUMMARY

In one aspect, the subject innovations are embodied in a method for providing full data provenance visualization of versioned datasets. The method is performed at one or more computing devices having one or more processors and memory storing one or more programs executed by the one or more processors to perform the method. The method includes receiving selection of a versioned dataset that is within a data pipeline system. The method also includes determining full data provenance of the selected versioned dataset. The full data provenance comprises a set of versioned datasets. The method also includes providing for display of a visualization of the full data provenance of the selected versioned dataset. The visualization comprises a graph. The graph comprises a compound node for the selected versioned dataset and for each versioned dataset in the set of versioned datasets. The graph further comprises edges connecting the compounds nodes. Each edge represents a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge.


These and other embodiments of the subject innovations include one or more of the following features: The compound node of the selected versioned dataset may indicate a name or identifier of the selected version dataset. The compound node for each versioned dataset in the set of versioned datasets may indicate a name or identifier of the each versioned dataset. The compound node of the selected versioned dataset may comprise a sub-entry representing a particular version of the selected versioned dataset. The compound node for each versioned dataset in the set of versioned datasets may comprise at least one sub-entry representing a version of the each versioned dataset in the full data provenance of the selected versioned dataset. A sub-entry of the compound node for a particular versioned dataset in the set of versioned datasets may be visually distinguished in the graphical user interface from other sub-entries of compound nodes of the graph to indicate that a version of the particular versioned dataset represented by the sub-entry has been flagged in a database as containing invalid data. An edge in the graph representing a derivation dependency of a first version of a first versioned dataset in the set of versioned datasets on a second version of a second versioned dataset in the set of versioned datasets may be visually distinguished from other edges in the graph to indicate that the first version of the first versioned dataset potentially contains invalid data as a result of the derivation dependency. At least one version of a versioned dataset in the set of versioned datasets may contain data generated as a result of a Spark system executing a derivation program taking at least one version of another versioned dataset as input. At least one version of a versioned dataset in the set of versioned datasets may contain data generated as a result of a MapReduce system executing a derivation program taking at least one version of another versioned dataset as input.


In one aspect, the subject innovations are embodied in one or more non-transitory computer-readable media storing one or more programs. The one or more programs comprise instructions for receiving selection of a versioned dataset that is within a data pipeline system. The one or more programs further comprise instructions for determining full data provenance of the selected versioned dataset. The full data provenance comprises a set of versioned datasets. The one or more programs further comprise instructions for providing for display of a visualization of the full data provenance of the selected versioned dataset. The visualization comprises a graph. The graph comprises a compound node for the selected versioned dataset and for each versioned dataset in the set of versioned datasets. The graph further comprises edges connecting the compounds nodes. Each edge represents a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge.


In one aspect, the subject innovations are embodied in a system comprising memory, one or more processors, and one or more programs stored in the memory and configured for execution by the one or more processors. The one or more programs comprise instructions for receiving selection of a versioned dataset that is within a data pipeline system. The one or more programs further comprise instructions for determining full data provenance of the selected versioned dataset. The full data provenance comprises a set of versioned datasets. The one or more programs further comprise instructions for providing for display of a visualization of the full data provenance of the selected versioned dataset. The visualization comprises a graph. The graph comprises a compound node for the selected versioned dataset and for each versioned dataset in the set of versioned datasets. The graph further comprises edges connecting the compounds nodes. Each edge represents a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge.


It is understood that other configurations of the subject innovations will become readily apparent to those skilled in the art from the following detailed description, wherein various configurations of the subject innovations are shown and described by way of illustration. As will be realized, the subject innovations are capable of other and different configurations and its several details are capable of modification in various other respects, all without departing from the scope of the subject innovations. Accordingly, the drawings and detailed description are to be regarded as illustrative in nature and not as restrictive.





BRIEF DESCRIPTION OF THE DRAWINGS

The features of the subject innovations are set forth in the appended claims. However, for purpose of explanation, several aspects of the disclosed subject matter are set forth in the following figures.



FIG. 1 illustrates an example of a computer system configured to provide full data provenance visualization of versioned datasets.



FIG. 2 illustrates an example graphical user interface configured to provide full data provenance visualization of versioned datasets.



FIG. 3 illustrates an example graphical user interface configured to provide full data provenance visualization of versioned datasets.



FIG. 4 illustrates an example process by which full data provenance visualization for versioned datasets is provided.



FIG. 5 is a very general block diagram of a computing device in which software-implemented processes of the subject innovations may be embodied.



FIG. 6 is a block diagram of a basic software system for controlling the operation of the computing device.





DETAILED DESCRIPTION

The detailed description set forth below is intended as a description of various configurations of the subject innovations and is not intended to represent the only configurations in which the subject innovations may be practiced. The appended drawings are incorporated herein and constitute a part of the detailed description. The detailed description includes specific details for the purpose of providing a thorough understanding of the subject innovations. However, the subject innovations are not limited to the specific details set forth herein and may be practiced without these specific details. In some instances, some structures and components are shown in block diagram form in order to avoid obscuring the concepts of the subject innovations.


Glossary

The following definitions are offered for purposes of illustration, not limitation, in order to assist with understanding the discussion that follows.


MapReduce: MapReduce is a programming model and an associated implementation for processing and generating large datasets with a parallel, distributed algorithm on a cluster. See, e.g., Dean, Jeffrey, et al., “MapReduce: Simplified Data Processing on Large Clusters”, Google, Inc., 2004, the entire contents of which is hereby incorporated by reference as if fully set forth herein. APACHE HADOOP is a well-known open source implementation of MapReduce.


Spark: Like MapReduce, Spark is a programming model and an associated implementation for processing and generating large datasets with a parallel, distributed algorithm on a cluster. However, Spark is optimized for data-intensive applications that reuse a working set across multiple parallel operations including iterative jobs and interactive analytics. See, e.g., Zaharia, et al., “Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing”, 9th USENIX Symposium on Networked Systems Design and Implementation, 2012, the entire contents of which is hereby incorporated by reference as if fully set forth herein. APACHE SPARK is a well-known open source implementation of Spark.


General Overview


As noted above, it may be useful to a user of a data pipeline system to visualize the full data provenance of a versioned dataset. As used herein, the term “full data provenance” of a given versioned dataset encompasses at least all other versioned datasets from which the given versioned dataset is derived and may also include any further versioned datasets in the full data provenance of the other versioned datasets from which the given versioned dataset is derived. For example, if versioned dataset A is derived from versioned datasets B and C, versioned dataset C is derived from versioned dataset D, and versioned dataset D is derived from versioned dataset E, then the full data provenance of versioned dataset A encompasses at least versioned datasets B and C but may also include versioned dataset D and/or versioned dataset E.


As a practical matter, a version of a dataset in a typical data pipeline system may be derived from versions of one or more other datasets, and those datasets each derived from versions of one or more further datasets, and so forth creating a situation where the validity of the dataset version depends on the validity of tens, hundreds, or more other dataset versions. It may be difficult within existing data pipeline systems to discover the full data provenance of a given dataset version and determine whether the dataset version is based on an invalid dataset version. For example, user Alice may flag dataset version D1 as invalid and user Bob may want to know if dataset version X1 is based on dataset version D1. A data pipeline system vendor or other software vendor may wish to assist Bob in discovering the full data provenance of dataset version X1 and help Bob determine if dataset version X1 is based on an invalid dataset version D1. As the foregoing illustrates, an approach for providing a visualization of the full data provenance of a dataset version may be desirable.


The subject innovations relate to providing a visualization of the full data provenance of a dataset version that is within a data pipeline system. In one implementation, a server may receive selection of a dataset within the data pipeline system. For example, a user may direct user input to a graphical user interface at a client computing device that selects the dataset from among other possible selectable datasets, and the selection may be transmitted to the server. The server may determine the full data provenance of the selected dataset. The full data provenance of the selected dataset may include a set of zero or more other datasets. The set may include at least any other datasets from which a version of the selected dataset is derived from in addition to any datasets from which those datasets are derived from and so on. The server may provide for display of a visualization of the full data provenance of the version of the selected dataset. The visualization may be displayed in a graphical user interface at the client computing device. For example, the visualization may be displayed within a web browser window or within an application window.


The visualization may include a graph. The graph may include a compound node for the selected dataset and a compound node for each dataset in the set of datasets of the full data provenance of the selected dataset. For example, the compound node for a dataset may include a sub-entry for each version of the dataset involved in the full data provenance of the selected dataset. For example, version one of selected dataset E might be derived from version one of dataset C and version twenty-nine of dataset D. Version one of dataset C might be derived, at least in part, from version five of dataset B. And version twenty-nine of dataset D might be derived, at least in part, from version twelve of dataset B. In this case, the compound node for dataset B may have at least two sub-entries: one sub-entry representing version five of dataset B and another sub-entry representing version twelve of dataset B.


The graph further comprises edges connecting the compounds nodes. Each edge represents a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge. For example, an edge connecting a sub-entry of the compound node for dataset B and a sub-entry of the compound node for dataset D may represent a derivation dependency between version twenty-nine of dataset D and version twelve of dataset B.


The sub-entry of a compound node corresponding to a dataset version that has been flagged or marked invalid may be highlighted or visually distinguished in the visualization. For example, assume version twelve of dataset B fails a dataset validation process and as a result is flagged or marked invalid in a database. In this case, the sub-entry of the compound node representing version twelve of dataset B, the edge connecting that sub-entry to the sub-entry of the compound node representing version twenty nine of dataset D, the sub-entry of the compound node representing version twenty nine of dataset D, the edge connecting the sub-entry of the compound node representing version twenty nine of dataset D to version one of dataset E, and the compound node representing dataset E all may be colored red or visually distinguished in some way to indicate that version twelve of dataset B contains invalid data and that version twenty nine of dataset D and version one of dataset E may contain invalid data as a result of version twelve of dataset B containing invalid data.


Advantageously, in some implementations of the subject innovations, a user is informed of the full data provenance of a selected dataset version. As a result, the user may more easily identify derivation dependencies between dataset versions including derivation dependencies involving invalid dataset versions.


Example Computer System



FIG. 1 illustrates an example of a computer system 100 configured to provide full data provenance visualization of versioned datasets. As shown, the computer system 100 includes a client computing device 112 used by a human user 110, a server 130, a database 140, a data pipeline system 150, and a distributed file system 160. The client computing device 112 and the server 130 may be configured to communicate with one another via a network 120. The network 120 may include the Internet, an intranet, a local area network, a wide area network, a wired network, a wireless network, a virtual private network (VPN).


The client computing device 112 may be a laptop computer, a desktop computer, a mobile phone, a personal digital assistant (PDA), a tablet computer, a netbook, a television with one or more processors, embedded therein or coupled thereto, a physical machine or a virtual machine. The client computing device 112 may include one or more of a keyboard, a mouse, a display 114, or a touch screen (of which display 114 may be a part of). For example, the client computing device 112 may be composed of hardware components like those of basic computing device 500 described below with respect to FIG. 5 and configured with a basic software system like software system 600 described below with respect to FIG. 6. The client computing device 112 may also include a web browser or a client application configured to display, in a graphical user interface 116 of the client computing device 112 display 114, a visualization of the full provenance of a selected dataset version in accordance with this disclosure of the subject innovations. The graphical user interface 116 may be a web browser window, a client application window, an operating system window, or other computer graphical user interface window. While only one user 110 and one client computing device 112 are illustrated in FIG. 1, the subject innovations may be implemented in conjunction with one or more users 110 and one or more client computing devices 112.


The server 130 may include a full data provenance visualization module to provide a visualization of the full data provenance of a selected dataset version, based on provenance metadata 142 stored in database 140. The server 130 may be implemented as a single server computing device or as multiple server computing devices arranged in a distributed or clustered computing arrangement. Each such server computing device may be composed of hardware components like those of basic computing device 500 described below with respect to FIG. 5 and configured with a basic software system like software system 600 described below with respect to FIG. 6.


The server 130 may include one or more processors (e.g., CPUs), a network interface, and memory. The processor(s) may be configured to execute computer instructions that are stored in one or more computer-readable media, for example, the memory of the server 130. The server 130 may include a network interface that is configured to allow the server 130 to transmit and receive data in a network, e.g., network 120 of FIG. 1. The network interface may include one or more network interface cards (NICs). The memory of the server 130 may store data or instructions. The instructions stored in the memory may include the full data provenance visualization module.


The database 140 may include a database server module for storing and retrieving database data including provenance metadata 142 and derivation programs 144. The database 140 may be implemented as a single server computing device or as multiple server computing devices arranged in a distributed or clustered computing arrangement. Each such server computing device may be composed of hardware components like those of basic computing device 500 described below with respect to FIG. 5 and configured with a basic software system like software system 600 described below with respect to FIG. 6.


The database 140 may include one or more processors (e.g., CPUs), a network interface, and memory. The processor(s) may be configured to execute computer instructions that are stored in one or more computer-readable media, for example, the memory of the database 140. The database 140 may include a network interface that is configured to allow the database 140 to transmit and receive data in one or more networks, e.g., a network connecting the server 130 and the database 140 and a network connecting the data pipeline system 150 to the database 140, which may be the same or different network as the network that connects the server 130 and the database 140. The network interface may include one or more network interface cards (NICs). The memory of the database 140 may store data or instructions. The instructions stored in the memory may include the database server module.


The data pipeline system 150 may include a dataset derivation module to derive dataset 162 versions from other dataset 162 versions by executing derivation programs 144. The data pipeline system 150 may also include a provenance metadata update module for updating provenance metadata 142 in database 140 when new dataset 162 versions are derived. The data pipeline system 150 may be implemented as a single server computing device or as multiple server computing devices arranged in a distributed or clustered computing arrangement. Each such server computing device may be composed of hardware components like those of basic computing device 500 described below with respect to FIG. 5 and configured with a basic software system like software system 600 described below with respect to FIG. 6.


The data pipeline system 150 may include one or more processors (e.g., CPUs), a network interface, and memory. The processor(s) may be configured to execute computer instructions that are stored in one or more computer-readable media, for example, the memory of the data pipeline system 150. The data pipeline system 150 may include a network interface that is configured to allow the data pipeline system 150 to transmit and receive data in a network, e.g., a network connecting the data pipeline system 150 and the database 140 and a network connecting the data pipeline system 150 to the distributed file system 160, which may be the same or different network as the network that connects the data pipeline system 150 and the database 140. The network interface may include one or more network interface cards (NICs). The memory of the database 140 may store data or instructions. The instructions stored in the memory may include the dataset derivation module and the provenance metadata update module. In an exemplary non-limiting embodiment, the dataset derivation module is implemented at least in part by an implementation of the MapReduce system, for example, APACHE HADOOP. In an exemplary non-limiting embodiment, the dataset derivation module is implemented at least in part by an implementation of the Spark system, for example, APACHE SPARK.


The distributed file system 160 may include a distributed file system module to provide distributed file system services to the data pipeline system 150 over a network that connects the distributed file system 160 and the data pipeline system 150. The distributed file system 160 may be implemented as a single server computing device or as multiple server computing devices arranged in a distributed or clustered computing arrangement. Each such server computing device may be composed of hardware components like those of basic computing device 500 described below with respect to FIG. 5 and configured with a basic software system like software system 600 described below with respect to FIG. 6.


The distributed file system 160 may include one or more processors (e.g., CPUs), a network interface, and memory. The processor(s) may be configured to execute computer instructions that are stored in one or more computer-readable media, for example, the memory of the distributed file system 160. The distributed file system 160 may include a network interface that is configured to allow the distributed file system 160 to transmit and receive data in a network, e.g., a network connecting distributed file system 160 and the data pipeline system 150. The network interface may include one or more network interface cards (NICs). The memory of the distributed file system 160 may store data or instructions. The instructions stored in the memory may include the distributed file system module. In an exemplary non-limiting embodiment, the distributed file system module is implemented by the APACHE HADOOP Distributed File System (HDFS) configured on a cluster of commodity server computing devices.


The full data provenance visualization module of the server 130 is configured to provide a visualization of the full data provenance of a selected versioned dataset 162, based on provenance metadata 142 stored in database 140. The selected version dataset 162 may be stored within the data pipeline system 150. The data pipeline system 150 includes the distributed file system 160. The visualization may include a graph. The graph may comprise compound nodes and edges connecting the compound nodes in the graph. Each of the compound nodes may represent the selected dataset 162 or a dataset 162 in the full data provenance of the selected dataset 162. Each edge represents a derivation dependency between a version of a dataset 162 and a version of another dataset 162.


A dataset 162 is a logical collection of highly structured, semi-structured, or unstructured data. A non-limiting example of highly structured data is data that conforms to a standardized or well-known data model, for example, a relational model or other table-based data model. A non-limiting example of semi-structured data is data that has self-describing structure, for example, eXtensible Markup Language (XML) data or Javascript Object Notation (JSON) data. A non-limiting example of unstructured data is data that is not highly structured or semi-structured data, for example, some text data or log data. Each dataset 162 version may be stored in one or more files in the distributed file system 160.


A derivation dependency may exist between two versions of two datasets 162 if one of the two versions was derived from the other of the two versions within the data pipeline system 150. In particular, a version of a “target” dataset 162 may be derived by the data pipeline system 150 from one or more versions of one or more “input” datasets 162. In doing so, the data pipeline system 150 may provide the version(s) of the input dataset(s) 162 as input to the derivation program 144. The derivation program 144, in conjunction with the data pipeline system 150, may produce the version of the target dataset 162 as output. In this case, the version of the target dataset 162 has a derivation dependency on each of the version(s) of the input dataset(s) 162. Such derivation dependencies may be stored by the data pipeline system 150 in database 140 as part of the provenance metadata 142.


Database 140 may store one or more derivation programs 144. A derivation program 144 may include instructions for extracting (e.g., selecting) and transforming data from version(s) of one or more datasets 162 input to derivation program 144. The extracted and transformed data may be stored as a new dataset 162 version in the distributed file system 160. The derivation program 144 itself may specify the versions of the dataset(s) 162 that are to be the input to the derivation program 144 when executed. Alternatively, a user may specify the versions of the dataset(s) 162 that are to be the input to an execution of the derivation program 144. The derivation program 144 may be executed by the dataset derivation module of the data pipeline system 150. The derivation program 144 may include a variety of different high-level query language instructions depending on whether the dataset derivation module is a MapReduce-based or Spark-based. For example, if the dataset derivation module is MapReduce-based, then the derivation program 144 may include, for example, MapReduce instructions that invoke an APACHE HADOOP MapReduce Application Programming Interface (API), APACHE PIG instructions, APACHE HIVE instructions, Jaql instructions, or other instructions for carrying out MapReduce operations on datasets 162. If the dataset derivation module is Spark-based, then the derivation program 144 may include, for example, Scala, Java, Python, Clojure, or R instructions for carrying out Spark transformations on datasets 162. While derivation programs 144 are shown in FIG. 1 as being stored in database 140, derivation programs 144 may be stored in another location, for example, in the distributed file system 160 or in a different database.


Provenance metadata 142 comprises information about the full data provenance of dataset 162 versions. For a given dataset 162 within the data pipeline system 150, provenance metadata 142 may include all of the following information about the given dataset 162, or a subset or a superset thereof:

    • A name or unique identifier of the given dataset 162.
    • An identifier of each version of the given dataset 162 within the data pipeline system 150.
    • An identifier of the current version of the given dataset 162 within the data pipeline system 150.


For each version of the given dataset 162 within the data pipeline system 150, the provenance metadata 142 may include all of the following information about the given dataset 162 version, or a subset or a superset thereof:

    • The identifier of the version of the given dataset 162 version.
    • If the given dataset 162 version was derived from one or more other dataset 162 versions, then, for each such other dataset 162, the name or unique identifier of the other dataset 162 and the identifier of the version of the other dataset 162.
    • If the given dataset 162 version was derived from one or more other dataset 162 versions, the name or identifier of the derivation program 144 executed by the data pipeline system 150 to derive the given dataset 162 version. In some implementations, derivation programs 144 are versioned and the provenance metadata 142 includes the identifier of the version of the derivation program 144 executed by the data pipeline system 150 to derive the given dataset 162 version.
    • A flag (e.g., a dirty bit) that indicates that the given dataset 162 version contains invalid data. The flag may be set as a result of the given dataset 162 version failing a data validation process, for example.


In one example, the full data provenance visualization module of the server 130 is implemented in software. The full data provenance visualization module may include code for receiving selection of a versioned dataset 162 within the data pipeline system 150. The data pipeline system 150 includes the distributed file system 160. The selection may be received by the full data provenance visualization module of the server 130 over network 120 from client computing device 112 (e.g., in a HTTP or HTTPS request) as a result of the user 110 interacting with a graphical user interface 116 presented on the display 114. The selection may be for just a dataset 162 or for a particular version of a dataset 162. If the selection is for just a dataset 162, then a particular version of the dataset 162 may be selected by the full data provenance visualization module based on the selection. For example, the full data provenance visualization module may select, as the particular version of the dataset 162 to provide a full data provenance visualization of, the current version of the selected dataset 162 or the most recent version of the selected dataset 162 as indicated in the provenance metadata 142.


The full data provenance visualization module may further include code for determining full data provenance of the particular version of the selected versioned dataset 162. The full data provenance may include a set of zero or more versioned datasets 162. The set may include no versioned datasets 162 if the particular version of the selected dataset 162 is not derived from any other datasets 162. For example, the particular version of the selected dataset 162 may have been stored in the distributed file system 160 by an external data source and not generated by the data pipeline 150 as a result of executing a derivation program 144. As another example, the particular version of the selected dataset 162 may have been generated by the data pipeline 150 as a result of executing a derivation program 144 that did not accept any other dataset 162 versions as input.


To determine the full data provenance of the particular version of the selected dataset 162, the full data provenance visualization module may consult the provenance metadata 162 in the database 140. In particular, the full data provenance visualization module may start the determination with an empty set of dataset 162 versions representing the full data provenance of the particular version of the selected dataset 162. The determination may then include the full data provenance visualization module consulting the provenance metadata 162 to determine all dataset 162 versions from which the particular version of the selected dataset 162 was derived and adding those dataset 162 versions to the set of versioned datasets 162 representing the full data provenance of the particular version of the selected dataset 162. The full data provenance visualization module may then repeat this determination for each of those dataset 162 versions just added to the set and so on in a recursive or iterative manner, adding any dataset 162 versions from which a dataset 162 version in the full data provenance of the particular version of the selected dataset 162 was derived to the set of dataset 162 versions representing the full data provenance of the particular version of the selected dataset 162. The recursion or iteration may end when all dataset 162 versions, according to the provenance metadata 142, in the full data provenance of the particular version of the selected dataset 162 have been determined and added to the set, or when a stop condition is reached. The stop condition may be based on a threshold degree of derivation between the particular version of the selected dataset 162 and a dataset 162 version in the full data provenance of the particular version of the selected dataset 162. For example, if the threshold degree of derivation is ten, then only dataset 162 versions in the full data provenance of the particular version of the selected dataset 162 that are within ten degrees derivation of the particular version of the selected dataset 162 will be added to the set of versioned datasets 162 representing the full data provenance of the particular version of the selected dataset 162.


The full data provenance visualization module may further include code for providing for display (e.g., via a web browser on the client computing device 112) of a visualization of the determined full data provenance of the particular version of the selected dataset 162. The visualization may include a graph. The graph may include a compound node for the selected dataset 162 and for each dataset 162 in the set of dataset 162 versions representing the full data provenance of the particular version of the selected dataset 162. The graph may include directed edges connected the compound nodes. Each directed edge may represent a derivation dependency between versions of the versioned datasets 162 represented by the compound nodes connected by the edge. Each compound node may include a sub-entry for each version of the dataset 162 represented by the compound node in the set of dataset 162 versions representing the full data provenance of the particular version of the selected dataset 162.


Example Graphical User Interfaces



FIG. 2 illustrates an example graphical user interface window 200 (e.g., a web browser window) configured to provide full data provenance visualization of versioned datasets.


The window 200 may be displayed via the display 114 (e.g., a screen) of the client computing device 112. As shown, the window 200 includes a graph 202.


The graph 202 represents the determined full data provenance of version one of selected dataset E. As indicated by the graph 202, version one of selected dataset E has a derivation dependency on version one of dataset C and version twenty nine of dataset D. Version one of dataset C has a derivation dependency on version one of dataset A and version five of dataset B. Version twenty nine of dataset D has a derivation dependency on versions seven and twelve of dataset B. Versions five, seven, and twelve of dataset B each have a derivation dependency on version two of dataset A. Since there are three versions of dataset B in the full data provenance of version one of dataset E, there are three sub-entries of the compound node representing dataset B in the graph 202. The three sub-entries represent the three versions, respectively. The remaining compound nodes have only one sub-entry as only one version of each of the remaining datasets is in the full data provenance of version one of selected dataset E.


A compound node in a graph representing a dataset may indicate the name or identifier of the dataset. For example, the compound node representing dataset B in graph 202 is labeled with the text “B”.


The compound node in a graph representing the selected dataset may indicate the selected dataset. For example, the compound node representing selected dataset E in graph 202 is labeled with the text “(TARGETED NODE)” to indicate that dataset E is the selected dataset for which the full data provenance is visualized in GUI 200. The compound node representing the selected dataset may be colored differently (or otherwise visually distinguished) from the other compound nodes in the graph to indicate the selected dataset.


A sub-entry of a compound node representing a version of a dataset may include metadata about the version of the dataset. For example, the sub-entry of the compound node representing version one of dataset C in graph 202 indicates that version number (e.g., “V 1”), the name of a user of a data pipeline system that caused the data pipeline system to create version one of dataset C (e.g., “Jane Smith”), and the date version of dataset C was created (e.g., “Monday”).



FIG. 3 illustrates an example graphical user interface window 300 (e.g., a web browser window) configured to provide full data provenance visualization of versioned datasets.


The window 200 may be displayed via the display 114 (e.g., a screen) of the client computing device 112. As shown, the window 300 includes a graph 302.


The graph 302 represents the determined full data provenance of version one of selected dataset E. As indicated by the graph 302, version one of selected dataset E has a derivation dependency on version one of dataset C and version twenty nine of dataset D. Version one of dataset C has a derivation dependency on version one of dataset A and version five of dataset B. Version twenty nine of dataset D has a derivation dependency on versions seven and twelve of dataset B. Versions five, seven, and twelve of dataset B each have a derivation dependency on version two of dataset A. Since there are three versions of dataset B in the full data provenance of version one of dataset E, there are three sub-entries of the compound node representing dataset B in the graph 302. The three sub-entries represent the three versions, respectively. The remaining compound nodes have only one sub-entry as only one version of each of the remaining datasets is in the full data provenance of version one of selected dataset E.


In this example, version five of dataset B has been flagged as invalid. For example, version five of dataset B may have failed a data validation process. As a result, the sub-entry representing version five of dataset B is colored differently (or otherwise visually distinguished) from other sub-entries in the graph 302 to indicate that the dataset version contains invalid data.


Other sub-entries representing “downstream” dataset versions may also be colored differently (or otherwise visually distinguished) to indicate that they may also contain invalid data as a result of invalid data in a dataset version. For example, since version one of dataset C has a derivation dependency on invalid version five of dataset B and version one of dataset E has a derivation dependency on potentially invalid version one of dataset C, the sub-entries representing version one of dataset C and version one of dataset E may be colored differently (or otherwise visually distinguished) to indicate that they may potentially contain invalid data as a result of the invalid data in version five of dataset B.


An edge connecting a sub-entry representing a potentially invalid dataset version to a sub-entry representing an invalid dataset version may be colored differently (or otherwise visually distinguished) from other edges in a graph to indicate that the potentially invalid dataset version has a derivation dependency on an invalid dataset version. For example, the edge in graph 302 connecting the sub-entry representing version one of dataset C with the sub-entry representing version five of dataset B may be colored differently (or otherwise visually distinguished) from other edges in the graph 302 to indicate that the potentially invalid version one of dataset C has a derivation dependency on an invalid version five of dataset B.


An edge connecting a sub-entry representing a potentially invalid dataset version to a sub-entry representing another potentially invalid dataset version may be colored differently (or otherwise visually distinguished) from other edges in a graph to indicate that the potentially invalid dataset version has a derivation dependency on another potentially invalid dataset version. For example, the edge in graph 302 between the sub-entry representing version one of dataset E and the sub-entry representing version one of dataset C may be colored differently (or otherwise visually distinguished) from other edges in the graph 302 to indicate that the potentially invalid version one of dataset E has a derivation dependency on potentially invalid version one of dataset C.


While in some embodiments as exemplified in FIG. 3, an edge in a graph representing a derivation dependency between dataset versions is directed from a sub-entry of a compound node representing an input dataset version to a derivation program to a sub-entry of a compound representing the output dataset version from the derivation program, the edge is directed from the sub-entry representing the output dataset version to the sub-entry representing the input dataset version in other embodiments.


Example Process



FIG. 4 illustrates an example process 400 by which full data provenance visualization for versioned datasets is provided. Process 400 may be performed by software when executed by one or more computing devices. For example, process 400 may be performed by one or more applications 602 executing on one or more computing devices 500, each configured with a software system like software system 600. (See FIGS. 5 and 6 and associated description below). The one or more computing devices on which process 400 executes can be, for example, client 112, server 130, or a combination of client 112 and server 130.


The process 400 beings at step 410, where a server (e.g., server 130) receives selection of a versioned dataset (e.g., 162-1). That versioned dataset may be within a data pipeline system (e.g., data pipeline system 150 and distributed file system 160). In fact, a number of versioned datasets may be within the data pipeline system. Some, but not necessarily all, of the dataset versions within the data pipeline system may be “derived” datasets in that the dataset version is generated by the data pipeline system executing a derivation program (e.g., 144), or a version of a derivation program. When executed, the derivation program may accept one or more other dataset versions as input. In this way, the generated dataset version is derived from the input dataset version(s). In some instances, a dataset version within the data pipeline system is generated as a result of a Spark system executing a derivation program taking a version of at least one other dataset as input to the derivation program. In some instances, a dataset version within the data pipeline system is generated as a result of a MapReduce system executing a derivation program taking a version of at least one other dataset as input to the derivation program.


The server may receive selection of the versioned dataset over a network (e.g., 120). The server may receive the selection from a client computing device (e.g., 112). The received selection may identify the versioned dataset selected. In addition, the received selection may identify a particular version of the selected dataset for which to determine the full data provenance of. If the selection does not identify a particular version of the selected dataset, then the server may assume a default version of the selected dataset. The default version can be selected by the server based on provenance metadata (e.g. 142) for the selected dataset. In some instances, the server selects a current version of the selected dataset as the default version. In some instances, the server selects the latest (more recent) version of the selected dataset as the default version.


In step 420, the server determines the full data provenance of the particular version of the selected dataset. The full data provenance may comprise a set of zero or more other datasets within the data pipeline system. For example, GUI 300 of FIG. 3 shows that datasets A, B, C, and D are in the full data provenance of version one of dataset E. In particular, version twenty nine of dataset D, version one of dataset C, versions five, seven, and twelve of dataset B, and versions one and two dataset A are in the full data provenance of version one of dataset E. The determination of the full data provenance of the particular version of the selected dataset may be based on the provenance metadata for datasets as maintained by the data pipeline system.


In step 430, the server provides for display (e.g., via a web browser window on the client computing device) of a visualization of the full data provenance of the particular version of the selected dataset. The visualization comprises a graph (e.g., 202 and 302). The graph may comprise a compound node for the selected dataset and a compound node for each versioned dataset in the set of versioned datasets determined in step 420. The graph may further comprise edges connecting the compounds nodes where each edge represents a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge.


In some instances, if a particular version of a particular dataset in the full data provenance of the particular version of the selected dataset is flagged or marked as invalid in the provenance metadata, then the sub-entry of the compound node representing the particular version of the particular dataset may be colored differently or otherwise visually distinguished in the graph from other sub-entries to indicate that the particular version of the particular dataset contains invalid data.


In some instances, if a particular version of a first dataset in the full data provenance of the particular version of the selected dataset has a derivation dependency on a particular version of a second dataset in the full data provenance of the particular version of the selected dataset and the particular version of the second dataset is flagged or marked as invalid in the provenance metadata, then the edge in the graph connecting the sub-entry of the compound node for the particular version of the first dataset to the sub-entry of the compound node for the particular version of the second dataset may be colored differently or otherwise visually distinguished in the graph from other edges to indicate that the particular version of the first dataset potentially contains invalid data as a result of the derivation dependency on the particular version of the second dataset.


In some instances, if a particular version of a first dataset in the full data provenance of the particular version of the selected dataset has a derivation dependency on a particular version of a second dataset in the full data provenance of the particular version of the selected dataset and the particular version of the second dataset potentially contains invalid data as a result of a derivation dependency on a version of a dataset that contains or potentially contains invalid data, then the edge in the graph connecting the sub-entry of the compound node for the particular version of the first dataset to the sub-entry of the compound node for the particular version of the second dataset may be colored differently or otherwise visually distinguished in the graph from other edges to indicate that the particular version of the first dataset potentially contains invalid data as a result of the derivation dependency on the particular version of the second dataset.


Basic Computing Device


Referring now to FIG. 5, it is a block diagram that illustrates a basic computing device 500 in which software-implemented processes of the subject innovations may be embodied. Computing device 500 and its components, including their connections, relationships, and functions, is meant to be exemplary only, and not meant to limit implementations of the subject innovations. Other computing devices suitable for implementing the subject innovations may have different components, including components with different connections, relationships, and functions.


Computing device 500 may include a bus 502 or other communication mechanism for addressing main memory 506 and for transferring data between and among the various components of device 500.


Computing device 500 may also include one or more hardware processors 504 coupled with bus 502 for processing information. A hardware processor 504 may be a general purpose microprocessor, a system on a chip (SoC), or other processor suitable for implementing the subject innovations.


Main memory 506, such as a random access memory (RAM) or other dynamic storage device, also may be coupled to bus 502 for storing information and instructions to be executed by processor(s) 504. Main memory 506 also may be used for storing temporary variables or other intermediate information during execution of software instructions to be executed by processor(s) 504.


Such software instructions, when stored in non-transitory storage media accessible to processor(s) 504, render computing device 500 into a special-purpose computing device that is customized to perform the operations specified in the instructions. The terms “instructions”, “software”, “software instructions”, “program”, “computer program”, “computer-executable instructions”, and “processor-executable instructions” are to be broadly construed to cover any machine-readable information, whether or not human-readable, for instructing a computing device to perform specific operations, and including, but not limited to, application software, desktop applications, scripts, binaries, operating systems, device drivers, boot loaders, shells, utilities, system software, JAVASCRIPT, web pages, web applications, plugins, embedded software, microcode, compilers, debuggers, interpreters, virtual machines, linkers, and text editors.


Computing device 500 also may include read only memory (ROM) 508 or other static storage device coupled to bus 502 for storing static information and instructions for processor(s) 504.


One or more mass storage devices 510 may be coupled to bus 502 for persistently storing information and instructions on fixed or removable media, such as magnetic, optical, solid-state, magnetic-optical, flash memory, or any other available mass storage technology. The mass storage may be shared on a network, or it may be dedicated mass storage. Typically, at least one of the mass storage devices 510 (e.g., the main hard disk for the device) stores a body of program and data for directing operation of the computing device, including an operating system, user application programs, driver and other support files, as well as other data files of all sorts.


Computing device 500 may be coupled via bus 502 to display 512, such as a liquid crystal display (LCD) or other electronic visual display, for displaying information to a computer user. In some configurations, a touch sensitive surface incorporating touch detection technology (e.g., resistive, capacitive, etc.) may be overlaid on display 512 to form a touch sensitive display for communicating touch gesture (e.g., finger or stylus) input to processor(s) 504.


An input device 514, including alphanumeric and other keys, may be coupled to bus 502 for communicating information and command selections to processor 504. In addition to or instead of alphanumeric and other keys, input device 514 may include one or more physical buttons or switches such as, for example, a power (on/off) button, a “home” button, volume control buttons, or the like.


Another type of user input device may be a cursor control 516, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 504 and for controlling cursor movement on display 512. This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane.


While in some configurations, such as the configuration depicted in FIG. 5, one or more of display 512, input device 514, and cursor control 516 are external components (i.e., peripheral devices) of computing device 500, some or all of display 512, input device 514, and cursor control 516 are integrated as part of the form factor of computing device 500 in other configurations.


Functions of the disclosed systems, methods, and modules may be performed by computing device 500 in response to processor(s) 504 executing one or more programs of software instructions contained in main memory 506. Such instructions may be read into main memory 506 from another storage medium, such as storage device(s) 510. Execution of the software program instructions contained in main memory 506 cause processor(s) 504 to perform the functions of the disclosed systems, methods, and modules.


While in some implementations, functions of the disclosed systems and methods are implemented entirely with software instructions, hard-wired or programmable circuitry of computing device 500 (e.g., an ASIC, a FPGA, or the like) may be used in place of or in combination with software instructions to perform the functions, according to the requirements of the particular implementation at hand.


The term “storage media” as used herein refers to any non-transitory media that store data and/or instructions that cause a computing device to operate in a specific fashion. Such storage media may comprise non-volatile media and/or volatile media. Non-volatile media includes, for example, non-volatile random access memory (NVRAM), flash memory, optical disks, magnetic disks, or solid-state drives, such as storage device 510. Volatile media includes dynamic memory, such as main memory 506. Common forms of storage media include, for example, a floppy disk, a flexible disk, hard disk, solid-state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, flash memory, any other memory chip or cartridge.


Storage media is distinct from but may be used in conjunction with transmission media. Transmission media participates in transferring information between storage media. For example, transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 502. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.


Various forms of media may be involved in carrying one or more sequences of one or more instructions to processor(s) 504 for execution. For example, the instructions may initially be carried on a magnetic disk or solid-state drive of a remote computer. The remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local to computing device 500 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal. An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data on bus 502. Bus 502 carries the data to main memory 506, from which processor(s) 504 retrieves and executes the instructions. The instructions received by main memory 506 may optionally be stored on storage device(s) 510 either before or after execution by processor(s) 504.


Computing device 500 also may include one or more communication interface(s) 518 coupled to bus 502. A communication interface 518 provides a two-way data communication coupling to a wired or wireless network link 520 that is connected to a local network 522 (e.g., Ethernet network, Wireless Local Area Network, cellular phone network, Bluetooth wireless network, or the like). Communication interface 518 sends and receives electrical, electromagnetic, or optical signals that carry digital data streams representing various types of information. For example, communication interface 518 may be a wired network interface card, a wireless network interface card with an integrated radio antenna, or a modem (e.g., ISDN, DSL, or cable modem).


Network link(s) 520 typically provide data communication through one or more networks to other data devices. For example, a network link 520 may provide a connection through a local network 522 to a host computer 524 or to data equipment operated by an Internet Service Provider (ISP) 526. ISP 526 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the “Internet” 528. Local network(s) 522 and Internet 528 use electrical, electromagnetic or optical signals that carry digital data streams. The signals through the various networks and the signals on network link(s) 520 and through communication interface(s) 518, which carry the digital data to and from computing device 500, are example forms of transmission media.


Computing device 500 can send messages and receive data, including program code, through the network(s), network link(s) 520 and communication interface(s) 518. In the Internet example, a server 530 might transmit a requested code for an application program through Internet 528, ISP 526, local network(s) 522 and communication interface(s) 518.


The received code may be executed by processor 504 as it is received, and/or stored in storage device 510, or other non-volatile storage for later execution.


Basic Software System



FIG. 6 is a block diagram of a basic software system 600 that may be employed for controlling the operation of computing device 500. Software system 600 and its components, including their connections, relationships, and functions, is meant to be exemplary only, and not meant to limit implementations of the subject innovations. Other software systems suitable for implementing the subject innovations may have different components, including components with different connections, relationships, and functions.


In various embodiments, software system 600 is provided for directing the operation of computing device 500. Software system 600, which may be stored in system memory (RAM) 506 and on fixed storage (e.g., hard disk or flash memory) 510, includes a kernel or operating system (OS) 610. The OS 610 manages low-level aspects of computer operation, including managing execution of processes, memory allocation, file input and output (I/O), and device I/O. One or more application programs, represented as 602A, 602B, 602C . . . 602N in FIG. 6, may be “loaded” (e.g., transferred from fixed storage 510 into memory 506) for execution by the system 600. The applications or other software intended for use on device 600 may also be stored as a set of downloadable computer-executable instructions, for example, for downloading and installation from an Internet location (e.g., a Web server).


Software system 600 may include a graphical user interface (GUI) 615, for receiving user commands and data in a graphical (e.g., “point-and-click” or “touch gesture”) fashion. These inputs, in turn, may be acted upon by the system 600 in accordance with instructions from operating system 610 and/or application(s) 602. The GUI 615 also serves to display the results of operation from the OS 610 and application(s) 602, whereupon the user may supply additional inputs or terminate the session (e.g., log off).


OS 610 can execute directly on the bare hardware 620 (e.g., processor(s) 504) of device 500. Alternatively, a hypervisor or virtual machine monitor (VMM) 630 may be interposed between the bare hardware 620 and the OS 610. In this configuration, VMM 630 acts as a software “cushion” or virtualization layer between the OS 610 and the bare hardware 620 of the device 500.


VMM 630 instantiates and runs one or more virtual machine instances (“guest machines”). Each guest machine comprises a “guest” operating system, such as OS 610, and one or more applications, such as application(s) 602, designed to execute on the guest operating system. The VMM 630 presents the guest operating systems with a virtual operating platform and manages the execution of the guest operating systems.


In some instances, the VMM 630 may allow a guest operating system to run as if it is running on the bare hardware 620 of device 500 directly. In these instances, the same version of the guest operating system configured to execute on the bare hardware 620 directly may also execute on VMM 630 without modification or reconfiguration. In other words, VMM 630 may provide full hardware and CPU virtualization to a guest operating system in some instances.


In other instances, a guest operating system may be specially designed or configured to execute on VMM 630 for efficiency. In these instances, the guest operating system is “aware” that it executes on a virtual machine monitor. In other words, VMM 630 may provide para-virtualization to a guest operating system in some instances.


The above-described basic computer hardware and software is presented for purpose of illustrating the basic underlying computer components that may be employed for implementing the subject innovations. The subject innovations, however, are not necessarily limited to any particular computing environment or computing device configuration. Instead, the subject innovations may be implemented in any type of system architecture or processing environment that one skilled in the art, in light of this disclosure, would understand as capable of supporting the features and functions of the subject innovations as presented herein.


Extensions and Alternatives


It is understood that any specific order or hierarchy of steps in the processes disclosed is an illustration of example approaches. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the processes may be rearranged, or that all illustrated steps be performed. Some of the steps may be performed simultaneously. For example, in certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system components illustrated above should not be understood as requiring such separation, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.


Various modifications to these aspects will be readily apparent, and the generic principles defined herein may be applied to other aspects. Thus, the claims are not intended to be limited to the aspects shown herein, but is to be accorded the full scope consistent with the language claims, where reference to an element in the singular is not intended to mean “one and only one” unless specifically so stated, but rather “one or more.” Unless specifically stated otherwise, the term “some” refers to one or more. Unless specifically stated otherwise, the term “may” is used to express one or more non-limiting possibilities. Headings and subheadings, if any, are used for convenience only and do not limit the subject innovations.


A phrase, for example, an “aspect”, an “embodiment”, a “configuration”, or an “implementation” does not imply that the aspect, the embodiment, the configuration, or the implementation is essential to the subject innovations or that the aspect, the embodiment, the configuration, or the implementation applies to all aspects, embodiments, configurations, or implementations of the subject innovations. A disclosure relating to an aspect, an embodiment, a configuration, or an implementation may apply to all aspects, embodiments, configurations, or implementations, or one or more aspects, embodiments, configurations, or implementations. A phrase, for example, an aspect, an embodiment, a configuration, or an implementation may refer to one or more aspects, embodiments, configurations, or implementations and vice versa.

Claims
  • 1. A method, comprising: at one or more computing devices having one or more processors and memory storing one or more programs executed by the one or more processors to perform the method, performing the operations of:storing an input dataset and provenance metadata identifying one or more previous versions of the input dataset;using a derivation program, transforming data in the input dataset and storing the transformed data as a versioned dataset;updating the provenance metadata to identify the input dataset in addition to the one or more previous versions of the input dataset;receiving selection of the versioned dataset that is within a data pipeline system;determining full data provenance of the selected versioned dataset, the full data provenance comprising a set of versioned datasets, by identifying, in the provenance metadata, at least the input dataset and the one or more previous versions of the input dataset;providing for display of a visualization of the full data provenance of the selected versioned dataset, the visualization comprising a graph, the graph comprising a compound node for the selected versioned dataset and a compound node for each versioned dataset in the set of versioned datasets, the graph further comprising edges connecting the compounds nodes, each edge representing a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge;wherein a sub-entry of the compound node for a particular versioned dataset in the set of versioned datasets is visually distinguished in the graphical user interface from other compound node sub-entries of the graph to indicate that a version, of the particular version dataset represented by the sub-entry has been flagged in a database as containing invalid data;wherein an edge in the graph representing a derivation dependency of a first version of a first versioned dataset in the set of versioned datasets on a second version of a second versioned dataset in the set of versioned datasets is visually distinguished from other edges in the graph to indicate that the first version of the first versioned dataset potentially contains invalid data as a result of the derivation dependency.
  • 2. The method of claim 1, wherein the compound node of the selected versioned dataset indicates a name or identifier of the selected version dataset; and wherein the compound node for each versioned dataset in the set of versioned datasets indicates a name or identifier of the each versioned dataset.
  • 3. The method of claim 1, wherein the compound node for each versioned dataset in the set of versioned datasets comprises at least one sub-entry representing a version of the each versioned dataset in the full data provenance of the selected versioned dataset.
  • 4. The method of claim 1, wherein at least one version of a versioned dataset in the set of versioned datasets contains data generated as a result of one or more Spark systems executing a derivation program taking at least one version of another versioned dataset as input to the derivation program.
  • 5. The method of claim 1, wherein at least one version of a versioned dataset in the set of versioned datasets contains data generated as a result of one or more MapReduce systems executing a derivation program taking at least one version of another versioned dataset as input as input to the derivation program.
  • 6. One or more non-transitory computer-readable media storing one or more programs, the one or more programs comprising instructions for: storing an input dataset and provenance metadata identifying one or more previous versions of the input dataset;using a derivation program, transforming data in the input dataset and storing the transformed data as a versioned dataset;updating the provenance metadata to identify the input dataset in addition to the one or more previous versions of the input dataset;receiving selection of a versioned dataset that is within a data pipeline system;determining full data provenance of the selected versioned dataset, the full data provenance comprising a set of versioned datasets, by identifying, in the provenance metadata, at least the input dataset and the one or more previous versions of the input dataset;providing for display of a visualization of the full data provenance of the selected versioned dataset, the visualization comprising a graph, the graph comprising a compound node for the selected versioned dataset and for each versioned dataset in the set of versioned datasets, the graph further comprising edges connecting the compounds nodes, each edge representing a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge;wherein a sub-entry of the compound node for a particular versioned dataset in the set of versioned datasets is visually distinguished in the graphical user interface from other compound node sub-entries of the graph to indicate that a version, of the particular version dataset represented by the sub-entry has been flagged in a database as containing invalid data;wherein an edge in the graph representing a derivation dependency of a first version of a first versioned dataset in the set of versioned datasets on a second version of a second versioned dataset in the set of versioned datasets is visually distinguished from other edges in the graph to indicate that the first version of the first versioned dataset potentially contains invalid data as a result of the derivation dependency.
  • 7. The one or more non-transitory computer-readable media of claim 6, wherein the compound node of the selected versioned dataset indicates a name or identifier of the selected version dataset; and wherein the compound node for each versioned dataset in the set of versioned datasets indicates a name or identifier of the each versioned dataset.
  • 8. The one or more non-transitory computer-readable media of claim 6, wherein the compound node for each versioned dataset in the set of versioned datasets comprises at least one sub-entry representing a version of the each versioned dataset in the full data provenance of the selected versioned dataset.
  • 9. The one or more non-transitory computer-readable media of claim 6, wherein at least one version of a versioned dataset in the set of versioned datasets contains data generated as a result of one or more Spark systems executing a derivation program taking at least one version of another versioned dataset as input to the derivation program.
  • 10. The one or more non-transitory computer-readable media of claim 6, wherein at least one version of a versioned dataset in the set of versioned datasets contains data generated as a result of one or more MapReduce systems executing a derivation program taking at least one version of another versioned dataset as input as input to the derivation program.
  • 11. A system comprising: memory;one or more processors;one or more programs stored in the memory and configured for execution by the one or more processors, the one or more programs comprising instructions for:storing an input dataset and provenance metadata identifying one or more previous versions of the input dataset;using a derivation program, transforming data in the input dataset and storing the transformed data as a versioned dataset;updating the provenance metadata to identify the input dataset in addition to the one or more previous versions of the input dataset;receiving selection of a versioned dataset that is within a data pipeline system;determining full data provenance of the selected versioned dataset, the full data provenance comprising a set of versioned datasets, by identifying, in the provenance metadata, at least the input dataset and the one or more previous versions of the input dataset;providing for display of a visualization of the full data provenance of the selected versioned dataset, the visualization comprising a graph, the graph comprising a compound node for the selected versioned dataset and for each versioned dataset in the set of versioned datasets, the graph further comprising edges connecting the compounds nodes, each edge representing a derivation dependency between versions of the versioned datasets represented by the compound nodes connected by the edge;wherein a sub-entry of the compound node for a particular versioned dataset in the set of versioned datasets is visually distinguished in the graphical user interface from other compound node sub-entries of the graph to indicate that a version, of the particular version dataset represented by the sub-entry has been flagged in a database as containing invalid data;wherein an edge in the graph representing a derivation dependency of a first version of a first versioned dataset in the set of versioned datasets on a second version of a second versioned dataset in the set of versioned datasets is visually distinguished from other edges in the graph to indicate that the first version of the first versioned dataset potentially contains invalid data as a result of the derivation dependency.
  • 12. The system of claim 11, wherein the compound node of the selected versioned dataset indicates a name or identifier of the selected version dataset; and wherein the compound node for each versioned dataset in the set of versioned datasets indicates a name or identifier of the each versioned dataset.
  • 13. The system of claim 11, wherein the compound node for each versioned dataset in the set of versioned datasets comprises at least one sub-entry representing a version of the each versioned dataset in the full data provenance of the selected versioned dataset.
US Referenced Citations (769)
Number Name Date Kind
5109399 Thompson Apr 1992 A
5329108 Lamoure Jul 1994 A
5632009 Rao et al. May 1997 A
5632987 Rao et al. May 1997 A
5670987 Doi et al. Sep 1997 A
5781704 Rossmo Jul 1998 A
5798769 Chiu et al. Aug 1998 A
5818737 Orr et al. Oct 1998 A
5845300 Comer Dec 1998 A
6057757 Arrowsmith et al. May 2000 A
6091956 Hollenberg Jul 2000 A
6094653 Li et al. Jul 2000 A
6161098 Wallman Dec 2000 A
6167405 Rosensteel, Jr. et al. Dec 2000 A
6219053 Tachibana et al. Apr 2001 B1
6232971 Haynes May 2001 B1
6247019 Davies Jun 2001 B1
6279018 Kudrolli et al. Aug 2001 B1
6289338 Stoffel et al. Sep 2001 B1
6341310 Leshem et al. Jan 2002 B1
6366933 Ball et al. Apr 2002 B1
6369835 Lin Apr 2002 B1
6430305 Decker Aug 2002 B1
6456997 Shukla Sep 2002 B1
6463404 Appleby Oct 2002 B1
6523172 Martines-Guerra et al. Feb 2003 B1
6539538 Brewster et al. Mar 2003 B1
6549752 Tsukamoto Apr 2003 B2
6549944 Weinberg et al. Apr 2003 B1
6560620 Ching May 2003 B1
6581068 Bensoussan et al. Jun 2003 B1
6594672 Lampson et al. Jul 2003 B1
6631496 Li et al. Oct 2003 B1
6640231 Andersen et al. Oct 2003 B1
6642945 Sharpe Nov 2003 B1
6643613 McGee et al. Nov 2003 B2
6714936 Nevin, III Mar 2004 B1
6748481 Parry et al. Jun 2004 B1
6775675 Nwabueze et al. Aug 2004 B1
6820135 Dingman Nov 2004 B1
6828920 Owen et al. Dec 2004 B2
6839745 Dingari et al. Jan 2005 B1
6877137 Rivette et al. Apr 2005 B1
6976210 Silva et al. Dec 2005 B1
6978419 Kantrowitz Dec 2005 B1
6980984 Huffman et al. Dec 2005 B1
6985950 Hanson et al. Jan 2006 B1
7027974 Busch et al. Apr 2006 B1
7028223 Kolawa et al. Apr 2006 B1
7036085 Barros Apr 2006 B2
7043702 Chi et al. May 2006 B2
7055110 Kupka et al. May 2006 B2
7089541 Ungar Aug 2006 B2
7117430 Maguire et al. Oct 2006 B2
7139800 Bellotti et al. Nov 2006 B2
7158878 Rasmussen et al. Jan 2007 B2
7162475 Ackerman Jan 2007 B2
7168039 Bertram Jan 2007 B2
7171427 Witkowski Jan 2007 B2
7194680 Roy et al. Mar 2007 B1
7237192 Stephenson et al. Jun 2007 B1
7240330 Fairweather Jul 2007 B2
7269786 Malloy et al. Sep 2007 B1
7278105 Kitts Oct 2007 B1
7290698 Poslinski et al. Nov 2007 B2
7333998 Heckerman et al. Feb 2008 B2
7370047 Gorman May 2008 B2
7379811 Rasmussen et al. May 2008 B2
7379903 Joseph May 2008 B2
7426654 Adams et al. Sep 2008 B2
7451397 Weber et al. Nov 2008 B2
7454466 Bellotti et al. Nov 2008 B2
7467375 Tondreau et al. Dec 2008 B2
7487139 Fraleigh et al. Feb 2009 B2
7502786 Liu et al. Mar 2009 B2
7525422 Bishop et al. Apr 2009 B2
7529727 Arning et al. May 2009 B2
7529734 Dirisala May 2009 B2
7533069 Fairweather May 2009 B2
7558677 Jones Jun 2009 B2
7574409 Patinkin Aug 2009 B2
7574428 Leiserowitz et al. Aug 2009 B2
7579965 Bucholz Aug 2009 B2
7596285 Brown et al. Sep 2009 B2
7614006 Molander Nov 2009 B2
7617232 Gabbert et al. Nov 2009 B2
7620628 Kapur et al. Nov 2009 B2
7627812 Chamberlain et al. Dec 2009 B2
7634717 Chamberlain et al. Dec 2009 B2
7685083 Fairweather Mar 2010 B2
7703021 Flam Apr 2010 B1
7706817 Bamrah et al. Apr 2010 B2
7712049 Williams et al. May 2010 B2
7716077 Mikurak May 2010 B1
7725530 Sah et al. May 2010 B2
7725547 Albertson et al. May 2010 B2
7730082 Sah et al. Jun 2010 B2
7730109 Rohrs et al. Jun 2010 B2
7739246 Mooney et al. Jun 2010 B2
7756843 Palmer Jul 2010 B1
7761407 Stern Jul 2010 B1
7770100 Chamberlain et al. Aug 2010 B2
7800796 Saito Sep 2010 B2
7805457 Viola et al. Sep 2010 B1
7809703 Balabhadrapatruni et al. Oct 2010 B2
7814084 Hallett et al. Oct 2010 B2
7818658 Chen Oct 2010 B2
7870493 Pall et al. Jan 2011 B2
7877421 Berger et al. Jan 2011 B2
7894984 Rasmussen et al. Feb 2011 B2
7899611 Downs et al. Mar 2011 B2
7899796 Borthwick et al. Mar 2011 B1
7917376 Bellin et al. Mar 2011 B2
7920963 Jouline et al. Apr 2011 B2
7933862 Chamberlain et al. Apr 2011 B2
7941321 Greenstein et al. May 2011 B2
7962281 Rasmussen et al. Jun 2011 B2
7962495 Jain et al. Jun 2011 B2
7962848 Bertram Jun 2011 B2
7970240 Chao et al. Jun 2011 B1
7971150 Raskutti et al. Jun 2011 B2
7984374 Caro et al. Jun 2011 B2
8001465 Kudrolli et al. Aug 2011 B2
8001482 Bhattiprolu et al. Aug 2011 B2
8010545 Stefik et al. Aug 2011 B2
8015487 Roy et al. Sep 2011 B2
8024778 Cash et al. Sep 2011 B2
8036632 Cona et al. Oct 2011 B1
8036971 Aymeloglu et al. Oct 2011 B2
8046283 Burns Oct 2011 B2
8054756 Chand et al. Nov 2011 B2
8103543 Zwicky Jan 2012 B1
8117022 Linker Feb 2012 B2
8132149 Shenfield et al. Mar 2012 B2
8134457 Velipasalar et al. Mar 2012 B2
8145703 Frishert et al. Mar 2012 B2
8185819 Sah et al. May 2012 B2
8196184 Amirov et al. Jun 2012 B2
8214361 Sandler et al. Jul 2012 B1
8214490 Vos et al. Jul 2012 B1
8214764 Gemmell et al. Jul 2012 B2
8225201 Michael Jul 2012 B2
8229902 Vishniac et al. Jul 2012 B2
8229947 Fujinaga Jul 2012 B2
8230333 Decherd et al. Jul 2012 B2
8271461 Pike et al. Sep 2012 B2
8271948 Talozi et al. Sep 2012 B2
8280880 Aymeloglu et al. Oct 2012 B1
8290838 Thakur et al. Oct 2012 B1
8290926 Ozzie et al. Oct 2012 B2
8290942 Jones et al. Oct 2012 B2
8301464 Cave et al. Oct 2012 B1
8301904 Gryaznov Oct 2012 B1
8302855 Ma et al. Nov 2012 B2
8312367 Foster Nov 2012 B2
8312546 Alme Nov 2012 B2
8332354 Chatterjee et al. Dec 2012 B1
8352881 Champion et al. Jan 2013 B2
8368695 Howell et al. Feb 2013 B2
8397171 Klassen et al. Mar 2013 B2
8412707 Mianji Apr 2013 B1
8418085 Snook et al. Apr 2013 B2
8447722 Ahuja et al. May 2013 B1
8452790 Mianji May 2013 B1
8463036 Ramesh et al. Jun 2013 B1
8473454 Evanitsky et al. Jun 2013 B2
8484115 Aymeloglu et al. Jul 2013 B2
8489331 Kopf et al. Jul 2013 B2
8489623 Jain et al. Jul 2013 B2
8489641 Seefeld et al. Jul 2013 B1
8494984 Hwang et al. Jul 2013 B2
8510304 Briggs Aug 2013 B1
8510743 Hackborn et al. Aug 2013 B2
8514082 Cova et al. Aug 2013 B2
8515207 Chau Aug 2013 B2
8554579 Tribble et al. Oct 2013 B2
8554653 Falkenborg et al. Oct 2013 B2
8554709 Goodson et al. Oct 2013 B2
8560494 Downing et al. Oct 2013 B1
8577911 Stepinski et al. Nov 2013 B1
8589273 Creeden et al. Nov 2013 B2
8595234 Siripurapu et al. Nov 2013 B2
8601326 Kirn Dec 2013 B1
8620641 Farnsworth et al. Dec 2013 B2
8646080 Williamson et al. Feb 2014 B2
8639757 Adams et al. Mar 2014 B1
8676857 Adams et al. Mar 2014 B1
8688573 Rukonic et al. Apr 2014 B1
8689108 Duffield et al. Apr 2014 B1
8689182 Leithead et al. Apr 2014 B2
8713467 Goldenberg et al. Apr 2014 B1
8726379 Stiansen et al. May 2014 B1
8739278 Varghese May 2014 B2
8799799 Cervelli et al. May 2014 B1
8742934 Sarpy et al. Jun 2014 B1
8744890 Bernier Jun 2014 B1
8745516 Mason et al. Jun 2014 B2
8781169 Jackson et al. Jul 2014 B2
8787939 Papakipos et al. Jul 2014 B2
8799867 Peri-Glass et al. Aug 2014 B1
8812960 Sun et al. Aug 2014 B1
8830322 Nerayoff et al. Sep 2014 B2
8832594 Thompson et al. Sep 2014 B1
8838556 Reiner et al. Sep 2014 B1
8855999 Elliot Oct 2014 B1
8868537 Colgrove et al. Oct 2014 B1
8903717 Elliot Dec 2014 B2
8917274 Ma et al. Dec 2014 B2
8924388 Elliot et al. Dec 2014 B2
8924389 Elliot et al. Dec 2014 B2
8924872 Bogomolov et al. Dec 2014 B1
8930897 Nassar Jan 2015 B2
8937619 Sharma et al. Jan 2015 B2
8938434 Shankar et al. Jan 2015 B2
8938686 Erenrich et al. Jan 2015 B1
8949164 Mohler Feb 2015 B1
8954410 Chang et al. Feb 2015 B2
9009171 Grossman et al. Apr 2015 B1
9009827 Albertson et al. Apr 2015 B1
9021260 Falk et al. Apr 2015 B1
9021384 Beard et al. Apr 2015 B1
9043696 Meiklejohn et al. May 2015 B1
9043894 Dennison et al. May 2015 B1
9069842 Melby Jun 2015 B2
9092482 Harris et al. Jul 2015 B2
9100428 Visbal Aug 2015 B1
9116975 Shankar et al. Aug 2015 B2
9129219 Robertson et al. Sep 2015 B1
9146954 Boe et al. Sep 2015 B1
9201920 Jain et al. Dec 2015 B2
9208159 Stowe et al. Dec 2015 B2
9223773 Isaacson Dec 2015 B2
9229952 Meacham et al. Jan 2016 B1
9230060 Jain et al. Jan 2016 B2
9230280 Maag et al. Jan 2016 B1
9280532 Cicerone Mar 2016 B2
9576015 Tolnay et al. Feb 2017 B1
20010056522 Satyanarayana Dec 2001 A1
20020033848 Sciammarella et al. Mar 2002 A1
20020065708 Senay et al. May 2002 A1
20020091707 Keller Jul 2002 A1
20020095360 Joao Jul 2002 A1
20020095658 Shulman Jul 2002 A1
20020103705 Brady Aug 2002 A1
20020116120 Ruiz et al. Aug 2002 A1
20020147805 Leshem et al. Oct 2002 A1
20020174201 Ramer et al. Nov 2002 A1
20020194058 Friedlander et al. Dec 2002 A1
20020194119 Wright et al. Dec 2002 A1
20030028560 Kudrolli et al. Feb 2003 A1
20030036848 Sheha et al. Feb 2003 A1
20030039948 Donahue Feb 2003 A1
20030074187 Ait-Mokhtar et al. Apr 2003 A1
20030088438 Eldering May 2003 A1
20030126102 Borthwick Jul 2003 A1
20030130993 Mendelevitch et al. Jul 2003 A1
20030140106 Raguseo Jul 2003 A1
20030144868 MacIntyre et al. Jul 2003 A1
20030163352 Surpin et al. Aug 2003 A1
20030171942 Maughan et al. Sep 2003 A1
20030172053 Fairweather Sep 2003 A1
20030177112 Gardner Sep 2003 A1
20030225755 Iwayama et al. Dec 2003 A1
20030229848 Arend et al. Dec 2003 A1
20040032432 Baynger Feb 2004 A1
20040034570 Davis Feb 2004 A1
20040044992 Muller et al. Mar 2004 A1
20040064256 Barinek et al. Apr 2004 A1
20040083466 Dapp et al. Apr 2004 A1
20040085318 Hassler et al. May 2004 A1
20040095349 Bito et al. May 2004 A1
20040111410 Burgoon et al. Jun 2004 A1
20040111480 Yue Jun 2004 A1
20040117387 Civetta et al. Jun 2004 A1
20040126840 Cheng et al. Jul 2004 A1
20040143602 Ruiz et al. Jul 2004 A1
20040143796 Lerner et al. Jul 2004 A1
20040153418 Hanweck Aug 2004 A1
20040153837 Preston et al. Aug 2004 A1
20040163039 McPherson et al. Aug 2004 A1
20040193600 Kaasten et al. Sep 2004 A1
20040205524 Richter et al. Oct 2004 A1
20040221223 Yu et al. Nov 2004 A1
20040236688 Bozeman Nov 2004 A1
20040260702 Cragun et al. Dec 2004 A1
20040267746 Marcjan et al. Dec 2004 A1
20050010472 Quatse et al. Jan 2005 A1
20050027705 Sadri et al. Feb 2005 A1
20050028094 Allyn Feb 2005 A1
20050039119 Parks et al. Feb 2005 A1
20050065811 Chu et al. Mar 2005 A1
20050078858 Yao et al. Apr 2005 A1
20050080769 Gemmell Apr 2005 A1
20050086207 Heuer et al. Apr 2005 A1
20050091420 Snover et al. Apr 2005 A1
20050102328 Gaito May 2005 A1
20050125715 Franco et al. Jun 2005 A1
20050154628 Eckart et al. Jul 2005 A1
20050154769 Eckart et al. Jul 2005 A1
20050162523 Darrell et al. Jul 2005 A1
20050166144 Gross Jul 2005 A1
20050180330 Shapiro Aug 2005 A1
20050182793 Keenan et al. Aug 2005 A1
20050183005 Denoue et al. Aug 2005 A1
20050210409 Jou Sep 2005 A1
20050246327 Yeung et al. Nov 2005 A1
20050251786 Citron et al. Nov 2005 A1
20060026120 Carolan et al. Feb 2006 A1
20060026170 Kreitler et al. Feb 2006 A1
20060059139 Robinson Mar 2006 A1
20060074881 Vembu et al. Apr 2006 A1
20060080619 Carlson et al. Apr 2006 A1
20060095521 Patinkin May 2006 A1
20060106847 Eckardt et al. May 2006 A1
20060129746 Porter Jun 2006 A1
20060129992 Oberholtzer et al. Jun 2006 A1
20060139375 Rasmussen et al. Jun 2006 A1
20060142949 Helt Jun 2006 A1
20060143034 Rothermel Jun 2006 A1
20060143075 Carr et al. Jun 2006 A1
20060143079 Basak et al. Jun 2006 A1
20060149596 Surpin et al. Jul 2006 A1
20060161558 Tamma et al. Jul 2006 A1
20060184889 Molander Aug 2006 A1
20060203337 White Sep 2006 A1
20060209085 Wong et al. Sep 2006 A1
20060218405 Ama et al. Sep 2006 A1
20060218637 Thomas et al. Sep 2006 A1
20060241974 Chao et al. Oct 2006 A1
20060242040 Rader Oct 2006 A1
20060242630 Koike et al. Oct 2006 A1
20060271277 Hu et al. Nov 2006 A1
20060271838 Carro Nov 2006 A1
20060279630 Aggarwal et al. Dec 2006 A1
20070000999 Kubo et al. Jan 2007 A1
20070011150 Frank Jan 2007 A1
20070011304 Error Jan 2007 A1
20070016363 Huang et al. Jan 2007 A1
20070038646 Thota Feb 2007 A1
20070038962 Fuchs et al. Feb 2007 A1
20070057966 Ohno et al. Mar 2007 A1
20070074169 Chess et al. Mar 2007 A1
20070078832 Ott et al. Apr 2007 A1
20070078872 Cohen Apr 2007 A1
20070083541 Fraleigh et al. Apr 2007 A1
20070094389 Nussey et al. Apr 2007 A1
20070112714 Fairweather May 2007 A1
20070150369 Zivin Jun 2007 A1
20070150801 Chidlovskii et al. Jun 2007 A1
20070156673 Maga Jul 2007 A1
20070174760 Chamberlain et al. Jul 2007 A1
20070185850 Walters et al. Aug 2007 A1
20070185867 Maga Aug 2007 A1
20070192265 Chopin et al. Aug 2007 A1
20070198571 Ferguson et al. Aug 2007 A1
20070208497 Downs et al. Sep 2007 A1
20070208498 Barker et al. Sep 2007 A1
20070208736 Tanigawa et al. Sep 2007 A1
20070233709 Abnous Oct 2007 A1
20070240062 Christena et al. Oct 2007 A1
20070266336 Nojima et al. Nov 2007 A1
20070284433 Domenica et al. Dec 2007 A1
20070294643 Kyle Dec 2007 A1
20080034327 Cisler et al. Feb 2008 A1
20080040275 Paulsen et al. Feb 2008 A1
20080040684 Crump Feb 2008 A1
20080051989 Welsh Feb 2008 A1
20080052142 Bailey et al. Feb 2008 A1
20080069081 Chand et al. Mar 2008 A1
20080077597 Butler Mar 2008 A1
20080077642 Carbone et al. Mar 2008 A1
20080103996 Forman et al. May 2008 A1
20080104019 Nath May 2008 A1
20080104060 Abhyankar et al. May 2008 A1
20080104407 Home et al. May 2008 A1
20080126951 Sood et al. May 2008 A1
20080140387 Linker Jun 2008 A1
20080148398 Mezack et al. Jun 2008 A1
20080155440 Trevor et al. Jun 2008 A1
20080162616 Gross et al. Jul 2008 A1
20080195417 Surpin et al. Aug 2008 A1
20080195608 Clover Aug 2008 A1
20080201339 McGrew Aug 2008 A1
20080215546 Baum et al. Sep 2008 A1
20080222295 Robinson et al. Sep 2008 A1
20080228467 Womack et al. Sep 2008 A1
20080243711 Aymeloglu et al. Oct 2008 A1
20080249983 Meisels et al. Oct 2008 A1
20080255973 El Wade et al. Oct 2008 A1
20080263468 Cappione et al. Oct 2008 A1
20080267107 Rosenberg Oct 2008 A1
20080276167 Michael Nov 2008 A1
20080278311 Grange et al. Nov 2008 A1
20080281580 Zabokritski Nov 2008 A1
20080288306 Maclntyre et al. Nov 2008 A1
20080301643 Appleton et al. Dec 2008 A1
20080313132 Hao et al. Dec 2008 A1
20090002492 Velipasalar et al. Jan 2009 A1
20090027418 Maru et al. Jan 2009 A1
20090030915 Winter et al. Jan 2009 A1
20090037417 Shankar et al. Feb 2009 A1
20090055251 Shah et al. Feb 2009 A1
20090076845 Bellin et al. Mar 2009 A1
20090088964 Schaaf et al. Apr 2009 A1
20090094166 Aymeloglu et al. Apr 2009 A1
20090106178 Chu Apr 2009 A1
20090112745 Stefanescu Apr 2009 A1
20090119309 Gibson et al. May 2009 A1
20090125359 Knapic May 2009 A1
20090125369 Kloostra et al. May 2009 A1
20090125459 Norton et al. May 2009 A1
20090132921 Hwangbo et al. May 2009 A1
20090132953 Reed et al. May 2009 A1
20090143052 Bates et al. Jun 2009 A1
20090144262 White et al. Jun 2009 A1
20090144274 Fraleigh et al. Jun 2009 A1
20090150854 Elaasar et al. Jun 2009 A1
20090164934 Bhattiprolu et al. Jun 2009 A1
20090171939 Athsani et al. Jul 2009 A1
20090172511 Decherd et al. Jul 2009 A1
20090172669 Bobak et al. Jul 2009 A1
20090172821 Daira et al. Jul 2009 A1
20090177962 Gusmorino et al. Jul 2009 A1
20090179892 Tsuda et al. Jul 2009 A1
20090187464 Bai et al. Jul 2009 A1
20090187546 Whyte et al. Jul 2009 A1
20090187548 Ji et al. Jul 2009 A1
20090198641 Tortoriello Aug 2009 A1
20090199047 Vaitheeswaran et al. Aug 2009 A1
20090222400 Kupershmidt et al. Sep 2009 A1
20090222760 Halverson et al. Sep 2009 A1
20090228507 Jain et al. Sep 2009 A1
20090234720 George et al. Sep 2009 A1
20090240664 Dinker et al. Sep 2009 A1
20090249244 Robinson et al. Oct 2009 A1
20090254970 Agarwal et al. Oct 2009 A1
20090254971 Herz Oct 2009 A1
20090271343 Vaiciulis et al. Oct 2009 A1
20090281839 Lynn et al. Nov 2009 A1
20090282097 Alberti et al. Nov 2009 A1
20090287470 Farnsworth et al. Nov 2009 A1
20090292626 Oxford Nov 2009 A1
20090307049 Elliott et al. Dec 2009 A1
20090310816 Freire Dec 2009 A1
20090313463 Pang et al. Dec 2009 A1
20090319295 Kass-Hout et al. Dec 2009 A1
20090319418 Herz Dec 2009 A1
20090319891 MacKinlay Dec 2009 A1
20090327208 Bittner et al. Dec 2009 A1
20100011282 Dollard et al. Jan 2010 A1
20100030722 Goodson et al. Feb 2010 A1
20100031141 Summers et al. Feb 2010 A1
20100042922 Bradateanu et al. Feb 2010 A1
20100057622 Faith et al. Mar 2010 A1
20100057716 Stefik et al. Mar 2010 A1
20100070489 Aymeloglu et al. Mar 2010 A1
20100070523 Delgo et al. Mar 2010 A1
20100070842 Aymeloglu et al. Mar 2010 A1
20100070845 Facemire et al. Mar 2010 A1
20100070897 Aymeloglu et al. Mar 2010 A1
20100082532 Shaik et al. Apr 2010 A1
20100098318 Anderson Apr 2010 A1
20100100963 Mahaffey Apr 2010 A1
20100103124 Kruzeniski et al. Apr 2010 A1
20100114629 Adler May 2010 A1
20100114887 Conway et al. May 2010 A1
20100122152 Chamberlain et al. May 2010 A1
20100125470 Chisholm May 2010 A1
20100131457 Heimendinger May 2010 A1
20100131502 Fordham May 2010 A1
20100161735 Sharma Jun 2010 A1
20100162176 Dunton Jun 2010 A1
20100191563 Schlaifer et al. Jul 2010 A1
20100198684 Eraker et al. Aug 2010 A1
20100199225 Coleman et al. Aug 2010 A1
20100204983 Chung et al. Aug 2010 A1
20100211550 Daniello et al. Aug 2010 A1
20100228786 Torok Sep 2010 A1
20100228812 Uomini Sep 2010 A1
20100235915 Memon et al. Sep 2010 A1
20100250412 Wagner Sep 2010 A1
20100257015 Molander Oct 2010 A1
20100257515 Bates et al. Oct 2010 A1
20100262688 Hussain et al. Oct 2010 A1
20100280857 Liu et al. Nov 2010 A1
20100293174 Bennett et al. Nov 2010 A1
20100306285 Shah et al. Dec 2010 A1
20100306713 Geisner et al. Dec 2010 A1
20100312837 Bodapati et al. Dec 2010 A1
20100313119 Baldwin et al. Dec 2010 A1
20100318838 Katano et al. Dec 2010 A1
20100318924 Frankel et al. Dec 2010 A1
20100321399 Ellren et al. Dec 2010 A1
20100325526 Ellis et al. Dec 2010 A1
20100325581 Finkelstein et al. Dec 2010 A1
20100330801 Rouh Dec 2010 A1
20110004498 Readshaw Jan 2011 A1
20110029526 Knight et al. Feb 2011 A1
20110047159 Baid et al. Feb 2011 A1
20110047540 Williams et al. Feb 2011 A1
20110060753 Shaked et al. Mar 2011 A1
20110061013 Bilicki et al. Mar 2011 A1
20110074811 Hanson et al. Mar 2011 A1
20110078055 Faribault et al. Mar 2011 A1
20110078173 Seligmann et al. Mar 2011 A1
20110093327 Fordyce, III et al. Apr 2011 A1
20110099133 Chang et al. Apr 2011 A1
20110117878 Barash et al. May 2011 A1
20110119100 Ruhl et al. May 2011 A1
20110131547 Elaasar Jun 2011 A1
20110137766 Rasmussen et al. Jun 2011 A1
20110153384 Home et al. Jun 2011 A1
20110153592 DeMarcken Jun 2011 A1
20110161096 Buehler et al. Jun 2011 A1
20110161132 Goel et al. Jun 2011 A1
20110167710 Ramakrishnan et al. Jul 2011 A1
20110170799 Carrino et al. Jul 2011 A1
20110173032 Payne et al. Jul 2011 A1
20110173093 Psota et al. Jul 2011 A1
20110181598 O'Neall et al. Jul 2011 A1
20110185316 Reid et al. Jul 2011 A1
20110208565 Ross et al. Aug 2011 A1
20110208724 Jones et al. Aug 2011 A1
20110213655 Henkin Sep 2011 A1
20110213791 Jain et al. Sep 2011 A1
20110218934 Elser Sep 2011 A1
20110218955 Tang Sep 2011 A1
20110219321 Gonzalez et al. Sep 2011 A1
20110219450 McDougal et al. Sep 2011 A1
20110225198 Edwards et al. Sep 2011 A1
20110238553 Raj et al. Sep 2011 A1
20110258158 Resende et al. Oct 2011 A1
20110258216 Supakkul et al. Oct 2011 A1
20110270604 Qi et al. Nov 2011 A1
20110270705 Parker Nov 2011 A1
20110270834 Sokolan et al. Nov 2011 A1
20110289397 Eastmond et al. Nov 2011 A1
20110289407 Naik et al. Nov 2011 A1
20110289420 Morioka et al. Nov 2011 A1
20110291851 Whisenant Dec 2011 A1
20110295649 Fine Dec 2011 A1
20110295795 Venkatasubramanian et al. Dec 2011 A1
20110310005 Chen et al. Dec 2011 A1
20110314007 Dassa et al. Dec 2011 A1
20110314024 Chang et al. Dec 2011 A1
20120004904 Shin et al. Jan 2012 A1
20120011238 Rathod Jan 2012 A1
20120011245 Gillette et al. Jan 2012 A1
20120019559 Siler et al. Jan 2012 A1
20120022945 Falkenborg et al. Jan 2012 A1
20120036013 Neuhaus et al. Feb 2012 A1
20120036434 Oberstein Feb 2012 A1
20120050293 Carlhian et al. Mar 2012 A1
20120054284 Rakshit Mar 2012 A1
20120059853 Jagota Mar 2012 A1
20120066166 Curbera et al. Mar 2012 A1
20120066296 Appleton et al. Mar 2012 A1
20120072825 Sherkin et al. Mar 2012 A1
20120075324 Cardno et al. Mar 2012 A1
20120079363 Folting et al. Mar 2012 A1
20120084117 Tavares et al. Apr 2012 A1
20120084118 Bai et al. Apr 2012 A1
20120084287 Lakshminarayan et al. Apr 2012 A1
20120102006 Larson Apr 2012 A1
20120106801 Jackson May 2012 A1
20120117082 Koperda et al. May 2012 A1
20120123989 Yu et al. May 2012 A1
20120124179 Cappio et al. May 2012 A1
20120130937 Leon et al. May 2012 A1
20120131512 Takeuchi et al. May 2012 A1
20120137235 Ts et al. May 2012 A1
20120144335 Abeln et al. Jun 2012 A1
20120159307 Chung et al. Jun 2012 A1
20120159362 Brown et al. Jun 2012 A1
20120159399 Bastide et al. Jun 2012 A1
20120170847 Tsukidate Jul 2012 A1
20120173381 Smith Jul 2012 A1
20120173985 Peppel Jul 2012 A1
20120191446 Binsztok et al. Jul 2012 A1
20120196557 Reich et al. Aug 2012 A1
20120196558 Reich et al. Aug 2012 A1
20120197651 Robinson et al. Aug 2012 A1
20120203708 Psota et al. Aug 2012 A1
20120208636 Feige Aug 2012 A1
20120215784 King et al. Aug 2012 A1
20120221511 Gibson et al. Aug 2012 A1
20120221553 Wittmer et al. Aug 2012 A1
20120221580 Barney Aug 2012 A1
20120226523 Weiss Sep 2012 A1
20120245976 Kumar et al. Sep 2012 A1
20120246148 Dror Sep 2012 A1
20120254129 Wheeler et al. Oct 2012 A1
20120284345 Costenaro et al. Nov 2012 A1
20120290527 Yalamanchilli Nov 2012 A1
20120290879 Shibuya et al. Nov 2012 A1
20120296907 Long et al. Nov 2012 A1
20120304150 Leithead et al. Nov 2012 A1
20120311684 Paulsen et al. Dec 2012 A1
20120323888 Osann, Jr. Dec 2012 A1
20120330973 Ghuneim et al. Dec 2012 A1
20130006426 Healey et al. Jan 2013 A1
20130006725 Simanek et al. Jan 2013 A1
20130006916 McBride et al. Jan 2013 A1
20130006947 Olumuyiwa et al. Jan 2013 A1
20130016106 Yip et al. Jan 2013 A1
20130018796 Kolhatkar et al. Jan 2013 A1
20130024268 Manickavelu Jan 2013 A1
20130024731 Shochat et al. Jan 2013 A1
20130046635 Grigg et al. Feb 2013 A1
20130046842 Muntz et al. Feb 2013 A1
20130050217 Armitage Feb 2013 A1
20130054306 Bhalla Feb 2013 A1
20130057551 Ebert et al. Mar 2013 A1
20130060742 Chang et al. Mar 2013 A1
20130060786 Serrano et al. Mar 2013 A1
20130061169 Pearcy et al. Mar 2013 A1
20130073377 Heath Mar 2013 A1
20130073454 Busch Mar 2013 A1
20130078943 Biage et al. Mar 2013 A1
20130086482 Parsons Apr 2013 A1
20130091084 Lee Apr 2013 A1
20130096988 Grossman et al. Apr 2013 A1
20130097130 Bingol et al. Apr 2013 A1
20130097482 Marantz et al. Apr 2013 A1
20130110746 Ahn May 2013 A1
20130110822 Ikeda et al. May 2013 A1
20130110877 Bonham et al. May 2013 A1
20130111320 Campbell et al. May 2013 A1
20130117011 Ahmed et al. May 2013 A1
20130117651 Waldman et al. May 2013 A1
20130124193 Holmberg May 2013 A1
20130101159 Rosen Jun 2013 A1
20130150004 Rosen Jun 2013 A1
20130151148 Parundekar et al. Jun 2013 A1
20130151388 Falkenborg et al. Jun 2013 A1
20130151453 Bhanot et al. Jun 2013 A1
20130157234 Gulli et al. Jun 2013 A1
20130166348 Scotto Jun 2013 A1
20130166480 Popescu et al. Jun 2013 A1
20130166550 Buchmann et al. Jun 2013 A1
20130176321 Mitchell et al. Jul 2013 A1
20130179420 Park et al. Jul 2013 A1
20130185245 Anderson Jul 2013 A1
20130185307 Ei-Yaniv et al. Jul 2013 A1
20130198565 Mancoridis et al. Aug 2013 A1
20130224696 Wolfe et al. Aug 2013 A1
20130225212 Khan Aug 2013 A1
20130226318 Procyk Aug 2013 A1
20130226879 Ring et al. Aug 2013 A1
20130226953 Markovich et al. Aug 2013 A1
20130238616 Rose et al. Sep 2013 A1
20130246170 Gross et al. Sep 2013 A1
20130246316 Talukder et al. Sep 2013 A1
20130246537 Gaddala Sep 2013 A1
20130246560 Feng et al. Sep 2013 A1
20130246597 Iizawa et al. Sep 2013 A1
20130251233 Yang et al. Sep 2013 A1
20130262403 Milousheff Oct 2013 A1
20130262527 Hunter et al. Oct 2013 A1
20130263019 Castellanos et al. Oct 2013 A1
20130267207 Hao et al. Oct 2013 A1
20130268520 Fisher et al. Oct 2013 A1
20130275446 Jain et al. Oct 2013 A1
20130279757 Kephart Oct 2013 A1
20130282696 John et al. Oct 2013 A1
20130290011 Lynn et al. Oct 2013 A1
20130290825 Arndt et al. Oct 2013 A1
20130297619 Chandarsekaran et al. Nov 2013 A1
20130304770 Boero et al. Nov 2013 A1
20130311375 Priebatsch Nov 2013 A1
20140012796 Petersen et al. Jan 2014 A1
20140019423 Liensberger et al. Jan 2014 A1
20140019936 Cohanoff Jan 2014 A1
20140032506 Hoey et al. Jan 2014 A1
20140033010 Richardt et al. Jan 2014 A1
20140040371 Gurevich et al. Feb 2014 A1
20140047319 Eberlein Feb 2014 A1
20140047357 Alfaro et al. Feb 2014 A1
20140058914 Song et al. Feb 2014 A1
20140059038 McPherson et al. Feb 2014 A1
20140067611 Adachi et al. Mar 2014 A1
20140068487 Steiger et al. Mar 2014 A1
20140095273 Tang et al. Apr 2014 A1
20140095509 Patton Apr 2014 A1
20140108068 Williams Apr 2014 A1
20140108380 Gotz et al. Apr 2014 A1
20140108985 Scott et al. Apr 2014 A1
20140123279 Bishop et al. May 2014 A1
20140129261 Bothwell et al. May 2014 A1
20140136285 Carvalho May 2014 A1
20140143009 Brice et al. May 2014 A1
20140149436 Bahrami et al. May 2014 A1
20140156527 Grigg et al. Jun 2014 A1
20140156617 Tomkins Jun 2014 A1
20140157172 Peery et al. Jun 2014 A1
20140164502 Khodorenko et al. Jun 2014 A1
20140181833 Bird et al. Jun 2014 A1
20140189536 Lange et al. Jul 2014 A1
20140195515 Baker et al. Jul 2014 A1
20140195887 Ellis et al. Jul 2014 A1
20140222521 Chait Aug 2014 A1
20140222793 Sadkin et al. Aug 2014 A1
20140229554 Grunin et al. Aug 2014 A1
20140244388 Manouchehri et al. Aug 2014 A1
20140258246 Lo Faro et al. Sep 2014 A1
20140267294 Ma Sep 2014 A1
20140267295 Sharma Sep 2014 A1
20140279824 Tamayo Sep 2014 A1
20140279979 Yost et al. Sep 2014 A1
20140310266 Greenfield Oct 2014 A1
20140316911 Gross Oct 2014 A1
20140324876 Konik et al. Oct 2014 A1
20140324929 Mason Oct 2014 A1
20140333651 Cervelli et al. Nov 2014 A1
20140337772 Cervelli et al. Nov 2014 A1
20140344230 Krause et al. Nov 2014 A1
20140351070 Christner et al. Nov 2014 A1
20140358829 Hurwitz Dec 2014 A1
20140366132 Stiansen et al. Dec 2014 A1
20150012509 Kirn Jan 2015 A1
20150019394 Unser et al. Jan 2015 A1
20150039886 Kahol et al. Feb 2015 A1
20150046481 Elliot Feb 2015 A1
20150046870 Goldenberg et al. Feb 2015 A1
20150073929 Psota et al. Mar 2015 A1
20150073954 Braff Mar 2015 A1
20150089353 Folkening Mar 2015 A1
20150089424 Duffield et al. Mar 2015 A1
20150095773 Gonsalves et al. Apr 2015 A1
20150100559 Nassar Apr 2015 A1
20150100897 Sun et al. Apr 2015 A1
20150100907 Erenrich et al. Apr 2015 A1
20150106379 Elliot et al. Apr 2015 A1
20150112641 Faraj Apr 2015 A1
20150112998 Shankar Apr 2015 A1
20150134666 Gattiker et al. May 2015 A1
20150135256 Hoy et al. May 2015 A1
20150142766 Jain et al. May 2015 A1
20150169709 Kara et al. Jun 2015 A1
20150169726 Kara et al. Jun 2015 A1
20150170077 Kara et al. Jun 2015 A1
20150178877 Bogomolov et al. Jun 2015 A1
20150186821 Wang et al. Jul 2015 A1
20150187036 Wang et al. Jul 2015 A1
20150188715 Castellucci et al. Jul 2015 A1
20150188872 White Jul 2015 A1
20150212663 Papale et al. Jul 2015 A1
20150213043 Ishii et al. Jul 2015 A1
20150213134 Nie et al. Jul 2015 A1
20150242397 Zhuang Aug 2015 A1
20150261817 Harris et al. Sep 2015 A1
20150261847 Ducott et al. Sep 2015 A1
20150324868 Kaftan et al. Nov 2015 A1
20150338233 Cervelli et al. Nov 2015 A1
20150341467 Lim et al. Nov 2015 A1
20150347903 Saxena et al. Dec 2015 A1
20150378996 Kesin et al. Dec 2015 A1
20150379413 Robertson et al. Dec 2015 A1
20160004667 Chakerian et al. Jan 2016 A1
20160004764 Chakerian et al. Jan 2016 A1
20160034545 Shankar et al. Feb 2016 A1
20160062555 Ward et al. Mar 2016 A1
20160098173 Slawinski et al. Apr 2016 A1
20160125000 Meacham et al. May 2016 A1
20160147730 Cicerone May 2016 A1
20160179828 Ellis Jun 2016 A1
20170068698 Tolnay et al. Mar 2017 A1
20170083595 Tolnay et al. Mar 2017 A1
20170097950 Meacham et al. Apr 2017 A1
Foreign Referenced Citations (77)
Number Date Country
2014206155 Dec 2015 AU
2014250678 Feb 2016 AU
2666364 Jan 2015 CA
102546446 Jul 2012 CN
103167093 Jun 2013 CN
102054015 May 2014 CN
102014103482 Sep 2014 DE
102014204827 Sep 2014 DE
102014204830 Sep 2014 DE
102014204840 Sep 2014 DE
202014204834 Sep 2014 DE
102014213036 Jan 2015 DE
102014215621 Feb 2015 DE
0652513 May 1995 EP
1566758 Aug 2005 EP
1672527 Jun 2006 EP
1962222 Aug 2008 EP
2221725 Aug 2010 EP
2487610 Aug 2012 EP
2551799 Jan 2013 EP
2560134 Feb 2013 EP
2778913 Sep 2014 EP
2778914 Sep 2014 EP
2778977 Sep 2014 EP
2778986 Sep 2014 EP
2835745 Feb 2015 EP
2835770 Feb 2015 EP
2838039 Feb 2015 EP
2846241 Mar 2015 EP
2851852 Mar 2015 EP
2858014 Apr 2015 EP
2858018 Apr 2015 EP
2863326 Apr 2015 EP
2863346 Apr 2015 EP
2869211 May 2015 EP
2881868 Jun 2015 EP
2884439 Jun 2015 EP
2884440 Jun 2015 EP
2889814 Jul 2015 EP
2891992 Jul 2015 EP
2892197 Jul 2015 EP
2897051 Jul 2015 EP
2911078 Aug 2015 EP
2963595 Jan 2016 EP
2993595 Mar 2016 EP
3 018 553 May 2016 EP
3128447 Feb 2017 EP
3142027 Mar 2017 EP
2366498 Mar 2002 GB
2513007 Oct 2014 GB
2516155 Jan 2015 GB
2517582 Feb 2015 GB
2518745 Apr 2015 GB
2012778 Nov 2014 NL
2013134 Jan 2015 NL
2013306 Feb 2015 NL
2011642 Aug 2015 NL
624557 Dec 2014 NZ
WO 2000009529 Feb 2000 WO
WO 2002035376 May 2002 WO
WO 2002065353 Aug 2002 WO
WO 2003060751 Jul 2003 WO
WO 2005010685 Feb 2005 WO
WO 2005104736 Nov 2005 WO
WO 2005116851 Dec 2005 WO
WO 2008064207 May 2008 WO
WO 2009061501 May 2009 WO
WO 2010000014 Jan 2010 WO
WO 2010030913 Mar 2010 WO
WO 20100098958 Sep 2010 WO
WO 2011017289 May 2011 WO
WO 2011071833 Jun 2011 WO
WO 2012025915 Mar 2012 WO
WO 2012079836 Jun 2012 WO
WO 2013010157 Jan 2013 WO
WO 2013067077 May 2013 WO
WO 20130102892 Jul 2013 WO
Non-Patent Literature Citations (406)
Entry
Glaab et al., “EnrichNet: Network-Based Gene Set Enrichment Analysis,” Bioinformatics 28.18 (2012): pp. i451-i457.
Hur et al., “SciMiner: web-based literature mining tool for target identification and functional enrichment analysis,” Bioinformatics 25.6 (2009): pp. 838-840.
Official Communication for New Zealand Patent Application No. 622513 dated Apr. 3, 2014.
Official Communication for Israel Patent Application No. 198253 dated Nov. 24, 2014.
Official Communication for Great Britain Patent Application No. 1413935.6 dated Jan. 27, 2015.
Official Communication for Australian Patent Application No. 2014201506 dated Feb. 27, 2015.
Official Communication for Australian Patent Application No. 2014201507 dated Feb. 27, 2015.
Official Communication for Australian Patent Application No. 2014201580 dated Feb. 27, 2015.
Zheng et al., “GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis,” Nucleic acids research 36.suppl 2 (2008): pp. W385-W363.
Horrocks et al., “The Effects of Weather on Crime”, dated 2008, 40 pages.
Aldor-Noiman et al., “Spatio-Temporal Low Count Processes with Application to Violent Crimes Events”, dated Apr. 23, 2013, 44 pages.
Azavea Journal, HunchLab: Heat Map and Kernel Density Calculation for Crime Analysis, dated 2009, 2 pages.
Bowers et al., “Prospective Hot Spotting”, The Future of Crime Mapping?, Advance Access Publication dated May 7, 2004, 18 pages.
Butke et al., “An Analysis of the Relationship Between Weather and Aggressive Crime in Cleveland, Ohio”, American Meteorological Society, dated Apr. 2010, 13 pages.
Caplan, Joel M. Mapping the Spatial Influence of Crime Correlates: A Comparision of Operationalization Schemes and Implications for Crime Analysis and Criminal Justice Practice dated 2011, 27 pages
Chainey et al., “The Utility of Hotspot Mapping for Predicting Spatial Patterns of Crime”, Security Journal, dated 2008, 25 pages.
Gorr et al., Crime Hot Spot Forecasting: Modeling and Comparative Evaluation, dated 2002, 37 pages.
“CrimeStat Statistics Program”, http://www.icpsr.umich.edi/NACJD/crimestat.html, accessed Mar. 31, 2014, 5 pages.
Groff et al., “Forecasting the Future of Predictive Crime Mapping”, Crime Prevention Studies, dated 2002 vol. 13, 29 pages.
Valentini et al., Ensembles of Learning Machines, dated 2002, 18 pages.
Jacob et al., “The Dynamics of Criminal Behavior: Evidence From Weather Shocks”, NBER Working Paper Series, National Bureau of Economic Research, dated Sep. 2004, 59 pages.
Karuppannan et al., “Crime Analysis Mapping in India”: A GIS Implementation in Chennai City, dated 2000, 25 pages.
Kong, Steve, “Return of the Burglar”, Masters Course in Crime Science, dated Sep. 2005, 52 pages.
Mohler et al., “Self-Exciting Point Process Modeling of Crime”, American Statistical Association, dated 2011, 9 pages.
Olligschlaeger, Andreas, “Artificial Neural Networks and Crime Mapping”, Carnegie Mellon University, dated 1997, 35 pages.
Ravi et al., “Soft Computing System for Bank Performance Prediction”, dated 2007, 11 pages.
Rayment, “Spatial and Temporal Crime Analysis Techniques”, dated 1995, 12 pages.
Short et al., “Measuring and Modeling Repeat and Near-Repeat Burglary Effects”, Springerlink,.com, dated 2009, 15 pages.
Gorr, “Proposed Crime Early Warning System Software”, dated 2003, 4 pages.
Official Communication for European Patent Application No. 14158977.0 dated Apr. 16, 2015.
Official Communication for European Patent Application No. 14158958.0 dated Apr. 16, 2015.
Official Communication for Netherlands Patent Application No. 2013306 dated Apr. 24, 2015.
“A Tour of Pinboard,” <http://pinboard.in/tour> as printed May 15, 2014, 6 pages.
“BackTuIt—JD Edwards One World Version Control System,” printed Jul. 23, 2007, 1 page.
Boytsov et al., “Drake: The Data Processing Workflow Tool (A.K.A. “Make for Data”)”, Specification and User Manual, working spec as of Jan. 21, 2013, 61 pages.
Delicious, <http://delicious.com/> as printed May 15, 2014 in 1 page.
Geiger, Jonathan G., “Data Quality Management, The Most Critical Initiative You Can Implement,” Data Warehousing, Management and Quality, Paper 098-29, SUGI 29, Intelligent Solutions, Inc., Bounder, CO, pp. 14, accessed Oct. 3, 2013.
Johnson, Maggie “Introduction to YACC and Bison”, Handout 13, Dated Jul. 8, 2005, 11 pages.
Kahan et al., “Annotea: an Open RDF Infrastructure for Shared Web Annotations”, Computer Networks, Elsevier Science Publishers B.V., vol. 39, No. 5, dated Aug. 5, 2002, pp. 589-608.
Klemmer et al., “Where Do Web Sites Come From? Capturing and Interacting with Design History,” Association for Computing Machinery, CHI 2002, Apr. 20-25, 2002, Minneapolis, MN, pp. 8.
Kokossi et al., “D7-Dynamic Ontoloty Management System (Design),” Information Societies Technology Programme, Jan. 10, 2002, pp. 1-27.
Miklau et al., “Securing History: Privacy and Accountability in Database Systems,” 3rd Biennial Conference on Innovative Data Systems Research (CIDR), Jan. 7-10, 2007, Asilomar, California, pp. 387-396.
Morrison et al., “Converting Users to Testers: An Alternative Approach to Load Test Script Creation, Parameterization and Data Corellation,” CCSC Southeastern Conference, JCSC 28, 2, Dec. 2012, pp. 188-196.
Niepert et al., “A Dynamic Ontology for a Dynamic Reference Work”, Joint Conference on Digital Libraries, Jun. 17-22, 2007, Vancouver, British Columbia, Canada, pp. 1-10.
Nivas, Tuli, “Test Harness and Script Design Principles for Automated Testing of non-GUI or Web Based Applications,” Performance Lab, Jun. 2011, pp. 30-37.
Official Communication for European Patent Application No. 14159629.6 dated Jul. 31, 2014.
Official Communication for Great Britain Patent Application No. 1404479.6 dated Aug. 12, 2014.
Official Communication for New Zealand Patent Application No. 622497 dated Mar. 26, 2014.
Official Communication for New Zealand Patent Application No. 622389 dated Mar. 20, 2014.
Official Communication for New Zealand Patent Application No. 622414 dated Mar. 24, 2014.
Official Communication for European Patent Application No. 14158958.0 dated Jun. 3, 2014.
Official Communication for European Patent Application No. 14158977.0 dated Jun. 10, 2014.
Official Communication for New Zealand Patent Application No. 622404 dated Mar. 20, 2014.
Official Communication for New Zealand Patent Application No. 622484 dated Apr. 2, 2014.
Official Communication for Canadian Patent Application No. 2666364 dated Jun. 4, 2012.
Official Communication for New Zealand Patent Application No. 622497 dated Jun. 19, 2014.
Palantir, “Extracting and Transforming Data with Kite,” Palantir Technologies, Inc., Copyright 2010, pp. 38.
Palantir, “Kite,” https://docs.palantir.com/gotham/3.11.1.0/adminreference/datasources.11 printed Aug. 30, 2013 in 2 pages.
Palantir, “Kite Data-Integration Process Overview,” Palantir Technologies, Inc., Copyright 2010, pp. 48.
Palantir, “Kite Operations,” Palantir Technologies, Inc., Copyright 2010, p. 1.
Palantir, “The Repository Element,” https://docs.palantir.com/gotham/3.11.1.0/dataguide/kite_config_file.04 printed Aug. 30, 2013 in 2 pages.
Palantir, “Write a Kite Configuration File in Eclipse,” Palantir Technologies, Inc., Copyright 2010, pp. 2.
Palantir, https://docs.palantir.com/gotham/3.11.1.0/dataguide/baggage/KiteSchema.xsd printed Apr. 4, 2014 in 4 pages.
Palermo, Christopher J., “Memorandum,” [Disclosure relating to U.S. Appl. No. 13/916,447, filed Jun. 12, 2013, and related applications], Jan. 31, 2014 in 3 pages.
Wollrath et al., “A Distributed Object Model for the Java System,” Conference on Object-Oriented Technologies and Systems, Jun. 17-21, 1996, pp. 219-231.
European Claims in application No. 15192965.0-1957, dated Mar. 2016, 3 pages.
European Patent Office, “Search Report” in application No. 15192965.0-1957, dated Mar. 17, 2016, 7 pages.
Notice of Acceptance for Australian Patent Application No. 2014203669 dated Jan. 21, 2016.
Official Communication for Netherlands Patent Application No. 2013134 dated Apr. 20, 2015.
Official Communication for Great Britain Patent Application No. 1411984.6 dated Jan. 8, 2016.
Official Communication for European Patent Application No. 15165244.3 dated Aug. 27, 2015.
Wright et al., “Palantir Technologies VAST 2010 Challenge Text Records—Investigations into Arms Dealing,” Oct. 29, 2010, pp. 1-10, retrieved from the internet http://hcil2.cs.umd.edu/newvarepository/VAST%20Challenge%202010/challenges/MC1%20-%20Investigations%20into%20Arms%20Dealing/entries/Palantir%20Technologies/retrieved on Aug. 20, 2015.
Palantir Technolgies, “Palantir Labs—Timeline,” Oct. 1, 2010, retrieved from the internet https://www.youtube.com/watch?v=JCgDW5bru9M retrieved on Aug. 19, 2015.
Gesher, Ari, “Palantir Screenshots in the Wild: Swing Sightings,” The Palantir Blog, Sep. 11, 2007, pp. 1-12, retrieved from the internet https://www.palantir.com/2007/09/palantir-screenshots/ retrieved on Aug. 18, 2015.
About 80 Minutes, “Palantir in a Number of Parts—Part 6—Graph,” Mar. 21, 2013, pp. 1-6, retrieved from the internet http://about80minutes.blogspot.nl/2013/03/palantir-in-number-of-parts-part-6-graph.html retrieved on Aug. 18, 2015.
Official Communication for European Patent Application No. 14200246.8 dated May 29, 2015.
Official Communication for Netherlands Patent Application No. 2012417 dated Sep. 18, 2015.
Wikipedia, “Multimap,” Jan. 1, 2013, https://en.wikipedia.org/w/index.php?title=Multimap&oldid=530800748.
Official Communication for Netherlands Patent Application No. 2012421 dated Sep. 18, 2015.
Official Communication for European Patent Application No. 15184764.7 dated Dec. 14, 2015.
Official Communication for Netherlands Patent Application No. 2012438 dated Sep. 21, 2015.
Chaudhuri et al., “An Overview of Business Intelligence Technology,” Communications of the ACM, Aug. 2011, vol. 54, No. 8.
Official Communication for European Patent Application No. 15166137.8 dated Sep. 14, 2015.
Jelen, Bill, “Excell 2013 in Depth, Video Enhanced Edition,” Jan. 25, 2013.
Zaharia et al., “Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing”, dated 2012, 14 pages.
Osterweil et al., “Capturing, Visualizing and Querying Scientific Data Provenance”, http://www.mtholyoke.edu/-blerner/dataprovenance/ddg.html, dated May 20, 2015, 3 pages.
Dean et al., “MapReduce: Simpli┐ ed Data Processing on Large Clusters”, OSDI 2004, 13 pages.
Official Communication for New Zealand Patent Application No. 624557 dated May 14, 2014.
Official Communication for New Zealand Patent Application No. 628585 dated Aug. 26, 2014.
Official Communication for European Patent Application No. 14158861.6 dated Jun. 16, 2014.
Official Communication for New Zealand Patent Application No. 622517 dated Apr. 3, 2014.
Official Communication for New Zealand Patent Application No. 627061 dated Jul. 14, 2014.
Official Communication for New Zealand Patent Application No. 628263 dated Aug. 12, 2014.
Official Communication for Great Britain Patent Application No. 1404457.2 dated Aug. 14, 2014.
Official Communication for New Zealand Patent Application No. 627962 dated Aug. 5, 2014.
Official Communication for European Patent Application No. 14159464.8 dated Jul. 31, 2014.
Official Communication for European Patent Application No. 14159464.8 dated Aug. 20, 2014.
Official Communication for European Patent Application No. 14159464.8 dated Sep. 22, 2014.
Official Communication for New Zealand Patent Application No. 628840 dated Aug. 28, 2014.
Official Communication in New Zealand Patent Application No. 628495 dated Aug. 19, 2014.
Official Communication for Great Britain Patent Application No. 1408025.3 dated Nov. 6, 2014.
Official Communication for New Zealand Patent Application No. 628161 dated Aug. 25, 2014.
Official Communication for Great Britain Patent Application No. 1404574.4 dated Dec. 18, 2014.
Official Communication for Great Britain Patent Application No. 1411984.6 dated Dec. 22, 2014.
Official Communication for European Patent Application No. 14180281.9 dated Jan. 26, 2015.
Official Communication for European Patent Application No. 14187996.5 dated Feb. 12, 2015.
Official Communication for European Patent Application No. 14180142.3 dated Feb. 6, 2015.
Official Communication for European Patent Application No. 14186225.0 dated Feb. 13, 2015.
Official Communication for European Patent Application No. 14189344.6 dated Feb. 20, 2015.
Official Communication for Australian Patent Application No. 2014201511 dated Feb. 27, 2015.
Official Communication for European Patent Application No. 14189347.9 dated Mar. 4, 2015.
Official Communication for European Patent Application No. 14199182.8 dated Mar. 13, 2015.
Official Communication for Australian Patent Application No. 2014202442 dated Mar. 19, 2015.
Official Communication for European Patent Application No. 14180321.3 dated Apr. 17, 2015.
Official Communication for European Patent Application No. 14197879.1 dated Apr. 28, 2015.
Official Communication for European Patent Application No. 14197895.7 dated Apr. 28, 2015.
Official Communication for European Patent Application No. 14189802.3 dated May 11, 2015.
Official Communication for European Patent Application No. 14191540.5 dated May 27, 2015.
Official Communication for Australian Patent Application No. 2014213553 dated May 7, 2015.
Official Communication for Australian Patent Application No. 2014203669 dated May 29, 2015.
Official Communication for Australian Patent Application No. 2014210604 dated Jun. 5, 2015.
Official Communication for Australian Patent Application No. 2014210614 dated Jun. 5, 2015.
Official Communication for Australian Patent Application No. 2014250678 dated Jun. 17, 2015.
Official Communication for European Patent Application No. 14180432.8 dated Jun. 23, 2015.
Official Communication for European Patent Application No. 14199180.2 dated Jun. 22, 2015.
Official Communication for European Patent Application No. 14187739.9 dated Jul. 6, 2015.
Wikipedia, “Federated Database System,” Sep. 7, 2013, retrieved from the internet on Jan. 27, 2015 http://en.wikipedia.org/w/index.php?title=Federated_database_system&oldid=571954221.
Yang et al., “HTML Page Analysis Based on Visual Cues,” 2001, pp. 859-864.
APPACTS, “Smart Thinking for Super Apps,” <http://www.appacts.com> Printed Jul. 18, 2013 in 4 pages.
APSALAR, “Data Powered Mobile Advertising,” “Free Mobile App Analytics” and various analytics related screen shots <http://apsalar.com> Printed Jul. 18, 2013 in 8 pages.
Capptain—Pilot Your Apps, <http://www.capptain.com> Printed Jul. 18, 2013 in 6 pages.
Cohn et al., “Semi-supervised Clustering with User Feedback,” Constrained Clustering: Advances in Algorithms, Theory, and Applications 4.1, 2003, pp. 17-32.
Countly Mobile Analytics, <http://count.ly/> Printed Jul. 18, 2013 in 9 pages.
DISTIMO—App Analytics, <http://www.distimo.com/app-analytics> Printed Jul. 18, 2013 in 5 pages.
Flurry Analytics, <http://www.flurry.com/> Printed Jul. 18, 2013 in 14 pages.
Google Analytics Official Website—Web Analytics & Reporting, <http://www.google.com/analytics.index.html> Printed Jul. 18, 2013 in 22 pages.
Gu et al., “Record Linkage: Current Practice and Future Directions,” Jan. 15, 2004, pp. 32.
Hua et al., “A Multi-attribute Data Structure with Parallel Bloom Filters for Network Services” HiPC 2006, LNCS 4297, pp. 277-288, 2006.
“HunchLab: Heat Map and Kernel Density Calculation for Crime Analysis,” Azavea Journal, printed from www.azavea.com/blogs/newsletter/v4i4/kernel-density-capabilities-added-to-hunchlab/ on Sep. 9, 2014, 2 pages.
Kontagent Mobile Analytics, <http://www.kontagent.com/> Printed Jul. 18, 2013 in 9 pages.
Localytics—Mobile App Marketing & Analytics, <http://www.localytics.com/> Printed Jul. 18, 2013 in 12 pages.
Mixpanel—Mobile Analytics, <https://mixpanel.com/> Printed Jul. 18, 2013 in 13 pages.
Open Web Analytics (OWA), <http://www.openwebanalytics.com/> Printed Jul. 19, 2013 in 5 pages.
Piwik—Free Web Analytics Software. <http://piwik.org/> Printed Jul. 19, 2013 in 18 pages.
StatCounter—Free Invisible Web Tracker, Hit Counter and Web Stats, <http://statcounter.com/> Printed Jul. 19, 2013 in 17 pages.
TestFlight—Beta Testing on the Fly, <http://testflightapp.com/> Printed Jul. 18, 2013 in 3 pages.
trak.io, <http://trak.io/> printed Jul. 18, 2013 in 3 pages.
UserMetrix, <http://usermetrix.com/android-analytics> printed Jul. 18, 2013 in 3 pages.
Vose et al., “Help File for ModelRisk Version 5,” 2007, Vose Software, pp. 349-353. [Uploaded in 2 Parts].
Wang et al., “Research on a Clustering Data De-Duplication Mechanism Based on Bloom Filter,” IEEE 2010, 5 pages.
Official Communication for New Zealand Patent Application No. 622473 dated Mar. 27, 2014.
Official Communication for New Zealand Patent Application No. 622473 dated Jun. 19, 2014.
Official Communication for Great Britain Patent Application No. 1404499.4 dated Aug. 20, 2014.
Official Communication for Great Britain Patent Application No. 1404486.1 dated Aug. 27, 2014.
Official Communication for Great Britain Patent Application No. 1404489.5 dated Aug. 27, 2014.
Official Communication for Great Britain Patent Application No. 1404499.4 dated Sep. 29, 2014.
Official Communication for Great Britain Patent Application No. 1404489.5 dated Oct. 6, 2014.
Official Communication for European Patent Application No. 14197938.5 dated Apr. 28, 2015.
Official Communication for European Patent Application No. 14200298.9 dated May 13, 2015.
Official Communication for Great Britain Patent Application No. 1404486.1 dated May 21, 2015.
Official Communication for Great Britain Patent Application No. 1404489.5 dated May 21, 2015.
Official Communication for Great Britain Patent Application No. 1404499.4 dated Jun. 11, 2015.
Official Communication for European Patent Application No. 14199180.2 dated Aug. 31, 2015.
Official Communication for European Patent Application No. 15181419.1 dated Sep. 29, 2015.
U.S. Appl. No. 14/533,433, filed Nov. 5, 2014, Office Action, dated Feb. 26, 2015.
U.S. Appl. No. 14/319,161, filed Jun. 30, 2014, Final Office Action, dated Jan. 23, 2015.
U.S. Appl. No. 14/490,612, filed Sep. 18, 2014, Office Action, dated Jan. 27, 2015.
U.S. Appl. No. 14/508,696, filed Oct. 7, 2014, Office Action, dated Mar. 2, 2015.
U.S. Appl. No. 14/319,161, filed Jun. 30, 2014, Notice of Allowance, dated May 4, 2015.
U.S. Appl. No. 14/044,800, filed Oct. 2, 2013, Notice of Allowance, dated Sep. 2, 2014.
U.S. Appl. No. 14/148,568, filed Jan. 6, 2014, Final Office Action, dated Oct. 22, 2014.
U.S. Appl. No. 14/148,568, filed Jan. 6, 2014, Office Action, dated Mar. 26, 2015.
U.S. Appl. No. 14/025,653, filed Sep. 12, 2013, Office Action Interview, dated Oct. 6, 2015.
U.S. Appl. No. 14/134,558, filed Dec. 19, 2013, Office Action, dated Oct. 7, 2015.
U.S. Appl. No. 14/025,653, filed Sep. 12, 2013, Interview Summary, dated Mar. 3, 2016.
U.S. Appl. No. 14/874,690, filed Oct. 5, 2015, Notice of Allowance, dated Oct. 5, 2016.
U.S. Appl. No. 14/533,433, filed Nov. 5, 2014, Notice of Allowance, dated Sep. 1, 2015.
U.S. Appl. No. 14/879,916, filed Oct. 9, 2015, Notice of Allowance, dated Jun. 22, 2016.
U.S. Appl. No. 14/526,066, filed Mar. 25, 2014, Final Office Action, dated May 6, 2016.
U.S. Appl. No. 14/504,103, filed Oct. 1, 2014, Notice of Allowance, dated Sep. 9, 2014.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, Final Office Action, dated Sep. 14, 2015.
U.S. Appl. No. 13/196,788, filed Aug. 2, 2011, Interview Summary, dated Nov. 25, 2015.
U.S. Appl. No. 14/923,374, filed Oct. 26, 2015, Notice of Allowance, dated May 8, 2014.
U.S. Appl. No. 14/134,558, filed Dec. 19, 2013, Final Office Action, dated May 16, 2016.
U.S. Appl. No. 13/196,788, filed Aug. 2, 2011, Notice of Allowance, dated Dec. 18, 2015.
U.S. Appl. No. 14/319,161, filed Jun. 30, 2014, Office Action, dated Sep. 25, 2014.
U.S. Appl. No. 13/557,100, filed Jul. 24, 2012, Final Office Action, dated Apr. 7, 2016.
U.S. Appl. No. 14/631,633, filed Feb. 25, 2015, First Office Interview, dated Feb. 3, 2016.
U.S. Appl. No. 14/148,568, filed Jan. 6, 2014, Notice of Allowance, dated Aug. 26, 2015.
U.S. Appl. No. 14/508,696, filed Oct. 7, 2014, Notice of Allowance, dated Jul. 27, 2015.
U.S. Appl. No. 14/526,066, filed Oct. 28, 2014, Office Action, dated Jan. 21, 2016.
U.S. Appl. No. 14/726,211, filed May 29, 2015, Office Action, dated Apr. 5, 2016.
U.S. Appl. No. 14/734,772, filed Jun. 9, 2015, Notice of Allowance, dated Apr. 27, 2016.
U.S. Appl. No. 14/841,338, filed Aug. 31, 2015, Office Action, dated Feb. 18, 2016.
U.S. Appl. No. 14/580,218, filed Dec. 23, 2014, Office Action, dated Jun. 7, 2016.
U.S. Appl. No. 14/961,830, filed Dec. 7, 2015, Office Action, dated May 20, 2016.
U.S. Appl. No. 14/849,454, filed Sep. 9, 2015, Notice of Allowance, dated May 25, 2016.
U.S. Appl. No. 14/996,179, filed Jan. 14, 2016, First Office Action Interview, dated May 20, 2016.
U.S. Appl. No. 14/578,389, filed Dec. 20, 2014, Office Action, dated Apr. 22, 2016.
U.S. Appl. No. 13/196,788, filed Aug. 2, 2011, Office Action, dated Oct. 23, 2015.
U.S. Appl. No. 14/734,772, filed Jun. 9, 2015, First Office Action Interview, dated Oct. 30, 2015.
U.S. Appl. No. 14/578,389, filed Dec. 20, 2014, Office Action, dated Oct. 21, 2015.
U.S. Appl. No. 14/879,916, filed Oct. 9, 2015, First Office Action Interview, dated Apr. 15, 2016.
U.S. Appl. No. 14/278,963, filed May 15, 2014, Notice of Allowance, dated Sep. 2, 2015.
U.S. Appl. No. 15/287,715, filed Oct. 6, 2016, Office Action, dated Aug. 16, 2017.
U.S. Appl. No. 14/954,680, filed Nov. 30, 2015, Office Action, dated May 12, 2016.
U.S. Appl. No. 15/369,753, filed Dec. 5, 2016, First Office Action Interview, dated Aug. 28, 2017.
U.S. Appl. No. 14/874,690, filed Oct. 5, 2015, Office Action, dated Jun. 1, 2016.
U.S. Appl. No. 13/922,437, filed Jun. 20, 2013, Notice of Allowance, dated Jul. 3, 2014.
U.S. Appl. No. 15/262,207, filed Sep. 12, 2016, Final Office Action, dated Jun. 8, 2017.
U.S. Appl. No. 15/262,207, filed Sep. 12, 2017, Office Action, dated Feb. 21, 2017.
“A First Look: Predicting Market Demand for Food Retail using a Huff Analysis,” TRF Policy Solutions, Jul. 2012, pp. 30.
“A Quick Guide to UniProtKB Swiss-Prot & TrEMBL,” Sep. 2011, pp. 2.
“A Word About Banks and the Laundering of Drug Money,” Aug. 18, 2012, http://www.golemxiv.co.uk/2012/08/a-word-about-banks-and-the-laundering-of-drug-money/.
Acklen, Laura, “Absolute Beginner's Guide to Microsoft Word 2003,” Dec. 24, 2003. pp. 15-18, 34-41, 308-316.
Amnet, “5 Great Tools for Visualizing Your Twitter Followers,” posted Aug. 4, 2010, http://www.amnetblog.com/component/content/article/115-5-grate-tools-for-visualizing-your-twitter-followers.html.
Ananiev et al., “The New Modality API,” http://web.archive.org/web/20061211011958/http://java.sun.com/developer/technicalArticles/J2SE/Desktop/javase6/modality/ Jan. 21, 2006, pp. 8.
Bluttman et al., “Excel Formulas and Functions for Dummies,” 2005, Wiley Publishing, Inc., pp. 280, 284-286.
Boyce, Jim, “Microsoft Outlook 2010 Inside Out,” Aug. 1, 2010, retrieved from the internet https://capdtron.files.wordpress.com/2013/01/outlook-2010-inside_out.pdf.
Bugzilla@Mozilla, “Bug 18726—[feature] Long-click means of invoking contextual menus not supported,” http://bugzilla.mozilla.org/show_bug.cgi?id=18726 printed Jun. 13, 2013 in 11 pages.
Canese et al., “Chapter 2: PubMed: The Bibliographic Database,” The NCBI Handbook, Oct. 2002, pp. 1-10.
Celik, Tantek, “CSS Basic User Interface Module Level 3 (CSS3 UI),” Section 8 Resizing and Overflow, Jan. 17, 2012, retrieved from internet http://www.w3.org/TR/2012/WD-c553-ui-20120117/#resizing-amp-overflow retrieved on May 18, 2015.
Chen et al., “Bringing Order to the Web: Automatically Categorizing Search Results,” CHI 2000, Proceedings of the SIGCHI conference on Human Factors in Computing Systems, Apr. 1-6, 2000, The Hague, The Netherlands, pp. 145-152.
Chung, Chin-Wan, “Dataplex: An Access to Heterogeneous Distributed Databases,” Communications of the ACM, Association for Computing Machinery, Inc., vol. 33, No. 1, Jan. 1, 1990, pp. 70-80.
Conner, Nancy, “Google Apps: The Missing Manual,” May 1, 2008, pp. 15.
Definition “Identify” downloaded Jan. 22, 2015, 1 page.
Definition “Overlay” downloaded Jan. 22, 2015, 1 page.
Delcher et al., “Identifying Bacterial Genes and Endosymbiont DNA with Glimmer,” BioInformatics, vol. 23, No. 6, 2007, pp. 673-679.
Dramowicz, Ela, “Retail Trade Area Analysis Using the Huff Model,” Directions Magazine, Jul. 2, 2005 in 10 pages, http://www.directionsmag.com/articles/retail-trade-area-analysis-using-the-huff-model/123411.
GIS-NET 3 Public—Department of Regional Planning. Planning & Zoning Information for Unincorporated LA County. Retrieved Oct. 2, 2013 from http://gis.planning.lacounty.gov/GIS-NET3_Public/Viewer.html.
Goswami, Gautam, “Quite Writly Said!,” One Brick at a Time, Aug. 21, 2005, pp- 7.
Griffith, Daniel A., “A Generalized Huff Model,” Geographical Analysis, Apr. 1982, vol. 14, No. 2, pp. 135-144.
Hansen et al. “Analyzing Social Media Networks with NodeXL: Insights from a Connected World”, Chapter 4, pp. 53-67 and Chapter 10, pp. 143-164, published Sep. 2010.
Hardesty, “Privacy Challenges: Analysis: It's Surprisingly Easy to Identify Individuals from Credit-Card Metadata,” MIT News on Campus and Around the World, MIT News Office, Jan. 29, 2015, 3 pages.
Hibbert et al., “Prediction of Shopping Behavior Using a Huff Model Within a GIS Framework,” Healthy Eating in Context, Mar. 18, 2011, pp. 16.
Hogue et al., “Thresher: Automating the Unwrapping of Semantic Content from the World Wide Web,” 14th International Conference on World Wide Web, WWW 2005: Chiba, Japan, May 10-14, 2005, pp. 86-95.
Huff et al., “Calibrating the Huff Model Using ArcGIS Business Analyst,” ESRI, Sep. 2008, pp. 33.
Huff, David L., “Parameter Estimation in the Huff Model,” ESRI, ArcUser, Oct.-Dec. 2003, pp. 34-36.
Johnson, Steve, “Access 2013 on demand,” Access 2013 on Demand, May 9, 2013, Que Publishing.
Kahan et al., “Annotea: an open RDF infrastructure for shared WEB annotations”, Computer Networks 39, pp. 589-608, 2002.
Keylines.com, “An Introduction to KeyLines and Network Visualization,” Mar. 2014, <http://keylines.com/wp-content/uploads/2014/03/KeyLines-White-Paper.pdf> downloaded May 12, 2014 in 8 pages.
Keylines.com, “KeyLines Datasheet,” Mar. 2014, <http://keylines.com/wp-content/uploads/2014/03/KeyLines-datasheet.pdf> downloaded May 12, 2014 in 2 pages.
Keylines.com, “Visualizing Threats: Improved Cyber Security Through Network Visualization,” Apr. 2014, <http://keylines.com/wp-content/uploads/2014/04/Visualizing-Threats1.pdf> downloaded May 12, 2014 in 10 pages.
Kitts, Paul, “Chapter 14: Genome Assembly and Annotation Process,” The NCBI Handbook, Oct. 2002, pp. 1-21.
Li et al., “Interactive Multimodal Visual Search on Mobile Device,” IEEE Transactions on Multimedia, vol. 15, No. 3, Apr. 1, 2013, pp. 594-607.
Liu, Tianshun, “Combining GIS and the Huff Model to Analyze Suitable Locations for a New Asian Supermarket in the Minneapolis and St. Paul, Minnesota USA,” Papers in Resource Analysis, 2012, vol. 14, pp. 8.
Madden, Tom, “Chapter 16: The BLAST Sequence Analysis Tool,” The NCBI Handbook, Oct. 2002, pp. 1-15.
Manno et al., “Introducing Collaboration in Single-user Applications through the Centralized Control Architecture.” 2010. pp. 10.
Manske, “File Saving Dialogs,” <http://www.mozilla.org/editor/ui_specs/FileSaveDialogs.html>, Jan. 20, 1999, pp. 7.
Map of San Jose, CA. Retrieved Oct. 2, 2013 from http://maps.bing.com.
Map of San Jose, CA. Retrieved Oct. 2, 2013 from http://maps.google.com.
Map of San Jose, CA. Retrieved Oct. 2, 2013 from http://maps.yahoo.com.
Microsoft—Developer Network, “Getting Started with VBA in Word 2010,” Apr. 2010, <http://msdn.microsoft.com/en-us/library/ff604039%28v=office.14%29.aspx> as printed Apr. 4, 2014 in 17 pages.
Microsoft Office—Visio, “About connecting shapes,” <http://office.microsoft.com/en-us/visio-help/about-connecting-shapes-HP085050369.aspx> printed Aug. 4, 2011 in 6 pages.
Microsoft Office—Visio, “Add and glue connectors with the Connector tool,” <http://office.microsoft.com/en-us/visio-help/add-and-glue-connectors-with-the-connector-tool-HA010048532.aspx?CTT=1> printed Aug. 4, 2011 in 1 page.
Mizrachi, Ilene, “Chapter 1: GenBank: The Nuckeotide Sequence Database,” The NCBI Handbook, Oct. 2002, pp. 1-14.
Nierman, “Evaluating Structural Similarity in XML Documents,” 2002, 6 pages.
Olanoff, Drew, “Deep Dive with the New Google Maps for Desktop with Google Earth Integration, It's More than Just a Utility,” May 15, 2013, pp. 1-6, retrieved from the internet: http://web.archive.org/web/20130515230641/http://techcrunch.com/2013/05/15/deep-dive-with-the-new-google-maps-for-desktop-with-google-earth-integration-its-more-than-just-a-utility/.
Palmas et al., “An Edge-Bunding Layout for Interactive Parallel Coordinates” 2014 IEEE Pacific Visualization Symposium, pp. 57-64.
Pythagoras Communications Ltd., “Microsoft CRM Duplicate Detection,” Sep. 13, 2011, https://www.youtube.com/watch?v=j-7Qis0D0Kc.
“Potential Money Laundering Warning Signs,” snapshot taken 2003, https://web.archive.org/web/20030816090055/http:/finsolinc.com/ANTI-MONEY%20LAUNDERING%20TRAINING%20GUIDES.pdf.
“Refresh CSS Ellipsis When Resizing Container—Stack Overflow,” Jul. 31, 2013, retrieved from internet http://stackoverflow.com/questions/17964681/refresh-css-ellipsis-when-resizing-container, retrieved on May 18, 2015.
Rouse, Margaret, “OLAP Cube,” <http://searchdatamanagement.techtarget.com/definition/OLAP-cube>, Apr. 28, 2012, pp. 16.
Sigrist, et al., “PROSITE, a Protein Domain Database for Functional Characterization and Annotation,” Nucleic Acids Research, 2010, vol. 38, pp. D161-D166.
Sirotkin et al., “Chapter 13: The Processing of Biological Sequence Data at NCBI.” The NCBI Handbook. Oct. 2002. pp. 1-11.
“The FASTA Program Package,” fasta-36.3.4, Mar. 25, 2011, pp. 29.
Thompson, Mick, “Getting Started with GEO,” Getting Started with GEO, Jul. 26, 2011.
Umagandhi et al., “Search Query Recommendations Using Hybrid User Profile with Query Logs,” International Journal of Computer Applications, vol. 80, No. 10, Oct. 1, 2013, pp. 7-18.
U.S. Appl. No. 14/746,671, filed Jun. 22, 2015, Notice of Allowance, dated Jan. 21, 2016.
U.S. Appl. No. 14/225,006, filed Mar. 25, 2014, Advisory Action, dated Dec. 21, 2015.
U.S. Appl. No. 14/306,147, filed Jun. 16, 2014, Final Office Action, dated Dec. 24, 2015.
U.S. Appl. No. 14/800,447, filed Jul. 15, 2012, First Office Action Interview, dated Dec. 10, 2010.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, Interview Summary, dated Dec. 24, 2015.
U.S. Appl. No. 14/225,084, filed Mar. 25, 2014, Interview Summary, dated Jan. 4, 2016.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, Interview Summary, dated Dec. 3, 2015.
U.S. Appl. No. 14/746,671, filed Jun. 22, 2015, First Office Action Interview, dated Nov. 12, 2015.
U.S. Appl. No. 14/948,009, filed Nov. 20, 2015, First Action Interview, dated Feb. 25, 2016.
U.S. Appl. No. 14/645,304, filed Mar. 11, 2015, Office Action, dated Jan. 25, 2016.
U.S. Appl. No. 14/874,690, filed Oct. 5, 2015, First Action Interview, dated Dec. 21, 2015.
U.S. Appl. No. 14/319,765, filed Jun. 30, 2014, Office Action, dated Feb. 1, 2016.
U.S. Appl. No. 13/247,987, filed Sep. 28, 2011, Notice of Allowance, dated Mar. 17, 2016.
U.S. Appl. No. 14/877,229, filed Oct. 7, 2015, Office Action, dated Mar. 22, 2016.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, Office Action, dated Mar. 17, 2016.
U.S. Appl. No. 14/306,154, filed Jun. 16, 2014, Office Action, dated Mar. 17, 2016.
U.S. Appl. No. 14/323,935, filed Jul. 3, 2014, Notice of Allowance, dated Oct. 1, 2015.
U.S. Appl. No. 14/948,009, filed Nov. 20, 2015, Notice of Allowance, dated May 6, 2016.
U.S. Appl. No. 14/094,418, filed Dec. 2, 2013, Notice of Allowance, dated Jan. 25, 2016.
U.S. Appl. No. 14/223,918, filed Mar. 24, 2014, Notice of Allowance, dated Jan. 6, 2016.
U.S. Appl. No. 14/849,545, filed Sep. 9, 2015, Office Action, dated Jan. 29, 2016.
U.S. Appl. No. 14/849,545, filed Sep. 9, 2015, Interview Summary, dated Feb. 24, 2016.
U.S. Appl. No. 14/102,394, filed Dec. 10, 2013, Notice of Allowance, dated Aug. 25, 2014.
U.S. Appl. No. 14/108,187, filed Dec. 16, 2013, Notice of Allowance, dated Aug. 29, 2014.
U.S. Appl. No. 14/135,289, filed Dec. 19, 2013, Notice of Allowance, dated Oct. 14, 2014.
U.S. Appl. No. 14/268,964, filed May 2, 2014, Notice of Allowance, dated Dec. 3, 2014.
U.S. Appl. No. 14/616,080, filed Feb. 6, 2015, Notice of Allowance, dated Apr. 2, 2015.
U.S. Appl. No. 14/486,994, filed Sep. 15, 2014, Notice of Allowance, dated May 1, 2015.
U.S. Appl. No. 14/225,084, filed Mar. 25, 2014, Notice of Allowance, dated May 4, 2015.
U.S. Appl. No. 14/504,103, filed Oct. 1, 2014, Notice of Allowance, dated May 18, 2015.
U.S. Appl. No. 14/289,596, filed May 28, 2014, First Office Action Interview, dated Jul. 18, 2014.
U.S. Appl. No. 14/289,599, filed May 28, 2014, First Office Action Interview, dated Jul. 22, 2014.
U.S. Appl. No. 14/294,098, filed Jun. 2, 2014, First Office Action Interview, dated Aug. 15, 2014.
U.S. Appl. No. 14/148,568, filed Jan. 6, 2014, Final Office, dated Oct. 22, 2014.
U.S. Appl. No. 14/294,098, filed Jun. 2, 2014, Final Office Action, dated Nov. 6, 2014.
U.S. Appl. No. 14/306,147, filed Jun. 16, 2014, First Office Action Interview, dated Sep. 9, 2014.
U.S. Appl. No. 14/306,154, filed Jun. 16, 2014, First Office Action Interview, dated Sep. 9, 2014.
U.S. Appl. No. 14/319,765, filed Jun. 30, 2014, First Office Action Interview, dated Nov. 25, 2014.
U.S. Appl. No. 14/323,935, filed Jul. 3, 2014, First Office Action Interview, dated Nov. 28, 2014.
U.S. Appl. No. 14/326,738, filed Jul. 9, 2014, First Office Action Interview, dated Dec. 2, 2014.
U.S. Appl. No. 14/225,160, filed Mar. 25, 2014, First Office Action Interview, dated Oct. 22, 2014.
U.S. Appl. No. 14/289,596, filed May 28, 2014, Final Office Action, dated Jan. 26, 2015.
U.S. Appl. No. 14/306,154, filed Jun. 16, 2014, Final Office Action, dated Mar. 11, 2015.
U.S. Appl. No. 13/247,987, filed Sep. 28, 2011, Office Action, dated Apr. 2, 2015.
U.S. Appl. No. 14/196,814, filed Mar. 4, 2014, Office Action, dated May 5, 2015.
U.S. Appl. No. 14/639,606, filed Mar. 5, 2015, First Office Action Interview, dated May 18, 2015.
U.S. Appl. No. 14/579,752, filed Dec. 22, 2014, First Office Action Interview, dated May 26, 2015.
U.S. Appl. No. 14/306,154, filed Jun. 16, 2014, Advisory Action, dated May 15, 2015.
U.S. Appl. No. 14/289,599, filed May 28, 2014, Final Office Action, dated May 29, 2015.
U.S. Appl. No. 14/225,160, filed Mar. 25, 2014, Advisory Action, dated May 20, 2015.
U.S. Appl. No. 14/289,596, filed May 28, 2014, Advisory Action, dated Apr. 30, 2015.
U.S. Appl. No. 14/319,765, filed Jun. 30, 2014, Final Office Action, dated Jun. 16, 2015.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, Office Action, dated May 26, 2015.
U.S. Appl. No. 13/835,688, filed Mar. 15, 2013, First Office Action Interview, dated Jun. 17, 2015.
U.S. Appl. No. 14/323,935, filed Jul. 3, 2014, Office Action, dated Jun. 22, 2015.
U.S. Appl. No. 14/225,160, filed Mar. 25, 2014, First Office Action Interview, dated Jul. 29, 2014.
U.S. Appl. No. 14/225,084, filed Mar. 25, 2014, First Office Action Interview, dated Sep. 2, 2014.
U.S. Appl. No. 14/225,006, filed Mar. 25, 2014, First Office Action Interview, dated Sep. 10, 2014.
U.S. Appl. No. 14/225,160, filed Mar. 25, 2014, Office Action, dated Aug. 12, 2015.
U.S. Appl. No. 14/225,006, filed Mar. 25, 2014, First Office Action Interview, dated Feb. 27, 2015.
U.S. Appl. No. 14/225,084, filed Mar. 25, 2014, First Office Action Interview, dated Feb. 20, 2015.
U.S. Appl. No. 14/225,160, filed Mar. 25, 2014, Final Office Action, dated Feb. 11, 2015.
U.S. Appl. No. 14/473,860, filed Aug. 29, 2014, Notice of Allowance, dated Jan. 5, 2015.
U.S. Appl. No. 14/192,767, filed Feb. 27, 2014, Notice of Allowance, dated Dec. 16, 2014.
U.S. Appl. No. 14/294,098, filed Jun. 2, 2014, Notice of Allowance, dated Dec. 29, 2014.
U.S. Appl. No. 14/473,552, filed Aug. 29, 2014, Notice of Allowance, dated Jul. 24, 2015.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, First Office Action Interview, dated Sep. 23, 2014.
U.S. Appl. No. 14/486,991, filed Sep. 15, 2014, Office Action, dated Mar. 10, 2015.
U.S. Appl. No. 13/831,791, filed Mar. 15, 2013, Office Action, dated Mar. 4, 2015.
U.S. Appl. No. 14/323,935, filed Jul. 3, 2014, First Office Action Interview, dated Mar. 31, 2015.
U.S. Appl. No. 14/326,738, filed Jul. 9, 2014, First Office Action Interview, dated Mar. 31, 2015.
U.S. Appl. No. 14/504,103, filed Oct. 1, 2014, First Office Action Interview, dated Mar. 31, 2015.
U.S. Appl. No. 15/504,103, filed Oct. 1, 2014, First Office Action Interview, dated Feb. 5, 2015.
U.S. Appl. No. 14/319,765, filed Jun. 30, 2014, First Office Action Interview, dated Feb. 4, 2015.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, Final Office Action, dated Feb. 18, 2015.
U.S. Appl. No. 14/306,147, filed Jun. 16, 2014, Final Office Action, dated Feb. 19, 2015.
U.S. Appl. No. 14/473,552, filed Aug. 29, 2014, First Office Action Interview, dated Feb. 24, 2015.
U.S. Appl. No. 14/268,964, filed May 2, 2014, Office Action, dated Sep. 3, 2014.
U.S. Appl. No. 13/839,026, filed Mar. 15, 2013, Restriction Requirement, dated Apr. 2, 2015.
U.S. Appl. No. 12/556,318, filed Sep. 9, 2009, Office Action, dated Jul. 2, 2015.
U.S. Appl. No. 13/839,026, filed Mar. 15, 2013, Office Action, dated Aug. 4, 2015.
U.S. Appl. No. 14/306,154, filed Jun. 16, 2014, Office Action, dated Jul. 6, 2015.
U.S. Appl. No. 14/639,606, filed Mar. 5, 2015, First Office Action Interview, dated Jul. 24, 2015.
U.S. Appl. No. 14/326,738, filed Jul. 9, 2014, Final Office Action, dated Jul. 31, 2015.
U.S. Appl. No. 14/306,147, filed Jun. 16, 2014, Office Action, dated Aug. 7, 2015.
U.S. Appl. No. 14/579,752, filed Dec. 22, 2014, Final Office Action, dated Aug. 19, 2015.
U.S. Appl. No. 14/479,863, filed Sep. 8, 2014, Notice of Allowance, dated Mar. 31, 2015.
U.S. Appl. No. 14/552,336, filed Nov. 24, 2014, Notice of Allowance, dated Nov. 3, 2015.
U.S. Appl. No. 14/326,738, filed Jul. 2014, Notice of Allowance, dated Nov. 18, 2015.
U.S. Appl. No. 14/451,221, filed Aug. 4, 2014, Office Action, dated Oct. 21, 2014.
U.S. Appl. No. 14/463,615, filed Aug. 19, 2014, First Office Action Interview, dated Nov. 13, 2014.
U.S. Appl. No. 13/827,491, filed Mar. 14, 2013, Office Action, dated Dec. 1, 2014.
U.S. Appl. No. 14/479,863, filed Sep. 8, 2014, First Office Action Interview, dated Dec. 26, 2014.
U.S. Appl. No. 14/483,527, filed Sep. 11, 2014, First Office Action Interview, dated Jan. 28, 2015.
U.S. Appl. No. 14/463,615, filed Aug. 19, 2014, First Office Action Interview, dated Jan. 28, 2015.
U.S. Appl. No. 14/571,098, filed Dec. 15, 2014, First Office Action Interview, dated Mar. 11, 2015.
U.S. Appl. No. 14/463,615, filed Aug. 19, 2014, Final Office Action, dated May 21, 2015.
U.S. Appl. No. 13/827,491, filed Mar. 14, 2013, Final Office Action, dated Jun. 22, 2015.
U.S. Appl. No. 14/483,527, filed Sep. 11, 2014, Final Office Action, dated Jun. 22, 2015.
U.S. Appl. No. 14/552,336, filed Nov. 24, 2014, First Office Action Interview, dated Jul. 20, 2015.
U.S. Appl. No. 14/676,621, filed Apr. 1, 2015, First Office Action Interview, dated Jul. 30, 2015.
U.S. Appl. No. 14/571,098, filed Dec. 15, 2014, First Office Action Interview, dated Aug. 5, 2015.
U.S. Appl. No. 14/225,006, filed Mar. 25, 2014, Final Office Action, dated Sep. 2, 2015.
U.S. Appl. No. 14/631,633, filed Feb. 25, 2015, First Office Action Interview, dated Sep. 10, 2015.
U.S. Appl. No. 14/463,612, filed Aug. 19, 2014, Advisory Action, dated Sep. 10, 2015.
U.S. Appl. No. 14/306,138, filed Jun. 16, 2014, First Office Action Interview, dated Sep. 23, 2015.
U.S. Appl. No. 14/676,621, filed Apr. 1, 2015, Final Office Action, dated Oct. 29, 2015.
U.S. Appl. No. 14/319,765, filed Jun. 30, 2014, Advisory Action, dated Sep. 10, 2015.
U.S. Appl. No. 14/574,098, filed Dec. 15, 2014, First Office Action Interview, dated Aug. 24, 2015.
U.S. Appl. No. 14/225,084, filed Mar. 25, 2014, Office Action, dated Sep. 11, 2015.
U.S. Appl. No. 14/562,524, filed Dec. 5, 2014, First Office Action Interview, dated Sep. 14, 2015.
U.S. Appl. No. 14/813,749, filed Jul. 30, 2015, Office Action, dated Sep. 28, 2015.
U.S. Appl. No. 14/746,671, filed Jun. 22, 2015, First Office Action Interview, dated Sep. 28, 2015.
U.S. Appl. No. 14/141,252, filed Dec. 26, 2014, Office Action, dated Oct. 8, 2015.
U.S. Appl. No. 13/827,471, filed Mar. 14, 2013, Office Action, dated Oct. 9, 2015.
U.S. Appl. No. 14/483,527, filed Sep. 11, 2014, Office Action, dated Oct. 28, 2015.
U.S. Appl. No. 14/571,098, filed Dec. 15, 2014, First Office Action Interview, dated Nov. 10, 2015.
U.S. Appl. No. 14/562,524, filed Dec. 5, 2014, First Office Action Interview, dated Nov. 10, 2015.
U.S. Appl. No. 14/306,154, filed Jun. 16, 2014, Final Office Action, dated Nov. 16, 2015.
U.S. Appl. No. 14/842,734, filed Sep. 1, 2015, First Office Action Interview, dated Nov. 19, 2015.
Official Communication for European Patent Application No. 16182336.4 dated Dec. 23, 2016.
Official Communication for Great Britain Patent Application No. 1413935.6 dated Dec. 21, 2015.
Official Communication for Netherlands Patent Application No. 2012436 dated Nov. 6, 2015.
Official Communication for Israel Patent Application No. 198253 dated Jan. 12, 2016.
QUEST, “Toad for ORACLE 11.6—Guide to Using Toad,” Sep. 24, 2012, pp. 1-162.
Official Communication for European Patent Application No. 15155845.9 dated Oct. 6, 2015.
Official Communication for European Patent Application No. 14159464.8 dated Feb. 18, 2016.
Official Communication for European Patent Application No. 16188060.4 dated Feb. 6, 2017.
Official Communication for Netherlands Patent Application No. 2012434 dated Jan. 8, 2016.
Official Communication for European Patent Application No. 14158977.0 dated Mar. 11, 2016.
Symantec Corporation, “E-Security Begins with Sound Security Policies,” Announcement Symantec, Jun. 14, 2001.
Official Communication for European Patent Application No. 14158958.0 dated Mar. 11, 2016.
Official Communication for European Patent Application No. 15183721.8 dated Nov. 23, 2015.
Official Communication for Great Britain Patent Application No. 1404479.6 dated Jul. 9, 2015.
European Claims in application No. 16182336.4-1952, dated Dec. 2016, 3 pages.
European Claims in application No. 16194936.7-1871, dated Mar. 3, 2017, 3 pages.
European Patent Office, “Search Report” in application No. 16182336.4-1952, dated Dec. 23, 2016, 10 pages.
European Patent Office, “Search Report” in application No. 16194936.7-1871, dated Mar. 9, 2017, 8 pages.
Related Publications (1)
Number Date Country
20170039253 A1 Feb 2017 US