Businesses must process large amounts of data to make strategic decisions and be successful. The data is often provided in formats such as reports. To build a meaningful report, businesses are relying on multi-tenanted software as a service (SAAS) analytic companies. Building and providing meaningful analytics typically require a large amount of resources and have a high cost.
In order to reduce cost, more and more businesses are adapting to cloud based SAAS application models. For example, businesses may store sales data in “Salesforce” applications, accounting data in “NetSuite” applications, and billing data in “Zuora” applications. It is important to have detailed information about a company's performance and positions, both present and past. Unfortunately, most services that process SAAS data do not keep track of past data, but rather overwrite past data with the most current information available. What is needed is an improved data collection system.
The present system fetches consistent datasets in batches for a given period of time and provides the ability to retrieve each batch. Batches of data may be fetched for an interval of time. The present system may fetch new or changed data from different cloud/on-premise applications. It will store this data in the cloud or on-premise to build data history. As the system fetches new data, existing batches of data will not be overwritten. New batches of data are created as new versions so that change history is preserved. Past batches of data for a past time period may be provided to one or more tenants.
In an embodiment, a method for collecting data may begin with collecting a first batch of data having a first plurality of data elements associated with a first period of time. The data may be collected by a server from one or more tenant applications. A second batch of data with a second plurality of data elements may also be collected by the server from the one or more tenant applications. The second batch of data may be associated with a second period of time subsequent to the first period of time. The first plurality of data elements and the second plurality of elements may have a set of intersecting data elements, and at least a portion of the set of intersecting data elements may have an updated value in the second plurality of elements. A request for the first batch of data may be received. The request may be initiated by a first tenant of one or more tenants. The first batch of data may then be reported.
In an embodiment, a system for collecting data may include a memory, a processor and one or more modules stored in memory and executable by the processor. The modules may be executable to collect a first batch of data and a second batch of data, each having a plurality of data elements. The second batch of data may be associated with a second period of time subsequent to the first period of time. The first plurality of data elements and the second plurality of elements may have a set of intersecting data elements, and at least a portion of the set of intersecting data elements may have an updated value in the second plurality of elements. The modules may further be executed to request for the first batch of data may be received, wherein the request may be initiated by a first tenant of one or more tenants, and report the first batch of data.
The present system fetches consistent datasets in batches for a given period of time and provides the ability to retrieve each batch. Batches of data may be fetched for an interval of time. The present system may fetch new or changed data from different cloud/on-premise applications. It will store this data in the cloud or on-premise to build data history. As the system fetches new data, existing batches of data will not be overwritten. New batches of data are created as new versions so that change history is preserved. Past batches of data for a past time period may be provided to one or more tenants.
Servers 110 and 115 and client device 120 may each be associated with a tenant (client organization) in a multitenancy. Each tenant of the multi-tenancy may include one or more servers and client devices. Each server and client may include data to be collected by data collection server 130 via integration server 125. In embodiments, integration server 125 may communicate with different SAAS providers, whether provided from a cloud or a particular machine, and communicate with data collection server 130. Client 120 may be implemented as a desktop, laptop, notebook, tablet computer, smart phone, or some other computing device.
Data collection server 130 may collect data from one or more tenant applications on devices 110-120 through integration server 125 and store the data in a batch data store 135. The Data collection server may send batch instructions to integration server 125 in response to receiving a start batch request. Data collection server may provide any portion of the retrieved batch data to batch data store 135, for example periodically or upon receiving a request from batch data store 135. When data is collected, it is stored as a separate batch in batch data store 135. Batches of data are not overwritten with newly collected data.
Batch data store 145 may receive data from data collection server 130. When data is loaded into batch data store 135, the data may be stored in a star schema and maintained. Previous batches of data do not overwritten when new batches of data are retrieved. This allows the system to provide batches of data for a period of time in the past.
A batch log 140 may be stored at batch data store 135. The batch log may be updated and maintained to track information about each batch of data and may be used to retrieve previous batches of data for reporting or providing as back-up data. The batch log may be stored in table format and may include attributes for each batch such as batch ID, tenant ID, data start date and time, data end date and time, DCS processing status, and other data. The DCS processing status may include not started, in-progress, success and failure. The batch log may be updated batch data store 135, and other servers of the system of
Though illustrated as one server or one device, each of the servers and clients of the system of
A second batch of data is collected at step 220. The second batch of data may include the same data objects as the first batch (sales information, opportunity information, and so forth), changes and additions to the data objects, or other data, but will cover a different period of time. The second batch of data objects and first batch of data objects may include objects occurring in both batches but with different values, thereby forming an intersecting set of data objects that changes between the two batches. In some embodiments, the second batch will automatically include data with a start time just after the end time of the previous successful batch. Collecting a second batch of data is performed as described with respect to
A request is received for the first batch of data at step 230. Though the second batch of data is the current batch of data, the request may be for a previous batch of data. For example, a tenant may wish to access previous data to determine if there was a problem or error in their operations. The request may include information such as batch number or identifier, tenant ID, application ID, other information relating to the batch and stored in the batch log, the time period for which data is requested, and other data. The timer period may cover one or more entire batches or a portion of a batch.
The requested first batch of data is reported at step 240. Reporting the requested batch may include transmitting the data to a tenant network service, tenant computing device, or other destination. The data may be reported by batch data store 135 through data collection server 130.
In response to the request, the DCS 130 transmits batch instructions to integration server 125 at step 320. The batch instructions may indicate the data start time and date, data end time and date, the data to be collected, and the batch ID. For example, the batch instructions may indicate to collect employee records, sales records, and revenue records created or changed during a time period of Jan. 1, 2013 at 8:00 AM to Jan. 1, 2013 at 10:00 AM, and to call the data batch no. 001. The batch log may be updated by DCS 130 to indicate the batch ID and that DCS processing of the batch is “not started.”
DCS 130 receives batch data at step 330. In some embodiments, DCS 130 may receive all batch data requested, a portion of the data, or none of the data. While data is received from integration server 125 by DCS 130, the DCS processing status may indicate “in-progress.” Once the batch data has been provided to DCS server 130, integration server 125 provides a batch end message to DCS 130 at step 340. The request for a batch of data may specify that all new data and changed data maintained by a tenant be collected. If no tenant data has changed or been updated for the specified period of time, in some embodiments, no data will be provided and no new batch is created.
DCS sever 130 may store the collected data for the batch at batch data store 135 at step 350. A determination is then made by DCS 130 if the batch data storage has failed or succeeded. The batch data storage is marked as “successful” in batch log 140 at step 380 if all batch data received by DCS 130 is stored or loaded into batch data store 135. If any portion of the batch data is not loaded into batch data store 135, the batch status is set to “failure” at step 370. If a batch is listed as a failure, the batch is removed from the batch log and the next batch will attempt to collect the same data for the same time period. In some embodiments, the batch log may be updated by script generated and executed by DCS 130 or other parts of the system of
After a change occurring on Aug. 1, 2012 is detected, the original batch of row 1 is replaced (hence, the strikeout of the data in row 1) with two batches, as indicated in the second row and third row of data in the batch log. The second row of data indicates that the business key is 1, the amount is 500, the data begins on Jan. 1, 1900 and ends at Jul. 31, 2012, the batch ID is 1 and that the batch is not the current record. The third column indicates a business key of 1, an amount of 1000, a start date of Aug. 1, 2012, an end date of Dec. 31, 2099, a batch ID of 2 and that the batch is the current record.
The components shown in
Storage device 530, which may include mass storage implemented with a magnetic disk drive or an optical disk drive, may be a non-volatile storage device for storing data and instructions for use by processor unit 510. Storage device 530 can store the system software for implementing embodiments of the present invention for purposes of loading that software into main memory 510.
Portable storage device of storage 530 operates in conjunction with a portable non-volatile storage medium, such as a floppy disk, compact disk or Digital video disc, to input and output data and code to and from the computer system 500 of
Antenna 540 may include one or more antennas for communicating wirelessly with another device. Antenna 516 may be used, for example, to communicate wirelessly via Wi-Fi, Bluetooth, with a cellular network, or with other wireless protocols and systems. The one or more antennas may be controlled by a processor 510, which may include a controller, to transmit and receive wireless signals. For example, processor 510 execute programs stored in memory 512 to control antenna 540 transmit a wireless signal to a cellular network and receive a wireless signal from a cellular network.
The system 500 as shown in
Display system 570 may include a liquid crystal display (LCD), LED display, or other suitable display device. Display system 570 receives textual and graphical information, and processes the information for output to the display device.
Peripherals 580 may include any type of computer support device to add additional functionality to the computer system. For example, peripheral device(s) 580 may include a modem or a router.
The components contained in the computer system 500 of
The foregoing detailed description of the technology herein has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the technology to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. The described embodiments were chosen in order to best explain the principles of the technology and its practical application to thereby enable others skilled in the art to best utilize the technology in various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope of the technology be defined by the claims appended hereto.
The present application is a continuation and claims the priority benefit of U.S. patent application Ser. No. 13/764,173 filed Feb. 11, 2013, issuing as U.S. Pat. No. 9,191,432, the disclosure of which is incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
5325519 | Long et al. | Jun 1994 | A |
5729743 | Squibb | Mar 1998 | A |
6035298 | McKearney | Mar 2000 | A |
6092083 | Brodersen et al. | Jul 2000 | A |
6212524 | Weissman et al. | Apr 2001 | B1 |
6321374 | Choy | Nov 2001 | B1 |
6367077 | Brodersen et al. | Apr 2002 | B1 |
6405219 | Saether et al. | Jun 2002 | B2 |
6493744 | Emens et al. | Dec 2002 | B1 |
6573907 | Madrane | Jun 2003 | B1 |
6631374 | Klein et al. | Oct 2003 | B1 |
6711593 | Gordon et al. | Mar 2004 | B1 |
6721765 | Ghosh et al. | Apr 2004 | B2 |
6721767 | De Meno et al. | Apr 2004 | B2 |
6732095 | Warshavsky et al. | May 2004 | B1 |
6775681 | Ballamkonda et al. | Aug 2004 | B1 |
7076496 | Ruizandrade | Jul 2006 | B1 |
7191183 | Goldstein | Mar 2007 | B1 |
7225249 | Barry et al. | May 2007 | B1 |
7249118 | Sandler et al. | Jul 2007 | B2 |
7290166 | Rothman et al. | Oct 2007 | B2 |
7487173 | Medicke et al. | Feb 2009 | B2 |
7546312 | Xu et al. | Jun 2009 | B1 |
7640264 | Chaulk et al. | Dec 2009 | B1 |
7657887 | Kothandaraman et al. | Feb 2010 | B2 |
7752172 | Boylan et al. | Jul 2010 | B2 |
7779039 | Weissman et al. | Aug 2010 | B2 |
7827350 | Jiang et al. | Nov 2010 | B1 |
7895474 | Collins et al. | Feb 2011 | B2 |
8161010 | Weissman et al. | Apr 2012 | B2 |
8200628 | An et al. | Jun 2012 | B2 |
8335264 | Suzumura | Dec 2012 | B2 |
8423524 | Rana et al. | Apr 2013 | B1 |
8825593 | Dodds et al. | Sep 2014 | B2 |
8832651 | Kibbar | Sep 2014 | B2 |
8874508 | Mittal | Oct 2014 | B1 |
8972405 | Chaulk et al. | Mar 2015 | B1 |
9141680 | Bengali | Sep 2015 | B2 |
9191432 | Bengali | Nov 2015 | B2 |
9442993 | Tung | Sep 2016 | B2 |
20030046422 | Narayanan et al. | Mar 2003 | A1 |
20040039879 | Gaither | Feb 2004 | A1 |
20040078516 | Henderson et al. | Apr 2004 | A1 |
20040236786 | Medicke et al. | Nov 2004 | A1 |
20040254964 | Kodama et al. | Dec 2004 | A1 |
20060047780 | Patnude | Mar 2006 | A1 |
20070250480 | Najork | Oct 2007 | A1 |
20070282806 | Hoffman et al. | Dec 2007 | A1 |
20080077613 | Hay et al. | Mar 2008 | A1 |
20080120618 | Collins et al. | May 2008 | A1 |
20080276239 | Collins et al. | Nov 2008 | A1 |
20080281918 | Kirkwood | Nov 2008 | A1 |
20080285738 | Misra et al. | Nov 2008 | A1 |
20090024915 | Cudich et al. | Jan 2009 | A1 |
20090049288 | Weissman | Feb 2009 | A1 |
20090055439 | Pai et al. | Feb 2009 | A1 |
20090063557 | Macpherson | Mar 2009 | A1 |
20090064147 | Beckerle et al. | Mar 2009 | A1 |
20090171927 | Nesamoney et al. | Jul 2009 | A1 |
20090279613 | Suzumura | Nov 2009 | A1 |
20090285067 | Chen et al. | Nov 2009 | A1 |
20090299987 | Willson | Dec 2009 | A1 |
20090313436 | Krishnaprasad et al. | Dec 2009 | A1 |
20090327311 | Becker | Dec 2009 | A1 |
20100005013 | Uriarte | Jan 2010 | A1 |
20100005055 | An et al. | Jan 2010 | A1 |
20100087935 | Pettus et al. | Apr 2010 | A1 |
20100138615 | Klaiber et al. | Jun 2010 | A1 |
20100211548 | Ott et al. | Aug 2010 | A1 |
20100229082 | Karmarkar et al. | Sep 2010 | A1 |
20110072212 | Kojima | Mar 2011 | A1 |
20110125705 | Aski et al. | May 2011 | A1 |
20110126168 | Ilyayev | May 2011 | A1 |
20110145499 | Ananthanarayanan et al. | Jun 2011 | A1 |
20110161946 | Thomson et al. | Jun 2011 | A1 |
20110246449 | Collins et al. | Oct 2011 | A1 |
20110258178 | Eidson et al. | Oct 2011 | A1 |
20110302583 | Abadi et al. | Dec 2011 | A1 |
20120005153 | Ledwich et al. | Jan 2012 | A1 |
20120023109 | Sternemann et al. | Jan 2012 | A1 |
20120110566 | Park | May 2012 | A1 |
20120150791 | Willson | Jun 2012 | A1 |
20120221608 | An et al. | Aug 2012 | A1 |
20120246118 | Feng et al. | Sep 2012 | A1 |
20120254111 | Carmichael | Oct 2012 | A1 |
20120259852 | Aasen et al. | Oct 2012 | A1 |
20120259894 | Varley et al. | Oct 2012 | A1 |
20130018904 | Mankala et al. | Jan 2013 | A1 |
20130019235 | Tamm | Jan 2013 | A1 |
20130055232 | Rajan et al. | Feb 2013 | A1 |
20130073513 | Kemper et al. | Mar 2013 | A1 |
20130073573 | Huang et al. | Mar 2013 | A1 |
20130080413 | Chen et al. | Mar 2013 | A1 |
20130086353 | Colgrove et al. | Apr 2013 | A1 |
20130212042 | Rosenberg | Aug 2013 | A1 |
20130238641 | Mandelstein et al. | Sep 2013 | A1 |
20130275612 | Voss et al. | Oct 2013 | A1 |
20140006580 | Raghu | Jan 2014 | A1 |
20140006581 | Raghu | Jan 2014 | A1 |
20140013315 | Genevski et al. | Jan 2014 | A1 |
20140019488 | Wo et al. | Jan 2014 | A1 |
20140074771 | He et al. | Mar 2014 | A1 |
20140149494 | Markley et al. | May 2014 | A1 |
20140149591 | Bhattacharya et al. | May 2014 | A1 |
20140156806 | Karpistsenko et al. | Jun 2014 | A1 |
20140172775 | Niehoff et al. | Jun 2014 | A1 |
20140223100 | Chen | Aug 2014 | A1 |
20140229423 | Bengali | Aug 2014 | A1 |
20140229511 | Tung | Aug 2014 | A1 |
20140229577 | Bengali | Aug 2014 | A1 |
20140229628 | Mandal | Aug 2014 | A1 |
20140359771 | Dash et al. | Dec 2014 | A1 |
20160085794 | Bengali | Mar 2016 | A1 |
Number | Date | Country |
---|---|---|
WO 2014123564 | Aug 2014 | WO |
WO 2014123565 | Aug 2014 | WO |
Entry |
---|
U.S. Appl. No. 14/862,007, filed Sep. 22, 2015, Ketan Bengali. |
Aulbach, Stefan, et al., “A comparison of Flexible Schemas for Software as a Service”, SIGMOD '09, Providence, RI, Jun. 29-Jul. 2, 2009, pp. 881-888. |
Aulbach, Stefan, et al., “Multi-Tenant Databases for Software as a Service: Schema-Mapping Techniques”, SIGMOD '08, Vancouver, BC, Canada, Jun. 9-12, 2008, pp. 1195-1206. |
Bobrowski, Steve, “Optimal Multi-tenant Designs for Cloud Apps”, CLOUD 2011, Washington, DC, Jul. 4-9, 2011, pp. 654-659. |
Brandt, Cynthia A., et al.; “Meta-driven creation of data marts from EAV-Modeled clinical research database”, International Journal of Medical Informatics, vol. 65, Issue 3, Nov. 12, 2002. pp. 225-241. |
Casati, Frank, et al., “A Generic solution for Warehousing Business Process Data”, VLDB '07, Vienna, Austria, Sep. 23-28, 2007. pp. 1128-1137. |
Chaudhuri, Surajit, et al., “An Overview of Business Intelligence Technology”, Communications of the ACM, vol. 54, No. 8, Aug. 2011, pp. 88-98. |
Chong, Frederick, et al., “Multi-Tenant Data Architecture”, Microsoft Corp., Jun. 2006, pp. 1-15. |
Curino, Carlo, et al., “Automating Database Schema Evolution in Information System Upgrades”, HotSWUp '09, Orlando, FL, Oct. 25, 2009, 5 pages. |
Domingo, Enrique Jimenez, et al., “CLOUDIO: A Cloud Computing-oriented Multi-Tenant Architecture for Business Information Systems”, 2010 IEEE 3rd Intl Conf. on Cloud Computing, IEEE Computer Society, © 2010, pp. 532-533. |
Gao, Bo, et al., “A Non-Intrusive Multi-tenant Database for Large Scale SaaS Applications”, ICEBE 2011, Beijing, China, Oct. 19-21, 2011, pp. 324-328. |
Google Scholar, “Streaming data cloud metadata” Date of download: Nov. 3, 2014 http://scholar.googl.com/scholar?=streaming+data+cloud+metadata&btnG=&hl=en&as—sdt=0%C47. |
Grund, Martin, et al., “Shared Table Access Pattern Analysis for Multi-Tenant Applications”, AMIGE 2008, Tianjin, China, 2008, pp. 1-5. |
Han, Jung-Soo, et al.; “Integration Technology of Literature Contents based on SaaS”, ICISA 2011, Jeju Island, Korea, Apr. 26-29, 2011, pp. 1-5. |
Phil, “Clarification on Cloud, SaaS and Multi-tenant Language”, e-Literate, Sep. 10, 2012, pp. 1-7. |
Jun, Yang, “A Modern Service Oriented Unit-Based Distributed Storage Model for Peer Nodes”, IC-BNMT 2009, Beijing, China, Oct. 18-20, 2009, pp. 659-663. |
Kwok, Thomas, et al., “A Software as a Service with Multi-Tenancy Support for an Electronic Contract Management Application”, 2008 IEEE Intl Conf. on Service Computing, IEEE Computer Society, © 2008, pp. 179-186. |
Momm, Christof, et al., “A Qualitative Discussion of Different Approaches for Implementing Multi-Tenant SaaS Offerings”, Software Engineering (Workshops), vol. 11, © 2011, pp. 139-150. |
“multi-tenancy”, Whatls.com, Apr. 5, 2011, 1 page. |
“Multitenancy”, Wikipedia, downloaded from: en.wikipedia.org/wiki/Multi-tenant on Oct. 3, 2014, pp. 1-5. |
Nadkami, Parkash M., “Metadata for Data Warehousing”, Meta-Driven Software Systems in Biomedicine, Health Informatics 2011, Apr. 29, 2011, pp. 359-372. |
Park, Kyounghyun, et al., “SaaSpia Platform: Integrating and Customizing On-Demand Applications Supporting Multi-tenancy”, ICACT 2012, PyeongChang, Korea, Feb. 19-22, 2012, pp. 961-964. |
Schaffner, Jan. et al., “Towards Analytics-as-a-Service Using an In-Memory Column Database”, Information and Software as Services, LNBIP 74, Springer-Verlag, Berlin, Germany, © 2011, pp. 257-282. |
“schema”, Microsoft Computer Dictionary, 5th Edition, Microsoft Press, Redmond, WA, © 2002, p. 465. |
“Software as a service”, Wikipedia, downloaded Aug. 2, 2014, pp. 1-10. |
Tsai, Wei-Tek, et al., “Towards a Scalable and Robust Multi-Tenancy SaaS”, Internetware 2010, Suzhou, China, Nov. 3-4, 2010, Article No. 8, pp. 1-15. |
Weissman, Craid D., et al., “The Design of the Force.com Multitenant Internet Application Development Platform”, SIGMOD Providence, RI, Jun. 29-Jul. 2, 2009, pp. 889-896. |
Xue, Wang, et al., “Multiple Sparse Tables Based on Pivot Table for Multi-Tenant Data Storage in SaaS”, Proc, of the IEEE Int'l Conf. on Information and Automation, Shenzhen, China, Jun. 2011, pp. 634-637. |
Xuxu, Zheng, et al., “A Data Storage Architecture Supporting Multi-Level Customization for SaaS”, WISA 2010, Hothot, China, Aug. 20-22, 2010, pp. 106-109. |
Yaish, Haitham, et al., “An Elastic Multi-tenant Database Schema for Softare as a Service”, DASC 2011, Sydney, NSW, Australia, Dec. 12-14, 2011, pp. 737-743. |
PCT Application No. PCT/US2013/046277 International Search Report and Written Opinion mailed Jan. 7, 2014. |
PCT Application No. PCT/US2013/046280 International Search Report and Written Opinion mailed Dec. 6, 2013. |
U.S. Appl. No. 13/764,384; Final Office Action mailed Oct. 8, 2015. |
U.S. Appl. No. 13/764,384; Office Action mailed May 7, 2015. |
U.S. Appl. No. 13/764,384; Final Office Action mailed Oct. 9, 2014. |
U.S. Appl. No. 13/764,384; Office Action mailed Aug. 14, 2014. |
U.S. Appl. No. 13/762,028; Final Office Action mailed May 21, 2015. |
U.S. Appl. No. 13/762,028; Office Action mailed Oct. 30, 2014. |
U.S. Appl. No. 13/764,173; Office Action mailed Jan. 27, 2015. |
U.S. Appl. No. 13/763,520; Office Action mailed Nov. 5, 2015. |
U.S. Appl. No. 13/763,520; Final Office Action mailed Apr. 9, 2015. |
U.S. Appl. No. 13/763,520; Office Action mailed Nov. 18, 2014. |
U.S. Appl. No. 13/764,446; Office Action mailed Feb. 2, 2015. |
U.S. Appl. No. 13/764,446; Office Action mailed Sep. 11, 2014. |
Liu, Hui, et al.; “Data Storage Schema Upgrade via Metadata Evolution in Seas”, CECNet 2012, Yichang, China, Apr. 21-23, 2012, pp. 3148-3151. |
U.S. Appl. No. 13/762,028; Office Action mailed Mar. 31, 2016. |
U.S. Appl. No. 14/862,007, filed Sep. 22, 2015, Ketan Bengali, Data Consistency and Rollback for Cloud Analytics. |
European Patent Application No. 13874570.8 Extended EP Search Report dated Jul. 27, 2016. |
U.S. Appl. No. 15/263,884, David Tung, Metadata Manager for Analytics System. |
U.S. Appl. No. 13/762,028; Final Office Action mailed Sep. 1, 2016. |
Number | Date | Country | |
---|---|---|---|
20160065651 A1 | Mar 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13764173 | Feb 2013 | US |
Child | 14936503 | US |