The present invention relates generally to the data processing field, and more particularly, relates to a method, system and computer program product for implementing intelligent standard deviation inserts into a relational database management system (RDBMS).
In the world of Internet of Things (IoT) and Big Data in general an emphasis is often placed on finding and doing something about outlying information. Outlying information currently is not simply a static relationship of values but an evolving relationship such that one can think of deviations within a given window of time.
A novel database system is needed to effectively address outlying information to provide enhanced and improved database operation.
A need exists for an efficient and effective mechanism for implementing intelligent standard deviation inserts into a relational database management system (RDBMS).
Principal aspects of the present invention are to provide a method, system and computer program product for implementing intelligent standard deviation inserts into a relational database management system (RDBMS). Other important aspects of the present invention are to provide such method, system and computer program product substantially without negative effects and that overcome many of the disadvantages of prior art arrangements.
In brief, a method, system and computer program product are provided for implementing intelligent standard deviation inserts into a relational database management system (RDBMS). A trigger program type logic is provided within a database for processing outlier data based upon deviation constraints. The processed outlier data is used for automatically taking appropriate action including preventing insertion of outliers into a Relational Database Management System (RDBMS).
In accordance with features of the invention, processing outlier data based upon deviation constraints ensures that the outlier data is reasonable before insertion into the database.
In accordance with features of the invention, processing outlier data based upon deviation constraints includes when inserting data checking whether data values of a given column or columns are within 1, 2 or X amount of deviations.
In accordance with features of the invention, deviation checks are configurable by a database administrator.
In accordance with features of the invention, processing outlier data based upon deviation constraints ensures that the outlier data records being inserted have reasonable values, given a sliding time window.
In accordance with features of the invention, when the data value being inserted has a higher value than a specified deviation constraint, then an appropriate action to reject the insert, provide warning messages on the insert or take another form of corrective action.
In accordance with features of the invention, when the data value being inserted has a higher value than a specified deviation constraint, then the insert optionally is stored in a holding data structure, and not return the insert with normal queries.
In accordance with features of the invention, an insert optionally stored in a holding data structure are returned with normal queries when a specified number of deviant values occur within a specified time frame.
In accordance with features of the invention, processing outlier data based upon deviation constraints includes providing deviation constraints and deviation values within a given window of time.
The present invention together with the above and other objects and advantages may best be understood from the following detailed description of the preferred embodiments of the invention illustrated in the drawings, wherein:
In the following detailed description of embodiments of the invention, reference is made to the accompanying drawings, which illustrate example embodiments by which the invention may be practiced. It is to be understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the invention.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
In accordance with features of the invention, a method and system are provided for implementing intelligent standard deviation inserts into a relational database management system (RDBMS).
Having reference now to the drawings, in
Computer system 102 includes a system memory 106 including an operating system 108 and relational database management systems (RDBMS) 110. System memory 106 is a random-access semiconductor memory for storing data, including programs. System memory 106 is comprised of, for example, a dynamic random access memory (DRAM), a synchronous direct random access memory (SDRAM), a current double data rate (DDRx) SDRAM, non-volatile memory, optical storage, and other storage devices.
Computer system 102 includes a storage 112 including a database 114 and a network interface 116. Computer system 102 includes an I/0 interface 118 for transferring data to and from computer system components including CPU 104, memory 106 including the operating system 108 and RDBMS 110, storage 112 including database 114, and network interface 116 and a network 120 and a client system and application 122.
In accordance with features of the invention, the new computer system 102 of the preferred embodiment implements special processing of outlying data to ensure data to be inserted into the database 114 is reasonable, effectively addressing outlying information to provide enhanced and improved database operation.
In accordance with features of the invention, when the value being inserted has a higher value from a deviation perspective then an appropriate action is taken including a selected one of: reject the insert, provide warning messages on the insert and take some other form of corrective action. For example, the database can hold the outlier values in a separate data structure, possibly holding table, and not return the outlier values to normal queries. The outlier values in a separate data structure optionally are returned to normal queries when a specified number of deviant values occur within a specified time frame. For example, once an hour is probably a glitch, but two a minute is a problem, the outlier values optionally are included in the data or a corrective action is taken. How to handle held results is configured, for example by the application programmer and the database administrator.
In accordance with features of the invention, partitioning data into a held structure has uses that go beyond just holding data for deviant values. When inserting blocks of records it can be ensured that all records are within a specified deviation value. This means that a group of records can be inserted within a single statement or an insert could be done via sub-select query. This gives a natural grouping of records to act upon. The deviation can be made to be computed by looking at records inserted within a given time frame such as a last 5 minutes, last 30 minutes, the last day, and the like. Also a specified X amount of previously inserted records can make up a window of records that are used to decide what the deviation values to be computed with. Furthermore the deviation can be broken into where the data is being inserted from and or who is doing the inserting.
In accordance with features of the invention, for example a drone may report its position as part of its information packet. Its position, while changing, cannot change more than a certain amount during a given time frame. Thus an information packet that contains a position value that deviates dramatically from the previous window of positions is suspect and should either be ignored or analyzed more carefully. An interesting aspect of this example, is that the deviation can also be dependent on a time aspect of the data, for example, where either the arrival time of the information packet, such as row insertion time or a timestamp provided in the information packet, data within the row.
Referring to
Referring to
Referring now to
Referring now to
Computer readable program instructions 404, 406, 408, and 410 described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The computer program product 400 may include cloud based software residing as a cloud application, commonly referred to by the acronym (SaaS) Software as a Service. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions 404, 406, 408, and 410 from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
A sequence of program instructions or a logical assembly of one or more interrelated modules defined by the recorded program means 404, 406, 408, and 410, direct the system 100 for implementing intelligent standard deviation inserts into a relational database management system (RDBMS) of the preferred embodiment.
While the present invention has been described with reference to the details of the embodiments of the invention shown in the drawing, these details are not intended to limit the scope of the invention as claimed in the appended claims.
Number | Date | Country | |
---|---|---|---|
62690130 | Jun 2018 | US |