This application claims the priority of the Chinese patent application No. 200910028106.1 filed on Sep. 1, 2009, which application is incorporated herein by reference.
The invention represents a method used in the telecommunication operation support system, especially the rapid mass data manipulation.
Along with the rapid development of the telecom industry and business users, how to deal with millions of phone call data quickly has become difficult and top-priority for the telecom operators. Application of the current system needs to enquire, update and delete huge amounts of data frequently existing in physical memory of computer systems. Obviously, the data index key algorithm will greatly affect the efficiency of the computer running speed.
The existing one-way hash function refers to the value of fixed-length output algorithm based on the input information (any byte string, such as text strings, Word documents, JPG files, etc.), the output value, is also known as “hashed value” or “message abstract”, and its length depends on the algorithm used, usually between 128˜256. One-way hash function aims at creating the short message abstract to validate integrity of the messages. In TPC/IP communication protocol, testing and CRC (Cyclic Redundancy Check) are often used to verify the integrity of the news.
The purpose of the invention is to announce the quick mass data manipulation method based on two-dimension hash, used for telecom operation system, which requires massive database, quick response, stable and self-maintained. This invention is designed to resolve the following issues:
Highly efficient data searching when the managed data can be well-proportioned distributed based on keywords searching result, it can even addressing directly and returns with a keywords related records list. No need to recreate the index if data records update, also can be expanded dynamically. With data index structure of this invention, efficiency of data searching for millions of data records can be raised to microsecond level. It greatly satisfies the technological request from the telecom operation system.
Technical proposal of this invention: The quick mass data manipulation method is based on two-dimension hash. First, it uses hash algorithms to set the data records into specific sequence and form a specific mapping relations between indexed keywords and indexed address sequence, here one-dimension hash structure is set up to store the data; when the mapping relations between indexed keywords and index sequence address cannot addressing for data records, a two-dimension hash link sheet would be constructed based on the same index keywords or not, and link it with the hash in the first layer of each node of the queue as an node expansion of two-dimension hash queue to distinguish the index field values.
When the data operation according to the keywords index is needed, according to the same hashing algorithm, reversing mapping from the one-dimension, to obtain corresponding address of the keyword index data record and rapidly addressing; if two-dimension hash link sheet is found under one-dimension hash node queue, then look up the data record address based on the keywords value through the two-dimension hash link sheet.
Create Index Interface:
In order to realize the conversion of specific mapping between the index keywords and the index sequence, subscript value of hash queue needs to be calculated according to the keywords; If the one-to-one corresponding relationship cannot be matched between the index key words of each data record and the subscript value based on the hashing algorithm, a 2-dimension hash link table would be extended to link to the hash in the first layer of each node queue to distinguish the index field values, make sure the conflicts would disappear. According to the mapping relationship above, quick data sets index structure is available.
Query Interface:
When operating the data set by using the index key words, firstly the data set index access which has already been created needs to be found, using the same hash algorithm to calculate the subscript value and reverse mapping from the 1 dimension hash queue to acquire the corresponding data record address with the index keywords and rapidly addressing; if two-dimension hash table is found under the one-dimension hash node queue, then search the data records address from two-dimension hash table according to the enquired keywords value; Finally, return the result.
This invention is mainly divided into two parts: hash algorithm and two-dimension hash algorithm.
Hash Algorithm
Two-Dimensional Hash Algorithm
The effective practice of this invention: the invention has been successfully applied in memory data management products, and also has become the main technical proposals of critical business data management in the core telecom operators in China deployed in the background of business processing system in expense accounts which has contributed to a 50%˜80% improvement in business treatment.
Currently, the invention is embedded in the memory data management of the index management module and also be an independent package to adapt to other modules as a third-party plug-in adapter. Standard software module. Here is one module applied inside of the index management, which is shown in
Create Index Interface
Query Interface
Number | Date | Country | Kind |
---|---|---|---|
200910028106.1 | Jan 2009 | CN | national |