The present disclosure relates to the field of database technologies, and in particular, to a method and an apparatus for querying service data, and a database system.
Business intelligence (BI) is also referred to as business wisdom, a BI database can effectively execute an operation, such as querying, analyzing, and multidimensional presentation, on service data using such excellent technologies of the BI database as a database technology, online analytical processing (OLAP), and data mining, so as to help an enterprise to make a wise business operation decision. In an actual application, a BI database generally stores data in a column manner (that is, a columnar database), and therefore can meet a quick data query requirement of an analytical service that is based on a rule. However, for data of a transactional service that is based on event triggering, a query requirement of an excessively high throughput generally needs to be met, and therefore, data needs to be stored in a BI database in a row manner (that is, a row database). Apparently, the BI database that stores data in the column manner cannot meet a data query requirement of a transactional service.
Embodiments of the present disclosure disclose a method and an apparatus for querying service data, and a database system, which can meet a data query requirement of a transactional service.
A first aspect of the embodiments of the present disclosure discloses a method for querying service data, where a database memory includes a column buffer and a row buffer, where a column of the column buffer is used to load column data of a database, a row of the row buffer is used to load column data of the column buffer or the column data of the database, and the method includes, when receiving a transactional service data query request that is sent by a transactional service system and that includes a target column identifier and a first query condition, detecting whether a row index corresponding to the target column identifier exists; and if the row index corresponding to the target column identifier exists, querying, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer and corresponds to the row index, to obtain a query result; and returning the result to the transactional service system.
In a first possible implementation manner of the first aspect of the embodiments of the present disclosure, if the row index corresponding to the target column identifier does not exist, the method further includes detecting whether the column data corresponding to the target column identifier is loaded in the column buffer, and if the column data corresponding to the target column identifier is loaded in the column buffer, reading the column data corresponding to the target column identifier from the column buffer; loading the read column data corresponding to the target column identifier to a row in the row buffer; and establishing a correspondence between the target column identifier and a row index of the row, and performing the step of querying, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer and corresponds to the row index, to obtain a query result; and returning the result to the transactional service system.
With reference to the first possible implementation manner of the first aspect of the embodiments of the present disclosure, in a second possible implementation manner of the first aspect of the embodiments of the present disclosure, if the column data corresponding to the target column identifier is not loaded in the column buffer, before the loading the read column data corresponding to the target column identifier to a row in the row buffer, the method further includes reading the column data corresponding to the target column identifier from the database.
In a third possible implementation manner of the first aspect of the embodiments of the present disclosure, before the receiving a transactional service data query request that is sent by a transactional service system and that includes a target column identifier and a first query condition, the method further includes, when receiving a query registration request that is sent by the transactional service system and that includes the target column identifier, detecting whether the column data corresponding to the target column identifier is loaded in the row buffer; if the column data corresponding to the target column identifier is not loaded in the row buffer, detecting whether the column data corresponding to the target column identifier is loaded in the column buffer; and if the column data corresponding to the target column identifier is loaded in the column buffer, reading the column data corresponding to the target column identifier from the column buffer, or if the column data corresponding to the target column identifier is not loaded in the column buffer, reading the column data corresponding to the target column identifier from the database; loading the read column data corresponding to the target column identifier to a row in the row buffer; and establishing a correspondence between the target column identifier and a row index of the row.
With reference to any one of the first to the third possible implementation manners of the first aspect of the embodiments of the present disclosure, in a fourth possible implementation manner of the first aspect of the embodiments of the present disclosure, the row buffer includes several sub-buffers, each sub-buffer includes several rows having a same length, the sub-buffers have a same quantity of rows, and rows of a same rank in the several sub-buffers share a same row index; and the loading the read column data corresponding to the target column identifier to a row in the row buffer includes dividing, according to the quantity of the rows of the sub-buffer in the row buffer, the read column data corresponding to the target column identifier, and loading one by one each partial column data obtained by means of division to a same location of each row included in any one of the sub-buffers.
With reference to the fourth possible implementation manner of the first aspect of the embodiments of the present disclosure, in a fifth possible implementation manner of the first aspect of the embodiments of the present disclosure, the querying, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer and corresponds to the row index, to obtain a query result includes determining each row that is in the row buffer and corresponds to the row index and that is located in any one of the sub-buffers; and querying, based on the first query condition, each partial column data that corresponds to the target column identifier, and that is loaded in a same location in each row that is located in any one of the sub-buffers and corresponds to the row index, to obtain the query result.
With reference to the first or the second possible implementation manner of the first aspect of the embodiments of the present disclosure, in a sixth possible implementation manner of the first aspect of the embodiments of the present disclosure, when receiving an analytical service data query request that is sent by an analytical service system and that includes a specified column identifier and a second query condition, detecting whether column data corresponding to the specified column identifier is loaded in the column buffer, and if yes, querying, based on the second query condition, the column data that is corresponding to the specified column identifier and that is in the column buffer, to obtain a query result, and returning the result to the analytical service system.
With reference to the sixth possible implementation manner of the first aspect of the embodiments of the present disclosure, in a seventh possible implementation manner of the first aspect of the embodiments of the present disclosure, if the column data corresponding to the specified column identifier is not loaded in the column buffer, the method further includes reading the column data corresponding to the specified column identifier from the database; and loading the read column data corresponding to the specified column identifier to a column of the column buffer, and performing the step of querying, based on the second query condition, the column data corresponding to the specified column identifier from the column buffer, to obtain a query result; and returning the result to the analytical service system.
A second aspect of the embodiments of the present disclosure discloses an apparatus for querying service data, where a database memory includes a column buffer and a row buffer, where a column of the column buffer is used to load column data of a database, a row of the row buffer is used to load column data of the column buffer or the column data of the database, and the apparatus includes a first detection unit configured to, when receiving a transactional service data query request that is sent by a transactional service system and that includes a target column identifier and a first query condition, detect whether a row index corresponding to the target column identifier exists; a first query unit configured to: when the first detection unit detects that the row index corresponding to the target column identifier exists, query, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer and corresponds to the row index, to obtain a query result; and output the result to an interaction unit; and the interaction unit configured to return the query result obtained by the first query unit to the transactional service system.
In a first possible implementation manner of the second aspect of the embodiments of the present disclosure, the apparatus further includes a second detection unit configured to, when the first detection unit detects that the row index corresponding to the target column identifier does not exist, detect whether the column data corresponding to the target column identifier is loaded in the column buffer; a first reading unit configured to, when the second detection unit detects that the column data corresponding to the target column identifier is loaded in the column buffer, read the column data corresponding to the target column identifier from the column buffer; a first loading unit configured to load the column data that corresponds to the target column identifier and that is read by the first reading unit to a row in the row buffer; and a first establishing unit configured to establish a correspondence between the target column identifier and a row index of the row, and trigger the first query unit to execute the operation of querying, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer and corresponds to the row index, to obtain a query result.
With reference to the first possible implementation manner of the second aspect of the embodiments of the present disclosure, in a second possible implementation manner of the second aspect of the embodiments of the present disclosure, the first reading unit is further configured to, when the second detection unit detects that the column data corresponding to the target column identifier is not loaded in the column buffer, read the column data corresponding to the target column identifier from the database.
In a third possible implementation manner of the second aspect of the embodiments of the present disclosure, the first detection unit is further configured to, before receiving the transactional service data query request that is sent by the transactional service system and that includes the target column identifier and the first query condition, when receiving a query registration request that is sent by the transactional service system and that includes the target column identifier, detect whether the column data corresponding to the target column identifier is loaded in the row buffer; a third detection unit configured to, when the first detection unit detects that the column data corresponding to the target column identifier is not loaded in the row buffer, detect whether the column data corresponding to the target column identifier is loaded in the column buffer; a second reading unit configured to, when the third detection unit detects that the column data corresponding to the target column identifier is loaded in the column buffer, read the column data corresponding to the target column identifier from the column buffer; and further configured to, when the third detection unit detects that the column data corresponding to the target column identifier is not loaded in the column buffer, read the column data corresponding to the target column identifier from the database; a second loading unit configured to load the column data that corresponds to the target column identifier and that is read by the second reading unit to a row in the row buffer; and a second establishing unit configured to establish a correspondence between the target column identifier and a row index of the row.
With reference to any one of the first to the third possible implementation manner of the second aspect of the embodiments of the present disclosure, in a fourth possible implementation manner of the second aspect of the embodiments of the present disclosure, where the row buffer includes several sub-buffers, each sub-buffer includes several rows having a same length, the sub-buffers have a same quantity of rows, and rows of a same rank in the several sub-buffers share a same row index, where the manner of loading the read column data corresponding to the target column identifier to the row in the row buffer is: dividing, according to the quantity of the rows of the sub-buffer in the row buffer, the read column data corresponding to the target column identifier, and loading one by one each partial column data obtained by means of division to a same location of each row included in any one of the sub-buffers.
With reference to the fourth possible implementation manner of the second aspect of the embodiments of the present disclosure, in a fifth possible implementation manner of the second aspect of the embodiments of the present disclosure, the first query unit includes a determining subunit configured to determine each row that is in the row buffer and corresponds to the row index and that is located in any one of the sub-buffers; and a query subunit configured to query, based on the first query condition, partial column data that corresponds to the target column identifier, and that is loaded in a same location in each row that is located in any one of the sub-buffers and corresponds to the row index, to obtain a query result.
With reference to the first or the second possible implementation manner of the second aspect of the embodiments of the present disclosure, in a sixth possible implementation manner of the second aspect of the embodiments of the present disclosure, the first detection unit is further configure to, when receiving an analytical service data query request that is sent by an analytical service system and that includes a specified column identifier and a second query condition, detect whether column data corresponding to the specified column identifier is loaded in the column buffer; and the first query unit is further configured to, when the first detection unit detects that the column data corresponding to the specified column identifier is loaded in the column buffer, query, based on the second query condition, the column data corresponding to the specified column identifier from the column buffer, to obtain a query result; and output the result to the interaction unit.
With reference to the sixth possible implementation manner of the second aspect of the embodiments of the present disclosure, in a seventh possible implementation manner of the second aspect of the embodiments of the present disclosure, the first reading unit is further configured to, when the first detection unit detects that the column data corresponding to the specified column identifier is not loaded in the column buffer, read the column data corresponding to the specified column identifier from the database; and the first loading unit is further configured to load the column data that corresponds to the specified column identifier and that is read by the first reading unit to a column of the column buffer, and trigger the first query unit to execute the operation of querying, based on the second query condition, the column data corresponding to the specified column identifier from the column buffer, to obtain a query result; and output the result to the interaction unit.
A third aspect of the embodiments of the present disclosure discloses a database system, including any apparatus for querying service data disclosed in the second aspect of the embodiments of the present disclosure, a database memory, and a database, where the database memory includes a column buffer and a row buffer, a column of the column buffer is used to load column data of the database, and a row of the row buffer is used to load column data of the column buffer or the column data of the database.
In the embodiments of the present disclosure, a database memory not only includes a column buffer used to load column data of a database, but also includes a row buffer used to load column data of the column buffer or the column data of the database. In this way, when a transactional service data query request that is sent by a transactional service system and that includes a target column identifier and a first query condition is received, if it is detected that a row index corresponding to the target column identifier exists, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer and corresponds to the row index may be directly queried based on the first query condition, to obtain a query result and the result is returned to the transactional service system, so that data of a transactional service can be queried in a database that stores data in a column manner. In addition, in the embodiments of the present disclosure, it is unnecessary to create two sets of databases for an analytical service and a transactional service, and maintenance of data synchronization between the two sets of databases can be omitted when the two sets of databases are used, reducing difficulty in maintenance of a database. In addition, database system complexity can also be effectively reduced in a manner of not using two sets of databases.
To describe the technical solutions in the embodiments of the present disclosure more clearly, the following briefly describes the accompanying drawings required for describing the embodiments. The accompanying drawings in the following description show merely some embodiments of the present disclosure, and a person of ordinary skill in the art may still derive other drawings from these accompanying drawings without creative efforts.
The following clearly describes the technical solutions in the embodiments of the present disclosure with reference to the accompanying drawings in the embodiments of the present disclosure. The described embodiments are merely a part rather than all of the embodiments of the present disclosure. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.
The embodiments of the present disclosure disclose a method and an apparatus for querying service data, and a database system, so that data of a transactional service can be queried in a database that stores data in a column manner. In addition, it is unnecessary to create two sets of databases for an analytical service and a transactional service, and maintenance of data synchronization between the two sets of databases can be omitted when the two sets of databases are used, reducing difficulty in maintenance of a database. In addition, database system complexity can also be effectively reduced in a manner of not using two sets of databases. Details are separately described below.
To better understand the embodiments of the present disclosure, a network architecture for querying service data disclosed in an embodiment of the present disclosure is first described below.
Referring to
The analytical service system is mainly used to query data of an analytical service, and the data query of an analytical service generally needs to meet a requirement of quick query, and requires data to be stored in a column manner. However, the transactional service system is mainly used to query data of a transactional service, and the data query of a transactional service generally needs to meet a query requirement of an excessively high throughput, and requires data to be stored in a row manner. The data query of an analytical service generally refers to query operations such as filtering and segment statistics collection performed on data based on a condition defined by a user, to help the user to study a pattern of the data, so that a user behavior feature may be explored, providing a basis for specifying a market strategy and a marketing principle. The data query of a transactional service refers to that when an event of a user happens (for example, arriving at a predefined region, dialing a predefined number, or visiting a web page that meets a predefined rule), a suitable response feature may be provided. As shown in
In the network architecture for querying service data shown in
(1) When a data query of an analytical service is executed, the analytical service system sends an analytical service data query request including a specified column identifier and a second query condition to the database system, and the database system may directly detect whether column data corresponding to the specified column identifier is loaded in the column buffer of the database memory, and if yes, query the column data corresponding to the specified column identifier from the column buffer based on the second query condition, to obtain a query result; and return the result to the analytical service system.
(2) When the database system detects that the column data corresponding to the specified column identifier is not loaded in a column buffer of the database memory, the database system may read the column data corresponding to the specified column identifier from the database, and load the read column data corresponding to the specified column identifier to a column of the column buffer. Based on this, the database system may query the column data corresponding to the specified column identifier from the column buffer based on the second query condition, to obtain a query result; and return the result to the analytical service system. If necessary, the database system may remove column data that is not used for the longest time from the column buffer.
(3) When a query registration request is executed, the transactional service system may send the query registration request including a target column identifier to the database system, the database system may detect whether a row index corresponding to the target column identifier exists, and if yes, it indicates that column data corresponding to the target column identifier is already loaded in the row buffer.
(4) If the database system detects that the row index corresponding to the target column identifier does not exist, it indicates that the column data corresponding to the target column identifier is not loaded in the row buffer. Further, the database system may detect whether the column data corresponding to the target column identifier is loaded in the column buffer, and if it is detected that the column data corresponding to the target column identifier is loaded in the column buffer, the database system may read the column data corresponding to the target column identifier from the column buffer, and load the read column data corresponding to the target column identifier to a row in the row buffer (that is, column-to-row conversion storage). Further, the database system may establish a correspondence between the target column identifier and a row index of the row (that is, the row, to which the column data corresponding to the target column identifier is loaded, in the row buffer), so as to detect, according to the correspondence, whether the row index corresponding to the target column identifier exists next time.
(5) When the database system detects that the column data corresponding to the target column identifier is not loaded in the column buffer, the database system may read the column data corresponding to the target column identifier from the database, and load the read column data corresponding to the target column identifier to a row in the row buffer (that is, column-to-row conversion storage). Further, the database system may establish a correspondence between the target column identifier and a row index of the row (that is, the row, to which the column data corresponding to the target column identifier is loaded, in the row buffer), so as to detect, according to the correspondence, whether the row index corresponding to the target column identifier exists next time.
(6) When a transactional service data query request is executed, the transactional service system may send the transactional service data query request including a target column identifier and a first query condition to the database system, and correspondingly, the database system may detect whether a row index corresponding to the target column identifier exists, and if yes, query, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer of the database memory and corresponds to the row index, to obtain a query result; and return the result to the transactional service system.
When step 1) to step 6) are implemented, data of a transactional service can be queried in a database that stores data in a column manner. In addition, it is unnecessary to create two sets of databases for an analytical service and a transactional service, and maintenance of data synchronization between the two sets of databases can be omitted when the two sets of databases are used, reducing difficulty in maintenance of a database. In addition, database system complexity can also be effectively reduced in a manner of not using two sets of databases.
In the network architecture for querying service data shown in
As shown in Table 1, the database system may divide data of the column 1 according to a quantity of rows (that is, three rows) in the row buffer, to obtain each partial data, which has a fixed length, of the column 1, and load one by one each partial data of the column 1 that is obtained by means of division to a same location of each row of the row buffer (that is, implementing column-to-row conversion of the data of the column 1). The database system may divide data of the column 2 according to the quantity of the rows (that is, three rows) in the row buffer, to obtain each partial data, which has a fixed length, of the column 2, and load one by one each partial data of the column 2 that is obtained by means of division to a same location of each row in the row buffer (that is, implementing column-to-row conversion of the data of the column 2) The database system may divide data of the column 5 according to the quantity of the rows (that is, three rows) in the row buffer, to obtain each partial data, which has a fixed length, of the column 5, and load one by one each partial data of the column 5 that is obtained by means of division to a same location of each row in the row buffer (that is, implementing column-to-row conversion of data of the column 5).
In this embodiment of the present disclosure, for example, it may be learned from Table 1 that, each partial data of the column 1 into which the data of the column 1 is divided has a same length, and each partial data of the column 1 into which the data of the column 1 is divided is in the same location in each row of the row buffer. Each partial data of the column 2 into which the data of the column 2 is divided has a same length, and each partial data of the column 2 into which the data of the column 2 is divided is in the same location in each row of row buffer. The length of each partial data of the column 1 and the length of each partial data of the column 2 may be different, that is, lengths into which different column data is divided may be different, but lengths or storage locations of a same piece of column data are the same in different rows.
In this embodiment of the present disclosure, using the structure of the row buffer shown in Table 1 as an example, partial column data in the row buffer may be removed from the row buffer by the database system because the partial column data are not used for a long time and the like, and new column data may continue to be loaded to the row buffer by the database system. In this case, for example, the structure of the row buffer may be shown in Table 2 below:
As shown in Table 2, assuming that the data of the column 2 is not used for a long time, the database system may remove the data of the column 2 from the row buffer, and data of the column 6 to data of the column 8 are used as new column data and may be loaded to the row buffer by the database system.
In this embodiment of the present disclosure, locations of rows that are occupied by partial column data removed from the row buffer by the database system may be reused, to load new column data. Using the structure of the row buffer shown in Table 2 as an example, after the database system removes the data of the column 2 from the row buffer, locations of rows that are occupied by the data of the column 2 may be reused, to load data of a column 9, and in this case, the structure of the row buffer may be shown in Table 3 below:
In this embodiment of the present disclosure, when a row of the row buffer does not have loadable space, the database system may create several sub-buffers for the row buffer, where each sub-buffer includes several rows having a same length, the sub-buffers have a same quantity of rows, and rows of a same rank in the several sub-buffers share a same row index, that is, the rows of a same rank in the several sub-buffers belong to a same row. As shown in Table 4 and Table 5, Table 4 and Table 5 are two sub-buffers created by the database system for the row buffer, where both of the sub-buffers shown in Table 4 and Table 5 include three rows, a first row in the sub-buffer in Table 4 and a first row in the sub-buffer in Table 5 belong to a same row, a second row in the sub-buffer in Table 4 and a second row in the sub-buffer in Table 5 belong to a same row, and a third row in the sub-buffer in Table 4 and a third row in the sub-buffer in Table 5 belong to a same row.
For example, as shown in Table 4, data of the column 1 is divided into three pieces of partial data, which has a same length, of the column 1, and the three pieces of partial data of the column 1 into which the data of the column 1 is divided are loaded one by one to a same location of each row included in the sub-buffer shown in the Table 4. As shown in Table 5, data of the column 12 is divided into three pieces of partial data, which has a same length, of the column 12, and the three pieces of partial data of the column 12 into which the data of the column 12 is divided are loaded one by one to a same location of each row included in the sub-buffer shown in Table 5.
In this embodiment of the present disclosure, when there is a relatively large quantity of sub-buffers in the row buffer, the database system may execute combination of the sub-buffers and increase the length of the row of the row buffer, as shown in Table 6:
In this embodiment of the present disclosure, some column data whose length cannot be fixed cannot be directly loaded to the row buffer, and in this case, the database system may add a location having a fixed length to a same location of each row in the row buffer to store index information (that is, a starting storage location of each column data) of the column data whose length cannot be fixed, and then store the column data whose length cannot be fixed to the column buffer. In this case, when the column data is queried, the column data may be positioned quickly from the column buffer according to the index information of the column data.
In this embodiment of the present disclosure, in the network architecture for querying service data shown in
Referring to
In this embodiment of the present disclosure, the input module 201 may be a wired or wireless receiver, and is configured to receive a service data query request sent by a service system (for example, an analytical service system and a transactional service system), and output the request to the processor 202. The output module 203 may be a wired or wireless transmitter, and is configured to send the service data found by the processor 202 to the service system.
In this embodiment of the present disclosure, the memory 204 may be a high-speed random access memory (RAM) memory, or a non-volatile memory, for example, at least one magnetic disk storage. Optionally, the memory 204 may further be at least one storage apparatus that is located far away from the processor 202. As shown in
In this embodiment of the present disclosure, the BI database memory module 205 is used as a buffer located between a BI database and an external interface, and may be configured to create a column buffer and a row buffer, where the column buffer stores column data using a column storage architecture, and is mainly applicable to a fast query; and the row buffer stores column data using a row storage architecture, and is mainly applicable to a query of an excessively high throughput.
In the apparatus for querying service data shown in
As an optional implementation manner, the processor 202 may further invoke the software program stored in the memory 204, to execute the following operations: if detecting that the column data corresponding to the specified column identifier is not loaded in the column buffer of the BI database memory module 205, reading the column data corresponding to the specified column identifier from the BI database; and loading the read column data corresponding to the specified column identifier to a column of the column buffer.
As an optional implementation manner, the processor 202 may further invoke the software program stored in the memory 204, to execute the following operations: if detecting that the row index corresponding to the target column identifier does not exist in the memory 204, detecting whether the column data corresponding to the target column identifier is loaded in the column buffer, and if the column data corresponding to the target column identifier is loaded in the column buffer, reading the column data corresponding to the target column identifier from the column buffer; or if the column data corresponding to the target column identifier is not loaded in the column buffer, reading the column data corresponding to the target column identifier from the BI database; loading the read column data corresponding to the target column identifier to a row in the row buffer; and establishing a correspondence between the target column identifier and a row index of the row (that is, the row, to which the column data corresponding to the target column identifier is loaded, in the row buffer), storing the correspondence in the memory 204, and executing the operation of querying, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer of the BI database memory module 205 and corresponds to the row index, to obtain a query result; and returning the result to the transactional service system using the output module 203.
As an optional implementation manner, the processor 202 may further invoke the software program stored in the memory 204, to execute the following operations: before receiving, using the inputting module 201, the transactional service data query request that is sent by the transactional service system and that includes the target column identifier and the first query condition, if a query registration request that is sent by the transactional service system and that includes the target column identifier is received using the inputting module 201, detecting whether the column data corresponding to the target column identifier is loaded in the row buffer of the BI database memory module 205, if the column data corresponding to the target column identifier is not loaded in the row buffer, detecting whether the column data corresponding to the target column identifier is loaded in the column buffer, and if the column data corresponding to the target column identifier is loaded in the column buffer, reading the column data corresponding to the target column identifier from the column buffer, or if the column data corresponding to the target column identifier is not loaded in the column buffer, reading the column data corresponding to the target column identifier from the BI database; loading the read column data corresponding to the target column identifier to a row in the row buffer; and establishing a correspondence between the target column identifier and a row index of the row (that is, the row, to which the column data corresponding to the target column identifier is loaded, in the row buffer), storing the correspondence in the memory 204, and executing the operation of querying, based on the first query condition, column data that corresponds to the target column identifier, and that is in a row that is in the row buffer of the BI database memory module 205 and corresponds to the row index, to obtain a query result; and returning the result to the transactional service system using the output module 203.
In an embodiment, the row buffer may include several sub-buffers, each sub-buffer includes several rows having a same length, the sub-buffers have a same quantity of rows, and rows of a same rank in the several sub-buffers share a same row index, that is, the rows of a same rank in the several sub-buffers belong to a same row; and correspondingly, a manner in which the processor 202 loads the read column data corresponding to the target column identifier to the buffer row in the row buffer may be: dividing, according to the quantity of the rows of the sub-buffer in the row buffer, the read column data corresponding to the target column identifier, and loading one by one each partial column data obtained by means of division to a same location of each row included in any one of the sub-buffers.
In an embodiment, a manner in which the processor 202 queries, based on the first query condition, the column data that corresponds to the target column identifier, and that is in the row that is in the row buffer of the BI database memory and corresponds to the row index, to obtain the query result is: determining each row that is in the row buffer of the BI database memory and corresponds to the row index and that is located in any one of the sub-buffers; and querying, based on the first query condition, partial column data that corresponds to the target column identifier, and that is loaded in a same location in each row that is located in any one of the sub-buffers and corresponds to the row index, to obtain the query result.
In this embodiment of the present disclosure, in the apparatus for querying service data shown in
Referring to
301: A BI database system identifies a received service data query request, and when the service data query request is an analytical service data query request that is sent by an analytical service system and that includes a specified column identifier and a second query condition, step 302 is performed, or when the service data query request is a transactional service data query request that is sent by a transactional service system and that includes a target column identifier and a first query condition, step 306 is performed.
In an embodiment, the BI database system may use different ports to receive the analytical service data query request sent by the analytical service system and the transactional service data query request sent by the transactional service system. Further, the BI database system may identify, according to a port using which a service data query request is received, whether the service data query request is the analytical service data query request that is sent by the analytical service system and that includes the specified column identifier and the second query condition, or is the transactional service data query request that is sent by the transactional service system and that includes the target column identifier and the first query condition.
In another embodiment, the analytical service data query request sent by the analytical service system and the transactional service data query request sent by the transactional service system may separately carry different service type identifiers. Further, the BI database system may identify, according to a service type identifier carried by a received service data query request, whether the service data query request is the analytical service data query request that is sent by the analytical service system and that includes the specified column identifier and the second query condition, or is the transactional service data query request that is sent by the transactional service system and that includes the target column identifier and the first query condition.
In this embodiment of the present disclosure, the specified column identifier and the target column identifier may also be referred to as customer identifiers because all data of each customer may be used as column data for storage in the BI database.
In this embodiment of the present disclosure, the first query condition and the second query condition are mainly used to perform querying operations such as filtering and statistics collection on the column data.
302: The BI database system detects whether column data corresponding to the specified column identifier is loaded in a column buffer of a BI database memory, and if not, step 303 is performed, or if yes, step 304 and step 305 are performed.
303: The BI database system reads the column data corresponding to the specified column identifier from a BI database, and loads the read column data corresponding to the specified column identifier to a column of the column buffer, and step 304 and step 305 are performed.
The BI database system may load in advance the column data corresponding to the specified column identifier from the BI database to a column of the column buffer.
304: The BI database system queries, based on the second query condition, the column data corresponding to the specified column identifier from the column buffer, to obtain a query result.
For example, it is assumed that Table 6 and Table 7 are created below in the BI database.
Table 6 and Table 7 may be stored in a file form, as shown in Table 8:
Based on Table 6 and Table 7 that are created in the BI database, it is further assumed that the analytical service data query request that is received by the BI database system and that is sent by the analytical service system may include the following second query condition, that is:
select avg(t1.attr1) from t1, t2 where t1.attr0=t2.attr0 and t2.f1>30.5, where t1.attr1, t1.attr0, t2.attr0, and t2.f1 all represent the specified column identifier, and meaning of the second query condition in an SQL syntax is: when t1 is associated with t2, if column data corresponding to t2.f1 is greater than 30.5, an average value of column data corresponding to t1.attr1 is obtained.
Correspondingly, the BI database system may detect whether column data corresponding to t1.attr1, t1.attr0, t2.attr0, and t2.f1 is loaded in the column buffer of the BI database memory, and if yes, queries the column data corresponding to t1.attr1, t1.attr0, t2.attr0, and t2.f1 from the column buffer based on the second query condition, to obtain a query result, or if not, the BI database system reads the column data corresponding to t1.attr1, t1.attr0, t2.attr0, and t2.f1 from the BI database, loads the read column data to a column of the column buffer, and queries, based on the second query condition, the column data corresponding to t1.attr1, t1.attr0, t2.attr0, and t2.f1 from the column buffer, to obtain a query result.
In this embodiment of the present disclosure, the query result may include various service data, such as a traffic value, a sales value, a sales target, and transaction data.
305: The BI database system returns the query result to the analytical service system, and ends the process.
306: The BI database system detects whether a row index corresponding to the target column identifier exists, and if it is detected that the row index corresponding to the target column identifier does not exist, step 307 is performed, or if it is detected that the row index corresponding to the target column identifier exists, step 312 and step 313 are performed.
307: The BI database system detects whether column data corresponding to the target column identifier is loaded in the column buffer, and if the column data corresponding to the target column identifier is loaded in the column buffer, step 308 is performed, or if the column data corresponding to the target column identifier is not loaded in the column buffer, step 309 is performed.
308: The BI database system reads the column data corresponding to the target column identifier from the column buffer, and step 310 is performed.
309: The BI database system reads the column data corresponding to the target column identifier from the BI database, and step 310 is performed.
310: The BI database system loads the read column data corresponding to the target column identifier to a row in a row buffer.
In this embodiment of the present disclosure, the row buffer may include several sub-buffers, each sub-buffer includes several rows having a same length, the sub-buffers have a same quantity of rows, and rows of a same rank in the several sub-buffers share a same row index, that is, the rows of a same rank in the several sub-buffers belong to a same row.
Correspondingly, in step 310, a manner in which the BI database system loads the read column data corresponding to the target column identifier to the row in the row buffer may be: dividing, by the BI database system, according to the quantity of the rows of the sub-buffer in the row buffer, the read column data corresponding to the target column identifier, and loading one by one each partial column data obtained by means of division to a same location of each row included in any one of the sub-buffers.
In this embodiment of the present disclosure, based on Table 6 and Table 7 that are created in the BI database, it is further assumed that the transactional service data query request that is received by the BI database system and that is sent by the transactional service system may include the following first query condition, that is:
select t1.attr0, t1.attr1, t1.attr2, t1.attr3, t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, t2.f9 from t1, t2 where t1.attr0=t2.attr0 and t1.attr0=?,
where t1.attr0, t1.attr1, t1.attr2, t1.attr3, t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, and t2.f9 represent the target column identifier, and meaning of the first query condition in the SQL syntax is: when t1, t2, and the like are associated, column data corresponding to the target column identifier, such as t1.attr0, t1.attr1, t1.attr2, t1.attr3, t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, and t2.f9 is selected.
Correspondingly, the BI database system may detect whether a row index corresponding to the target column identifier such as t1.attr0, t1.attr1, t1.attr2, t1.attr3, t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, and t2.f9, exists, and if an row index corresponding to t1.attr0 and t2.f6 exists, it indicates that column data corresponding to t1.attr0 and t2.f6 is already loaded to a row of the row buffer, or if an row index corresponding to t1.attr1, t1.attr2, t1.attr3, t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 does not exist, it indicates that column data corresponding to t1.attr1, t1.attr2, t1.attr3, t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 is not loaded to a row of the row buffer. Correspondingly, the BI database system may detect whether the column data corresponding to t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 is loaded in the column buffer, and if the column data corresponding to t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 is loaded in the column buffer, the BI database system may read the column data corresponding to t1.attr1, t1.attr2, t1.attr3, t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 from the column buffer, or if the column data corresponding to t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 is not loaded in the column buffer, the BI database system may read the column data corresponding to t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 from the BI database. Further, the BI database system may load the read column data corresponding to t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f7, t2.f8, and t2.f9 to a row in the row buffer (that is, column-to-row conversion).
In this embodiment of the present disclosure, if storage space of the row buffer is sufficient, but space is insufficient because of a memory fragment, memory defragmentation may be considered. Alternatively, if the storage space of the row buffer is insufficient, a new sub-buffer may be established for the row buffer. When there is a relatively large quantity of sub-buffers in the row buffer (for example, more than three), an operation of combining the sub-buffers is executed.
311: The BI database system establishes a correspondence between the target column identifier and a row index of the row, to which the column data corresponding to the target column identifier is loaded, in the row buffer, and step 312 is performed.
In this embodiment of the present disclosure, after the BI database system establishes the correspondence between the target column identifier and the row index of the row, to which the column data corresponding to the target column identifier is loaded, in the row buffer, the correspondence may be used to indicate that the column data corresponding to the target column identifier is already loaded to the row of the row buffer. In this way, the BI database system may detect, according to the correspondence, that the row index corresponding to the target column identifier exists next time.
312: The BI database system queries, based on the first query condition, the column data that corresponds to the target column identifier, and that is in the row that is in the row buffer of the BI database memory and corresponds to the row index, to obtain a query result.
In this embodiment of the present disclosure, in step 312, a manner in which the BI database system queries, based on the first query condition, the column data that corresponds to the target column identifier, and that is in the row that is in the row buffer of the BI database memory and corresponds to the row index, to obtain the query result may be: determining, by the BI database system, each row that is in the row buffer of the BI database memory and corresponds to the row index and that is located in any one of the sub-buffers, and querying, based on the first query condition, each partial column data that corresponds to the target column identifier, and that is loaded in a same location of each row that is located in any one of the sub-buffers and corresponds to the row index, to obtain the query result.
For example, in this embodiment of the present disclosure, after all column data corresponding to t1.attr0, t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, and t2.f9 is loaded to a row of the row buffer, and assuming that the row index is 13800138000, the BI database system may determine each row that corresponds to the row index 13800138000 in the row buffer of the BI database memory and that is located in any one of the sub-buffers, and query, based on the first query condition, each partial column data that corresponds to t1.attr0, t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, and t2.f9, and that is located in the same location in each row that is located in any one of the sub-buffers and corresponds to the row index 13800138000, to obtain a query result.
313: The BI database system returns the query result to the transactional service system, and the process ends.
In this embodiment of the present disclosure, in the method for querying service data shown in
Referring to
401: When receiving a query registration request that is sent by a transactional service system and that includes a target column identifier, a BI database system detects whether column data corresponding to the target column identifier is loaded in a row buffer of a BI database memory, and if the column data corresponding to the target column identifier is not loaded in the row buffer, step 402 is performed, or if the column data corresponding to the target column identifier is already loaded in the row buffer, step 407 is performed.
For example, the query registration request that is sent by transactional service system, that includes the target column identifier, and that is received by the BI database system may be:
registration query 910 as “select t1.attr0, t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, t2.f9 from t1, t2 where t1.attr0=t2.attr0 and t1.attr0=?”,
where t1.attr0, t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, and t2.f9 represent the target column identifier, and the BI database system may detect whether column data corresponding to t1.attr0, t1.attr1, t1.attr2, t1.attr3 t1.attr4, t1.attr5, t1.attr6, t2.f6, t2.f7, t2.f8, and t2.f9 is loaded in the row buffer of the BI database memory, and if not, step 402 is performed, or if yes, step 407 is performed.
402: The BI database system detects whether the column data corresponding to the target column identifier is loaded in a column buffer, and if the column data corresponding to the target column identifier is loaded in the column buffer, step 403 is performed, or if the column data corresponding to the target column identifier is not loaded in the column buffer, step 404 is performed.
403: The BI database system reads the column data corresponding to the target column identifier from the column buffer, and step 405 is performed.
404: The BI database system reads the column data corresponding to the target column identifier from a BI database, and step 405 is performed.
405: The BI database system loads the read column data corresponding to the target column identifier to a row in the row buffer.
In this embodiment of the present disclosure, the row buffer may include several sub-buffers, each sub-buffer includes several rows having a same length, the sub-buffers have a same quantity of rows, and rows of a same rank in the several sub-buffers share a same row index, that is, the rows of a same rank in the several sub-buffers belong to a same row.
Correspondingly, in step 405, a manner in which the BI database system loads the column data corresponding to the target column identifier to the row in the row buffer may be: dividing, by the BI database system, according to the quantity of the rows of the sub-buffer in the row buffer, the read column data corresponding to the target column identifier, and loading one by one each partial column data obtained by means of division to a same location of each row included in any one of the sub-buffers.
406: The BI database system establishes a correspondence between the target column identifier and a row index of the row.
407: The BI database system returns a query registration result to the transactional service system.
The query registration result may also be referred to as metadata, and is mainly used to indicate that the column data corresponding to the target column identifier is already loaded in the row buffer.
408: The BI database system identifies a received service data query request, and when the service data query request is an analytical service data query request that is sent by an analytical service system and that includes a specified column identifier and a second query condition, step 409 is performed, or when the service data query request is a transactional service data query request that is sent by a transactional service system and that includes a target column identifier and a first query condition, step 413 is performed.
409: The BI database system detects whether column data corresponding to the specified column identifier is loaded in the column buffer of the BI database memory, and if not, step 410 is performed, or if yes, step 411 to step 412 are performed.
410: The BI database system reads the column data corresponding to the specified column identifier from the BI database, and loads the read column data corresponding to the specified column identifier to a column of the column buffer, and step 411 to step 412 are performed.
411: The BI database system queries, based on the second query condition, the column data corresponding to the specified column identifier from the column buffer, to obtain a query result.
412: The BI database system returns the query result to the analytical service system.
413: The BI database system detects whether a row index corresponding to the target column identifier exists, and if it is detected that the row index corresponding to the target column identifier does not exist, the processing ends, or if it is detected that the row index corresponding to the target column identifier exists, step 414 and step 415 are performed.
The BI database system may detect, according to the correspondence established in step 406, whether the row index corresponding to the target column identifier exists.
414: The BI database system queries, based on the first query condition, the column data corresponding to the target column identifier in the row corresponding to the row index in the row buffer of the BI database memory, to obtain a query result.
In this embodiment of the present disclosure, in step 414, a manner in which the BI database system queries, based on the first query condition, the column data corresponding to the target column identifier in the row corresponding to the row index in the row buffer of the BI database memory, to obtain the query result may be: determining, by the BI database system, each row that is in the row buffer of the BI database memory and corresponds to the row index and that is located in any one of the sub-buffers, and querying, based on the first query condition, each partial column data that corresponds to the target column identifier, and that is loaded in a same location of each row that is located in any one of the sub-buffers and corresponds to the row index, to obtain the query result.
415: The BI database system returns the query result to the transactional service system.
In this embodiment of the present disclosure, in the method for querying service data shown in
Referring to
Referring to
In the apparatus for querying service data described in
Correspondingly, in the apparatus for querying service data described in
In this embodiment of the present disclosure, in the apparatus for querying service data described in
Referring to
Correspondingly, when receiving the transactional service data query request that is sent by the transactional service system and that includes the target column identifier and the first query condition, the first detection unit 501 may detect whether the row index corresponding to the target column identifier exists in the correspondence established by the second establishing unit 511.
In the apparatus for querying service data described in
Correspondingly, in the apparatus for querying service data described in
In this embodiment of the present disclosure, in the apparatus for querying service data described in
In this embodiment of the present disclosure, in the apparatus for querying service data shown in
Referring to
In this embodiment of the present disclosure, by implementing the system described in
The network architecture, the apparatus, the method, and the system that are disclosed in the embodiments of the present disclosure are described in detail. To better present effects of the present disclosure, the present disclosure and some optional solutions are compared below, and a comparison result is shown in Table 9:
A sequence of the steps of the method in the embodiments of the present disclosure may be adjusted, and certain steps may also be merged and removed according to an actual requirement.
The units or subunits in the terminal or device in the embodiments of the present disclosure may be adjusted, divided, and removed according to an actual requirement.
A person of ordinary skill in the art may understand that, all or a part of the steps in each method of the foregoing embodiments may be implemented by a program instructing related hardware. The program may be stored in a computer readable storage medium. The storage medium includes a read-only memory (ROM), a RAM, a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), a one-time programmable read-only memory (OTPROM), an electrically-erasable programmable read-only memory, (EEPROM), a compact disc read-only memory (CD-ROM), or another optical disk memory, magnetic disk memory, magnetic tape memory, or any other computer readable medium that can be configured to carry or store data.
The method and apparatus for querying service data and the database system disclosed in the embodiments of the present disclosure are described above in detail. Although the principles and implementation manners of the present disclosure are described using specific embodiments in this specification, the foregoing descriptions of the embodiments are only intended to help understand the method and core idea of the method of the present disclosure. Meanwhile, a person of ordinary skill in the art may make modifications to the specific implementation manners and application range according to the idea of the present disclosure. In conclusion, the content of the specification should not be construed as a limitation to the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
201410804281.6 | Dec 2014 | CN | national |
This application is a continuation of International Application No. PCT/CN2015/089802, filed on Sep. 16, 2015, which claims priority of Chinese Patent Application No. 201410804281.6, filed on Dec. 22, 2014. The disclosures of the aforementioned applications are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2015/089802 | Sep 2015 | US |
Child | 15630195 | US |