The present disclosure relates to the technical field of data distribution, and in particular, relates to a data distribution method and an electronic device.
In the current production environments, the amount of data that needs to be distributed is increasing, and the real-time distribution of large-scale data is usually realized by using a distributed message queue system. The distributed message queue system is composed of a plurality of servers. The plurality of servers distribute and store messages in a plurality of message queues, and the plurality of servers perform distributed storage and real-time distribution of the messages in the plurality of message queues.
Embodiments of the present disclosure provide a data distribution method and an electronic device.
According to an aspect of the embodiments of the present disclosure, a data distribution method is provided, which is performed by a storage server of a data distribution system. The data distribution system further includes a set of subscription servers. The method includes: acquiring a subscription timestamp of each of the subscription servers, wherein the subscription timestamp is configured to indicate a subscription progress of the subscription server; determining a sorting result of the subscription servers based on the subscription timestamps of the subscription servers; determining a first subscription server and a target subscription server in the set of subscription servers based on the sorting result, wherein the target subscription server has subscribed to and stored a target message, the target message being a message that the first subscription server needs to subscribe to; and sending a notification message to the first subscription server, wherein the notification message is configured to instruct the first subscription server to subscribe to the target message from the target subscription server.
According to another aspect of the embodiments of the present disclosure, a data distribution method is provided, which is performed by a subscription server of a data distribution system. The data distribution system further includes a storage server. The method includes: receiving a notification message from the storage server, wherein the notification message is configured to instruct the subscription server to subscribe to a message from a target subscription server; wherein the target subscription server is determined, by the storage server, for the subscription server through determining a sorting result of subscription servers in a set of subscription servers based on subscription timestamps of the subscription servers and determining the subscription server from the subscription servers based on the sorting result; the subscription timestamp is configured to indicate a subscription progress of each of the subscription servers; the target subscription server has subscribed to and stored a target message, the target message being a message that the subscription server needs to subscribe to; and subscribing to the target message from the target subscription server based on the notification message.
According to still yet another aspect of the embodiments of the present disclosure, an electronic device is provided. The electronic device includes: a processor; and a memory configured to store one or more instructions executable by the processor; wherein the processor, when executing the one or more instructions, is caused to: acquire a subscription timestamp of each subscription server in a set of subscription servers, wherein the subscription timestamp is configured to indicate a subscription progress of the subscription server; determine a sorting result of the subscription servers based on the subscription timestamps of the subscription servers; determine a first subscription server and a target subscription server in the set of subscription servers based on the sorting result, wherein the target subscription server has subscribed to and stored a target message, the target message being a message that the first subscription server needs to subscribe to; and send a notification message to the first subscription server, wherein the notification message is configured to instruct the first subscription server to subscribe to the target message from the target subscription server.
The terms “first,” “second,” and the like in the description, claims, and accompanying drawings of the present disclosure are configured to distinguish similar objects instead of indicating a specific sequence or order. It should be understood that the data used in such a way is interchangeable under proper conditions to make it possible to implement the described embodiments of the present disclosure in sequences except those illustrated or described herein.
User information involved in the present disclosure is information authorized by a user or fully authorized by various parties.
In some embodiments, the meaning of A and/or B includes: A and B, A, and B.
In the related art, message data published by a server that publishes messages (hereinafter referred to as a production server) is uniformly stored in a distributed message queue system, and a plurality of servers that request to subscribe to messages (hereinafter referred to as subscription servers) need to subscribe to messages from message queues in the distributed message queue system. When a plurality of subscription servers subscribe to messages in the message queue simultaneously, the distributed message queue system needs to communicate with the plurality of subscription servers simultaneously, resulting in high network communication load and consuming many machine resources.
For clear illustration of embodiments of the present disclosure, an example of an application scenario of a data distribution method in the embodiments of the present disclosure is introduced.
Referring to
In the distributed message queue system, the production server publishes a produced message to a storage server based on the storage server configured in a local configuration file, and publishes a message queue identifier configured in the local configuration file to the storage server. The storage server stores the message published by the production server in a corresponding message queue based on the message queue identifier.
During the process of generating each message, the production server generates a time point corresponding to the message (i.e., the time point of the message), that is, each message generated by the production server includes the time point corresponding to the message. Upon generating each message, the production server sends the generated message to the storage server, and the storage server sequentially stores the message based on the sequence of the time point corresponding to each message. The message with the earliest time point (i.e., the smallest time point) is ranked at the top of the message queue.
In some embodiments, time unit of the time point includes: year, month, day, hour, minute, second, millisecond, and the like. The time unit of the time point is not limited in the embodiments of the present disclosure. In some embodiments, a number of storage servers, a number of subscription servers, and a number of production servers are determined based on business requirements, which are not limited in the embodiment of the present disclosure.
During the data distribution process, each of the subscription servers subscribes to a message in a corresponding message queue from the storage server based on the storage server and the message queue identifier configured in the local configuration file, and consumes the subscribed message (consuming the message means using the message). The message is data. For example, in a recommendation system, the production server acquires user data from a terminal, processes the acquired user data, generates a log (a type of message), and sends the log to the storage server for storage. The storage server is a server that stores logs. The subscription server subscribes to the logs from the storage server, and trains a model by using the subscribed logs to obtain a trained model.
Referring to
For example, the production server generates 1 to 30 messages, a time point corresponding to a first message is 001, a time point corresponding to a second message is 002, a time point corresponding to a third message is 003, and so on. The time points corresponding to the 1st to 30th messages are 001 to 030 respectively. The production server randomly sends the 30 messages whose time points are 001 to 030 to the storage server 1 or the storage server 2.
The 30 messages whose time points are 001 to 030 being sent to the storage server 1 is taken as an example. Upon receiving the 30 messages whose time points are 001 to 030, the storage server 1 stores the received messages sequentially in the message queue 1 of the storage server 1 based on the time point corresponding to each message in a time sequence from 001 to 030.
Similarly, upon generating 30 messages whose time points are from 031 to 060, the production server randomly sends the 30 messages whose time points are from 031 to 060 to the storage server 1 or the storage server 2. The 30 messages whose time points are from 031 to 060 being sent to the storage server 2 is taken as an example. The storage server 2 stores the 30 messages whose time points are from 031 to 060 in the message queue 1 of the storage server 2.
In the case that a message queue 2 exists, the processes that the storage server 1 and the storage server 2 store messages in the message queue 2 are the same as the processes for storing the messages in the message queue 1, which is not repeated herein.
For example, assuming that the message queues configured in the local configuration files of the subscription server 1, the subscription server 2, the subscription server 3, and the subscription server 4 are all the message queue 1, then the subscription server 1, the subscription server 2, the subscription server 3, and the subscription server 4 subscribe to a message in the message queue 1 from the storage server 1 respectively, wherein the message is at least one of the 30 messages whose time points are 001 to 030. Or the subscription server 1, the subscription server 2, the subscription server 3 and the subscription server 4 subscribe to a message in the message queue 1 from the storage server 2, wherein the message is at least one of the 30 messages whose time points are 031 to 060.
In some embodiments, the storage server maintains blocks, message queues, and subscription timestamps. A block is a fixed-size memory space used to store a plurality of messages in sequence. A message queue is a linked list composed of a plurality of blocks, and the plurality of blocks are arranged in order based on the creation time. A distributed message queue system supports a plurality of message queues simultaneously.
In some embodiments, the storage server further maintains a binary tree for the message queues. For example, a plurality of subscription servers subscribing to the same message queue are organized into a binary tree. The subscription server with a newer subscription timestamp has a node in the complete binary tree closer to the root node.
In some embodiments, with respect to a scenario where a plurality of subscription servers subscribe to the same message queue, during subscribing to a message in the message queue, one subscription server may first attempt to subscribe to the message from a subscription server corresponding to its parent node. In the case that the subscription fails, the subscription server may continue to attempt to subscribe to the message from a subscription server corresponding to its ancestor node. In the case that the subscription still fails, the subscription server may finally subscribe to the message from the storage server. In addition, the subscription server caches the subscribed message locally for its descendant nodes to subscribe to. The way of caching the subscribed message for other subscription servers to subscribe to effectively disperses the subscription pressure of a plurality of subscription servers to the distributed message queue system, thereby greatly reducing the network communication load of the distributed message queue system, and reducing the consumption of machine resources.
In some embodiments, the data distribution method according to the embodiments of the present disclosure is performed by a data distribution apparatus, wherein the data distribution apparatus is usually achieved in software and/or hardware. The data distribution apparatus is deployed in each storage server in the distributed message queue system. The method includes the following processes 201-203.
In 201, a subscription timestamp of each subscription server in the set of subscription servers is acquired.
The subscription timestamp is configured to indicate a subscription progress of the subscription server and marks a message that the subscription server has subscribed to and a message that needs to be subscribed to next time. In some embodiments, the subscription timestamp is a maximum time point among the time points of the messages subscribed by the subscription server each time. The set of subscription servers refers to a set composed of a plurality of subscription servers.
For example, with reference to
In the embodiments of the present disclosure, the storage server acquires the subscription timestamp of each of the subscription servers to determine the subscription progress of each of the subscription servers. For example, with reference to
In some embodiments, the subscription server subscribes to messages from the storage server 1 and the storage server 2. In some other embodiments, the subscription server subscribes to messages from other subscription servers. The storage server may acquire the subscription timestamp of each of the subscription servers by any suitable method, which is not limited in the embodiments of the present disclosure.
In 202, based on the subscription timestamps of the subscription servers, a first subscription server is determined from the set of subscription servers and a target subscription server of the first subscription server is determined.
The first subscription server subscribes to messages from other subscription servers. The target subscription server has subscribed to and stored the messages that the first subscription server needs to subscribe to.
In the embodiments of the present disclosure, upon acquiring the subscription timestamp of each of the subscription servers, the storage server determines the subscription progress of each of the subscription servers based on the subscription timestamp of the subscription server, and based on the subscription progress of each of the subscription servers, determines the first subscription server from the set of subscription servers and determines the target subscription server of the first subscription server.
In some embodiments, the storage server determines a sorting result of the subscription servers based on the subscription timestamps of the subscription servers; and determines a first subscription server and a target subscription server in the set of subscription servers based on the sorting result.
For example, in conjunction with 201, the subscription server 1 has subscribed to the messages whose time points are 001 to 005, the subscription server 1 needs to subscribe to 5 messages whose time points are 006 to 010 next time, and the subscription server 2, the subscription server 3 and the subscription server 4 have subscribed to and stored the 5 messages whose time points are 006 to 010, then the subscription server 1 subscribes to the messages whose time points are 006 to 010 from the subscription server 2, the subscription server 3, and the subscription server 4. In this case, the storage server 1 determines that the subscription server 1 is the first subscription server, and at the same time determines that the target subscription server of the subscription server 1 is at least one of the subscription server 2, the subscription server 3, or the subscription server 4.
Similarly, next time in the case that the subscription server 2 subscribes to 5 messages whose time points are 011 to 015 from the subscription server 3 and the subscription server 4, the storage server 1 determines that the subscription server 2 is the first subscription server and determines that the target subscription server of the subscription server 2 is the subscription server 3 and/or the subscription server 4.
Similarly, next time in the case that the subscription server 3 subscribes to 5 messages whose time points are 016 to 020 from the subscription server 4, the storage server 1 determines that the subscription server 3 is the first subscription server, and determines that the target subscription server of the subscription server 3 is the subscription server 4.
Because the subscription server 4 needs to subscribe to the 5 messages whose time points are from 021 to 025 next time, and the subscription server 1, the subscription server 2, and the subscription server 3 have not subscribed and stored the 5 messages whose time points are from 021 to 025, the subscription server 4 is not the first subscription server.
In some embodiments, one or more target subscription servers are determined for each first subscription server. For example, only one target subscription server is determined for each first subscription server. The number of target subscription servers determined for the first subscription server is determined based on business requirements, which is not limited in the embodiments of the present disclosure.
In 203, the first subscription server is instructed to subscribe to a message from the target subscription server.
In the embodiments of the present disclosure, upon determining the first subscription server and the target subscription server of the first subscription server, the storage server instructs the first subscription server to subscribe to a message from the corresponding target subscription server.
In some embodiments, the storage server sends a notification message to the first subscription server, wherein the notification message is configured to instruct the first subscription server to subscribe to the target message from the target subscription server. The target message is a message that the first subscription server needs to subscribe to, and the target subscription server has subscribed to and stored the target message.
For example, in conjunction with 202, in a scenario where the subscription server 1 is the first subscription server, the storage server 1 instructs the subscription server 1 to subscribe to 5 messages whose time points are 006 to 010 from the target subscription server (any one of the subscription server 2, the subscription server 3, or the subscription server 4).
In some embodiments, the storage server may instruct the first subscription server to subscribe to a message from the target subscription server by any suitable method, which is not limited in the embodiments of the present disclosure.
It should be noted that in the embodiments of the present disclosure, the subscription server 4 is not determined to be the first subscription server. In this case, the subscription server 4 directly subscribes to messages from the storage server 1 or the storage server 2.
The storage server 2 is the same as the storage server 1. As shown in
The embodiments of the present disclosure realize that the first subscription server subscribes to messages from the target subscription server, such that the number of subscription servers accessing the storage server simultaneously is reduced, and the number of subscription servers accessing the distributed message queue system simultaneously is reduced, thereby effectively dispersing the subscription pressure of a plurality of subscription servers to the distributed message queue system, greatly reducing the network communication load of the distributed message queue system, and reducing the consumption of machine resources.
In 301, a storage server acquires a subscription timestamp of each subscription server in a set of subscription servers.
In embodiments of the present disclosure, the storage server acquires the subscription timestamp of each of the subscription servers by any one of the following methods or a combination of a plurality of following methods.
In a first method, a subscription request sent by the subscription server is received, and the subscription timestamp of the subscription server is determined based on the subscription request.
In some embodiments, in the case that the subscription server subscribes to a message from the storage server, the subscription server sends a subscription request to the storage server, requesting to subscribe to the message in a message queue stored in the storage server.
In some embodiments, the subscription request includes a starting time point of the subscription message and a number of subscription messages. Upon receiving the subscription request, the storage server parses the subscription request to determine the specific message that the subscription server needs to subscribe to.
For example, in conjunction with 201 to 203, in a scenario where 5 messages whose time points are 001 to 005 are subscribed by the subscription server 1 from the storage server 1, the subscription server 1 sends a subscription request to the storage server 1. In some embodiments, the subscription request includes the starting time point 001 and the number of messages that need to be subscribed to which is 5. Upon receiving the subscription request, the storage server 1 parses the subscription request and determines that the subscription server 1 needs to subscribe to 5 messages whose time points are 001 to 005. In this case, the storage server 1 determines that the subscription timestamp of the subscription server 1 is 005. The subscription timestamp 005 indicates that the subscription server 1 is subscribing to the messages at the time point 005 and before the time point 005. Upon receiving the subscription request, the storage server 1 sends the 5 messages whose time points are 001 to 005 to the subscription server 1.
Similarly, the storage server 1 determines the subscription timestamps of the subscription server 2, the subscription server 3, and the subscription server 4 based on the subscription requests sent by the subscription server 2, the subscription server 3, and the subscription server 4, respectively.
It should be noted that in the case that the subscription server 3 and the subscription server 4 do not send a subscription request to the storage server 1, the subscription timestamps of the subscription server 3 and the subscription server 4 can be determined by a following second method.
The processes that the subscription server sends a subscription request to the storage server, the storage server parses the subscription request, and sends corresponding messages to the subscription server based on the subscription request are not limited in the embodiments of the present disclosure.
In the second method, feedback information sent by the subscription server is received, and the subscription timestamp of the subscription server is determined based on the feedback information.
The feedback information is sent from the subscription server to the storage server after the subscription server subscribes to a message from other subscription servers. For example, the feedback information is sent by the subscription server to the storage server after the subscription server subscribes to a message from the target subscription server. The feedback information includes the subscription timestamp.
In some embodiments, after the subscription server subscribes to a message from other subscription servers (such as target subscription server), the subscription server determines its own subscription timestamp based on the subscribed message, and sends feedback information including the subscription timestamp to the storage server.
For example, in conjunction with 201 to 203, in a scenario where the subscription server 1 subscribes to messages whose time points are 006 to 010 from any one of the subscription server 2, the subscription server 3, and the subscription server 4, the subscription server 1 determines that the 5 messages whose time points are 006 to 010 have been subscribed to and stored, determines that its own subscription timestamp is 010, and sends feedback information including the subscription timestamp 010 to the storage server 1 and the storage server 2. In this case, the storage server 1 determines the subscription timestamp of the subscription server 1 based on the feedback information sent by the subscription server 1.
The process that the storage server 2 determines the subscription timestamps of the subscription server 1, the subscription server 2, the subscription server 3, and the subscription server 4 is the same as that of the subscription server 1, which is not repeated herein.
In 302, based on the subscription timestamps of the subscription servers, the storage server determines a first subscription server from the set of subscription servers and determines a target subscription server of the first subscription server.
In some embodiments, the storage server sorts the subscription servers based on the subscription timestamps of the subscription servers to acquire a sorting result of the subscription servers; and based on the sorting result, determines the first subscription server from the subscription servers and determines the target subscription server of the first subscription server.
In some embodiments, the sorting result is a binary tree sorting result.
In some embodiments, the storage server sorts the subscription servers based on the subscription timestamps of the subscription servers to acquire the binary tree sorting result.
For example, in conjunction with 301, referring to
In some other embodiments, in the case that the sorting result is a binary tree sorting result, based on the sorting result, determining the first subscription server from the set of subscription servers and determining the target subscription server of the first subscription server includes: determining subscription servers other than a second subscription server as the first subscription servers based on the binary tree sorting result; selecting the subscription servers corresponding to some or all of ancestor nodes of a specified node as the target subscription servers of the first subscription servers. The second subscription server is the subscription server corresponding to the root node of the binary tree.
The specified node corresponds to the first subscription server.
In the embodiments of the present disclosure, in the case that the sorting result is a binary tree sorting result, because the subscription timestamp of the subscription server corresponding to the root node is the greatest, other subscription servers can subscribe to a message from the subscription server corresponding to the root node, and the subscription server corresponding to the root node can only subscribe to a message from the storage server. Therefore, subscription servers other than the subscription server corresponding to the root node are determined as the first subscription servers. As shown in
As for the process for determining the target subscription server of the first subscription server, since among ancestor nodes of a binary tree node corresponding to the first subscription server, the subscription timestamp of the subscription server corresponding to each ancestor node is greater than the subscription timestamp of the subscription server corresponding to the node, the subscription server corresponding to at least one ancestor node is selected from the ancestor nodes of the binary tree node corresponding to the first subscription server as the target subscription server of the first subscription server.
As shown in
Similarly, for the subscription server 2 and the subscription server 3, the subscription server 4 corresponding to the root node is selected as the target subscription server of the subscription server 2 and the subscription server 3.
In the embodiments of the present disclosure, in the case that the sorting result is a binary tree sorting result, an ancestor node of a node corresponding to the first subscription server is determined, and the subscription server corresponding to the ancestor node serves as the target subscription server of the first subscription server. During the process of determining the target subscription server, the same subscription server serving as the target subscription server of a plurality of first subscription servers is avoided, and the communication load and machine resource consumption of the subscription server are reduced. For example, the subscription server 3 serves as the target subscription server of the subscription server 1 and the subscription server 2 simultaneously, and only the subscription server 4 corresponding to the subscription server 2 serves as the target subscription server of the subscription server 2, thereby avoiding the subscription server 3 serving as the target subscription server of the subscription server 1 and the subscription server 2 simultaneously, and reducing the communication load and resource consumption of the subscription server 3. The node corresponding to the subscription server 4 is the ancestor node of the descendant node of the node corresponding to the subscription server 4.
In some other embodiments, in the case that the sorting result is a binary tree sorting result, based on the sorting result, determining the first subscription server from the set of subscription servers and determining the target subscription server of the first subscription server includes: determining other subscription server except a second subscription server as the first subscription server based on the binary tree sorting result, taking a parent node of a specified node as a target node, and taking a subscription server corresponding to the target node as the target subscription server of the first subscription server.
For example, referring to
Similarly, for the subscription server 2, the parent node (the root node) of the binary tree node corresponding to the subscription server 2 is determined as the target node, and the subscription server 4 corresponding to the target node is taken as the target subscription server of the subscription server 2.
In some embodiments, in the case that the sorting result is a binary tree sorting result, the parent node of the binary tree node corresponding to the first subscription server is taken as the target node, and the subscription server corresponding to the target node is taken as the target subscription server of the first subscription server. During the process of determining the target subscription server, the same subscription server serving as the target subscription server of a plurality of first subscription servers is avoided, thereby reducing the communication load and machine resource consumption of the subscription server. Additionally, the target subscription server of the first subscription server is quickly determined, thereby improving the operating efficiency of the entire system.
As shown in
In some embodiments, in the case that the sorting result is a binary tree, the binary tree is a complete binary tree or a full binary tree. In the complete binary tree, each layer except the last layer has reached the maximum number of nodes in the layer, and all nodes in the last layer are in the left row. In the full binary tree, the number of nodes in each layer reaches the maximum. The complete binary tree or the full binary tree has a complete structure, such that the target subscription server of each first subscription server can be quickly and conveniently determined.
In some embodiments, the target subscription server corresponds to an ancestor node of a specified node; or the target subscription server corresponds to a parent node of a specified node. The specified node corresponds to the first subscription server. The first subscription server performs the following processes: subscribing to a target message from the target subscription server corresponding to the parent node; in the case that the target message is not subscribed to from the target subscription server corresponding to the parent node, subscribing to the target message from the target subscription server corresponding to the ancestor node; and in the case that the target message is not subscribed to from the target subscription server corresponding to the ancestor node, subscribing to the target message from the storage server.
The storage server determines a second subscription server corresponding to the root node of the binary tree based on the binary tree sorting result; determines the first subscription server based on the second subscription server, wherein the first subscription server is a subscription server other than the second subscription server; and determines the target subscription server based on the specified node corresponding to the first subscription server. The target subscription server is the subscription server corresponding to the ancestor node or parent node of the specified node.
In some embodiments, the sorting result is a linear sorting result.
In some embodiments, the storage server sorts the subscription servers based on the subscription timestamps of subscription servers to acquire a linear sort result of the subscription servers. For example, in conjunction with 201 to 203, referring to
As shown in
In some embodiments, the linear sorting result is in order or in reverse order. For example, the linear sorting result determined by the storage server 1 based on the subscription timestamps is: the subscription server 1 <the subscription server 2 <the subscription server 3 <the subscription server 4.
In some other embodiments, in the case that the sorting result is a linear sorting result, based on the sorting result, determining the first subscription server from the set of subscription servers and determining the target subscription server of the first subscription server includes: determining other subscription server except a third subscription server as the first subscription server based on the linear sorting result; and selecting a subscription server from the set of subscription servers as the target subscription server of the first subscription server. The third subscription server is a subscription server corresponding to a greatest subscription timestamp, and the subscription timestamp of the selected subscription server is greater than the subscription timestamp of the first subscription server. That is, the storage server first determines a linear sorting result of the subscription servers based on the subscription timestamps of the subscription servers, then determines the third subscription server corresponding to the greatest subscription timestamp based on the linear sorting result, and then determines the first subscription server based on the third subscription server, the first subscription server being a subscription server other than the third subscription server, and then determines the target subscription server from the set of subscription servers based on the first subscription server, wherein the target subscription server is a subscription server whose subscription timestamp is greater than the subscription timestamp of the first subscription server.
As shown in
In the embodiments of the present disclosure, the first subscription server and the target subscription server of the first subscription server can be determined more quickly, and at the same time, a plurality of target subscription servers can be quickly determined for each first subscription server.
In some embodiments, in the case that a number of the target subscription servers is more than one, upon determining the target subscription server of the first subscription server, the method further includes: determining a priority of each of target subscription servers based on the sorting result. The priority is configured to instruct the first subscription server to subscribe to the target message from the target subscription server based on the priority of the target subscription server. Accordingly, instructing the first subscription server to subscribe to the message from the target subscription server includes: sending the priority of each of the target subscription servers to the first subscription server, and instructing the first subscription server to subscribe to the message from the target subscription server based on the priority of the target subscription server.
In the embodiments of the present disclosure, the first subscription server selects a target subscription server in order according to the priorities of the target subscription servers, and subscribes to messages from the selected target subscription server. The higher the priority of the target subscription server, the smaller the subscription timestamp. For example, in conjunction with 301 to 302, in the linear sorting result, when the first subscription server is the subscription server 1, the storage server 1 determines that the target subscription servers of the subscription server 1 are the subscription server 2, the subscription server 3, and the subscription server 4 based on the linear sorting result. Since in the linear sorting result, according to the subscription timestamps, the subscription server 4>the subscription server 3>the subscription server 2, it is determined that a priority of the subscription server 2 is higher than a priority of the subscription server 3, and the priority of the subscription server 3 is higher than a priority of the subscription server. 4.
In some embodiments, in the binary tree sorting result, the priorities of the target subscription servers are determined based on the node position of each of target subscription servers in the binary tree. For example, when the first subscription server is the subscription server 1, among the target subscription servers of the subscription server 1, a node corresponding to subscription server 4 is a parent node of nodes corresponding to the subscription server 2 and the subscription server 3, thus it is determined that the priorities of the subscription server 2 and the subscription server 3 are higher than the priority of the subscription server 4. In some embodiments, the priorities of the subscription server 2 and the subscription server 3 are the same.
In some embodiments, the priorities of the target subscription servers are determined based on the sorting result. In the case that the priority corresponding to each of the target subscription servers is determined and the number of target subscription servers is more than one, the first subscription server subscribes to a message from the target subscription servers in order according to the priorities. The determination of the priorities can avoid the problem of high communication load and consumption of machine resources caused by one subscription server serving as the target subscription server of a plurality of first subscription servers simultaneously.
For example, referring to
It should be noted that, the sorting result acquired by sorting the subscription servers based on the subscription timestamps of the subscription servers is any sorting result that can determine the first subscription server and the target subscription of the first subscription server. The specific form of the sorting result is not limited in the embodiments of the present disclosure does.
In 303, the storage server instructs the first subscription server to subscribe to a message from the target subscription server.
That is, the storage server sends a notification message to the first subscription server. The notification message instructs the first subscription server to subscribe to a target message from the target subscription server. The target subscription server has subscribed to and stored the target message. The target message is a message that the first subscription server needs to subscribe to.
In some embodiments, instructing the first subscription server to subscribe to the message from the target subscription server includes: sending identification information of the target subscription server to the first subscription server, such that the first subscription server subscribes to the message from the target subscription server based on the identification information. That is, the notification message sent by the storage server to the first subscription server includes identification information of the target subscription server, and the identification information is configured to instruct the first subscription server to subscribe to the target message from the target subscription server.
In the embodiments of the present disclosure, the identification information of the target subscription server is information that uniquely identifies the target subscription server. In some embodiments, the identification information includes an IP address of the target subscription server. In conjunction with 301 to 302, upon determining that the subscription server 1 is the first subscription server and determining that the target subscription servers of the subscription server 1 are the subscription server 2, the subscription server 3, and the subscription server 4, the storage server 1 sends IP addresses 192.168.2.3 (an IP address of the subscription server 2), 192.168.2.4 (an IP address of the subscription server 3), and 192.168.2.5 (an identification of the subscription server 4) to the subscription server 1, to instruct the subscription server 1 to subscribe to 5 messages whose time points are 006 to 010 from the subscription servers corresponding to the IP addresses 192.168.2.3, 192.168.2.4, and 192.168.2.5 next time.
In some embodiments, the identification information further includes a priority identifier of the target subscription server.
In some embodiments, the identification information sent by the storage server to the first subscription server includes a priority identifier of each target subscription server. For example, in conjunction with the 303, upon determining the priority of each target subscription server of the subscription server 1, the storage server determines the priority identifier of each target subscription server. For example, the priority identifier of the subscription server 4 is determined to be 3, the priority identifier of the subscription server 3 is determined to be 2, and the priority identifier of the subscription server 2 is determined to be 1. The determination of the priority identifier is not limited in the embodiments of the present disclosure.
In some embodiments, the specific data included in the identification information and the specific form of the identification information are determined based on requirements, which are not limited in the embodiments of the present disclosure.
In some embodiments, the first subscription server corresponds to a specified node. In the case that the target subscription server corresponds to an ancestor node of the specified node or the target subscription server corresponds to a parent node of the specified node, the notification message is configured to instruct the first subscription server to: subscribe to the target message from the target subscription server corresponding to the parent node; subscribe to the target message from the target subscription server corresponding to the ancestor node in the case that the target message is not subscribed from the target subscription server corresponding to the parent node; and subscribe to the target message from the storage server in the case that the target message is not subscribed from the target subscription server corresponding to the ancestor node.
In 304, the first subscription server receives the notification sent by the storage server.
In some embodiments, the first subscription server receives and parses a notification message sent by the storage server, and determines the target subscription server based on the notification to subscribe to a target message from the target subscription server. The target message is a message that the first subscription server needs to subscribe to; and the target subscription server has subscribed to and stored the target message.
For example, in conjunction with the 303, the subscription server 1 receives the identification information sent by the storage server 1, the identification information including an IP address, and based on the IP address, determines that the target subscription servers are the subscription server 2, the subscription server 3, and the subscription server 4. In some embodiments, in the case that the identification information sent by the storage server 1 further includes the priority identifier corresponding to each target subscription server, the subscription server 1 determines the priority of each target subscription server.
In some embodiments, for the process that the first subscription server receives and parses the notification message sent by the storage server, reference may be made to the process that the server receives and parses information, which is not limited in the embodiments of the present disclosure.
In 305, based on the notification, the first subscription server subscribes to the message from the target subscription server.
In the embodiments of the present disclosure, upon receiving the notification message sent by the storage server and determining the target subscription server, the first subscription server sends a subscription request to the target subscription server to subscribe to and store the target message.
For example, in conjunction with 304, the subscription server 1 determines that the target subscription servers are the subscription server 2, the subscription server 3, and the subscription server 4. Upon determining the respective IP addresses of the subscription server 2, the subscription server 3, and the subscription server 4, the subscription server 1 subscribes to the message from any one of the subscription server 2, the subscription server 3, and the subscription server 4 until 5 messages whose time points are 006 to 010 are subscribed, and stores the subscribed messages.
In some embodiments, in the case that the identification information further includes the priority identifier corresponding to each subscription server, the subscription server 1 will further determine the priority of each target subscription server at the same time. In this case, based on the priorities, the subscription server 1 first subscribes to the message from the subscription server 2; in the case that the message is not subscribed from the subscription server 2, subscribes to the message from the subscription server 3; and in the case that the message is not subscribed from the subscription server 3, subscribes to the message directly from the subscription server 4. For the process that the first subscription server subscribes to a message from a plurality of target subscription servers based on the priority identifiers may be made with reference to the method of using the priority, which is not limited in the embodiments of the present disclosure.
In some embodiments, in the case that the first subscription server has subscribed to the target message from the target subscription server, the first subscription server determines feedback information based on the subscribed target message, and sends the feedback information to the storage server. The feedback information includes the subscription timestamp of the first subscription server. For the process that the first subscription server subscribes to a message from the target subscription server, determines feedback information based on the subscribed message, and sends the feedback information to the storage server, reference may be made to 301, which is not repeated herein.
In some embodiments, in the case that the first subscription server has not subscribed to the target message from the target subscription server, the first subscription server sends a subscription request to the storage server to subscribe to the target message from the storage server. For example, in conjunction with 301 to 305, in the case that the subscription server 1 has not subscribed to the 5 messages whose time points are 006 to 010 from the subscription server 2, the subscription server 3, and the subscription server 4, the subscription server 1 sends a subscription request to the storage server 1 or the storage server 2 (the subscription server 1 is not aware of the specific storage locations whose time points are 006 to 010, such that the subscription server randomly sends a subscription request to the storage server 1 or the storage server 2) to subscribe to the 5 messages whose time points are 006 to 010.
It should be noted that, for each message queue stored in each storage server, the storage server determines a set of subscription servers that subscribe to a message in the message queue, and determines a first subscription server from the set of subscription servers and a target subscription server of the first subscription server. For example, referring to
The processes that the storage server 2 respectively determines the first subscription server and the first subscription object information of the first subscription server in the sets of subscription servers subscribing to the message in the message queue 1 and the message queue 2 are the same as the storage server 1, which are not repeated herein.
The embodiments of the present disclosure realize that a subscription server subscribes to a message in a message queue from other subscription servers, such that the number of subscription servers accessing the storage server simultaneously is reduced, and the number of subscription servers accessing the distributed message queue system simultaneously is reduced, thereby effectively dispersing the subscription pressure of a plurality of subscription servers to the distributed message queue system, greatly reducing the network communication load of the distributed message queue system, and reducing the consumption of machine resources.
In some embodiments, upon subscribing to target messages, the subscription server stores the messages from different storage servers in different storage areas by sharding (the storage areas are, for example, shards). For example, the subscription server 1 corresponds to a shard 1 of the storage server 1 and a shard 2 of the storage server 2. The subscription server 1 stores 5 messages whose time points are 001 to 005 in the message queue 1 subscribed from the storage server 1 in the shard 1, stores 5 whose time points are 006 to 010 in the message queue 1 subscribed from the subscription server 2 in the shard 1, stores 5 messages whose time points are 011 to 015 in the message queue 1 subscribed from the subscription server 3 in the shard 1, and stores 5 messages whose time points are 016 to 020 in the message queue 1 subscribed from the subscription server 4 in the shard 1. With reference to
In the same way, the subscription server 1 stores the messages subscribed from the subscription server 2, the subscription server 3, and the subscription server 4 into the shard 2, where the messages originate from the storage server 2. Alternatively, the subscription server 1 stores the messages directly subscribed from the storage server 2 in the shard 2. The time points of the messages are 031 to 060. In some embodiments, the number of the shards is determined based on the number of the storage servers, and one shard corresponds to one storage server.
In some embodiments, when the subscription server 1 subscribes to messages in the message queue 1 originated from the storage server 1, it subscribes to the messages in the message queue 1 from the shard 1 of the subscription server 2, the shard 1 of the subscription server 3, and the shard 1 of the subscription server 4. The messages in the shard 1 are the messages in the message queue 1 subscribed by the subscription server 2, the subscription server 3, and the subscription server 4 from the storage server 1, or the messages subscribed from a shard 1 of other subscription server.
In some embodiments, when the subscription server 1 subscribes to messages in the message queue 1 originated from the storage server 2, it subscribes the messages in the message queue 1 from the shard 2 of the subscription server 2, the shard 2 of the subscription server 3, and the shard 2 of the subscription server 4. The messages in the shard 2 are messages in the message queue 1 subscribed by the subscription server 2, the subscription server 3, and the subscription server 4 from the storage server 2, or the messages subscribed from a shard 2 of other subscription server. The processes that the subscription server 2, the subscription server 3, and the subscription server 4 subscribe to messages from other subscription servers and save the same are the same as the subscription server 1, which are not repeated herein.
In some embodiments, the subscription server stores messages from different storage servers in corresponding shards by sharding, such that the management of messages by the subscription server is facilitated.
In some embodiments, referring to
In some embodiments, when the production server publishes a message, it randomly selects a storage server corresponding to a shard, and writes the published message to the storage server. In addition, the subscription server subscribing to messages includes two cases: in the case that the messages are allowed to be out of order, the subscription server randomly selects a subscription timestamp corresponding to a shard, and starts subscribing to the messages based on the subscription timestamp; and in the case that the messages are required to be in order, the subscription server selects a shard where the message with the smallest subscription timestamp is located for subscription by using a merge algorithm each time, which is not limited in the embodiments of the present disclosure.
Referring to
The acquiring module 601 is configured to acquire a subscription timestamp of each of the subscription servers, wherein the subscription timestamp is configured to indicate a subscription progress of the subscription server.
The determining module 602 is configured to determine a sorting result of the subscription servers based on the subscription timestamps of the subscription servers; and determine a first subscription server and a target subscription server in the set of subscription servers based on the sorting result, wherein the target subscription server has subscribed to and stored a target message, the target message being a message that the first subscription server needs to subscribe to.
The notifying module 603 is configured to send a notification message to the first subscription server, wherein the notification message is configured to instruct the first subscription server to subscribe to the target message from the target subscription server.
According to the embodiments of the present disclosure, a subscription server subscribes to a message in a message queue from other subscription servers, such that a number of subscription servers accessing a storage server simultaneously is reduced, and a number of subscription servers accessing a distributed message queue system simultaneously is reduced, thereby effectively dispersing the subscription pressure of a plurality of subscription servers to the distributed message queue system, greatly reducing the network communication load of the distributed message queue system, and reducing the consumption of machine resources.
In some embodiments, the acquiring module 601 is configured to: receive a subscription request from the subscription server, and determine the subscription timestamp of the subscription server based on the subscription request; or receive feedback information from the subscription server, and determine the subscription timestamp of the subscription server based on the feedback information; wherein the feedback information is sent by the subscription server to the storage server after the subscription server subscribes to a message from the target subscription server, and the feedback information includes the subscription timestamp.
In some embodiments, the determining module 602 is configured to determine a binary tree sorting result of the subscription servers based on the subscription timestamps of the subscription servers, wherein the target subscription server corresponds to an ancestor node of a specified node, or the target subscription server corresponds to a parent node of a specified node, the specified node being corresponding to the first subscription server.
In some embodiments, the notification message is configured to instruct the first subscription server to subscribe to the target message from the target subscription server corresponding to the parent node; subscribe to the target message from the target subscription server corresponding to the ancestor node in the case that the target message is not subscribed from the target subscription server corresponding to the parent node; and subscribe to the target message from the storage server in the case that the target message is not subscribed from the target subscription server corresponding to the ancestor node.
In some embodiments, the determining module 602 is configured to: determine a second subscription server corresponding to a root node of a binary tree based on the binary tree sorting result; determine the first subscription server based on the second subscription server, wherein the first subscription server is a subscription server other than the second subscription server; and determine the target subscription server based on the specified node corresponding to the first subscription server, wherein the target subscription server is a subscription server corresponding to the ancestor node or parent node of the specified node.
In some embodiments, the determining module 602 is configured to: determine a linear sorting result of the subscription servers based on the subscription timestamps of the subscription servers; determine a third subscription server corresponding to a greatest subscription timestamp based on the linear sorting result; determine the first subscription server based on the third subscription server, wherein the first subscription server is a subscription server other than the third subscription server; and determine the target subscription server from the set of subscription servers based on the first subscription server, wherein the target subscription server is a subscription server with a subscription timestamp being greater than the subscription timestamp of the first subscription server.
In some embodiments, in the case that a plurality of target subscription servers are determined, the determining module 602 is further configured to: determine a priority of each of the target subscription servers based on the sorting result; wherein the priority is configured to instruct the first subscription server to subscribe to the target message from the target subscription server based on the priority of the target subscription server.
In some embodiments, the notification message includes identification information of the target subscription server, and the identification information is configured to instruct the first subscription server to subscribe to the target message from the target subscription server.
Referring to
The receiving module 701 is configured to receive a notification message from the storage server, wherein the notification message is configured to instruct the subscription server to subscribe to a message from a target subscription server. The target subscription server is determined, by the storage server, for the subscription server through determining a sorting result of subscription servers in a set of subscription servers based on subscription timestamps of the subscription servers and determining the subscription server from the subscription servers based on the sorting result; the subscription timestamps is configured to indicate a subscription progress of the subscription servers; and the target subscription server has subscribed to and stored a target message, the target message being a message that the subscription server needs to subscribe to.
The subscribing module 702 is configured to subscribe to the target message from the target subscription server based on the notification message.
The embodiments of the present disclosure realize that a subscription server subscribes to a message in a message queue from other subscription servers, such that a number of subscription servers accessing a storage server simultaneously is reduced, and a number of subscription servers accessing a distributed message queue system simultaneously is reduced, thereby effectively dispersing the subscription pressure of a plurality of subscription servers to the distributed message queue system, greatly reducing the network communication load of the distributed message queue system, and reducing the consumption of machine resources.
In some embodiments, the subscribing module 702 is further configured to: determine feedback information based on the target message, and send the feedback information to the storage server in the case that the target message is subscribed from the target subscription server, wherein the feedback message includes the subscription timestamp of the subscription server; and send a subscription request to the storage server to subscribe to the target message from the storage server in the case that the target message is not subscribed from the target subscription server.
In some embodiments, in the case that a plurality of target subscription servers are determined, the notification message includes identification information and a priority of each of the target subscription servers; and the subscribing module 702 is configured to: subscribe to the target message from the target subscription servers based on the identification information and priority of each of the target subscription servers.
In some embodiments, the sorting result is a binary tree sorting result, and the target subscription server corresponds to an ancestor node of a specified node; or the target subscription server corresponds to a parent node of the specified node; and the specified node corresponds to the subscription server.
In some embodiments, the subscribing module 702 is configured to: subscribe to the target message from the target subscription server corresponding to the parent node; subscribe to the target message from the target subscription server corresponding to the ancestor node in the case that the target message is not subscribed from the target subscription server corresponding to the parent node; and subscribe to the target message from the storage server in the case that the target message is not subscribed from the target subscription server corresponding to the ancestor node.
An embodiment of the present disclosure provides a data distribution system. Referring to
The storage server is configured to acquire a subscription timestamp of each subscription server in a set of subscription servers, wherein the subscription timestamp is configured to indicate a subscription progress of the subscription server.
The storage server is further configured to determine a sorting result of the subscription servers in the set of subscription servers based on the subscription timestamps of the subscription servers; and determine a first subscription server and a target subscription server in the set of subscription servers based on the sorting result, wherein the target subscription server has subscribed to and stored a target message, the target message being a message that the first subscription server needs to subscribe to.
The storage server is further configured to send a notification message to the first subscription server.
The first subscription server is configured to, upon receiving the notification message, subscribe to the target message from the target subscription server based on the notification message.
Referring to
a processor 801; and
a memory 802 configured to store one or more instructions executable by the processor 801.
The processor 801, when executing the one or more instructions stored in the memory 802, is caused to perform the data distribution method as defined in the embodiments shown in
In an exemplary embodiment, a storage medium storing one or more instructions therein is also provided, such as the memory 802 configured to store one or more instructions. The instructions, when executed by the processor 801 of the electronic device 800, cause electronic device 800 to perform the method as defined in the embodiments shown in
Referring to
a processor 901; and
a memory 902 configured to store one or more instructions executable by the processor 901.
The processor 901, when executing the one or more instructions stored in the memory 902, is caused to perform the data distribution method as defined in the embodiments shown in
In an exemplary embodiment, a storage medium storing one or more instructions therein is also provided, such as the memory 902 configured to store one or more instructions. The instructions, when executed by the processor 901 of the electronic device 900, cause the electronic device 900 to perform the method as defined in the embodiments shown in
As for the apparatus embodiments, since they are basically similar to the method embodiments, the descriptions are relatively simple, and for related parts, reference may be made to the parts of the descriptions of the method embodiments.
As used herein, the singular forms “a,” “an,” and “the” include both singular and plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a memory including an instruction” includes one or more memories that include plurality of instructions as well as a single memory consisting of a single memory, reference to “a protocol” includes a combination of protocols as well as a single protocol, and the like.
Number | Date | Country | Kind |
---|---|---|---|
201910985286.6 | Oct 2019 | CN | national |
This application is a continuation application of International Application No. PCT/CN2020/118122, filed on Sep. 27, 2020, which claims priority to the Chinese Patent Application No. 201910985286.6, filed on Oct. 16, 2019, the disclosures of which are herein incorporated by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2020/118122 | Sep 2020 | US |
Child | 17668730 | US |