Method and System for Concurrent Message Processing

Information

  • Patent Application
  • 20080201712
  • Publication Number
    20080201712
  • Date Filed
    February 15, 2008
    16 years ago
  • Date Published
    August 21, 2008
    16 years ago
Abstract
A method and system are provided for concurrent message processing. The system includes: an input queue capable of receiving multiple messages in a given order; an intermediary for processing the messages; and an output queue for releasing the messages from the intermediary. Means are provided for retrieving a message from an input queue for processing at the intermediary and starting a transaction under which the message is to be processed. The intermediate logic processes the transactions in parallel and a transaction management means ensures that the messages are released to the output queue in the order of the messages in the input queue.
Description
BACKGROUND

1. Field of the Invention


This invention relates to the field of concurrent message processing. In particular, it relates to concurrent message processing of asynchronous messages at an intermediary.


2. Description of the Related Art


A common requirement of asynchronous message queuing systems is for the order of messages sent to any queue to be maintained. This means that a first message sent before a second message must be received by the consumer in that order.


The enforcement of message ordering has many implications for the processing of messages. One of these relates to the processing of messages by an intermediary.


Intermediaries may process messages in various different ways by performing logic between the senders and the receivers of messages. An example intermediary is a message broker that translates a message from the formal messaging protocol of the sender to the formal messaging protocol of the receiver in a telecommunication network where programs communicate by exchanging formally-defined messages.


An example message broker is IBM's WebSphere Message Broker (IBM and WebSphere are trademarks of International Business Machines Corporation) which allows data and information in the form of messages to flow between disparate applications across multiple hardware and software platforms. Rules can be applied to the data flowing through the message broker to route, store, retrieve, and transform the information.


Another example of an intermediary is a mediation framework such as the Service Integration Bus (Sibus) in IBM WebSphere Application Server messaging.


A known solution to the problem of maintaining message order when being processed by an intermediary is to limit what the intermediary is permitted to do. For example, to remove the intermediary's ability to send additional messages or to delete messages in the flow. In another example, the fields in the message on which the intermediary can perform may be limited. In this case, the simplest solution is to mark the messages with a sequence number and to have the client reorder the messages in the correct sequence on receive. This can be done by the API layer so that it is transparent to the receiving application.


Another solution to the problem is to prevent the intermediary from processing more than a single message at a time. However, this obvious solution results in considerable overheads. If a message ordering requirement does not exist, the intermediary processing entity is allowed to process multiple messages simultaneously, resulting in a much higher throughput of messaging.


US 2003/208489 discloses a system and method of running multiple operations in a resource manager in parallel and sorting out conflicts when they occur. This is done inside a single transaction and ensuring that the parallel operations are executed such that the result is the same as if the operations were serially executed. This involves rerunning operations in the correct order when a conflict occurs.


SUMMARY

An aim of this invention is to provide a mechanism that allows multiple messages to be processed simultaneously at an intermediary, whilst releasing messages in the original message order.


A further aim is to allow logic performed at an intermediary to respect message ordering while running messages concurrently without restricting the functions that can be performed by the intermediary.


According to a first aspect of the present invention there is provided a method for concurrent message processing, comprising: retrieving a first message from an input queue for processing at an intermediary; starting a first transaction under which the first message is to be processed; retrieving a second message from an input queue for processing at an intermediary; starting a second transaction under which the second message is to be processed; processing the first and second transactions at the intermediary in parallel; and operating the transactions to ensure that the messages are released to an output queue from the intermediary in the order of the messages on the input queue.


The method may include retrieving further messages, starting a transaction for each message, and processing the transactions at the intermediary in parallel.


Operating the transactions may include ensuring the commits of the transactions occur in the order of the messages on the input queue. Ensuring the commits of the transactions occur in the order of the messages on the input queue may include ensuring the prepare of the transactions occur in the correct order.


In another embodiment, operating the transactions may include creating the transactions in the order of the messages on the input queue and releasing the message to an output queue in the order in which the transactions were created.


Retrieving a message may include retrieving multiple serially batched messages, and starting a transaction may include starting a transaction in which the serially batched messages are processed.


The transaction for a message may be the transaction required by the intermediary for correct semantic operation.


The transactions may use an abort policy and transactions whose outcome is unknown may be rolled back, and the operation of the transactions may be configured to ensure that the messages are released to the output queue in the order of the messages on the input queue when a roll back occurs.


The transactions may use an abort policy and transactions whose outcome is unknown may be rolled back with an indication being provided of the position in the order of messages of a rolled back message.


The method may include retrieving multiple messages from the input queue simultaneously and starting transactions for each of the messages, or each batch of serially batched messages.


According to a second aspect of the present invention there is provided a system for concurrent message processing, comprising: an input queue capable of receiving multiple messages in a given order; an intermediary for processing the messages; an output queue for releasing the messages from the intermediary; means for retrieving a message from an input queue for processing at the intermediary and starting a transaction under which the message is to be processed; intermediate logic for processing transactions in parallel; and a transaction management means to ensure that the messages are released to the output queue in the order of the messages on the input queue.


According to a third aspect of the present invention there is provided a computer program product stored on a computer readable storage medium, comprising computer readable program code means for performing the steps of: retrieving a first message from an input queue for processing at an intermediary; starting a first transaction under which the first message is to be processed; retrieving a second message from an input queue for processing at an intermediary; starting a second transaction under which the second message is to be processed; processing the first and second transactions at the intermediary in parallel; and operating the transactions to ensure that the messages are released to an output queue from the intermediary in the order of the messages on the input queue.





BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the present invention will now be described, by way of examples only, with reference to the accompanying drawings in which:



FIG. 1 is a block diagram of a messaging system as known in the prior art;



FIG. 2 is a block diagram of a messaging system in accordance with the present invention;



FIG. 3 is a sequence diagram showing flow between components of a first embodiment of a method in accordance with the present invention; and



FIG. 4 is sequence diagram showing flow between components of a second embodiment of a method in accordance with the present invention.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

Referring to FIG. 1, a messaging system 100 includes an intermediary 110 for performing logic between the senders 101 and receivers 102 of messages. For example, an intermediary may be a message broker, a mediation framework, or other form of messaging processing mechanism. Other examples of messaging intermediaries include steps in a workflow and message driven beans.


The intermediary 110 has an input queue 103 to which senders 101 put messages, and an output queue 104 from which receivers 102 get messages. One or more senders 101 may put messages to the input queue 103 and the order that the messages arrive at the input queue 103 must be maintained so that a message receiver or consumer 102 can obtain the messages in the correct order from an output queue 104.


Referring to FIG. 2, a messaging system 200 is shown in which an intermediary 210 retrieves messages from an input queue 203 and delivers them to an output queue 204. The system 200 includes means to ensure that the messages processed by the intermediary 210 are released to the output queue 204 in the same order that the messages were placed on the input queue 203 whilst allowing parallel processing of the messages by the intermediary 210.


The intermediary processor 211 that calls the intermediary logic 212 is modified so that the message processing for a single message is performed in a transaction under a thread 230.


This transaction is the same one, if any, as may be required by the intermediary logic 212 for its correct semantic operation. The intermediary logic 212 may be performing actions that need to be made transactionally (for example, updating a database table) and the contents of the message may need to be contained within a single ACID (Atomicity, Consistency, Isolation, and Durability) transaction in order for the intermediary logic 212 to perform correctly. In order to ensure that the message ordering is performed, the same transaction should be used by the thread 230 and the intermediary logic 212.


The intermediary processor 211 then ensures that the transactions result in output messages in the original order of the input messages. This allows concurrent processing of messages and maintains message ordering, without putting messages at risk of being lost.


A transaction management system 220 includes a transaction manager 221, an abort transaction coordinator 223 to ensure that transactions whose outcome is unknown will be rolled back, and, optionally, a transaction coordinator 222.


The input and output queues 203, 204 are provided by a message queuing system where message ordering has been configured to deliver the messages in the same order if a rollback occurs. Upon rollback of a message, the input queue 203 may ensure that the message is still in the same place in the input order without having the transaction coordinator 222 communicate with it.


As an alternative, an indication may be provided of the position in the sequence of messages of a failed message. The message order must to be available in some form. For example, this may be by only allowing the messages to be accessed in the output queue 204 in the message order, or by providing sequence numbers, or by providing mechanisms to access the next and/or previous messages in the output queue.


The message queuing system knows that it is delivering the message to an intermediary 210 so it will deliver multiple messages at a time.



FIG. 2 is a schematic diagram and the components may be coupled within an integral system or separate entities connected via suitable communications means.


The stage at which the ordering of the messages is set within the intermediary processor 211 of the intermediary 210 may vary depending on the nature of the intermediary 210. In a first described embodiment, the transaction management system 220 is responsible for ensuring the commits occur in the order the messages came into the system. This means that commit time ordering is required, so that the order in which output message is released to the output queue is the same as the order of messages on the input queue. Many intermediaries 210, such as message brokers, make use of commit time ordering.


An alternative embodiment might make use of prepare-time ordering; however existing transaction management systems do not distinguish at an external level between the phases of prepare and commit. An enhanced transaction manager would be able to take advantage of this approach.


In a second described embodiment, a transaction management system 220 provides create-time ordering, in which output messages are released to the output queue in the order in which the transactions were created.


Other patterns of implementation are also possible but require additional steps in order to ensure that they function correctly. For example, it would be possible to implement the transactions in a system that utilizes put-time ordering by providing additional logic to ensure that the puts occurred in the same order as the input messages were read.


In the first embodiment, of commit-time ordering, the system 200 includes a logical transaction management system 220 that includes a transaction coordinator 222 that is able to maintain an order for transactions and ensure that the transactions are committed in a given order. The first embodiment carries out the following method.


1. The intermediary processor 211 discovers that a message is ready for processing.


2. The intermediary processor 211 informs the transaction coordinator 222 of the transaction management system of where the message is in the order.


3. The intermediary processor 211 allocates the processing of the message to a thread 230.

    • 1. The thread 230 calls the transaction coordinator 222 to start a new transaction.
    • 2. The thread 230 gets the message from the input queue 203 using the transaction.
    • 3. The thread 230 calls the intermediary processing logic 212.
    • 4. The thread 230 calls the transaction coordinator 222 to commit the transaction.


4. Repeat.


The transaction coordinator 222 is responsible for ensuring the commits occur in the order the messages come into the system 200.


The method described above is the optimal path where the commit works successfully, but there are some other paths that need to be considered:


1. There are transactions ready to be committed, but not yet committed and the intermediary processing logic 212 fails.

    • This is solved by using a presumed abort transaction coordinator 223. When the system is operational again all transactions that were in effect get rolled back prior to new work being performed. This maintains message ordering.


2. A transaction rolls back instead of committing:

    • This can be solved in one of three ways:
    • 1. When a transaction is rolled back, all future (and queued) transactions are rolled back until a transaction handling the original message failed message is next to be committed.
    • 2. When a transaction is rolled back the message queuing system redelivers the message to the intermediary processor 211 which waits until that transaction is ready and automatically puts it to the head of the transaction commit queue.
    • 3. A combination of 1 and 2, where the intermediary processor 211 waits for a period of time and if the message queuing system has not redelivered the message then all future transactions are rolled back until the right message is available.



FIG. 3 is a representation of the method steps of the first embodiment between the components of the input queue 203, the intermediary processor 211, the transaction coordinator 222, the transaction manager 221, the intermediary processing logic 212, and the output queue 204.


A message is available 301 to the intermediary processor 211 from the input queue 203. The intermediary processor 211 registers 302, 303 the message order with the transaction coordinator 222. The intermediary processor 211 then allocates 304 the message to a thread 230. The thread 230 creates a transaction 305 at the transaction coordinator 222 and the transaction coordinator 222 begins the transaction 306, 307 at the transaction manager 221 and confirmation is returned 308 to the thread 230.


The thread 230 gets the message 309 under the transaction and the message is sent 310 to the thread 230. The message is then processed 311 by the intermediary processing logic 212 and put 312 under the transaction to the output queue 204. A response 313 from the output queue 204 is returned to the intermediary processing logic 212 and a response 314 is returned to the thread 230.


The thread 230 then commits 315 the transaction at the transaction coordinator 222. The transaction coordinator 222 ensures that the actual commit 316 by the transaction manager 221 occurs in the same order as the original register call, and that only one commit occurs at a time. A response 317 is sent to the transaction coordinator 222 and back 318 to the thread 230.


The transaction manager 221 will have phases such as end, prepare, and commit. These are all performed under this high level commit operation. The transaction coordinator 222 is responsible for ensuring the transaction manager 221 is only asked to perform the high level operation one at a time. With changes to the transaction manager 221 is would be possible to make use of ordering at the lower level prepare or commit operations. This would make the process faster, but would require changes to the transaction manager 221.


In the second embodiment, the transaction management system 220 provides create-time ordering, in which output messages are released to the output queue in the order in which the transactions were created. In this embodiment, there is no transaction coordinator 222.


Under the approach of the second embodiment, the step of the method of the first embodiment of informing the transaction management system of where the message is in the order is replaced by the step of calling the transaction management system to start a new transaction. The message and the new transaction are then both passed to a new thread for parallel processing, and subsequent committing of the transaction.



FIG. 4 is a representation of the method steps of the second embodiment between the components of the input queue 203, the intermediary processor 211, the transaction manager 221, the intermediary processing logic 212, and the output queue 204.


A message is available 401 to the intermediary processor 211 from the input queue 203. The intermediary processor 211 creates a transaction 402 at the transaction manager 221 with implicit order and a response 403 is returned. The intermediary processor 211 then allocates 404 the message and the transaction 402 to a thread 230.


The thread 230 gets the message 409 under the transaction and the message is sent 410 to the thread 230. The message is then processed 411 by the intermediary processing logic 212 and put 312 under the transaction to the output queue 204. A response 413 from the output queue 204 is returned to the intermediary processing logic 212 and a response 414 is returned to the thread 230.


The thread 230 then commits 415 the transaction at the transaction manager 221. A response 418 is sent back 318 to the thread 230. The transaction manager 221 ensures that the actual commit occurs in the same order as the order the transactions were created in. Ordering of commits is not required when using create time ordering. However, if may be used additionally if desired.


The above description provides for allocation of a single message to a given transaction, and aims to ensure that the ordering in which the messages are placed on the output queue is the same as that on the input queue. There is an overhead cost associated with each transaction that is created/committed and so it is sometimes desirable to spread this overhead across a set of messages rather than to incur it for each message. This spreading across a set of messages in a pattern is often described as ‘batching’.


As an example, messages 1,2,3 might be allocated to batchOne, which is then assigned to a thread for processing, with all three messages being processed under a single transaction. Once the first batch has been allocated to a thread, messages 4,5,6 are then allocated to batchTwo which is assigned to a second thread, and uses a second transaction to process the messages. As with all aspects of the described method, the key aim is to ensure that the messages appear on the output queue in the same order as they were on the input queue (i.e. 1,2,3,4,5,6).


There are various methods of how messages are allocated to batches; however under the approach described above there is the same problem as when there is one message in each transaction, and thus batching of messages is automatically supported by the described method.


The invention can take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment containing both hardware and software elements. In a preferred embodiment, the invention is implemented in software, which includes but is not limited to firmware, resident software, microcode, etc.


The invention can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system. For the purposes of this description, a computer usable or computer readable medium can be any apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus or device.


The medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium. Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read only memory (ROM), a rigid magnetic disk and an optical disk. Current examples of optical disks include compact disk read only memory (CD-ROM), compact disk read/write (CD-R/W), and DVD.


Improvements and modifications can be made to the foregoing without departing from the scope of the present invention.

Claims
  • 1. A method in an information handling system for concurrent message processing, comprising: retrieving a first message from an input queue for processing at an intermediary;starting a first transaction under which the first message is to be processed;retrieving a second message from the input queue for processing at the intermediary;starting a second transaction under which the second message is to be processed;processing the first and second transactions at the intermediary in parallel; andoperating the first and second transactions to ensure that the first and second messages are released to an output queue from the intermediary in an order of the first and second messages in the input queue.
  • 2. A method as claimed in claim 1, including retrieving further messages, starting a transaction for each further message to form transactions, and processing the transactions at the intermediary in parallel.
  • 3. A method as claimed in claim 1, wherein operating processing the first and second transactions includes ensuring an order of commits of the first and second transactions occur in the order of the first and second message in the input queue.
  • 4. A method as claimed in claim 3, wherein ensuring the order of the commits of the first and second transactions occur in the order of the first and second messages in the input queue includes ensuring the the first and second transactions occur in the order of the first and second messages in the input queue.
  • 5. A method as claimed in claim 1, wherein processing the first and second transactions includes creating the first and second transactions in the order of the first and second messages in the input queue and releasing the first and second messages to an output queue in an order in which the first and second transactions were created.
  • 6. A method as claimed in claim 1, wherein retrieving the first message includes retrieving multiple serially batched messages, and starting the first transaction includes starting a transaction in which the serially batched messages are processed.
  • 7. A method as claimed in claim 1, wherein the first transaction is a required transaction required by the intermediary for correct semantic operation.
  • 8. A method as claimed in claim 1, wherein the transactions use an abort policy and unknown transactions whose outcome is unknown are rolled back, and the processing of the first and second transactions is configured to ensure that the first and second messages are released to the output queue in the order of the first and second messages in the input queue when a roll back occurs.
  • 9. A method as claimed in claim 1, wherein the transactions use an abort policy and unknown transactions whose outcome is unknown are rolled back, and an indication is provided of a position in an order of messages of a rolled back message.
  • 10. A method as claimed in claim 1, including concurrently retrieving multiple messages from the input queue and starting transactions for each of the multiple messages, or each batch of serially batched messages.
  • 11. A system for concurrent message processing, comprising: an input queue capable of receiving multiple messages in a given order;an intermediary for processing the messages;an output queue for releasing the messages from the intermediary;means for retrieving a message from the input queues, for processing at the intermediary and for starting a transaction under which the message is to be processed;intermediate logic for processing transactions in parallel; anda transaction management means to ensure that the messages are released to the output queue in an order of the messages in the input queue.
  • 12. A system as claimed in claim 11, wherein the transaction management means includes a transaction coordinator to ensure commits of the transactions occur in the order of the messages in the input queue.
  • 13. A system as claimed in claim 12, wherein the transaction coordinator includes means to ensure the transactions occur in the order of the messages in the input queue.
  • 14. A system as claimed in claim 11, wherein the transaction management means includes means for creating the transactions in the order of the messages in the input queue and releasing the message to the output queue in an order in which the transactions were created.
  • 15. A system as claimed in claim 11, wherein the means for retrieving a message includes means for retrieving multiple serially batched messages, and starting the transaction in which the serially batched messages are processed.
  • 16. A system as claimed in claim 11, wherein the transaction for the message is a required transaction required by the intermediary for correct semantic operation.
  • 17. A system as claimed in claim 11, wherein the transaction management means includes a transaction abort means and unknown transactions whose outcome is unknown are rolled back.
  • 18. A system as claimed in claim 11, wherein the means for retrieving a message from the input queue can concurrently retrieve multiple messages from the input queue and start transactions for each of the multiple messages, or each batch of serially batched messages.
  • 19. A computer program product stored on a computer readable storage medium, comprising computer readable program code means that are operable by an information handling system for performing the steps of: retrieving a first message from an input queue for processing at an intermediary;starting a first transaction under which the first message is to be processed;retrieving a second message from the input queue for processing at the intermediary;starting a second transaction under which the second message is to be processed;processing the first and second transactions at the intermediary in parallel; andoperating the transactions to ensure that the messages are released to an output queue from the intermediary in an order of the messages in the input queue.
Priority Claims (1)
Number Date Country Kind
07102721.3 Feb 2007 GB national