A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The present invention is generally related to computer systems and software such as middleware, and is particularly related to supporting a transactional middleware machine environment.
A transactional middleware system, or transaction oriented middleware, includes enterprise application servers that can process various transactions within an organization. With the developments in new technologies such as high performance network and multiprocessor computers, there is a need to further improve the performance of transactional middleware. These are the generally areas that embodiments of the invention are intended to address.
Described herein is a system and method for managing message queues in a middleware machine environment. A server machine in the middleware machine environment comprises a shared memory on a message receiver and a daemon process. Here, the shared memory maintains one or more message queues in the middleware machine environment. The daemon process is capable of creating at least one message queue in the shared memory, when a client requests that the at least one message queue be set up to support sending and receiving messages.
Described herein is a system and method for supporting a transactional middleware system that can take advantage of fast machines with multiple processors, and a high performance network connection in a transactional middleware machine environment. The system can provide message queues for multimode applications using a data structure based on a ring buffer (a circular queue). The system includes a remote ring structure with a first ring structure on a reader and a second ring structure on a writer, wherein each of the first ring structure and the second ring structure has a head pointer and a tail pointer. When the writer operates to write a message to the remote ring, the writer can update the head pointers for both the first ring structure and the second ring structure, and the data in the remote ring structure. When the reader operates to read a message from the remote ring, the reader can update the tail pointers for both the first ring structure and the second ring structure. Additionally, the message can be stored in a heap data structure, while the metadata associated with the message can be stored in the remote ring structure.
In accordance with an embodiment of the invention, the system comprises a combination of high performance hardware, e.g. 64-bit processor technology, high performance large memory, and redundant InfiniBand and Ethernet networking, together with an application server or middleware environment, such as WebLogic Suite, to provide a complete Java EE application server complex which includes a massively parallel in-memory grid, that can be provisioned quickly, and can scale on demand. In accordance with an embodiment, the system can be deployed as a full, half, or quarter rack, or other configuration, that provides an application server grid, storage area network, and InfiniBand (IB) network. The middleware machine software can provide application server, middleware and other functionality such as, for example, WebLogic Server, JRockit or Hotspot JVM, Oracle Linux or Solaris, and Oracle VM. The system can include a plurality of compute nodes, IB switch gateway, and storage nodes or units, communicating with one another via an IB network. When implemented as a rack configuration, unused portions of the rack can be left empty or occupied by fillers.
In accordance with an embodiment of the invention, referred to herein as “Sun Oracle Exalogic” or “Exalogic”, the system is an easy-to-deploy solution for hosting middleware or application server software, such as the Oracle Middleware SW suite, or Weblogic. As described herein, the system is a “grid in a box” that comprises one or more servers, storage units, an IB fabric for storage networking, and all the other components required to host a middleware application. Significant performance can be delivered for all types of middleware applications by leveraging a massively parallel grid architecture using, e.g. Real Application Clusters and Exalogic Open storage. The system delivers improved performance with linear I/O scalability, is simple to use and manage, and delivers mission-critical availability and reliability.
In accordance with an embodiment of the invention, Tuxedo is a set of software modules that enables the construction, execution, and administration of high performance, distributed business applications and has been used as transactional middleware by a number of multi-tier application development tools. Tuxedo is a middleware platform that can be used to manage distributed transaction processing in distributed computing environments. It is a proven platform for unlocking enterprise legacy applications and extending them to a services oriented architecture, while delivering unlimited scalability and standards-based interoperability.
Message Queues for Multinode Applications
In accordance with an embodiment of the invention, messaging software, such as messaging queues, can take advantage of a high performance network, such as an IB network using a remote direct memory access (RDMA) protocol. The RDMA protocol allows a message sender to bypass OS kernels and directly access to the memory without a need to wake up a process on the remote machine.
In accordance with an embodiment of the invention, a message queue can contain only the metadata information that is associated with the message, while the heap data structure contains the physical message. Thus, messages with variable size can be easily accommodated and be stored in the shared memory. As shown in
Also as shown in
In accordance with an embodiment of the invention, a message sender on a local Machine A 101 can directly write a message into heap data structure 110 in a shared memory 106 on the remote Machine B 102. As shown in
Furthermore, the message sender 103 on the local Machine A 101 can send a message to a message receiver 104 regardless the size of the message. Hence, this messaging mechanism can be cost effective, efficient, and requires less overhead for large volume of data.
Additionally, the message sender 103 can wake up a process 112 on the remote Machine B 102 that is associated with the message receiver 104, according to a pre-configured procedure. For example, the message sender can wake up the process when a service request message that can be handled by the process has delivered. In another example, the message sender can wake up a daemon process on the remote Machine B 102 when the queue is full.
In accordance with an embodiment of the invention, before a process on the message receiver, e.g. process 112, goes to sleep, the process can notice the message control structure 105 on the client side, and/or provide a procedure to the message control structure 105 on how to wake itself up. Then, the process on the receiver side can wait for the delivery of the message. For example, a process that is expecting the message can be in a sleeping status until it is wakened up by the message sender.
Also as shown in
In accordance with an embodiment of the invention, in order to support two-way communications between two servers in a middleware machine environment, each server on the middleware machine can be provided with a receiver and a sender. Thus, the communication between these two servers can be performed by different message senders at the different machines, using RDMA protocol such as RDMA write operations.
Message Queue Creation and Management
In accordance with an embodiment of the invention, a daemon process on a server node in the middleware machine environment can be used to create and manage the message queue in the shared memory.
In accordance with an embodiment of the invention, the server node 301 can include a daemon process 306 that is responsible for creating the one or more message queues in the shared memory on the server, when the various clients request the server to set up the message queues for sending and receiving messages. For example, when Client B 303 initiates a connection with the server 301, the daemon process 306 on the server can dynamically create a Queue B 312 for communicating with Client B 303 via a message control structure 322.
In accordance with an embodiment of the invention, this communication scheme between the server and multiple clients can be further extended using proxies. For example, the queue/control structure A 321 on Client A 302 can be extended using one or more proxies, e.g. Proxies I-III 323-325. Using these proxies, the processes associated with the different proxies on Client A can use the queue/control structure A to communicate with the server.
Thus, a great scalability can be achieved in the middleware machine for supporting communication between different servers and clients using the RDMA protocol, since a message initiated from a process on Client A 302 can be sent to the server 301 by allowing the process to write the message directly into the heap data structure 305 on the server 301, without server intervention.
In accordance with an embodiment of the invention, the daemon process 306 on the server 301 can also create and reserve a local message queue, e.g. Queue C 308, for local messaging purpose. In one example, the local server processes can communicate with each other using the local message queue, and the System V IPC protocol can be used instead of the RDMA protocol since the IPC protocol is faster than the RDMA protocol when it is used locally.
As shown in
In accordance with an embodiment of the invention, a client can determine whether a queue or a control structure on the client can be created in a shared memory or private memory. If the client chooses to create the queue or the control structure in a private memory of the client machine that is associated with a particular process, then the system can prevent other processes on the client machine and remote machines to access the control structure on the client. This can be beneficial since some messages can contain sensitive information such as custom financial information.
In accordance with an embodiment of the invention, an interruption can occur on a server process or even the daemon process in a server. The client can continue performing RDMA write operations in the shared memory on the server machine without a need of waiting for the recovery of the server process or the daemon process. This makes the disaster recovery for the system robust and straight-forward. Additionally, the clients can stop writing into the shared memory on the server machine when the queue is full.
Security Model for Protecting a Message Queue
In accordance with an embodiment of the invention, a security model can be used to protect the message queue in the middleware machine environment.
In accordance with an embodiment of the invention, the daemon process 510 can further register the key or the security token 514 with the IB network, and send the security token 514 to the message sender 510 on the client node via a secured network 520. As shown in
In accordance with an embodiment of the invention, after the message sender 501 receives the security token 514, the message sender 501 can access the shared memory 504 in the receiver machine directly. As shown in
The present invention may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
In some embodiments, the present invention includes a computer program product which is a storage medium or computer readable medium (media) having instructions stored thereon/in which can be used to program a computer to perform any of the processes of the present invention. The storage medium can include, but is not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.
The foregoing description of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations will be apparent to the practitioner skilled in the art. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalence.
This application claims the benefit of priority on U.S. Provisional Patent Application No. 61/542,119, entitled “SYSTEM AND METHOD FOR PROVIDING MESSAGE QUEUES FOR MULTINODE APPLICATIONS IN A TRANSACTIONAL MIDDLEWARE MACHINE ENVIRONMENT” filed Sep. 30, 2011, which application is herein incorporated by reference. The current application hereby incorporates by reference the material in the following patent applications: U.S. patent application Ser. No. 13/572,491, entitled “SYSTEM AND METHOD FOR PROVIDING MESSAGE QUEUES FOR MULTINODE APPLICATIONS IN A MIDDLEWARE MACHINE ENVIRONMENT,” filed Aug. 10, 2012.
Number | Name | Date | Kind |
---|---|---|---|
5499349 | Nikhil | Mar 1996 | A |
5784615 | Lipe | Jul 1998 | A |
6557056 | Lanteigne et al. | Apr 2003 | B1 |
6766358 | Chesson et al. | Jul 2004 | B1 |
6871265 | Oren | Mar 2005 | B1 |
7953903 | Finkelstein | May 2011 | B1 |
20020062356 | Clarke et al. | May 2002 | A1 |
20020129085 | Kubala | Sep 2002 | A1 |
20020129172 | Baskey | Sep 2002 | A1 |
20050131883 | Buxton et al. | Jun 2005 | A1 |
20050251856 | Araujo | Nov 2005 | A1 |
20060056413 | Ikeda et al. | Mar 2006 | A1 |
20060149840 | Thompson | Jul 2006 | A1 |
20060288228 | Botz | Dec 2006 | A1 |
20070078995 | Benard et al. | Apr 2007 | A1 |
20070101341 | Downing et al. | May 2007 | A1 |
20070156808 | Koegel et al. | Jul 2007 | A1 |
20070162457 | Barcia et al. | Jul 2007 | A1 |
20070299973 | Borgendale et al. | Dec 2007 | A1 |
20080069098 | Shah et al. | Mar 2008 | A1 |
20080126564 | Wilkinson | May 2008 | A1 |
20080244613 | Parthasarathy et al. | Oct 2008 | A1 |
20090119676 | Supalov et al. | May 2009 | A1 |
20100138534 | Mutnuru | Jun 2010 | A1 |
20100153546 | Clubb | Jun 2010 | A1 |
20100250684 | Naish | Sep 2010 | A1 |
20110107406 | Frost | May 2011 | A1 |
20110154443 | Thakur | Jun 2011 | A1 |
20110219208 | Asaad | Sep 2011 | A1 |
20110302569 | Kunze | Dec 2011 | A1 |
20120023296 | Yan | Jan 2012 | A1 |
20120066616 | Woods et al. | Mar 2012 | A1 |
20120110572 | Kodi et al. | May 2012 | A1 |
20120174124 | Ward et al. | Jul 2012 | A1 |
20120179837 | Bender et al. | Jul 2012 | A1 |
20120216216 | Lopez Taboada | Aug 2012 | A1 |
20120221774 | Atkisson et al. | Aug 2012 | A1 |
20120254480 | Tamir | Oct 2012 | A1 |
20120254889 | Demkowicz et al. | Oct 2012 | A1 |
20120331153 | Aho | Dec 2012 | A1 |
20130007377 | Winn et al. | Jan 2013 | A1 |
20130007412 | Krishnamurthy | Jan 2013 | A1 |
20130275631 | Magro | Oct 2013 | A1 |
Number | Date | Country |
---|---|---|
1909485 | Feb 2007 | CN |
101719960 | Jun 2010 | CN |
H0721038 | Jan 1995 | JP |
2005284840 | Oct 2005 | JP |
2010165022 | Jul 2010 | JP |
2013513860 | Apr 2013 | JP |
2011071624 | Jun 2011 | WO |
Entry |
---|
European Patent Office, International Searching Authority, International Search Report and Written Opinion dated Feb. 26, 2013 for International Application No. PCT/US2012/057634, 8 pages. |
Search Report issued by Chinese Patent Office for Chinese patent application No. 201280047474.0, mailed May 30, 2016 (2 pages). |
Wang et al., “Design and Implementation of a Message Queuing Middleware”, Computer Engineering, vol. 31, No. 21, Nov. 2005, pp. 81-83 (3 pages). |
Office Action issued by United States Patent and Trademark Office on U.S. Appl. No. 13/572,491, mailed Jul. 18, 2016 (15 pages). |
Office Action issued by Japanese Patent Office on Japanese patent application No. 2014-533333, mailed Sep. 27, 2016 (7 pages). |
Number | Date | Country | |
---|---|---|---|
20130086199 A1 | Apr 2013 | US |
Number | Date | Country | |
---|---|---|---|
61542119 | Sep 2011 | US |