This application is related to co-pending U.S. patent application Ser. No. 10/993,936, titled “SYNCHRONOUS MODE BROTHER'S KEEPER BUS GUARDIAN FOR A TDMA BASED NETWORK,” filed on Nov. 19, 2004, which is hereby incorporated by reference in its entirety and referred to herein as the “'936 application.
This application is related to co-pending U.S. patent application Ser. No. 10/993,931 filed Nov. 19, 2004 entitled “UNSYNCHRONOUS MODE BROTHER'S KEEPER BUS GUARDIAN FOR A RING NETWORKS”, which is hereby incorporated by reference in its entirety and referred to herein as the “'931 application.
This application is related to co-pending U.S. patent application Ser. No. 11/010,249, filed Dec. 10, 2004, entitled “SELF-CHECKING PAIR ON A BRAIDED RING NETWORK”, hereby incorporated herein by reference, and referred to herein as the “'249 application”.
Distributed, fault-tolerant communication systems are used, for example, in applications where a failure could possibly result in injury or death to one or more persons. Such applications are referred to here as “safety-critical applications.” One example of a safety-critical application is in a system that is used to monitor and manage sensors and actuators included in an airplane or other aerospace vehicle
Often safety critical applications are implemented using a time-triggered, table driven communications architecture, for example, SAFEbus, FlexRay or Time-Triggered Protocol (TTP). To provide tolerance to physical faults, robust communications topologies that allow for point-to-point fault isolation (such as star, mesh, ring, and braided ring configurations) are often deployed. However, such topologies may introduce undesirable overhead for in the form of additional components (for example, in star configurations) or extra wiring complexity (for example, in mesh or braided ring configurations).
In one embodiment, a network comprises a plurality of nodes that are communicatively coupled to one another using bi-directional, half-duplex links. The network has a logical first channel over which data is propagated along the network in a first direction and a logical second channel over which data is propagated along the network in a second direction. For a given period of time, at least one of the plurality of nodes (for example, a single transmitting node or two neighboring nodes acting as a self-checking pair) is scheduled to be a transmitting node that transmits data on both the first channel and the second channel. A first subset of the nodes not scheduled to transmit during the period are scheduled to relay data received from the first channel along the first channel. A second subset of the nodes not scheduled to transmit during the period are scheduled to relay data received from the second channel along the second channel. At least one of the nodes not scheduled to transmit during the period does not relay any data on at least one of that node's outbound links for at least one of the first channel and the second channel.
In one implementation of such an embodiment, if any of the data transmitted by the transmitting node during the period is not validly received from the first channel by any of the first subset of nodes, the data that was not validly received from the first channel by any of the first subset of nodes is forwarded to those nodes along the second channel for receipt thereby.
Another embodiment is a method of propagating data in a network. The network comprises a plurality of nodes that are communicatively coupled to one another using bi-directional, half-duplex links. The network has a logical first channel over which data is propagated along the network in a first direction and a logical second channel over which data is propagated along the network in a second direction. The method comprises, for a given period of time, transmitting data on the first channel and on the second channel from at least one of the plurality of nodes that is scheduled to be a transmitting node. The method further comprises, for the given period of time, relaying along the first channel data validly received from the first channel by a first subset of the nodes that are not scheduled to transmit during the period and relaying along the second channel data validly received from the second channel by a second subset of the nodes that are not scheduled to transmit during the period. At least one of the nodes not scheduled to transmit during the period does not relay any data on at least one of that node's outbound links to at least one of the first channel and the second channel.
In one implementation of such an embodiment, the method further comprises, if any of the data transmitted by the transmitting node during the period is not validly received from the first channel by any of the first subset of nodes, forwarding along the second channel the data transmitted by the transmitting node during the period that was not validly received from first channel by any of the first subset of nodes for receipt thereby and, at those nodes in the first subset of nodes that did not validly receive from the first channel all the data transmitted by the transmitting node during the period, combining data was validly received from the first channel at those nodes with data that was validly received from the second channel at those nodes in order to reassemble all the data transmitted by the transmitting node during the period.
In another embodiment, a program product comprises program instructions embodied on a processor-readable medium for execution by a programmable processor included in a node that is used in a network. The network comprises a plurality of nodes that are communicatively coupled to one another using bidirectional, half-duplex links. The network has a logical first channel over which data is propagated along the network in a first direction and a logical second channel over which data is propagated along the network in a second direction. The program instructions are operable to cause the programmable processor to, when the node is scheduled to be a transmitting node for a given period of time, transmit data on the first channel and on the second channel. The program instructions are further operable to cause the programmable processor to, while the node is not scheduled to be the transmitting node for the period, relay, along the first channel, data validly received from the first channel when the node is scheduled to do so; and not relay any data on at least one of the node's outbound links for the first channel when scheduled to do so.
In one implementation of such an embodiment, the program instructions are further operable to cause the programmable processor to do the following when scheduled to do so while the node is not scheduled to be the transmitting node for the period: receive data from both the first channel and the second channel, and, if the nadir node determines that any of the first subset of nodes did not validly receive from the first channel all of the data transmitted by the transmitting node during the period and the nadir node received from the second channel the data that was not validly received from the first channel by any of the first subset of nodes, forward that data along the second channel.
The details of various embodiments of the claimed invention are set forth in the accompanying drawings and the description below. Other features and advantages will become apparent from the description, the drawings, and the claims.
In the following description, various embodiments of the present invention may be described in terms of various computer architecture elements and processing steps. It should be appreciated that such elements may be realized by any number of hardware or structural components configured to perform specified operations. Further, it should be noted that although various components may be coupled or connected to other components within exemplary system architectures, such connections and couplings can be realized by direct connection between components, or by connection through other components and devices located therebetween. The following detailed description is, therefore, not to be taken in a limiting sense.
Instructions for carrying out the various process tasks, calculations, and generation of signals and other data used in the operation of the systems and methods of the invention can be implemented in software, firmware, or other computer readable instructions. These instructions are typically stored on any appropriate computer readable medium used for storage of computer readable instructions or data structures. Such computer readable media can be any available media that can be accessed by a general purpose or special purpose computer or processor, or any programmable logic device.
Suitable computer readable media may comprise, for example, non-volatile memory devices including semiconductor memory devices such as EPROM, EEPROM, or flash memory devices; magnetic disks such as internal hard disks or removable disks (e.g., floppy disks); magneto-optical disks; CDs, DVDs, or other optical storage disks; nonvolatile ROM, RAM, and other like media. Any of the foregoing may be supplemented by, or incorporated in, specially-designed application-specific integrated circuits (ASICs). When information is transferred or provided over a network or another communications connection (either hardwired, wireless, or a combination of hardwired or wireless) to a computer, the computer properly views the connection as a computer readable medium. Thus, any such connection is properly termed a computer readable medium. Combinations of the above are also included within the scope of computer readable media.
Embodiments of the present invention increase network dependability through the reduction of connectors by implementing half-duplex communication links between the nodes of a braided-ring network instead of separate communication links between nodes for each direction. Fault tolerant communication of messages simultaneously traveling clockwise and counter-clockwise around the half-duplex braided-ring network is achieved by embodiments of the present invention through the implementation of “nadir nodes” described in detail below. Further, embodiments of the present invention provide a high integrity data propagation mechanism that enables simultaneous data propagation on half-duplex links, in a manner that facilitates the immediate detection of a propagation fault and additional mechanisms to re-configure half-duplex communication paths to the supplement the erroneous data flow.
In the particular embodiment shown in
The eight nodes 102 shown in
In addition, as used herein, a “neighbor's neighbor node” (or just “neighbor's neighbor”) for a given node 102 is the neighbor node 102 of the neighbor node 102 of the given node 102. Each node 102 has two neighbor's neighbor nodes 102, one in the clockwise direction (also referred to here as the “clockwise neighbor's neighbor node” or “clockwise neighbor's neighbor”) and one in the counter-clockwise direction (also referred to here as the “counter-clockwise neighbor's neighbor node” or “counter-clockwise neighbor's neighbor”). For example, the two neighbor's neighbor nodes for node A are node C in the clockwise direction and node G in the counter-clockwise direction.
As shown in
Direct links 108 connect a given node 102 to that node's respective clockwise and counter-clockwise neighbor nodes. The links 109 that connect a given node 102 to that node's respective clockwise and counter-clockwise neighbor's neighbors are referred to here as “skip” links 109.
In the embodiment shown in
In one implementation of such an embodiment, each relay node also acts as a “brother's keeper guardian” for its inbound neighbor node on the relevant channel as is described in the '936 application and the '931 application. When a node is the transmitting node's outbound neighbor for the channel, the node verifies that any data received from the transmitting node complies with one or more policies (for example, content-based and/or temporal policies). If the data complies with all relevant policies, the node relays the data as described above. If not, the node either does not relay the data or appends to (or otherwise includes in) the transmitted data an indication that the data did not comply with the one or more policies. Different guardian actions can be taken for violating different policies. Also, if a faulty node that is not scheduled to transmit at a given point in time attempts to transmit on a particular channel, that node's outbound neighbor will detect that the data received on its inbound direct link 108 was transmitted by the faulty node (for example, because the same data will not have been received on the neighbor's inbound skip link 109) and, the neighbor, for example, will not relay any of the data transmitted by the faulty node.
In other embodiments, one or more pairs of nodes are scheduled to act as “self-checking pairs” in a similar manner as is described in the '249 application. In such an embodiment, each such self-checking pair operates in a replica-deterministic fashion at the application layer such that the output of both members of the self-checking pair is bit-for-bit identical. As result, the relay and/or guardian processing described above is modified in such an embodiment to the extent that each of the outbound neighbors of the self-checking pair will (in the absence of any faults) receive the same data transmitted by the self-checking pair on both its inbound direct link 108 and its inbound skip link 109. That is, for each such outbound neighbor of the self-checking pair, the neighbor's inbound direct link 108 couples that neighbor to one member of the self-checking pair and the neighbor's inbound skip link 109 is coupled to the other member of the self-checking pair. As described below, the neighbor of a self-checking pair propagates the data it receives from its inbound direct link 108 if corresponding matching data is received from that node's inbound skip link 109.
As shown in
Because each direct link 108 and skip link 109 operates in a half-duplex mode, each link is capable of propagating data in only one direction at any one time. For example, while node A is sending data along channel 0 in the clockwise direction to node B via direct link 108, node B cannot simultaneously send data to node A along channel 1 in the counter-clockwise direction via the same direct link 108. Similarly, while node A is sending data along channel 0 in the clockwise direction to node C via skip link 109, node C cannot simultaneously send data to node A along channel 1 in the counter-clockwise direction via the same skip link 109.
In the embodiments described here, propagation of data through the first and second channel is controlled in accordance with a network communication schedule. In one embodiment, the network communication schedule is stored in the form of a table such as network communication schedule 200 shown in
In the absence of the present invention, the clockwise propagation of data along channel 0 and the counter-clockwise propagation data along channel 1 would eventually meet at a point on network 100 opposite from the transmitting node 102 (hereinafter referred to as the “nadir point”) and collide as two nodes 102 on either side of the nadir point simultaneously attempt to send data from opposite directions across a unidirectional direct link 108 and/or a unidirectional skip link 109. During normal operation of network 100 (that is, in the absence of a fault), embodiments of the present invention designate one or more specific nodes for each timeslot to terminate the continued propagation of data to additional downstream nodes along a channel. These nodes are referred to in this specification as the “nadir nodes”. “Nadir action” (that is, data propagation termination) is configured on a link by link basis. Based on the network communication schedule 200, some links propagate data while others do not, in a manner that resolves collisions. The idea of the nadir node is that by default they resolve the data path contention hence they are then able to sense both traffic flows in order to monitor message transmissions from both sides (discussed in greater detail below).
For example, in one implementation of network 100 using network communication schedule 200, during time slot 1 when node A is scheduled as the transmitting node, nodes E and F are designated as its associated nadir nodes to perform a respective nadir action. During this time slot, node E, acting as a nadir node for node A, expects to receive clockwise propagating data originating from node A on channel 0 and counter-clockwise propagating data originating from node A on channel 1. Node E, acting as a node A nadir node, performs a nadir action by terminating continuing the propagation of channel 0 data messages to nodes F, G and H. Similarly, node F expects to receive clockwise propagating data originating from node A on channel 0 and counter-clockwise propagating data originating from node A on channel 1. Node F, acting as a node A nadir node performs a nadir action by terminating continuing the propagation of channel 1 data to nodes E, D, C and B. In one embodiment, two nadir nodes are used to prevent the continued propagation of data on both direct links 108 and skip links 109 of network 100. For example, in the counter-clockwise direction, when node E refrains from propagating channel 1 data it receives to node D via direct link 108, node D will still receive channel 1 data via skip link 109 from node F. For that reason node F is also configured as a nadir node associated with node A, which stops the propagation of the channel 1 data via the skip link 109 from node F. Whether a node 102 in network 100 performs a nadir action during any specific timeslot is determined by the network schedule.
Although the example provided by
When a node 102 is not scheduled to be a transmitting node for the current timeslot, the node 102 performs the processing illustrated in
For each channel (referred to here as the “current” channel) that a non-transmitting node is scheduled to listen to for the current timeslot, the node 102 determines if its inbound neighbor for that channel is scheduled to be a simplex transmitting node for that timeslot (block 308). If that is the case, for the current timeslot, the node 102 forwards the data it receives on its inbound direct link 108 on that node's non-nadir outbound links (if any) for the current channel (block 310).
If the node's inbound neighbor is a transmitting node that is a part of a self-checking pair or if the node's inbound neighbor is not a transmitting node, the node waits to receive a bit of data on its inbound direct link 108 for the current channel and a corresponding bit of data on its inbound skip link 109 for the current channel (block 312). When this occurs, the node compares the bit received on the inbound direct link 108 with the corresponding bit received on the inbound skip link 108 (block 314). If the bits match, the node considers that bit to be valid and forwards the received, matching bit on that node's non-nadir outbound links (if any) for the current channel (block 316). For example, if the node is scheduled to have no nadir links for the current timeslot, the node forwards the bit onto the outbound direct link 108 and the outbound skip link 109 for the current channel. If the schedule indicates that, for the current timeslot, the outbound skip link 109 for the current channel is a nadir link but the outbound direct link 108 for the current channel is a non-nadir link, the node forwards the bit on the outbound direct link 108 but not the outbound skip link 109. If the schedule indicates that, for the current timeslot, both the outbound direct link 108 and the outbound skip link 109 for the current channel are nadir links, the node 102 does not relay the bit any further along the current channel. The node then loops back to block 312 to process the next bits.
If the most recently received bits do not match or if the node fails to receive a bit on the inbound direct link 108 or the inbound skip link 109 for the current channel within a predefined period of time, the node immediately “truncates” the message by not relaying any more bits received on the current channel for the remainder of the current timeslot (block 318). Then, the node, if it is not a nadir node, starts listening to the other channel and performs the “backfill” processing described below in connection with blocks 320-332 in order to attempt to reassemble a complete message by combining any valid bits received on the current channel with any valid bits received on the other channel.
The backfill processing shown in blocks 320-332 is performed when a non-transmitting node determines that an error has occurred in the propagation of data along the network 100. In this embodiment, at least one of the non-transmitting nodes is scheduled to not relay any data along either channel 0 or 1 if there is not an error in the network 100. That is, such a node is scheduled to take a nadir action for each of its four links. Such as a node is also referred to here as a “nadir” node. As a result, such a nadir node will be able to receive data from both channels on all four inbound links. The particular node that is performing the processing shown in blocks 320-332 determines if that node is a nadir node for the current timeslot (checked in block 320). If that is the case, when an error occurs, if that nadir node is receiving valid data from one of the channels (referred to here as the “good” channel) and invalid data on the other (referred to here as the “faulty” channel) (checked in block 322), the nadir node forwards along the good channel those valid bits it has received from the good channel but did not successfully receive from the faulty channel (block 324). Each nadir node keeps track of the valid bits it has received on both channels during each timeslot so that it may determine which bits need to forwarded as described in connection with blocks 322-324 and in order to assemble a complete, valid message for the current timeslot.
If the node is not a nadir node for the current timeslot, the node stops listening to the faulty channel and starts listening to the other good channel (block 326). The node receives any bits forwarded by a nadir node (as described above) from the good channel (block 328) and relays the bits further along the good channel if valid (determined, for example, using similar processing as is described above in connection with blocks 312-318) (block 330). The node attempts to reassemble a complete message by combining any valid bits received on the faulty channel with any valid bits received on the good channel after that node started listening to the good channel (block 332). Each such node keeps track of the valid bits it has received on both channels during each timeslot so that it may assemble a complete, valid message for the current timeslot. The node utilizes in-line integrity checks to verify that it has reconstructed the originally transmitted message correctly. In-line integrity checks can include, but are not limited to, verifying length fields and cyclic redundancy checks (CRCs).
One alternate embodiment further utilizes brother's keeper guardianship. The receiving node is scheduled to act as a guardian for its nearest neighbor when the nearest neighbor is the transmitting node. The guardian action verifies that data transmitted by a transmitting node complies with one or more policies. In one example, the guardian action verifies the integrity of a data message transmitted by the transmitting node by confirming that potential message protocol contents is correct in addition to temporal enforcement of sending times. When operating as a guardian, a node only forwards data messages it receives via the direct link from the scheduled sending node. As illustrated in
Although the embodiments described in the specification above discuss braided-ring networks that do not operate using non-store-and-forward techniques, other embodiment that store-and-forward techniques are contemplated as within the scope of the present invention. For example, instead doing the comparison and relaying described above in connection with
Although specific embodiments have been illustrated and described herein, it will be appreciated by those of ordinary skill in the art that any arrangement, which is calculated to achieve the same purpose, may be substituted for the specific embodiment shown. This application is intended to cover any adaptations or variations of the present invention. Therefore, it is manifestly intended that this invention be limited only by the claims and the equivalents thereof.
Number | Name | Date | Kind |
---|---|---|---|
4417334 | Gunderson et al. | Nov 1983 | A |
4428046 | Chari et al. | Jan 1984 | A |
4630254 | Tseng | Dec 1986 | A |
4631718 | Miyao | Dec 1986 | A |
4654480 | Weiss | Mar 1987 | A |
4733391 | Godbold et al. | Mar 1988 | A |
4740958 | Duxbury et al. | Apr 1988 | A |
4856023 | Singh | Aug 1989 | A |
4866606 | Kopetz | Sep 1989 | A |
4905230 | Madge et al. | Feb 1990 | A |
5132962 | Hobgood et al. | Jul 1992 | A |
5161153 | Westmore | Nov 1992 | A |
5235595 | O'Dowd | Aug 1993 | A |
5257266 | Maki | Oct 1993 | A |
5307409 | Driscoll | Apr 1994 | A |
5341232 | Popp | Aug 1994 | A |
5383191 | Hobgood et al. | Jan 1995 | A |
5386424 | Driscoll et al. | Jan 1995 | A |
5394401 | Patrick et al. | Feb 1995 | A |
5463634 | Smith et al. | Oct 1995 | A |
5557778 | Vaillancourt | Sep 1996 | A |
5687356 | Basso et al. | Nov 1997 | A |
5715391 | Jackson et al. | Feb 1998 | A |
5742646 | Woolley et al. | Apr 1998 | A |
5896508 | Lee | Apr 1999 | A |
5903565 | Neuhaus et al. | May 1999 | A |
5920267 | Tattersall et al. | Jul 1999 | A |
5937414 | Souder et al. | Aug 1999 | A |
5940367 | Antonov | Aug 1999 | A |
6052753 | Doerenberg et al. | Apr 2000 | A |
6172984 | Beyda et al. | Jan 2001 | B1 |
6175553 | Luk et al. | Jan 2001 | B1 |
6219528 | Wright et al. | Apr 2001 | B1 |
6226676 | Crump et al. | May 2001 | B1 |
6374078 | Williams et al. | Apr 2002 | B1 |
6414953 | Lamarche et al. | Jul 2002 | B1 |
6513092 | Gorshe | Jan 2003 | B1 |
6594802 | Ricchetti et al. | Jul 2003 | B1 |
6618359 | Chen et al. | Sep 2003 | B1 |
6707913 | Harrison et al. | Mar 2004 | B1 |
6741559 | Smeulderse et al. | May 2004 | B1 |
6760768 | Holden et al. | Jul 2004 | B2 |
6765924 | Wu et al. | Jul 2004 | B1 |
6842617 | Williams et al. | Jan 2005 | B2 |
6925497 | Vetrivelkumaran et al. | Aug 2005 | B1 |
6956461 | Yoon et al. | Oct 2005 | B2 |
7035539 | Gumaste | Apr 2006 | B2 |
7050395 | Chow et al. | May 2006 | B1 |
7085560 | Petermann | Aug 2006 | B2 |
7088921 | Wood | Aug 2006 | B1 |
7269177 | Baker | Sep 2007 | B2 |
7349414 | Sandstrom | Mar 2008 | B2 |
7372859 | Hall et al. | May 2008 | B2 |
7457303 | Blumrich et al. | Nov 2008 | B2 |
7502334 | Hall et al. | Mar 2009 | B2 |
20020027877 | Son et al. | Mar 2002 | A1 |
20020087763 | Wendorff | Jul 2002 | A1 |
20020118636 | Phelps et al. | Aug 2002 | A1 |
20030002435 | Miller | Jan 2003 | A1 |
20030067867 | Weis | Apr 2003 | A1 |
20030128984 | Oberg et al. | Jul 2003 | A1 |
20040073698 | Harter et al. | Apr 2004 | A1 |
20040223515 | Rygielski et al. | Nov 2004 | A1 |
20040258097 | Arnold et al. | Dec 2004 | A1 |
20050002332 | Oh | Jan 2005 | A1 |
20050132105 | Hall et al. | Jun 2005 | A1 |
20050135278 | Hall et al. | Jun 2005 | A1 |
20050152377 | Hall et al. | Jul 2005 | A1 |
20050152379 | Hall et al. | Jul 2005 | A1 |
20050169296 | Katar et al. | Aug 2005 | A1 |
20050198280 | Hall et al. | Sep 2005 | A1 |
20060077981 | Rogers | Apr 2006 | A1 |
20060203851 | Eidson | Sep 2006 | A1 |
20080144526 | Hall et al. | Jun 2008 | A1 |
20080144668 | Hall et al. | Jun 2008 | A1 |
20090086653 | Driscoll et al. | Apr 2009 | A1 |
Number | Date | Country |
---|---|---|
407582 | Apr 2001 | AT |
3238692 | Apr 1984 | DE |
19633744 | Feb 1998 | DE |
20220280 | Nov 2003 | DE |
0405706 | Feb 1990 | EP |
1280024 | Jan 2003 | EP |
1280312 | Jan 2003 | EP |
1365543 | Nov 2003 | EP |
1398710 | Mar 2004 | EP |
1469627 | Oct 2004 | EP |
1672847 | Jun 2006 | EP |
2028062 | Feb 1980 | GB |
1581803 | Dec 1980 | GB |
2175775 | Dec 1986 | GB |
0064122 | Oct 2000 | WO |
0213393 | Feb 2002 | WO |
0235761 | May 2002 | WO |
03030437 | Apr 2003 | WO |
2006063237 | Jun 2006 | WO |
Number | Date | Country | |
---|---|---|---|
20080080551 A1 | Apr 2008 | US |