The present invention relates in general to the architecture of communication network systems.
More particularly, the present invention relates to a node, in particular to an electronic control unit, of a distributed communication system with a number of nodes, in particular with at least one fail-silent node, the nodes being interconnected by a communication medium, in particular by at least one channel and by at least one optional further channel, (with this wording covering single-channel systems up to N-channel systems).
The present invention further relates to a method for monitoring communication between and among a number of nodes, in particular between and among at least one unprotected node and at least one fail-silent node, said communication being based on at least one cyclic time-triggered communication medium access schedule being assigned to at least one communication controller.
Dependable communication networks used for safety-critical automotive applications typically rely on time-triggered communication protocols like
Dependable communication is achieved by providing redundant communication channels and protection against illegal transmissions, for example by means of a bus guardian.
More particularly, safety-critical applications require that a single fault in one of the nodes or in the communication infrastructure may not inhibit communication between other fault-free nodes. They rely on using at least two redundant communication channels and on fail-silent behaviour of faulty nodes.
Fail-silent behaviour of faulty nodes can be achieved by means of supervision units like the bus guardian (cf. “FlexRay Communications System Bus Guardian Specification”, version 2.0, June 2004, FlexRay Consortium; http://www.flexray.com/), which protects a communication channel from illegal transmissions in the time domain.
In general, communication networks for safety-critical applications should be separated from other networks but due to cost reasons sometimes there is a demand for using a single network for safety-critical and non-critical applications.
In addition, due to cost reasons sometimes it is not acceptable to use only fail-silent nodes. This results in mixed networks being composed of standard nodes without any protection and fail-silent nodes. The standard nodes in such networks are connected to only one of the communication channels and therefore a single faulty standard node cannot prevent communication between fail-silent nodes being related to the safety-critical application.
The principal architecture of such standard nodes S1, S2 is shown in
It can be further taken from
The data signals RxD, TxD, TxEN′ being exchanged between the communication controller CC and the transceiver T comprise
The two standard nodes S1, S2 (as shown in detail in
With this approach a single faulty standard node (in
The startup of such distributed communication network systems typically relies on the exchange of specific messages between a subset of the nodes. If this message exchange is affected by messages from a faulty node then the startup may be inhibited. The following description is based on the startup of a FlexRay cluster but the described disadvantages may apply also to other communication protocols.
In FlexRay systems, the cold start is performed by a predefined subset of the nodes in a communication cluster. Each of these so-called cold start nodes can act
After wakeup, a cold start node first listens to the communication channel(s) for a listen period. If the cold start node receives a valid pair of startup frames from another cold start node then the cold start node derives its schedule and clock correction from this cold start node. To allow network startup even in case of a cable failure, communication on one communication channel is sufficient for this.
Only if a cold start node does not detect activity on any communication channel during this listen period, the cold start node assumes that the cluster startup has to be initiated and acts as a leading cold start node by sending startup frames.
Integrating nodes, (i. e. non-cold start) nodes must also first listen to the communication channel(s). They may only start transmitting after they have received valid startup frame pairs from at least two cold start nodes. This shall ensure that the startup is not affected by transmissions from integrating nodes. Faulty integrating nodes could start transmitting at any time, including startup.
Such faulty transmissions during startup may be prevented by a bus guardian, if available, but in a mixed network as shown in
Even though connected only to one communication channel C1 or C2 such a fault could result in frames being received by the cold start nodes during the listen period, thus causing the cold start nodes to assume an already running network. As a result none of the cold start nodes would act as a leading cold start node and thus, the cluster startup would not be initiated.
In the described scenario a single faulty standard node would be able to inhibit the cluster startup completely.
To summarize, mixed networks may contain unprotected nodes related to non-critical applications, as long as these nodes are connected to one communication channel only. A disadvantage of this approach is that without protection by a bus guardian illegal transmissions from such nodes can inhibit the network startup.
Regarding related prior art documents, reference can be made to prior art document JP 02-075046 the purpose of which is to avoid unnecessary communication with inactive nodes by enabling each host to monitor by itself the active states of the nodes.
Prior art document EP 1 355 461 A2 describes the wakeup of FlexRay systems, the startup of FlexRay systems and the protection of FlexRay systems by means of a bus guardian.
Regarding the technological background of the present invention, further reference can be made to
Despite all efforts as described above, the problem remains that bus guardians require a costly data interface to protect the communication medium from timing failures of the communication controller, in particular to protect a communication channel from illegal transmissions in the time domain.
Starting from the disadvantages and shortcomings as described above and taking the prior art as discussed into account, an object of the present invention is to further develop a communication system as described in the technical field as well as a corresponding communication method as described in the technical field in such way that a protection of the communication medium from timing failures of the communication controller, in particular a limited protection of the communication channel from illegal transmissions in the time domain, can be achieved without providing any bus guardian.
The object of the present invention is achieved by a node comprising the features of claim 1 as well as by a method comprising the features of claim 8. Advantageous embodiments and expedient improvements of the present invention are disclosed in the respective dependent claims.
The present invention is principally based on the idea of preventing any transmission of the node during phases with high susceptibility to illegal transmission, in particular during the communication startup of the communication system.
More particularly, the present invention refers to the idea of, based on existing information, providing an additional check for the status of the communication cluster or communication system by a host unit which is independent of the communication controller of the node. As result of this check, transmissions of the node are enabled or are disabled. This check can be performed during startup (so-called startup protection) but also during normal operation or during other critical phases or in other critical situations, like during shutdown of the communication cluster or communication system.
Even more particularly, the present invention is principally based on the idea of an efficient startup protection for communication networks; more particularly, the present invention proposes an efficient means for preventing illegal transmissions from a mixed communication network comprising fail-silent nodes and unprotected standard nodes during startup of this communication network. In this context, the startup is to be protected from faulty nodes without bus guardian.
This may be achieved in that transmissions of the communication node are prevented until a successful communication startup has been detected by the host computer. More particularly, after having initialized the node the host computer
Only after indications for a successful network startup have been met the host computer enables transmissions by the node. This provides redundancy in such a way that the host and the communication controller of a node both must agree on successful communication startup before transmissions from this node will start.
Unlike prior art document 02-075046, the present invention proposes to prevent illegal communication of a faulty communication node; such illegal communication of a faulty communication node might disturb the communication between further faultless nodes in such way that the startup of the whole communication network would be endangered.
The arrangement according to the present invention as well as the method according to the present invention are applicable to nodes which are not related to safety-critical applications and therefore do not require full protection as it would be provided by a bus guardian.
A possible extension of the present invention can be implemented for supervising the synchronization of a node to the FlexRay cluster also during normal operation, i. e. after the startup has been performed. If synchronization of a node to the FlexRay cluster has degraded to the extent that transmissions from this node can no longer be allowed, the communication controller of this node shall enter the normal passive state. In this state, reception is still ongoing but transmission is not allowed. The conditions for this transition from normal active state to normal passive state are configurable.
An example for such a situation would be that no sync[hronization] frames or startup frames are received by all nodes. In that case all nodes should preferably enter the normal passive state, and one of the cold start nodes should preferably initiate a cold start. A single faulty communication controller which would not enter the normal passive state and would continue transmitting in this situation could prevent the network from performing the startup.
By observing the information about the number of received sync[hronization] frames as well as startup frames and by monitoring the states of the communication controller, the host can advantageously detect if a communication controller does not enter the normal passive state although it should. In this situation, the host can advantageously prevent transmissions from this faulty communication controller.
The present invention further relates to a distributed fault-tolerant and/or time-triggered communication system with at least one node as described above, said node being in particular required for communication startup.
The present invention further relates to a computer program product
According to a preferred embodiment of the present invention, the computer program product can be stored on at least one R[ead]O[nly]M[emory] module, on at least one R[andom]A[ccess]M[emory] module or on at least one flash memory module.
The present invention finally relates to the use of at least one node as described above and/or of at least one distributed communication system as described above and/or of the method as described above and/or of at least one computer program product as described above for ensuring error containment in the time domain of the node, in particular for protecting at least one dual-channel environment from illegal transmission.
The present invention may be implemented in the technical field of semiconductor-connectivity-automotive bus systems, for instance on a C[ontroller]A[rea]N[etwork] platform or on a Flexray platform and/or on the basis of an automotive M[edium]A[ccess]C[ontrol] protocol and/or with reference to chip data transfer; more particularly, the present invention may be implemented in low-cost microcontrollers with integrated FlexRay communication controller for automotive communication systems providing network startup protection as differentiating feature.
As already discussed above, there are several options to embody as well as to improve the teaching of the present invention in an advantageous manner. To this aim, reference is made to the claims respectively dependent on claim 1, on claim 8 and on claim 14; further improvements, features and advantages of the present invention are explained below in more detail with reference to a preferred embodiment by way of example and to the accompanying drawings where
The same reference numerals are used for corresponding parts in
The present invention as illustrated in
By the present invention, the availability of the communication network 400 being composed of a mix of fail-silent nodes 200 and of unprotected extended standard nodes 100 is improved. Other than protection by a bus guardian as in the prior art, the method of the present invention can be applied with standard transceiver circuits not requiring an additional control input for enabling transmission or for disabling transmission.
The principal architecture of such proposed extended standard nodes 100 with startup protection is shown in
It can be further taken from
The data signals RxD, TxD, TxEN being exchanged between the communication controller 120 and the transceiver 110 comprise
As can be taken from
By means of
In addition, in the extended standard node 100 the host 130
In other words, the host 130 monitors whether the communication controller 120 tries to transmit, for example during startup, and the host 130 controls propagation of the transmit enable signal TXE1 from the communication controller 120 to the transceiver 110.
Accordingly, the transmit enable signal TXE1 is controlled by the communication controller 120, not by the host 130 but by means of the additional output signal TXE2 and of the AND gate 140 the host 130 controls the propagation of the transmit enable signal TXE1 from the communication controller 120 to the transceiver 110.
Furthermore, in the extended standard node 100, the host 130 uses the status information SI provided by the communication controller 120 in order to decide if the startup of the FlexRay cluster 400 has been finished, i. e. is completed and if the transmission of the local communication controller 120 can be enabled.
The actual transmission enable signal TxEN is sent from the AND gate 140 to the transceiver 110 as result
The two extended standard nodes 100 (as shown in detail in
After the init (=step [i] in
In order to possibly disable transmission again after step [v] continuous supervision of the status information SI from the communication controller 120 can be provided, thus allowing to enable and to disable transmission at any time, in order to provide protection also during normal operation (in addition to startup).
After the init (=step [a] in
The process as described in
Finally, this status information SI can be provided from the communication controller 120 to the host 130 with different levels of independence:
[1] The communication controller 120 reports to the host 130 a communication controller-internal state indicating that the startup has been finished, i. e. has been completed.
This approach relies on some functionality inside the communication controller 120, even in case of a fault.
[2] The communication controller 120 provides to the host 130 the number of cold start nodes 200 from which valid startup frame pairs have been received, and the host 130 checks if valid startup frame pairs from at least the minimum number of cold start nodes 200 have been received.
The communication protocol defines the minimum number of cold start nodes 200 from which startup frame pairs must have been received before a node 100, 200 is allowed to transmit.
[3] For each received frame the communication controller 120 provides to the host 130 the frame header at least containing a frame ID[entification number], a cycle ID[entification number], and an indication for startup frames.
By means of this information, which can be protected by at least one C[yclic]R[edundancy]C[heck] sum, the host 130 can independently check if valid startup frame pairs from at least the minimum number of cold start nodes 200 have been received.
In this context, the host 130 requires this CRC checksum in order to check if the received frame header is valid; otherwise a single bit error, for instance at the communication medium or inside the communication controller 120, could for example change a non-startup frame into a startup frame, thus making the independent check at the host 130 more or less worthless.
The CRC checksum is generated and added to the header by the sending node and cannot be generated by the receiving node. The C[yclic]R[edundancy]C[heck] is to be calculated for all header information provided to the host 130, or at least to the subset of header information to be protected.
By means of the CRC checksum, the communication controller 120 and the host 130 at the receiving node can perform independent validity checks.
With this latter embodiment [3], the maximum independence between a faulty communication controller 120 and the host 130 can be achieved.
[4] Combinations of [1] to [3], for example the host 130 determines the number of received startup frame pairs from different cold start nodes 200 and uses this information to validate the state reported by the communication controller 120.
In all cases [1], [2], [3], [4], the host 130 enables transmission by activating the additional output signal TXE2 between the host 130 and the AND gate 140 only if a condition is met indicating that a node 100 may start transmitting without disturbing the startup.
This condition must be chosen such that in the fault-free case the host 130 enables transmission not later than at the beginning of the first communication cycle, which is used by the communication controller 120 for transmission.
To summarize, the present invention protects the network 400 from illegal transmissions which can disturb protocol mechanisms like communication startup, performed by other nodes 100, 200. These nodes 100, 200 required for communication startup can be fail-silent (=reference numeral 200) but do not necessarily have to be (=reference numeral 100).
Number | Date | Country | Kind |
---|---|---|---|
06117479.3 | Jul 2006 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2007/052694 | 7/9/2007 | WO | 00 | 1/7/2009 |