The present invention concerns tactical and other highly mobile communications networks. Such networks are distinguished by their ability to self-organize and heal connections, as radio nodes enter and leave each others' direct communications ranges without impacting the performance of other elements of the network. This invention is also concerned with voice teleconferencing over digital networks.
Classic voice radio communications, while very useful, suffer from the limitations inherent in analog radio. An analog radio is usually designed for a single type of communications, such as voice, with no flexibility on data communications, and more importantly, no flexibility within the analog protocol. Radio users sharing a channel are inherently “conferencing,” in modern terms.
Such conferences are extremely limited compared to our experience with online and other digital conferencing experiences. Analog radio conferences are usually very susceptible to radio band noise. The only way to carry on multiple, separate conferences is by changing radio channels. Changing channels is a hardware decision, which makes the notion of flexible conferencing, with multiple users each participating in multiple, independent conference groups, impractical. In addition, classic analog voice radio communications are only functional when each radio can hear every other radio in a conference. There are no redundancy, no repeaters, etc. An analog radio conference also has little or very weak security.
Thus, radio voice conferencing can be greatly enhanced by building a voice conferencing system on a modern digital network radio. This is in practice what many existing conferencing systems have done, both in traditional Public Switched Telephone Network (PSTN) telecommunications providers and, more recently, online and other forms of Internet Protocol (IP) conferencing. The tradition of these various protocols is to present centralized servers for client rendezvous, both for directory services and for the conferencing aggregation itself.
For highly mobile radio communications, however, the analog solution has still often been the choice over a digital system, for a simple reason: it is decentralized. There is no need for a central server in an analog radio conference, so if any participant in a conference loses signal, that affects only that participant. The conference itself is maintained. But for a typical centrally managed digital voice conference, losing that central resource nullifies the whole conference.
It is therefore an object of the invention to combine the advantages of analog and digital conferencing in order to overcome the disadvantages of both.
To achieve the above and other objects, an improvement to the art is described herein. This presents a digital voice conferencing system as a group of peer network nodes. The need for digital mixing of conference audio can be managed by any node in the network. New protocols allow the mixing node to be selected and re-selected, as nodes dynamically enter or leave the network. While not critical to its operation, in the preferred implementation, the conferencing system (dubbed TRoIP—Tactical Radio over Internet Protocol) runs over a mesh-based network. The mesh improves the radio's performance, and consequently the network's performance, versus a non-mesh implementation, allowing a conference to continue as long as each radio can hear at least one other radio, and there is some path from each radio to the other, either directly, through other radios, or through repeaters. This method also encapsulates the conference in any security applied at the network layer.
A self-configuring voice conferencing network capability is described that may be superimposed over wired networks, wireless networks, or combinations thereof. The voice conferencing network removes the requirement of using a fixed centralized VoIP registrar, such as a SIP server, or conferencing server. In particular, such a system is well suited to the realities of a highly mobile IP mesh network. As described herein, a local network cluster of voice enabled nodes will form a voice conference with one another over the mobile network at the direction of a participating node that has the role of conference server/mixing node, or Mix-Master.
When a conference is started, every node in the conference will participate in an election to select a Mix-Master node. In this process, each participating node in the network transmits an election bid message packet containing its election score, VoIP callback address and addressing information. The election message is sent to the multicast address configured for said node's conference or Call Group. Each node will then receive all other nodes' election bid messages from the IP multicast address. Each node will evaluate all bids received during the election cycle, along with its own bid. The single node in the cluster with the highest election score is assigned the role of Mix-Master for the network. All other nodes in the cluster will initiate a VoIP call to the conference server, thus eliminating the traditional need for a centralized server. The election process is repeated by each node in the Call Group continually.
Call groups are pre-configured independent voice conferences on all voice enabled nodes. Each voice enabled node is configured with the unique multicast addresses assigned to each of the call groups of which the node is a member. The node determines which call groups to join via a user interface (UI). If two or more nodes become isolated from the other participating nodes in a Call Group, those isolated nodes will form a new instance of said Call Group, and the election process is repeated among said nodes.
Part of this system is a Push-to-Talk interface. This incorporates the client software mixer and a hardware module. The hardware module provides the digital interface to the user, ADC for microphone and and DAC for speaker/headset. The interface controls microphone bias, reads the PTT button, and optionally implements a secondary audio interface. The secondary interface includes audio in and out connections and a switch, to allow the user to choose between the network radio and a second communications device, such as a cellphone or analog radio, using the same headset.
Every participating node in a given Call Group will mix network audio to the PTT interface/headset, based on the rules set up for the specific Call Group. This can involve directing different conversations to different channels (left/right) on the headset, and directing microphone audio back to the selected Call Group. An elected Call Group Mix-Master node will run in a different mode, the precise nature of which depends on other details of the configuration, such as point-to-point versus broadcast settings.
The system configuration tool allows any number of voice-enabled nodes to be preconfigured or reconfigured for any given call group at any time a connection is present. Each node may be assigned to multiple call groups. Multiple and different Call Groups can be sent to the left or right side headphone, mixed as necessary with each other and audio user interface output, such as alarm, alert, and navigation tones locally generated on the network node in response to network events of various types.
The invention is contemplated for use in remote, topographically evolving or challenging environments, disaster, military and diverse environments that require reliable secure data and analog communications between groups of people with limited or no infrastructure to support traditional communication methodologies.
For a better understanding of the invention, reference is made to the following description taken in conjunction with the accompanying drawings and the appended claims. The objects, features, and advantages of the present invention will be apparent to those skilled in the art in light of the following detailed description of the invention in which:
a-2f illustrate a number of possible configurations of a TRoIP node, with the flexible mixer blocks in various different configuration;
A preferred embodiment of the present invention will be disclosed with reference to the drawings, in which like reference numerals refer to like elements or steps throughout.
Given a digital computer networking system, the preferred embodiment (dubbed Tactical Radio over Internet Protocol, or TRoIP) implements a flexible audio conferencing system. This conferencing system works across any peer-to-peer network, no central server or master node is required. The design is also able to deal with active mesh networks, which in a highly mobile environment may be adding and dropping nodes on a regular basis.
A main component of the TRoIP mixing system is the notion of logical Signal Groups. Each Group represents an independent conference. Any number of conferences may coexist on the same network, bound only by the limits on network bandwidth and use policies. An individual listener can participate in multiple conferences, the TRoIP configuration designates which conferences are mixed to either speaker in the stereo headset.
The PTT interface incorporates a push-to-talk button, which through the system directs the user's speech input to the primary conference channel. In the case of the stereo headset, this will address the primary conference on either the left or right side of the headset. A double-click of the PTT button 106, 116 will direct subsequent PTT input to the primary conference on the other side of the headset, eg, switch left to right or right to left.
The invention also supports an external auxiliary device 120, which is connected to the radio and PTT interface through an analog or digital interconnect 118. The PTT module for use with an auxiliary device 120 will have a secondary push button 116, which is used to direct speech from the audio I/O device 110, 112 to the auxiliary device, rather than the radio. The auxiliary device 120 may be a computer, tablet, or smartphone, used for configuration of the TRoIP system, and optionally, as a secondary means of communication. This allows a single headset to be shared between the network node and the secondary device.
While not visible to the observer, most of the network devices 102 in any TRoIP network will be client-only devices. At least one, however, will be designated as the Mix-Master 114. Each device has the ability to be elected by its peers in the network to be the Mix-Master by means of an incorporated computer system embedded in a microchip. This function will be discussed in detail, as well as the election process that allows regular addition and loss of network nodes without any significant disruption to communications within the active network.
a-2f illustrate the various possible modes of the audio mixing module within the invention. Note well that all mixing stages are implemented in software. As such, the number of channels for any given mixer or signal tee can be essentially any width, as dictated by the specifics for the individual node in question: the left/right configuration, the number of Signal Groups selected, etc.
As the system is designed to work on any peer-to-peer network, there is no concept of a permanent master node. And yet, for an audio conferencing system, all client audio input must be mixed at some central point, then distributed to every relevant client. The invention does this by including the full mixer logic in every node, then selecting one node as the Mix-Master, by means of an election process occurring between the various nodes. This is done by an election process that will be described in more detail below. A system will hold such an election when there is no Mix-Master, such as when a mesh network is initially established, split in two, and again when two independent networks are merged, ensuring that there is only one version of each channel/group available on the network.
The generic logic 202 of the mixer is illustrated in
A client-only node 204 is illustrated in
The microphone switch 306 can direct microphone audio to either the left or the right channel, the choice being determined logically by the user's prior channel selection, in the case of multiple conferences. The audio is passed first to a tee 310, which sends the microphone audio to the selected channel's earpiece mixer, and also to a volume control 312 and on to the flexible mixer stage 324, 326. As mentioned, this mixing stage is processing network audio of some kind, depending on the specific mode in use on any particular node. Audio leaving each flexible mixer stage 324, 326 enters the Local Audio Mixer 300 at another volume control 318, and goes on to the respective earpiece mixers 316.
A final input to each earpiece mixer is a tone generator 314. This tone generator 314 is driven by system level events, such as alerts and other sorts of audio interface queues to the listener.
The earpiece mixer outputs 316 from both left and right channels are merged into a stereo stream in the L/R Multiplexer 320, and sent to the operating system's audio output driver. In the preferred embodiment, as before, this is an ALSA driver, but the same functionality would exist in any other operating system.
For audio sourced out of the Local Audio Mixer 312, it is necessary to compand back to μ-Law for routing over the network 406. And this is put on the network using RTP as the transport 408, though as before, other efficient media transport protocols would work here as well.
Audio from each tee is cross routed to an equal number of per-channel Mix-Master Mixers 504. Each of these mixers will independently feed the input of another TRoIP node, so each Mixer can use different configuration data to determine which signals actually get mixed here. Each Mixer 504 is routed to a μ-Law encoder 410, and sent to its destination unit via an RTP encapsulation 412.
Thus, for an N-channel TRoIP conference there will be N−1 RTP network inputs fed to N μ-Law decoders and on to N Mix-Master tees, N Mix-Master Mixers, and N−1 μ-Law encoders feeding N RTP network outputs. The final input to the mixer is via the Local Audio Mixer output 312, and the final output from the Mix-Master Mixer is sent to the Local Audio Mixer input 318.
The PTT interface is managed via two push buttons 702, 704. The actual push-to-talk button 704 will indicate to the radio system that the user is transmitting. In cooperating with the radio unit software, a double-keying of the PTT button will change the routing of the microphone, in the case in which the user is a member of more than one conference group. The system's tone generator (314, see
An option on the PTT interface is a single volume control 720. This control works in conjunction with the local node mixer software. It will adjust the gain of the microphone when the talk button is keyed. Otherwise, it will adjust the relative audio level of the current default conference.
The criteria for each nodes' bids will be based on the specific nature of the underlying network. In a static peer to peer network, this might simply be first come, first served. On a highly mobile mesh network, data from the mesh (proximity and quality of node-to-node links, etc) can inform the bidding process.
The local node listens for peer election bids 812, eventually receiving some 814. These are added to the bid tally, and checked for a new high score 816. If the current high score has changed, the corresponding node is marked as the election leader 820, and the conference server change is made 850.
With no change in high score 822, the system checks if the election period is over. If not, more bids are analyzed 828. If so, the local node checks to see if the current leader is the incumbent Mix-Master 830. If so, the election is over, with no change in Mix-Master 832.
If the leader is not the incumbent 834, we check if the incumbent has placed bids 836, indicating that the network can hear the current Mix-Master. If so, then the incumbent has simply lost the election 838, and the system calls for a change in the conference server 850.
If the incumbent hasn't bid, we check to see if the incumbent has been present for at least a threshold count of cycles 842. If the incumbent is missing too many cycles 844, the conference server is changed 850. Otherwise 846, the election cycle is reset 846.
The actual Conference Server Change 850 flowchart is described in
While a preferred embodiment has been set forth above, those skilled in the art who have reviewed the present disclosure will readily appreciate that other embodiments can be realized within the scope of the invention. For example, the invention can be used with any suitable network and network protocol. Therefore, the present invention should be construed as limited only by the appended claims.
The present application claims the benefit of U.S. Provisional Patent Application No. 61/734,734, filed Dec. 7, 2012, whose disclosure is hereby incorporated by reference in its entirety into the present disclosure.
Number | Date | Country | |
---|---|---|---|
61734734 | Dec 2012 | US |