The invention is related to the field of modular and distributed robots, and in particular to a generalized distributed consensus control framework used by modular robots.
A modular robot is a new class of robots that is composed of many independent modules. Each module can communicate locally with other modules that are physically connected to it. By applying appropriate control, modular robots are capable of changing their configurations to become different structures or shapes, so they are sometimes referred as (self) reconfigurable robots. There are mainly three types of hardware design for modular robots. The first type is the “chain-based” modular robot where modules are normally connected in a chain and perform tasks such as locomotion by controlling their actuators. Another common style is the “lattice-based” modular robot, where overall shape change is achieved by modules changing their local connectivity. More recently, several groups have proposed strut-based modular robot in which shape formation is achieved by modules' self-deformation.
Several groups have demonstrated centralized and decentralized control in modular robots. However, there are only a few that focus on self-adaptation tasks based on sensory feedbacks. In chain-based robots, robot locomotion conforms to the environment via a hand-designed gait table and distributed force feedback. However, there is no theoretical guarantee for the control laws they propose adaptive locomotion strategy for chain-based robots is based on CPG.
In a lattice-based system, distributed algorithms can be used for locomotion over obstacles. One major limitation of lattice-based systems in self-adaptive tasks is that shape change can only be achieved through module movement, which is slow in the hardware implementation.
According to one aspect of the invention, there is provided a modular robot having a plurality of agents for performing movements. Each of these agents includes a computation component for performing computations needed in performing selective movements of the modular robot structure. A communication component is coupled to the computation module. The communication component allows each agent to communicate with its immediate physically-connected neighbor. An actuation component performs actuations associated with movements of the modular robot. A sensing component measures positional information that allows the agent to determine its respective environment. Once a defined shape or a desired task has been specified, each of the agents and their respective component coordinate their respective movements until the defined shape is reached.
According to another aspect of the invention, there is provided method of performing movements of a modular robot. The method includes providing a plurality of agents and managing computations associated with each agent needed in performing selective movements of the modular robot. Also, the method includes managing communications between the agents and performing actuations associated with movements of the modular robot. Moreover, the method includes measuring positional information that allows the agent to determine its respective environment and defining a shape so that each of the agents coordinate their respective movements until the defined shape or desired task is reached.
The invention proposes a generalized distributed consensus control framework. This generalization allows many new application areas in modular robotics by extending such a control scheme in several directions. New types of sensors can be use, e.g., pressure and light sensors. In these cases, a module agent has an indirect relationship between its sensor and actuator. The invention allows a variety of modular robot tasks to be formulated and solved as self-adaptation processes based on environmental feedback, including structure adaptation, sensory adaption, gripper manipulation, and locomotion.
One can use the same underlying distributed control principle to derive control laws for these tasks. The control laws are robust and simple to implement. One can show that the control laws are provably correct: convergence can be guaranteed to the tasks being considered. The proposed control framework is implemented on five different hardware robot prototypes and show that it is robust toward sensing/actuation noise and exogenous perturbations. This framework can potentially be applied to many distributed robotics applications beyond modular robotics.
The robot model and the capabilities assumed in the framework is described. The primary focus is on modular robotic systems in which the whole robot is composed of many independent and autonomous modules. However, this decentralized control framework is applicable to many other distributed robotic systems as long as the assumptions described herein are satisfied.
In the model, each module is an independent agent that has computation, communication, and actuation capabilities. One can refer to an autonomous module as an agent. These agents can be reconfigured into different robotic systems. Agents are assumed have been connected into a fixed configuration and they need to coordinate with each other to complete a desired task. The assumptions are now described that each agent is assumed to satisfy, and one can use three different modular robots (an adaptive column, a modular gripper, and a modular tetrahedral robot) that are built as examples.
Each agent 4 is equipped with one or more sensors 6 suited to different robotics applications. Sensors 6 are used to measure the current state of the agent 4. In the pressure-adaptive structure 2 as shown in
Each agent is equipped with an actuator. Several types of actuators are considered in the framework. In the pressure-adaptive structure 2 and tetrahedral robots 8, each agent 2, 10 is equipped with a linear actuator 14. In the modular grippe 16r, a rotary servo 18 is mounted on each agent 17 as shown in
Moreover, each agent is capable of performing simple computations such as addition and multiplication. Each agent is able to communicate with its immediate neighbors that are physically connected to it. Most of the current modular robots have these stated capabilities.
The task is described as inter-agent sensor constraints. An agent's task is complete when it has satisfied sensory state constraints between it and its neighbors. A consensus is formed when all agents have satisfied their constraints with their neighbors. In the framework, a task can be composed of one or more processes for reaching consensus.
A brief review of the standard distributed consensus algorithm is provided. A more general form of the algorithm and sufficient conditions is presented for agents to reach consensus. This generalized framework allows us to extend the control law to a wide range of applications.
Distributed consensus is a process by which a group of networked agents come to a state of agreement by communicating only with neighbors. At each time step, each agent updates its new state according to the difference between its own state and its neighbors' states. This process can be formally written as:
where a indicates agent i, and x (t) and x (t+1) are actuation states of agent i at time step t and t+1, respectively. Nj indicates the set of all one-hop neighbors of agent i is a small constant, and is sometimes called damping factor. There are two main assumptions buried in Eq. 1: First, each agent is capable of directly observing or computing its state and its neighbors' states. Second, each agent is capable of freely driving itself to a new state x (t+1).
In many cases, the mapping between sensor space and agent's actuation state is not precisely known. For example, in the modular gripper 16, as shown in
where θi is agent αi's sensor reading and θj indicates sensor reading of αj's neighbor, a3. g(θi,θj) is a sensory feedback function that agent a computes based on θi and θj. One can denote T(•) as a function that maps the agent's actuation changes to sensor changes. Also, one can show that g(•) can be any function satisfying the following conditions:
g(θi,θj)=0θi=j Eq. 3
sign(T(g(θi,θj)))=sign(θj−θi) Eq. 4
g(−θi,−θj)=−g(θi,θj) Eq. 5
Intuitively, Eq. 3 means that g only “thinks” the system is solved when it actually is; Eq. 4 means that when not solved, each sensory feedback g at least points the agent in the correct direction to satisfy the local constraint with a neighboring agent; and Eq. 5 means that g is anti-symmetric.
In addition, one needs to ensure that g/xj(t)−xi(t)−Δi,j* holds for all ai and for all t where Δi,j* is the desired state difference between agents that achieves θj=θi. This will ensure agents' states from fluctuation while reaching the consensus state, and it is usually done by selecting an appropriate α constant and choosing g as a function that is proportional to distance from the desired state.
This formulation is capable of being applied to a large class of distributed control tasks provided that one can create local agent rules that satisfy the conditions. In the next section, three different generalizations and their applications are described.
When solving modular robot tasks, there are still two main challenges one needs to address to apply this framework. First, one needs to represent a new task in terms of inter-agent sensor constraints or consensus. Second, a design is needed to have an appropriate sensor feedback function g and prove that the conditions outlined in Eqs. 3-5 are satisfied.
Solutions are provided to these challenges using six different example applications: (A) a self-balancing table that autonomously adapts to uneven terrain; (B) a terrain-adaptive bridge that always maintains bridge surface level irrespective of underlying terrain; (C a self-adaptive 3D Relief Display that can render a variety of shapes on arbitrary terrains (D) a pressure-adaptive column in which case each agent's sensor and actuator has an indirect relationship; (E) a modular gripper in which case each agent's actuator has a long range effect; (F) a modular tetrahedral robot which extends the agents' task space from forming a single consensus to a sequence of consensuses.
The inventive algorithm for shape formation has several important features: (1) The algorithm involves simple, local behavior by each agent, which scales as one can add more supporting groups to the flexible sheet; (2) the algorithm is guaranteed to converge to the target shape; (3) if the terrain changes, the robot automatically adjusts to maintain the desired shape.
These features lead to many potential applications.
In the inventive framework, one can achieve a modular robotic bridge that can adapt to different terrains. One can construct a terrain-adaptive bridge with Open Dynamics Engine 156, as shown in
3D Relief Display is an application where a modular robot forms arbitrary shapes as a novel form of 3D media and visualization. A proposed flexible surface 158 can act as a “relief” display, since the distributed algorithm can easily achieve complex shapes, as shown in
Note that this approach can be used in combination with traditional rearrangement reconfiguration, e.g. a modular robot can locomote quickly on smooth terrain using a track-like configuration and then configure to form a bridge over rough terrain. One can also expect that this distributed control approach can be extended to dynamic shape descriptions and other types of sensing (e.g. pressure), opening up many application possibilities which is described hereinafter.
The invention can be used to design and implement a self-balancing table robot 169 to test how well the approach works in real world scenarios, as shown in
Each agent 172 controls a Hitec standard servo 178 which can perform a rotation of 90° in either a clockwise or counterclockwise direction. A two-axis (x and y) tilt sensor 176 is mounted on the table surface 174. Each of the pivots can receive from this sensor 176, instead of having their own tilt sensors. Both sensors 176, 177 serve to act as agents for the surface group modules 176, 177.
For simplicity of implementation, the distributed shape formation algorithm is run on a laptop computer (2 GHZ CPU) that simulates purely distributed control. Although the distributed control is simulated, the hardware implements the sensing and distributed actuation so that one can directly test the algorithm in the face of real-world noise. After each agent computes the new angle of its servo, the control signal is sent to the hardware robot via serial port. It takes approximately 50 milliseconds for all agents to finish one iteration. This hardware prototype robot is used by the invention.
In a first experiment, it is examined how quickly and accurately the robot 192 responds to consistent, rapid environmental changes. In this experiment, the robot's 192 four supporting groups 190 are fixed to a rigid board. One can repeatedly change the orientation 194-200 of the board 204 to examine the robot's 192 response, as shown in
Agents 206-210 are programmed to maintain a surface level surface; i.e. tilt angles in x axis and y axis, θx and θy, equal to zero at all times. Therefore, |θx|+|θy| is an error measure of how far the table surface is from a level state.
In a second experiment, it is examined how the robot responds to different rough terrains. As shown in
Note as the table surface is a rigid object and cannot be stretched, the horizontal between two pivots might change in the process. Nevertheless, the algorithm still behaves correctly even if one can treat the horizontal distance as a fixed constant over the process.
Robustness experiments are performed by observing the robot's reaction when one of the agents fails. It was tested under different task difficulties and in different positions in the group. Two situations were tested which an agent fails to respond: (1) the agent's servo is disabled and becomes a passive link, so it freely takes on any angle with no resistance to movement; and (2) the agent's servo remains stuck at the zero degree position at all times. It is discovered that the first case does not affect the effectiveness of the algorithm, while the second case affects a few scenarios. It implicitly means the algorithm is robust to hardware failure of the first case.
It is observed that when the middle agent fails, it leads to a more unstable state of the robot. This is primarily because it is responsible for two times more rotation than either top or bottom module. On possible solution is to have more modules in each leg which allows a greater flexibility to compensate for individual failure, as well as increase the range over which the leg can compress and uncompress.
The distributed algorithm can provably form arbitrary shapes, and the pivot actions remain local even when the number of surface groups increases. Here one can evaluate the scalability of our system by observing how convergence time is affected by a large number of surface groups and different shape complexities. One can implement a simulation of a 64×64 flexible sheet in MATLAB, which includes 4096 pivots/supporting legs (16384 agents) for tasks of forming a pre-defined 3D shape. One can assume surface groups are formed by elastic materials. In simulation, one can add Gaussian noise to both servo actuation and sensor readings.
The robot is programmed to render six 3D models: a statue, teapot, knot, bunny, donut, and face. 3D depth information is used to transform these models into tilt angles for shape specification. The simulation starts by placing the robot on a randomly generated terrain. One can define the initial state as 100% error to the desired shape and 0% error when the desired shape is perfectly achieved. Table I shows the mean and standard deviation for the robot to reach 10% of error. In our previous experiments, the self-balancing table (12 agents) achieves 10% of error around 40 iterations. One can see from
The invention presents a decentralized control framework that allows a chain-style modular robot to achieve various environmentally-adaptive shapes. The control algorithm is shown to provably converge: it leads the robot to form the desired shapes regardless of its initial conditions and environmental changes. Through the experiments discussed above, one can demonstrate that the proposed algorithm is effective in real world applications.
Another potential application for modular robotics is a reconfigurable structure: a structure that can reconfigure itself to achieve functional requirements irrespective of external environment changes. Examples include forming the supporting structure for a building that absorbs uniform force, and a modular seat back that adapts to apply uniform pressure on the user. Motivated by this application area, one can construct a pressure-adaptive column with a modular robot 20 as shown in
As shown in
The algorithmic overview of the self-adapting process is shown in
In step 42, each agent 22 runs a local control law to change the length of its linear actuator 26 based on sensory feedback from its neighbors. This control law can be written as:
Here, the feedback function g is simply g(θi,θj)=θj−θi. g satisfies conditions Eq. 3-Eq. 5, since: (1) when θj=θi, g(θi,θj)=θj−θi=0, (2) when sensory θi is smaller than θj, g(θi,θj)>0 such that agent a increases its length to increase its pressure state θi. Therefore, T(g(θi,θj)) is moving in the same direction as θj−θi, (3) g function is anti-symmetric. Therefore, the control law (Eq. 6) will allow the robot to converge to the desired state.
Another application of the invention is modeling of a modular gripper. The gripper is capable of reconfiguring itself to grasp an object using distributed sensing and actuation. The control law design follows a similar procedure as in the described above for Eq. 6. However, the analysis of the convergence property is somewhat different due to the fact that each agent's actuation affects more than its own sensor state.
As shown in
The illustration of the algorithmic procedure is shown as
x
i(t+1)=xi(t)+(θR
Agents 48 iterate between step 58 and step 64 until all agents 48 have reached the desired state, as shown in step 66 of
The control law one can show in Eq. 7 satisfies condition 1, since sensory feedback g(•)=θR
Most of the controllers designed for grasping tasks have used a centralized architecture. The decentralized and modular robot approach that is proposed here allows the whole system to adapt to local perturbations more efficiently. In addition, given any initial contacting module, the gripper is able to form a grasping configuration that conforms to the shape of the object. This control scheme is also applicable to different kinds of gripper configurations.
A single consensus state between agents has been presented. However, one can show how agents can achieve more complicated tasks by forming a sequence of consensus states. A modular tetrahedral robot is presented that is capable of performing locomotion towards a light source with a sequence of such tasks. This approach can be potentially applied to many other modular robot locomotion tasks.
As shown in
The detailed steps are as follows,
where xik is ai's supporting actuator. This control law will allow the activated surface to lean forward until the tetrahedron rolls over to put all three activated agents in contact with the ground. In
The consensus state is formed when all activated agents have ground contact and ∥θj−θi∥≦ε for all agents ai and their neighbors aj. After agents have achieved consensus, they reset to the default configuration (step 86), and a new surface is triggered, as shown in
The verification of sufficient conditions for reaching consensus with the control law Eq. 8 is similar to that of Eq. 6. The generalization of a single consensus formation to a sequence of consensuses allows this framework to extend from solving static shape/structure adaptations to dynamic tasks such as locomotion. Utilizing agents' sensor consensus provides a way for modular robots to adapt to different environmental conditions. In the tetrahedral robot example, the cycle time of locomotion is determined by the pressure states of the agents. When the environmental conditions allow agents to reach consensus state sooner, for example, when the robot is on a steeper slope, the locomotion cycle time will adapt to become shorter.
There are many potential applications that can be generalized from this framework. Here illustrate some of them are illustrated: (1) Light-adaptive modular panel: One can change the pressure sensors that are mounted on the robot to light sensors. Each agent is programmed to achieve the same light absorption as its neighbors. A similar concept can be applied in many environmental sensory adaptation tasks. (2) Adaptive prosthetic structure: Existing prosthetic devices for children require manual reconfiguration to adapt to limb growth. If force (pressure) sensors are mounted on the device, it is possible to construct a self-reconfigurable prosthetic device. (3) A similar concept can be applied to a support structure for plants. The structure is capable of self-adaptation based on the growth of the plant and lighting conditions. (4) In the dynamic task domain, robotic systems that locomote by shape/structure deformation can potentially apply our framework for locomotion tasks, for example, an amoebic modular robot and a cubic modular robot.
The experimental results of applying this framework in three different real robots are presented. The results show that our decentralized control approach is able to cope with real world sensing and actuation noise to achieve self-adaptation tasks. In the pressure-adaptive column experiments, one can show that agents are capable of converging to an equal pressure state irrespective of different initializations when an unknown object is placed on it. In the modular gripper experiments, it is shown that the control law is capable of leading agents to grasp around a balloon while applying equal pressure on it. Furthermore, agents are capable of achieving the desired state regardless of initial contact locations. They can also maintain the desired state when facing exogenous perturbations. In the modular tetrahedral robot experiments, it is show that the robot is capable of moving toward a light source through a sequence of consensus formation processes.
In this experiment, one can examine the control law's convergence property with different initial conditions. Each agent is equipped with a pressure sensor (force sensing resistor) with sensory readings ranging from 0 to 900. Agents are programmed to achieve equal pressure with their neighbors. The weight of the unknown object is roughly 1.5 pound. The robot starts in three different configurations, such that the number of initial contacting agents is different, ranging from one to three.
One can define ε=maxiθi−miniθi, the difference between maximal and minimal sensory reading among agents, as a measure of distance from reaching consensus. One can see from
An empirical evaluation is presented of this control framework when applied to a modular gripper. Agents are programmed to apply equal pressure on a balloon. One can test Eq. 7's convergence properties under different initial conditions and different numbers of agents. One can also assess its adaptability towards repetitive perturbations.
The agents 94 are connected to form a “cross” configuration as shown in
One can use k to denote the first activated (contacted) agent's index. One can denote θi(t) as the pressure sensor reading of agent i at time t. After the first contact between the object and the robot, the object is held in place. This will lead all other agents to approach agent ak's sensor reading θk (t) while reaching the consensus state. Therefore, one can define the percentage from achieving the task, ε, as a ratio of the current distance for all agents to reach the first contacted agent's sensor reading θk(t) to the initial distance. This can be formally written as:
ε=Σi∥θi(t)−θk(t)∥/Σi∥θi(0)−θk(0)∥.
One can further evaluate the algorithm's scalability towards the number of agents. As shown in
After all agents achieve the desired state, one can start applying an external force on the gripper.
The sequential consensus formation process as described herein is implemented on a tetrahedral robot 110. As shown in
As shown in
The invention presents a generalized distributed consensus framework for self-adaptation tasks in modular robotics. Three example applications in hardware are presented using this framework, including (1) a pressure-adaptive column; (2) an adaptive modular gripper; (3) a modular tetrahedral robot. The proposed control laws are provably correct and robust toward different initial conditions and constant perturbations. These applications represent a small set of what is achievable within this framework.
Although the present invention has been shown and described with respect to several preferred embodiments thereof, various changes, omissions and additions to the form and detail thereof, may be made therein, without departing from the spirit and scope of the invention.
This application claims priority from provisional application Ser. No. 60/983,755 filed Oct. 30, 2007, which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
60983755 | Oct 2007 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US08/81759 | Oct 2008 | US |
Child | 12769107 | US |