SYSTEMS AND METHODS FOR CONTROLLING AN UNDERACTUATED MECHANICAL SYSTEM WITH MULTIPLE DEGREES OF FREEDOM

TECHNICAL FIELD

The present disclosure relates generally to control systems for mechanical manipulators, and more particularly to a system and a method for controlling an underactuated mechanical system having multiple degrees of freedom and different kinds of actuators.

BACKGROUND

A mechanical system, e.g., a robotic manipulator, is configured to track a reference trajectory for performing a task. The task, for example, corresponds to moving an object to a target position, or an assembly operation. To control the mechanical system to track the reference trajectory, a model of dynamics of the mechanical system is utilized. The model of dynamics is a mathematical model that includes equations which define dynamics of the mechanical system. Some approaches use a model of inverse dynamics of the mechanical system to control the mechanical system as the model of inverse dynamics enables controlling of the mechanical system. The model of inverse dynamics expresses joint torques as a function of joint positions, velocities and accelerations. It is desired to formulate an accurate model of the inverse dynamics for controlling the mechanical system.

Under-actuated robots (UR) are an important class of mechanical systems. UR are mechanical systems characterized by fewer control inputs than degrees of freedom (DOF). Such systems are ubiquitous in robotics: examples are manipulators with passive joints, autonomous bicycles and motorcycles, bipedal robots, and most of the aerospace and marine vehicles. For instance, several Reinforcement Learning algorithms have been applied to the UR control problem. These algorithms aim at automatically learning a control law. However, such methods typically require a huge amount of interactions with the system and do not provide any performance guarantees. Even control strategies which combine model learning with classic model-based control methods rely on the inverse dynamics model, which relates torques to the robot trajectories. However, learning the inverse dynamics model is a challenging and to a great extent unexplored task for control of UR systems.

SUMMARY

Some example embodiments are based on the realization that for effective control of mechanical systems such as robots, an accurate model of the dynamics of such a mechanical system is required. The model of dynamics is a mathematical model including equations which define dynamics of the mechanical system. The model of dynamics may be a model of forward dynamics that connects control commands and current states of the different actuators to transitioned states of the different actuators achieved by executing the control commands. Alternately, the model of dynamics may be a model of inverse dynamics that maps the current states and the transitioned states of the different actuators to corresponding torques for the different actuators. For efficient control of such systems, one approach is to use the model of inverse dynamics of the mechanical system since the model of inverse dynamics enables controlling of the mechanical system. The model of inverse dynamics expresses joint torques as a function of joint positions, velocities and accelerations. It is an object of some embodiments to control a mechanical system using a model of inverse dynamics. In this regard, it is also an objective of some embodiments to derive an accurate inverse dynamics model of the mechanical system. The mechanical system may be a robotic system having different actuators and multiple degrees of freedom to track a reference trajectory for performing a task. For example, the robotic system may be a manipulator configured to track the reference trajectory for performing a task of moving an object to a target location. The robotic manipulator includes joints and links. Each joint is actuated by an actuator such as an electric motor. A motion given by the actuator makes the link attached to the joint move.

Generally, an accurate physical model of the inverse dynamics of such mechanical systems is difficult and time-consuming to generate. Conventional model-based approaches which derive parametric models directly from first principles of physics are often limited in performance by both the presence of parametric uncertainty and the inability to describe certain complex dynamics typical of real systems, such as motor friction or joint elasticity. Accordingly, it is advantageous to utilize machine learning-based approaches for deriving inverse dynamics models of such systems. Some machine learning-based approaches in this regard are mainly based on deep neural networks (NN) and Gaussian Process Regression (GPR). Some embodiments realized that in this context, both gray-box and black-box approaches may be suitable. Within gray-box techniques, a model-based component encoding the known dynamics may be combined with a data-driven one, which can compensate for modeling errors and unknown dynamical effects.

However, it is a realization of some embodiments that the performance of these methods strongly depends on the effectiveness of the model-based component, so they still require to derive sufficiently accurate physical models, which might be particularly time-consuming and complex if some parameters are unknown or not known precisely. In contrast, pure black-box methods learn inverse dynamics models directly from experimental data, without requiring deep knowledge of the underlying physical system. Despite their ability to approximate even complex non-linear dynamics, pure black-box methods typically suffer from low data efficiency and poor generalization properties: learned models require a large amount of samples to be trained and extrapolate only within a neighborhood of the training trajectories.

Some embodiments realized that to overcome the aforementioned limitations of black-box techniques in the context of NN and for the GPR framework, a promising class of solutions is represented by Physics Informed Learning (PIL), which proposes to embed insights from physics as a prior in black-box models. Instead of learning the inverse dynamics in an unstructured manner, which makes the problem unnecessarily complex, some embodiments embed physical properties in the model to improve generalization and data efficiency. Accordingly, some embodiments provide a PIL model for inverse dynamics identification of mechanical systems based on GPR. A standard approach for applying GPR to the inverse dynamics identification involves modeling each joint torque directly with a distinct Gaussian Process (GP), assuming the GPs independent of one another given the current joint position, velocity, and acceleration. However, some embodiments realized that such single-output approaches ignore the correlations between the different joint torques imposed by the Lagrangian equations, which in turn limits generalization and data efficiency.

Inspired from these realizations, some embodiments provide a multi-output GPR estimator based on a novel kernel function referred to as a Lagrangian Inspired Polynomial kernel (LIP), which exploits Lagrangian mechanics to model the correlations between the different joint torques, instead of modelling each joint torque with a distinct Gaussian Process (GP), assuming the GPs independent of one another given the current joint position, velocity, and acceleration. Some embodiments exploit the fact that the dynamics equations are linear with respect to the Lagrangian, to obtain the Gaussian Processes (GPS) of the torques by applying a set of linear operators to the GPs of the potential and kinetic energy of the mechanical system. Some embodiments recognize that the kinetic and potential energy are polynomial functions in a suitable input space and derive a polynomial kernel that encodes this property.

Accordingly, some example embodiments derive the LIP estimator as a black-box multi-output GPR model which encodes the symmetries typical of Lagrangian systems. The LIP model estimates the kinetic and potential energy in a principled way, allowing its integration with energy-based control strategies. Since the LIP estimator encodes physical properties in the model, the LIP estimator outperforms state-of-the-art black-box GP estimators as well as NN-based solutions, obtaining better data efficiency and generalization performance.

Some embodiments are also based on the realization that training the model of inverse dynamics is difficult because of lack of structure of the model of inverse dynamics, which fail to describe physical principles that govern dynamics of the multiple degrees of freedom of the mechanical system. For example, the robotic manipulator may have joints that define the multiple degrees of freedom, and dynamics of the joints are correlated to each other. Therefore, there exists a complex and potentially non-linear correlation between torques for the joints needed to track the reference trajectory. Such a correlation is challenging to learn through training. Also, for some special types of mechanical systems such as underactuated robots which have fewer actuators than degrees of freedom, learning inverse dynamics models for the control of such robots is particularly challenging because under actuation further exacerbates the aforementioned problems. For example, torques of the underactuated dimensions are constant signals equal to zero, leading to an ill-posed estimation problem.

Towards this end, some embodiments are directed towards physics-informed model-based solutions for controlling such systems. Some embodiments are based on the recognition that Gaussian Processes Regression (GPR) can be used to learn the correlation between torques of the joints. For example, several Gaussian processes-based solutions use GPR to model n torque components; one for each degree of freedom with n independent Gaussian Process (GP) and include a model-based component on a mean function or a covariance. As a result, a covariance matrix for such a GPR model is either diagonal or block diagonal to represent that the correlation between torques of the different actuators is not captured.

Some embodiments are based on the realization that such a deficiency is caused, at least in part, by an attempt to model the torque itself as a Gaussian process. However, some embodiments are based on the realization that the Gaussian process can be designed to model kinetic and potential energy of the mechanical system. In contrast with modeling individual torques, modeling the energy captures mutual effects of the torques of the different actuators on each other, which in turn allows to learn the correlation among the torques of the different actuators. As a result, the covariance matrix capturing correlations between the torques of the different actuators is a full matrix with non-zero elements inside and outside of the diagonal.

Towards this end, some embodiments provide an inverse dynamics model that models energy of the mechanical system with a GPR having a full prior and posterior covariance matrix that captures the correlations between the torques of the different actuators. The inverse dynamics model is trained with machine learning to map the dynamic states of the different actuators and joints to corresponding torques for the different actuators. According to an embodiment, dynamic states of the different actuators and joints are processed with the inverse dynamics model to produce values of the torques for the different actuators of the mechanical system and estimate values of the potential and kinetic energy of the mechanical system. Further, the mechanical system is controlled based on the values of the torques and the values of the potential and kinetic energy of the mechanical system. For instance, control commands are determined based on the values of the torques for the different actuators and the values of the potential and kinetic energy of the mechanical system. Further, the determined control commands are applied to the different actuators to control the mechanical system.

Some example embodiments are directed towards model and control of underactuated robots with energy-based techniques. In this regard, some example embodiments provide techniques for modelling and synthesis of energy-based controllers and control methods for UR systems. Some embodiments provide a Lagrangian Inspired Polynomial (LIP) estimator as a black-box estimator based on Gaussian Process Regression. The LIP estimator relies on a multidimensional, multi-output kernel that embeds the structure of the Euler-Lagrange equation. According to some embodiments, the LIP estimator learns various components of the inverse dynamics map, as well as the kinetic and potential energies of the UR system. Some embodiments utilize the LIP estimator to estimate values of kinetic and potential energies of the UR system, as well as the inertial, Coriolis, and gravity components directly from the overall torque measures. Some embodiments further utilize these properties to derive an energy-based controller for the stabilization and control of complex robots such as UR. The energy-based controller performs a partial feedback linearization on the actuated system and a regulation of the energy to steer the non-actuated system to a trajectory passing through the unstable equilibrium. Once the system is sufficiently close to the target, the control is switched to a Linear Quadratic Regulator (LQR) controller. The LIP model is suitable to implement this kind of controller since it returns the inertia matrix, the Coriolis and gravity torques, energy estimates, and the linearization of the system dynamics required by the LQR.

In order to realize the aforementioned objectives and advantages, various embodiments of this disclosure provide feedback controllers, feedback control methods and systems for controlling a mechanical system such as an underactuated robot in accordance with an inverse dynamics model of the system that is learned through machine learning techniques involving GPR.

According to some embodiments, a feedback controller for controlling a mechanical system to perform a task or to track a reference trajectory for performing a task is provided. The mechanical system has multiple degrees of freedom and comprises a plurality of actuators. The feedback controller comprises a memory configured to store an energy-based inverse dynamics model and computer program instructions and a processor configured to execute the instructions for controlling the mechanical system. The energy-based inverse dynamics model is trained with machine learning to map dynamic states of the mechanical system to corresponding torques for the plurality of actuators. In this regard, the energy-based inverse dynamics model is configured to model potential and kinetic energy of the mechanical system as Gaussian Processes of the dynamic states and derive Gaussian Processes for the torques from the Gaussian Processes of the dynamic states of the mechanical system based on physics of relationship between the torques and the potential and kinetic energy of the mechanical system. The processor executes the instructions to collect a feedback signal of an operation of the mechanical system, the feedback signal including current states of dynamics of the mechanical system indicative of a position, a velocity, and an acceleration of each joint of the plurality of joints of the mechanical system. The processor is further configured to process the current states of dynamics with the energy-based inverse dynamics model to produce values of the torques for the plurality of actuators and values of the potential and kinetic energy of the mechanical system. The processor controls the mechanical system based on the produced values of the torques for the plurality of actuators of the mechanical system and the values of the potential and kinetic energy of the mechanical system.

According to some other embodiments, a method for controlling a mechanical system to track a reference trajectory for performing a task is provided. The mechanical system has multiple degrees of freedom and comprises a plurality of actuators. The method utilizes an energy-based inverse dynamics model trained with machine learning to map dynamic states of the mechanical system to corresponding torques for the plurality of actuators. In this regard, the energy-based inverse dynamics model is configured to model potential and kinetic energy of the mechanical system as Gaussian Processes of the dynamic states and derive Gaussian Processes for the torques from the Gaussian Processes of the dynamic states of the mechanical system based on physics of relationship between the torques and the potential and kinetic energy of the mechanical system. The method comprises collecting a feedback signal of an operation of the mechanical system, the feedback signal including current states of dynamics of the mechanical system indicative of a position, a velocity, and an acceleration of each joint of the plurality of joints of the mechanical system. The method further comprises processing the current states of dynamics with the energy-based inverse dynamics model to produce values of the torques for the plurality of actuators and values of the potential and kinetic energy of the mechanical system. The method further comprises controlling the mechanical system based on the produced values of the torques for the plurality of actuators of the mechanical system and the values of the potential and kinetic energy of the mechanical system.

BRIEF DESCRIPTION OF THE DRAWINGS

The presently disclosed embodiments will be further explained with reference to the following drawings. The drawings shown are not necessarily to scale, with emphasis instead generally being placed upon illustrating the principles of the presently disclosed embodiments.

FIG. 1A illustrates a block diagram of a controller for controlling a mechanical system, according to some embodiments;

FIG. 1B illustrates a method for controlling the mechanical system of FIG. 1A, according to some embodiments;

FIG. 2A illustrates a robotic manipulator, according to some embodiments;

FIG. 2B illustrates one example of a full covariance matrix for an underactuated mechanical system, according to some embodiments;

FIG. 3 illustrates a block diagram of a method for formulating an inverse dynamics model, according to some embodiments;

FIG. 4 illustrates a block diagram showing training of hyperparameters of a Lagrangian polynomial kernel, according to an embodiment;

FIG. 5A illustrates a flow diagram of a method for estimation of kinetic energy of the mechanical system, according to some embodiments;

FIG. 5B illustrates a flow diagram of a method for estimation of potential energy of the mechanical system, according to some embodiments;

FIG. 6 illustrates a block diagram of a system for controlling an underactuated mechanical system, according to some embodiments;

FIG. 7 illustrates a flow diagram of a method for anomaly detection based on the estimated kinetic energy and the potential energy of the underactuated mechanical system of FIG. 6, according to some embodiments;

FIG. 8 illustrates a framework for generating a motion plan that consumes a minimum amount of energy for performing a task, according to some embodiments; and

FIG. 9 is a schematic illustrating a computing device for implementing various hardware components, according to some embodiments.

While the above-identified drawings set forth presently disclosed embodiments, other embodiments are also contemplated, as noted in the discussion. This disclosure presents illustrative embodiments by way of representation and not limitation. Numerous other modifications and embodiments can be devised by those skilled in the art which fall within the scope and spirit of the principles of the presently disclosed embodiments.

DETAILED DESCRIPTION

The following description provides exemplary embodiments only, and is not intended to limit the scope, applicability, or configuration of the disclosure. Rather, the following description of the exemplary embodiments will provide those skilled in the art with an enabling description for implementing one or more exemplary embodiments. Contemplated are various changes that may be made in the function and arrangement of elements without departing from the spirit and scope of the subject matter disclosed as set forth in the appended claims.

Also, individual embodiments may be described as a process which is depicted as a flowchart, a flow diagram, a data flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process may be terminated when its operations are completed but may have additional steps not discussed or included in a figure. Furthermore, not all operations in any particularly described process may occur in all embodiments. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. When a process corresponds to a function, the function's termination can correspond to a return of the function to the calling function or the main function.

FIG. 1A illustrates a block diagram 100 of a controller 101 for controlling a mechanical system 109, according to some embodiments of the present disclosure. The controller 101 is communicatively coupled to the mechanical system 109. The controller 101 includes a processor 103 and a memory 105. The processor 103 may be a single core processor, a multi-core processor, a computing cluster, or any number of other configurations. The memory 105 may include random access memory (RAM), read only memory (ROM), flash memory, or any other suitable memory systems. Additionally, in some embodiments, the memory 105 may be implemented using a hard drive, an optical drive, a thumb drive, an array of drives, or any combinations thereof.

According to some embodiments, the controller 101 controls the mechanical system 109 to track a reference trajectory for performing a task. The mechanical system 109 has multiple degrees of freedom (DoF) and comprises a plurality of joints that can are movable. In this regard, the mechanical system 109 comprises a plurality of actuators and one or more joints of the system 109 are movable by one or more of the actuators. According to some embodiments, the mechanical system 109 may be an underactuated system such as an underactuated robot where at least one joint of the underactuated system is not directly movable by any actuator. “Underactuated” in the present disclosure may correspond to a jointed mechanism in which not all of the joints of the mechanism are actuated, i.e., some of the joints are passive or unactuated. An under-actuated robot means a robot whose number of degrees of freedom of motion, which can be directly controlled by the mounted actuator, is less than the degree of freedom of motion of the robot. According to some embodiments, the mechanical system 109 may have all joints directly movable by at least one actuator. However, in some special applications, the mechanical system 109 may be converted into an underactuated system. For example, when at least one actuator associated with at least one joint does not move the corresponding joint, the system 109 may operate as an underactuated system. Such scenarios may arise when one or more actuators are deliberately not fired but the corresponding joint is intended to be moved for performing a task. Alternately or additionally, such scenarios may also arise when one or more actuators malfunction due to operational reasons or due to breakdown in the underlying control system or communication system of such an actuator.

The controller 101 may also be coupled to suitable interfaces to collect a feedback signal for an operation of the mechanical system 109. The feedback signal may include current state of dynamics of the mechanical system 109. The current state of dynamics of the system 109 may be indicative of a position, a velocity, and an acceleration of each joint of the plurality of joints of the mechanical system 109. In this regard, the controller 101 may interface with suitable sensing circuitry that provides measures of the current state of dynamics of the system 109. In some embodiments, the controller 101 may compute the current state of dynamics of the system 109 from one or more observations pertaining to the operation of the mechanical system 109.

The energy based inverse dynamics model 107 defines a mathematical model of the inverse dynamics of the mechanical system 109. The model 107 maps the current states and the transitioned states of the different actuators to corresponding torques for the different actuators. The model 107 expresses joint torques as a function of joint positions, velocities and accelerations. Some embodiments utilize machine learning-based approaches for deriving the energy-based inverse dynamics model 107 of the system 109. The model 107 embeds physical properties between torques of the actuators in the model to improve generalization and data efficiency. In this regard, some embodiments provide the model 107 as a physics-informed-learned model for inverse dynamics identification of the mechanical system 109 based on Gaussian Process Regression (GPR). To model the correlations between the different joint torques, some embodiments realize the energy based inverse dynamics model 107 as a multi-output GPR estimator based on a novel kernel function referred to as a Lagrangian Inspired Polynomial kernel (LIP) which exploits Lagrangian mechanics to model the correlations between the different joint torques, in addition to modelling each joint torque with a distinct Gaussian Process (GP), assuming the GPs independent of one another given the current joint position, velocity, and acceleration. Details of the structure and training of the model 107 are described later in the disclosure.

The controller 101 processes the feedback signal to produce values of torques for the different actuators of the system 107. In some embodiments, the produced values of the torques may be translated into control commands specifying values of currents, voltages or other physical parameters of the actuators. The control commands may be applied to the different actuators. The control commands cause the different actuators to change their states to track the reference trajectory. The changed states of the different actuators may be input to a feedback controller 111. Based on the changed states of the different actuators, the feedback controller 111 provides a correction to the values of the torques to compensate for errors in the positions and the velocities of the joints to accurately track the reference trajectory.

FIG. 1B illustrates a method 150 for controlling the mechanical system 109 of FIG. 1A by the controller 101, according to some embodiments. FIG. 1B will be described with reference to FIG. 1A. The current states of the dynamics of the mechanical system are obtained 152 by the controller 101. The current state of dynamics of the system 109 may be indicative of a position, a velocity, and an acceleration of each joint of the plurality of joints of the mechanical system 109. The current state of the dynamics of the system 109 is processed 154 with the trained energy-based inverse dynamics model 107. For example, the states of the different actuators and/or joints may be applied as input to the inverse dynamics model 107 to produce values of the torques for the actuators of the mechanical system 109 and values of the potential and kinetic energy of the mechanical system 109. Towards this end, the energy-based inverse dynamics model 107 may be trained with machine learning to map dynamic states of the mechanical system 109 to corresponding torques for the plurality of actuators. Thus, the energy-based inverse dynamics model 107 models the potential and kinetic energy of the mechanical system 109 as Gaussian Processes of the dynamic states. The model 107 derives Gaussian Processes for the torques from the Gaussian Processes of the dynamic states of the mechanical system 109 based on physics of relationship between the torques and the potential and kinetic energy of the mechanical system 109.

Using the values of the torques and the values of the kinetic energy and potential energy of the system 109, control commands may be generated to control 156 the mechanical system 109. For example, the determined control commands may be applied to the different actuators. The control commands cause the different actuators to change their states to track the reference trajectory. The changed states of the different actuators may be input to the feedback controller 111. Based on the changed states of the different actuators, the feedback controller 111 may be configured to provide a correction to the values of the torques to compensate for errors in the positions and the velocities of the joints to accurately track the reference trajectory.

In an embodiment, the mechanical system 109 corresponds to a robotic manipulator having different actuators and multiple degrees of freedom to track a reference trajectory for performing a task. An example robotic manipulator is described next with reference to FIG. 2A. For example, the robotic manipulator 200 is configured to track a reference trajectory 205 for performing a task of moving an object 207 to a target location 209. In another example, the robotic manipulator 200 is configured to track a reference trajectory for performing an assembly operation. The robotic manipulator 200 includes joints such as joints 201a, 201b, and 201c. The robotic manipulator 200 further includes links, such as a link 203a, a link 203b, and a link 203c. Each joint is actuated by an actuator such as an electric motor. An action given by the actuator makes the link attached to the joint move. The movement can be rotational or translational according to a type of the joint. The type of the joint may be a revolute joint or a prismatic joint. The actuator takes as input a desired position of the joint and outputs an action that can be current, torque or a quantity that can be transformed to torque which causes the joint to move to the desired position.

Further, the different actuators are equipped with position sensors (such as encoders) that can measure current positions of the joints. In some embodiments, states of the different actuators are defined as positions of the joints. In some other embodiments, the states of the different actuators are defined as a combination of the positions of the joints, velocities of the joints, and/or accelerations of the joints.

According to some alternate embodiments, the manipulator 200 may be an underactuated manipulator where at least one joint of the manipulator 200 is only indirectly movable by the actuators of the manipulator. Such a joint may be referred to as an unactuated joint. For example, the unactuated joint may not be associated with a corresponding actuator(s). Alternately, the unactuated joint may have a corresponding actuator that has malfunctioned due to operational reasons. The absence of an actuator for one or more joints or presence of an unactuated joint in the robotic manipulator operationally makes it an underactuated mechanical system that is characterized by fewer control inputs than degrees of freedom. Such systems are ubiquitous in robotics: examples are manipulators with passive joints, autonomous bicycles and motorcycles, bipedal robots, and most of the aerospace and marine vehicles.

For such underactuated robotic systems, learning inverse dynamics models for the control of such robots is particularly challenging. For example, torques of the underactuated dimensions are constant signals equal to zero, leading to an ill-posed estimation problem. It is a realization of several embodiments that Gaussian Processes Regression (GPR) may be used to learn the correlation between torques of the joints. In this regard, some embodiments realize that the Gaussian process can be designed to model energy of the mechanical system. In contrast with modeling individual torques, modeling the energy captures mutual effects of the torques of the different actuators on each other, which in turn allows to learn the correlation among the torques of the different actuators. As a result, the covariance matrix capturing correlations between the torques of the different actuators is a full matrix with non-zero elements inside and outside of the diagonal.

FIG. 2B illustrates one example of a full covariance matrix 211, according to some embodiments of the present disclosure. Diagonal elements 213, i.e., a₁, b₂, and c₃, and non-diagonal elements 215, i.e., a₂, a₃, b₁, b₃, c₁, and c₂, of the full covariance matrix 211 are non-zero elements. Since the elements of the full covariance matrix 211 are non-zero elements, the full covariance matrix 211 captures the correlations between the torques of the different actuators.

FIG. 3 illustrates a flow diagram of a method 300 for formulating the energy-based inverse dynamics model 107 of FIG. 1A, according to some embodiments. In general, GPR can use different types of kernels to learn the inverse dynamics of the mechanical system 109 by modeling its energy. Some embodiments use physics informed energy kernels that define the energy of the mechanical system 109. The energy kernels increase an accuracy of the inverse dynamics model 107. For example, the energy kernels define the energy using a function of kinetic energy and a function of potential energy. Such kernels advantageously consider laws of preservation of energy of the mechanical system 109.

To that end, at block 301 of the method 300, a Lagrangian function that describes the mechanical system 109 in terms of energies of the mechanical system 109, i.e., kinetic energy and potential energy, is defined. For instance, the Lagrangian function, such as custom-character (q, {dot over (q)}), is defined as a difference between the kinetic energy and the potential energy of the mechanical system 109

$ℒ (q, \dot{q}) = 𝒯 (q, \dot{q}) - 𝒱 (q)$

- where (q, {dot over (q)}) and (q) are, respectively, the kinetic energy and the potential energy of the mechanical system 109 with n degrees of freedom.

At block 303 of the method 300, the kinetic energy custom-character (q, {dot over (q)}) and the potential energy (q) are defined as two independent zero-mean Gaussian Processes (GPS) with covariance determined by kernel functions (x, x′) and (x, x′)

$~ GP (0, k (x, x^{'})), ~ GP (0, k (x, x^{'})) .$

- where x=[q, {dot over (q)}, {umlaut over (q)}] with q being the joint positions, {dot over (q)} being the joint velocities and {umlaut over (q)} being the joint accelerations. A GP prior on (q, {dot over (q)}) and (q) cannot be used directly in GPR to compute posterior distributions since the kinetic energy and the potential energy are not measured. However, starting from the GP prior on the kinetic energy and the potential energy, some embodiments derive a GP prior for the torques of the inverse dynamics by relying on Lagrangian mechanics. Lagrangian mechanics states that inverse dynamics equations

$B (q) \ddot{q} + c (q, \dot{q}) + g (q) + \tilde{τ} = τ,$

- also named Lagrange's equations, where B(q) is an inertia matrix, c(q, {dot over (q)}) and g (q) account, respectively, for contributions of fictitious forces and gravity, and {tilde over (τ)} are torques due to friction and unknown dynamical effects, are solutions of a set of differential equations of the Lagrangian function (q, {dot over (q)}). The differential equation of the i-th row is

$\frac{\partial ℒ}{dt} (\frac{\partial ℒ}{\partial {\dot{q}}^{i}}) - \frac{\partial ℒ}{\partial q^{i}} = τ^{i},$

- where qⁱ, {dot over (q)}ⁱand τⁱare, respectively, i-th component of q, {dot over (q)} and τ. The Lagrangian equations can be rewritten avoiding explicit differentiation with respect to time using a chain rule, which leads to following linear partial differential equation of .

$\sum_{j = 1}^{n} (\frac{\partial^{2} ℒ}{\partial {\dot{q}}^{i} \partial {\dot{q}}^{j}} {\ddot{q}}^{j} + \frac{\partial^{2} ℒ}{\partial {\dot{q}}^{i} \partial q^{j}} {\dot{q}}^{j}) - \frac{\partial ℒ}{\partial q^{i}} = τ^{i} .$

At block 305 of the method 300, a Lagrangian operator G_ithat maps the Lagrangian function custom-character of the mechanical system 109 to the torques τ_iof the different actuators, is defined by a set of partial differential equations as

$i ℒ = \sum_{j = 1}^{n} (\frac{\partial^{2} ℒ}{\partial {\dot{q}}^{i} \partial {\dot{q}}^{j}} {\ddot{q}}^{j} + \frac{\partial^{2} ℒ}{\partial {\dot{q}}^{i} \partial q^{j}} {\dot{q}}^{j}) - \frac{\partial ℒ}{\partial q^{i}} = τ^{i}, and τ = ℒ = {[\begin{matrix} 1 ℒ & \dots & n ℒ \end{matrix}]}^{T} .$

Some embodiments define the model of custom-character as a GP since and are two independent GPs. This is because the sum of two independent GPs is a GP, and its kernel is a sum of kernels, namely,

$ℒ ~ GP (0, k^{ℒ} (x, x^{'})), k^{ℒ} (x, x^{'}) = k^{𝒯} (x, x^{'}) + k^{𝒱} (x, x^{'}) .$

Further, applying property of GPs and linear operators to τ= custom-character (x), the inverse dynamics is modeled as τ˜GP(0, k^τ(x, x′)) a zero mean GP, with covariance function k^τ(x, x′)) named Lagrangian polynomial kernel.

To this end, at block 307 of the method 300, the Lagrangian polynomial kernel that defines the inverse dynamics model 107 is computed based on the Lagrangian operator, the kernel function custom-character of the kinetic energy and the kernel function of the potential energy as

$k^{τ} (x, x^{'}) = [\begin{matrix} 1 1_{'} k^{ℒ} (x, x^{'}) & \dots & 1 n_{'} k^{ℒ} (x, x^{'}) \\ ⋮ & ⋱ & ⋮ \\ n 1_{'} k^{ℒ} (x, x^{'}) & \dots & n n_{'} k^{ℒ} (x, x^{'}) \end{matrix}]$

Therefore, some embodiments of the present disclosure model the inverse dynamics function as an unknown multi-input multi-output function f(x): R³ⁿ→Rⁿ.

Some embodiments are based on the realization that the kernel functions custom-character and used to define the prior on the potential energy and on the kinetic energy can be formulated as polynomial functions in a space defined by a trigonometric transformation of the state of the mechanical system 109. Such a formulation gives a physics informed structure to the kernel functions with a compact number of parameters to be learned.

Therefore, at block 309, custom-character of the kinetic energy and the kernel function of the potential energy are characterized as polynomial functions. The kernel functions and are defined based on two propositions that characterize and as polynomial functions in a space defined by a trigonometric transformation of the state of the mechanical system 109. The state of the mechanical system 109 may include the positions and velocities of the joints of the mechanical system 109. The trigonometric transformation is defined as follows.

Let qⁱ, {dot over (q)}ⁱbe vectors including the positions and the velocities of the joints up to index i, respectively:

$q^{i} = {[q^{1}, \dots, q^{i}]}^{T} \in i, {\dot{q}}^{i} = {[{\dot{q}}^{1}, \dots, {\dot{q}}^{i}]}^{T} \in i .$

N_rand N_pare, respectively, a number of revolute and prismatic joints, with N_r+N_p=n. Sets I_r={r₁, . . . , r_N_r} and I_p={p₁, . . . , P_N_p} include the revolute and prismatic joints indexes, respectively. Further, following vectors are defined:

$\begin{matrix} q_{c} = {[\cos (q^{r_{1}}), \dots, \cos (q^{r_{N_{r}}})]}^{T} \in, \\ q_{s} = {[\sin (q^{r_{1}}), \dots, \sin (q^{r_{N_{r}}})]}^{T} \in, \\ q_{p} = {[q^{p_{1}}, \dots, q^{p_{N_{p}}}]}^{T} \in \end{matrix} .$

q_c^b, q_s^band q_p^bdenote b-th element of q_c, q_sand q_p, respectively.

Next, let I_rⁱ(resp. I_pⁱ) be a subset of I_r(resp. I_p) composed by indexes lower or equal to i and the vectors q_cⁱ, q_sⁱ, (resp. qp) are defined as restriction of q_c, q_s(resp. q_p) to I_rⁱ(resp. I_pⁱ). For the sake of clarity, consider the following example. Let index i be such that r_j≤i<r_j+1for some 1≤j<r_N_r. Then I_rⁱ={r₁, . . . , r_j}∈ custom-character , q_cⁱ=[cos(q^r¹), . . . , cos(q^r^j)]^T∈ and q_sⁱ=[sin(q^r¹), . . . , sin(q^r^j)]^T∈. To conclude let q_cs_bbe the vector concatenating the b-th elements of q_cand q_s, that is, q_cs_b=[q_c^b, q_s^b]^T.

According to an embodiment, the kernel function of the potential energy custom-character is a polynomial function in a space defined by a trigonometric transformation of the state of the mechanical system 109. The following proposition establishes that the potential energy is polynomial with respect to a set of variables =(q_c, q_s, q_p) that are functions of the joint positions vector q. custom-character is the trigonometric transformation of the state of the mechanical system 109 that is the space over which the potential energy is a polynomial function.

Proposition-1: Consider a manipulator with n+1 links and n joints. Total potential energy custom-character (q) belongs to space _(n)(q_c₍₁₎, q_s₍₁₎, q_p₍₁₎), namely it is a polynomial function in (q_c, q_s, q_p) of degree not greater than n, such that each element of q_c, q_sand q_pappears with degree not greater than 1. Moreover, for any monomial of the aforementioned polynomial, a sum of degrees of q_c^band q_s^bis equal or lower than 1, namely, it holds deg(q_c^b)+deg(q_s^b)≤1.

To comply with constraints on the maximum degree of each term, the kernel function custom-character (x, x′) is defined as a product of N_r+N_pinhomogeneous polynomial kernels where N_rkernels have p=1 and each of them is defined on the 2-dimensional input space given by q_cs_b, b∈I_r, and N_pkernels have p=1 and each of them is defined on the 1-dimensional input q_p^b, b∈I_p. Resulting kernel function for the potential energy is then given by

$\begin{matrix} k^{v} (x, x^{'}) = \prod_{b \in I_{r}} k_{pk}^{(1)} (q_{{cs}_{b}}, q_{{cs}_{b}}^{'}) \prod_{b \in I_{p}} k_{pk}^{(1)} (q_{p}^{b}, q_{p}^{' b}) . & (A) \end{matrix}$

Each of n kernels accounts for contribution of a distinct joint, and the potential energy kernel custom-character (x, x′) spans _(n)(q_c₍₁₎, q_s₍₁₎, q_p₍₁₎). Further, since all the N_rkernels defined on q_cs_b, b∈I_r, have p=1, constraint deg(q_c^b)+deg(q_s^b)≤1 is satisfied.

Further, some embodiments are based on the observation that total kinetic energy is a sum of the kinetic energies relative to each link, that is,

$𝒯 (q, \dot{q}) = \sum_{i = 1}^{n} 𝒯_{i} (q, \dot{q}),$

- where (q, {dot over (q)}) is the kinetic energy of link i. The kinetic energy is a polynomial function with respect to a set of variables =(q_cⁱ, q_sⁱ, q_pⁱ, {dot over (q)}_i), which are functions of the joints positions and velocities vectors qⁱand {dot over (q)}_i. Therefore, the following proposition is established for the kinetic energy.

Proposition-2: Consider a manipulator with n+1 links and n joints. The kinetic energy custom-character (q, {dot over (q)}) of link i belongs to _(2i+2)(q_c₍₂₎ⁱ, q_s₍₂₎ⁱ, q_p₍₂₎ⁱ{dot over (q)}₍₂₎ⁱ), namely it is a polynomial function in (q_cⁱ, q_sⁱ, q_pⁱ, {dot over (q)}ⁱ) of degree not greater than 2i+2, such that:

- (i) each element of q_cⁱ, q_sⁱ, q_pⁱand {dot over (q)}ⁱappears with degree not greater than 2;
  - each monomial has inside a term of the type {dot over (q)}ⁱ{dot over (q)}^jfor 1≤i≤n and i≤j≤n; and
  - in any monomial a sum of degrees of q_c^band q_s^bis equal or lower than 2, namely deg(q_c^b)+deg(q_s^b)≤2.

To comply with constraints and properties stated in the above proposition, a kernel function given by a product of i inhomogeneous polynomial kernels and 1 homogeneous kernel is adopted, where

- |I_rⁱ| inhomogeneous kernels have p=2 and each of them is defined on the 2-dimensional input space given by q_cs_b, b∈I_r;
- |I_pⁱ| inhomogeneous kernels have p=2 and each of them is defined on the 1-dimensional input q_p^b, b∈I_p;
- 1 homogeneous kernel have p=2 and is defined on the i-dimensional input {dot over (q)}ⁱ.

Resulting kernel function for the kinetic energy is then given by

$\begin{matrix} k_{i}^{𝒯} (x, x^{'}) = k_{hpk}^{(2)} ({\dot{q}}^{i}, {\dot{q}}^{' i}) \prod_{b \in I_{r}^{i}} k_{pk}^{(2)} (q_{{cs}_{b}}, q_{{cs}_{b}}^{'}) \cdot \prod_{b \in I_{p}^{i}} k_{pk}^{(2)} (q_{p}^{b}, q_{p}^{' b}) & (B) \end{matrix}$

- where |I_rⁱ|+|I_pⁱ|=i. As all the kernels have p=2, properties (i) and (iii) of the proposition-2 are satisfied. Further, using a homogeneous kernel defined on {dot over (q)}ⁱwith p=2 guarantees validity of property (ii). Finally, the kernel function of the kinetic energy, is defined as

$k^{𝒯} (x, x^{'}) = \sum_{i = 1}^{n} k_{i}^{𝒯} (x, x^{'}) .$

At block 311 of the method 300, the Lagrangian polynomial kernel, k^τ, that defines the inverse dynamics model 107 is computed based on the Lagrangian operator custom-character , the kernel function of the potential energy (eqn. A) and the kernel function of the kinetic energy (eqn. B) characterized as the polynomial functions

$k^{τ} (x, x^{'}) = [\begin{matrix} 𝒢_{1} 𝒢_{1}^{'} k^{ℒ} (x, x^{'}) & \dots & 𝒢_{1} 𝒢_{n}^{'} k^{ℒ} (x, x^{'}) \\ ⋮ & ⋱ & ⋮ \\ 𝒢_{n} 𝒢_{1}^{'} k^{ℒ} (x, x^{'}) & \dots & 𝒢_{n} 𝒢_{n}^{'} k^{ℒ} (x, x^{'}) \end{matrix}] .$

- where

$k^{ℒ} (x, x^{'}) = k^{𝒯} (x, x^{'}) + k^{𝒱} (x, x^{'}) = k_{hp}^{(2)} ({\dot{q}}^{i}, {\dot{q}}^{' i}) \prod_{b \in I_{r}^{i}} k_{p}^{(2)} (q_{{cs}_{b}}, q_{{cs}_{b}}^{'}) \prod_{b \in I_{p}^{i}} k_{p}^{(2)} (q_{p}^{b}, q_{p}^{' b}) + \prod_{b \in I_{r}^{i}} k_{p}^{(1)} (q_{{cs}_{b}}, q_{{cs}_{b}}^{'}) \prod_{b \in I_{p}^{i}} k_{p}^{(1)} (q_{p}^{b}, q_{p}^{' b})$

- adopting for and the polynomial functions defined above and _i, are elements of the Lagrangian operator . Proposition-1 characterizes the potential energy of the whole mechanical system 109, while proposition 2 focuses on the kinetic energy of each link. In general, characterizing the energies for each link and designing a tailored kernel to be combined as executed for allows for higher flexibility in terms of regularization and for more accurate predictions.

FIG. 4 illustrates training of the one or more hyperparameters, indicated with the symbol θ, of the Lagrangian polynomial kernel, according to an embodiment. Training input data, which includes a number of degrees of freedom n_DoF401, a type of the joint (i.e., revolute joint or prismatic joint) 403, data q, {dot over (q)}, {umlaut over (q)} 405 that are the joint positions, joint velocities and joint accelerations, respectively (i.e. states of a system), and data τ 406, that are the torques, is obtained by executing one or more trajectories in the mechanical system 109, are used to train the one or more hyperparameters of the Lagrangian polynomial kernel that defines the inverse dynamics model 107. The number of degrees of freedom n_DoF401, and type of the joint 403 are inputs that a user specifies, and are the only knowledge required for the mechanical system 109. The data q, {dot over (q)}, {umlaut over (q)} 405 and data τ 406 are obtained by executing trajectories on the mechanical system 109, that may be a sequence of positions if the mechanical system 109 is controlled in position, or a sequence of velocities if the mechanical system 109 is controlled in velocity or a sequence of torques if 109 is controlled in torque. In some embodiments, for example, these trajectories may be the sum of sinusoids in the joint positions q.

In an embodiment, input of the energy-based inverse dynamics model 107 during training is the data q, {dot over (q)}, {umlaut over (q)} 405 and labels are the torques data 406. The one or more hyperparameters can be trained based on any machine learning algorithm. In some embodiments, the one or more hyperparameters are learned with a machine learning algorithm using maximization of marginal likelihood. The one or more hyperparameters are the parameters that define the kernel function of the Gaussian process. For example, in a standard squared exponential kernel the one or more hyperparameters may be scaling factors and length scales of the squared exponential function. In some embodiments, the one or more hyperparameters may be coefficients of the polynomial functions that define the Lagrangian kernel custom-character (x, x′).

According to some embodiments, the inverse dynamics model 107 is a multi-input-multi-output (MIMO) torque estimator model that produces the torques for the different actuators based on the positions of the joints of the mechanical system 109, and velocity and acceleration of the joints providing multiple degrees of freedom. In other words, the positions of the joints, and velocity and acceleration of the multiple degrees of freedom are applied as input to the MIMO torque estimator, and the MIMO torque estimator outputs the torques for the different actuators.

Additionally, or alternatively, in some embodiments, the inverse dynamics model 107 is used to estimate the kinetic energy and the potential energy from torque measurements. For instance, the processor 103 may be configured to process the states of the different actuators with the trained inverse dynamics model 107 to estimate the kinetic energy of the mechanical system 109 and the potential energy of the mechanical system 109. According to some embodiments, after the training of the inverse dynamics 107, the estimate 409 of kinetic energy and the estimate 410 of potential energy may be computed without requiring further training for such computation of the energies. That is, the training of the model 107 does not require labeled data for the kinetic and/or potential energy. Indeed, the hyperparameters of the torque kernel k_θ^τ are the hyperparameters of the Lagrangian Kernel custom-character (x, x′), that is the sum of the potential energy kernel and of the kinetic kernel as described previously in this disclosure. According to some embodiments, the potential energy can be computed by combining the quantity 407 obtained during the estimation of the inverse dynamics of the system, the potential kernel custom-character in 408 and the linear operator that maps the Lagrangian function to torques determined at step 305 of FIG. 3. The quantities 407 and the potential kernel in 408 are evaluated at the data points 405, 406, therefore the data 405, 406 are also necessary to estimate the potential energy. According to some embodiments, the kinetic energy can be computed by combining the quantity 407, obtained during the estimation of the inverse dynamics of the system, the kinetic kernel custom-character in 408 and the linear operator that maps the Lagrangian function to torques determined at step 305 of FIG. 3. The quantities 407 and the kinetic kernel in 408 are evaluated at the data points 405, 406, and therefore the data 405, 406 are also necessary in estimating the kinetic energy. The estimation of the kinetic energy is described in detail later in FIG. 5A and the estimation of the potential energy is explained in detail in FIG. 5B, and mathematically in the remainder of this disclosure.

Inverse Dynamics Model of Underactuated Systems Under the Rigid Body Assumption

Consider a mechanical system with n-DOF and let q∈ custom-character be the vector of generalized coordinates. The system may have m control inputs (where m<n), each of which actuates a single DOF. The vector q∈ may be partitioned as q^T=[q₁^T, q₂^T], where q₁∈ and q₂∈ refer respectively to the actuated and the non-actuated DOFs. Under the rigid body assumption, the inverse dynamics of the underactuated system can be derived from the Euler-Lagrange equations as:

$\begin{matrix} (1) &  \\ \underset{m (q, \ddot{q})}{\underset{︸}{[\begin{matrix} M_{1 1} (q) & M_{1 2} (q) \\ M_{2 1} (q) & M_{2 2} (q) \end{matrix}]}} \underset{c (q, \dot{q})}{\underset{︸}{[\begin{matrix} {\ddot{q}}_{1} \\ {\ddot{q}}_{2} \end{matrix}]}} + \underset{g (q)}{\underset{︸}{[\begin{matrix} c_{1} (q, \dot{q}) \\ c_{2} (q, \dot{q}) \end{matrix}]}} \underset{τ}{\underset{︸}{[\begin{matrix} g_{1} (q) \\ g_{2} (q) \end{matrix}]}} = [\begin{matrix} τ_{1} \\ 0 \end{matrix}] & (1) \end{matrix}$

$where$

$M (q) = [\begin{matrix} M_{1 1} (q) & M_{1 2} (q) \\ M_{2 1} (q) & M_{2 2} (q) \end{matrix}]$

- is the symmetric, positive definite inertia matrix, m(q, {umlaut over (q)}) represents the inertial torque, c(q, {dot over (q)}) accounts for the Coriolis and centripetal torques, g(q) represents the gravity contribution while τ₁∈ is the vector of generalized torques produced by the m actuators.

The inverse dynamics identification problem consists of estimating the map in eq. (1) that relates {tilde over (x)}=(q, {dot over (q)}, {umlaut over (q)}) and the torques τ from a set of noisy measures. Black-box solutions treat the inverse dynamics as an unknown function and, generally, rely on universal approximators to estimate the function from experimental data. According to some embodiments, GPR, which is a framework for Bayesian inference widely used in machine learning and robotics, may be adopted in this regard.

The standard approach when using GPR consists of considering the different torque components independently and solving n independent regression problems, one for each generalized coordinate. However, with underactuated systems, torques of the under-actuated dimensions are constant signals equal to zero, leading to an ill-posed estimation problem. This black-box setup prevents the possibility of deriving any inverse dynamics model useful for model-based control strategies.

Model-Based Balancing Control of Underactuated Systems

A particular class of robots described by eq. (1), are known as balancing systems. Common examples of balancing systems are the Cartpole, the Furuta Pendulum, the Acrobot, and the Pendubot. Within such systems, the typical control challenge requires swinging up and balancing the robot in the unstable equilibrium point, hereafter denoted by x_★=[q_★^T, {dot over (q)}_★^T]^Twith {dot over (q)}_★=0.

Energy-Based Swing-Up Controller

The first step consists of a partial feedback linearization. From eq. (1) the dynamics of the actuated and non-actuated subsystems are isolated respectively as

$\begin{matrix} M_{1 1} {\ddot{q}}_{1} + M_{1 2} {\ddot{q}}_{2} + c_{1} + g_{1} = τ_{1} & (2) \end{matrix}$

$\begin{matrix} M_{2 1} {\ddot{q}}_{1} + M_{2 2} {\ddot{q}}_{2} + c_{2} + g_{2} = 0 & (3) \end{matrix}$

Eq. (3) may be solved for {umlaut over (q)}₂as {umlaut over (q)}₂=−M₂₂⁻¹(M₂₁{umlaut over (q)}₁+c₂+g₂) since M₂₂is invertible (given that M>0). Substituting the resulting expression into eq. (2) leads to

$\begin{matrix} {\bar{M}}_{1 1} {\ddot{q}}_{1} + {\bar{c}}_{1} + {\bar{g}}_{1} = τ_{1} & (4) \end{matrix}$

$with$

${\bar{M}}_{1 1} = M_{1 1} - M_{1 2} M_{2 2}^{- 1} M_{2 1},$

${\bar{c}}_{1} = c_{1} - M_{1 2} M_{2 2}^{- 1} c_{2}$

$and$

${\bar{g}}_{1} = g_{1} - M_{1 2} M_{2 2}^{- 1} g_{2} .$

A feedback linearizing controller for eq. (4) can be defined as

$\begin{matrix} τ_{1} = {\bar{M}}_{1 1} u + {\bar{c}}_{1} + {\bar{g}}_{1} & (5) \end{matrix}$

- where u is a design parameter. Choosing τ₁as in eq. (5) leads to the following closed-loop system:

$\begin{matrix} {\ddot{q}}_{1} = u & (6) \end{matrix}$

$\begin{matrix} M_{2 2} {\ddot{q}}_{2} + c_{2} + g_{2} = - M_{2 1} u & (7) \end{matrix}$

A linear second-order dynamics for the actuated subsystem may be obtained. Selecting u according to

$\begin{matrix} u = - k_{1} q_{1} - k_{2} {\dot{q}}_{1} + \bar{u}, & (8) \end{matrix}$

$k_{1}, k_{2} > 0$

- makes the linear subsystem in eq. (6) asymptotically stable for ū=0. The remaining design problem is the choice of ū, which can be used to stabilize the non-actuated dynamics in eq. (7). According to some embodiments, ū may be designed based on energy concepts, penalizing the mismatch with respect to the system energy at the desired configuration. This leads to control laws of the form

$\begin{matrix} \bar{u} = f_{e} (𝒯, 𝒱) & (9) \end{matrix}$

- where

$𝒯 (q, \dot{q}) = \frac{1}{2} {\dot{q}}^{T} M (q) \dot{q}$

is the kinetic energy, while custom-character (q) denotes the potential energy. However, the choice of f_edepends on the system of interest.

It may be noted that the controller presented in this section is not stabilizing the system to a fixed point but only to a manifold. For this reason, in applications such as the swing up of balancing robots, the control must switch to another controller achieving local asymptotic stability to the equilibrium.

Linear Quadratic Regulator

To stabilize the system at the equilibrium x_★ we resort to a Linear Quadratic Regulator (LQR). First, a state space description for the system may be provided with dynamics expressed in eq. (1). Let the system state x be such that x^T=[q^T, {dot over (q)}^T]. The state evolution can be derived from eq. (1) as

$\begin{matrix} \dot{x} = [\begin{matrix} \dot{q} \\ - M^{- 1} (q) [c (q, \dot{q}) + g (q)] \end{matrix}] + [\begin{matrix} 0_{n} \\ M^{- 1} (q) \end{matrix}] τ = f (x) + g (x, τ) & (10) \end{matrix}$

Then the non-linear system in eq. (10) may be linearized around x_★^T. Moreover, let τ_★ be the reference input at the equilibrium. Applying a first-order Taylor expansion, the system dynamics around (x_★, τ_★) can be approximated as

$\begin{matrix} \dot{x} = A (x - x_{★}) + B (τ - τ_{★}) & (11) \end{matrix}$

Recalling that at the equilibrium {dot over (q)}_★=0 matrices A and B are

$\begin{matrix} A = \frac{\partial f^{T}}{\partial x} ❘_{x_{★}, τ_{★}} =  [⁠ \begin{matrix} 0 & I \\ - M^{- 1} (q) \frac{\partial g (q)}{\partial q} & M^{- 1} (q) C (q, \dot{q}) \end{matrix}] ❘_{x_{★}, τ_{★}} = [⁠ \begin{matrix} 0 & I \\ - M^{- 1} (q_{★}) \frac{\partial g (q_{★})}{\partial q} & 0 \end{matrix}] & (12) \end{matrix}$

$and$

$\begin{matrix} B = \frac{\partial g^{T}}{\partial τ} ❘_{x_{★}, τ_{★}} = [\begin{matrix} 0 \\ M^{- 1} (q_{★}) \end{matrix}] & (13) \end{matrix}$

- where C(q, {dot over (q)})∈ is the skew-symmetric Coriolis matrix, such that c(q, {dot over (q)})=C(q, {dot over (q)}){dot over (q)}.

Then, the infinite horizon control problem on the linearized system is, namely,

$\underset{τ}{\arg \min} \int_{0}^{\infty} {(x - x_{★})}^{T} Q (x - x_{★}) + {(τ - τ_{★})}^{T} {R (τ - τ_{★})}^{T} dt$

- which leads to the control input τ=−Kx, with K being the optimal control gain, which can be computed by solving the continuous time algebraic Riccati equation.

Lagrangian Inspired Polynomial Estimator for Modeling and Control of Under-Actuated Systems

The LIP estimator is based on GP, which is a framework for Bayesian inference widely used in machine learning and robotics applications. Generally, GPR solutions for inverse dynamics identification model each torque component τⁱ({tilde over (x)}) as a Gaussian Process (GP) by assuming τⁱ({tilde over (x)})s are independent given {tilde over (x)}, and then apply standard GPR inference. As discussed in the previous section, this approach is not effective for underactuated systems. The LIP estimator follows an alternative strategy and defines the kinetic and potential energies as two independent GPs. Then, it derives a multi-output kernel of the torques by exploiting EL equations. In this way, the inverse dynamics problem is well defined also in the underactuated setup.

Some embodiments model custom-character and as independent GPs, namely ˜(0, (⋅,⋅)) and ˜(0, (⋅,⋅)), where and are the kernels functions that defines the covariance of and . For instance, let {tilde over (x)} and {tilde over (x)}′ be two input locations, then the covariance between the values of at {tilde over (x)} and {tilde over (x)}′ is E[ custom-character ({tilde over (x)}), ({tilde over (x)}′)]=({tilde over (x)}, {tilde over (x)}′). For convenience, , that is the matrix that collects evaluated at X={{tilde over (x)}₁, . . . , {tilde over (x)}_N}, X′={x′₁, . . . , x′_M} is given as:

$K_{{XX}^{'}}^{𝒯} = [\begin{matrix} k^{𝒯} ({\tilde{x}}_{1}, {\tilde{x}}_{1}^{'}) & \dots & k^{𝒯} ({\tilde{x}}_{1}, {\tilde{x}}_{M}^{'}) \\ ⋮ & ⋮ & ⋮ \\ k^{𝒯} ({\tilde{x}}_{N}, {\tilde{x}}_{1}^{'}) & \dots & k^{𝒯} ({\tilde{x}}_{N}, {\tilde{x}}_{M}^{'}) \end{matrix}] .$

Similarly,

$K_{{XX}^{'}}^{𝒱} = [\begin{matrix} k^{𝒱} ({\tilde{x}}_{1}, {\tilde{x}}_{1}^{'}) & \dots & k^{𝒱} ({\tilde{x}}_{1}, {\tilde{x}}_{M}^{'}) \\ ⋮ & ⋮ & ⋮ \\ k^{𝒱} ({\tilde{x}}_{N}, {\tilde{x}}_{1}^{'}) & \dots & k^{𝒱} ({\tilde{x}}_{N}, {\tilde{x}}_{M}^{'}) \end{matrix}]$

The LIP estimator defines custom-character and relying on a polynomial formulation as described previously with reference to FIG. 3.

It may be noted that, (i) since custom-character and are defined as zero-mean GPS, for the properties of GPs also the Lagrangian function =− is a zero-mean GP with kernel (⋅,⋅)=(⋅,⋅)+(⋅,⋅), namely ˜(0, (⋅,⋅)). Furthermore, (ii) under rigid body assumptions, each τⁱ({tilde over (x)}) is described by a linear differential equation of custom-character , namely,

$τ^{i} = \frac{d}{dt} (\frac{\partial ℒ}{\partial {\dot{q}}^{i}}) - \frac{\partial ℒ}{\partial q^{i}} .$

Expanding explicit derivations with respect to time, gives:

$τ^{i} = \sum_{j = 1}^{n} (\frac{\partial^{2} ℒ}{\partial {\dot{q}}^{i} \partial {\dot{q}}^{j}} {\ddot{q}}^{j} + \frac{\partial^{2} ℒ}{\partial {\dot{q}}^{i} \partial q^{j}} {\dot{q}}^{j}) - \frac{\partial ℒ}{\partial q^{i}} = : 𝒢_{i} ℒ,$

- where _iis the linear operator that maps in the linear differential equation of τⁱ. Finally, (iii) GPs are closed with respect to linear operators, namely, if f is a zero-mean GP with kernel k^f(⋅,⋅) and g({tilde over (x)})=f({tilde over (x)}), then also g({tilde over (x)}) is a zero-mean GP with kernel k^g({tilde over (x)}, {tilde over (x)}′)=k^f({tilde over (x)}, {tilde over (x)}′). The last expression means that k^g({tilde over (x)}, {tilde over (x)}′) is obtained by applying two times the operator to k^f({tilde over (x)}, {tilde over (x)}′), first with respect to the input {tilde over (x)} then w.r.t. {tilde over (x)}′.

Based on (i), (ii), and (iii), some embodiments realize that torques are a zero-mean GP, with covariance defined by a multi-output kernel k^τ({tilde over (x)}, {tilde over (x)}′)∈ custom-character that encodes the EL equations, and is expressed as

$\begin{matrix} k^{τ} (\tilde{x}, {\tilde{x}}^{'}) = [\begin{matrix} 𝒢_{1} 𝒢_{1}^{'} k^{ℒ} (\tilde{x}, {\tilde{x}}^{'}) & \dots & 𝒢_{1} 𝒢_{n}^{'} k^{ℒ} (\tilde{x}, {\tilde{x}}^{'}) \\ ⋮ & ⋱ & ⋮ \\ 𝒢_{n} 𝒢_{1}^{'} k^{ℒ} (\tilde{x}, {\tilde{x}}^{'}) & \dots & 𝒢_{n} 𝒢_{n}^{'} k^{ℒ} (\tilde{x}, {\tilde{x}}^{'}) \end{matrix}] & (14) \end{matrix}$

To derive (14) the multi-output version of property (iii) may be applied. This is also described previously with reference to FIG. 3.

Once k^τ is defined, torque estimates may be computed following standard GPR. Let X be a set of N training input locations, and y=[y₁^T, . . . , y_N^T]^Tthe respective torque measurements, with y_i∈ custom-character equal to the torque measures at input {tilde over (x)}_i. The LIP torque estimate in a general input location {tilde over (x)} is

$\begin{matrix} \hat{τ} (\tilde{x}) = {K_{\tilde{x} X}^{τ} (K_{XX}^{τ} + \sum_{e})}^{- 1} y & (15) \end{matrix}$

- where Σ_eis a regularization parameter that accounts for the additive Gaussian noise modeled by GPR. For each dimension, independent and identically distributed noise may be assumed, thus obtaining a block diagonal matrix with equal diagonal blocks, namely Σ_e=diag(Σ_e₁, . . . , Σ_e_N), with Σ_e₁=Σ_e₂= . . . =Σ_e_N=diag(σ_e₁², . . . , σ_e_n²) where σ_e_i²is the variance of the τⁱmeasures.

LIP for Control

Next the estimation of the kinetic and potential energies, the inertial, Coriolis, and gravity vector as well as δg/δq required by the control laws presented with reference to eq. 9 is described.

Kinetic and Potential Energies

FIG. 5A illustrates a block diagram of a method 500A for estimation of the kinetic energy, according to some embodiments. FIG. 5B illustrates a block diagram of a method 500B for estimation of the potential energy, according to some embodiments.

The kinetic energy custom-character and potential energy are required to implement the energy based control law described by eq. (9). The LIP model provides a principled way to estimate them from the torque measurements y. Indeed, within the LIP framework, , , and τ are jointly Gaussian distributed, since the prior of τ is derived by applying the linear operator custom-character to the kinetic and potential GPs and . The covariances between and τ and between and τ at general input locations {tilde over (x)} and {tilde over (x)}′ are

$\begin{matrix} Cov [𝒯 (\tilde{x}), τ ({\tilde{x}}^{'})] = Cov [𝒯 (\tilde{x}), 𝒢 ℒ ({\tilde{x}}^{'})] = : k^{𝒯^{τ}} (\tilde{x}, {\tilde{x}}^{'}), & (16 a) \end{matrix}$

$\begin{matrix} Cov [𝒱 (\tilde{x}), τ ({\tilde{x}}^{'})] = Cov [𝒱 (\tilde{x}), 𝒢 ℒ ({\tilde{x}}^{'})] = : k^{𝒱^{τ}} (\tilde{x}, {\tilde{x}}^{'}), & (16 b) \end{matrix}$

Recalling that custom-character and are modelled as independent GPs, and in view of the properties of GPs under linear operators,

$\begin{matrix} k^{𝒯^{τ}} (\tilde{x}, {\tilde{x}}^{'}) = Cov [𝒯 (\tilde{x}), 𝒢𝒯 ({\tilde{x}}^{'})] = {[𝒢^{'} k^{𝒯} (\tilde{x}, {\tilde{x}}^{'})]}^{T} = [𝒢_{1}^{'} k^{𝒯} (\tilde{x}, {\tilde{x}}^{'}), \dots, 𝒢_{n}^{'} k^{𝒯} (\tilde{x}, {\tilde{x}}^{'})] & (17 a) \end{matrix}$

$\begin{matrix} k^{𝒱^{τ}} (\tilde{x}, {\tilde{x}}^{'}) = Cov [𝒱 (\tilde{x}), 𝒢𝒱 ({\tilde{x}}^{'})] = {[𝒢^{'} k^{𝒱} (\tilde{x}, {\tilde{x}}^{'})]}^{T} = [𝒢_{1}^{'} k^{𝒱} (\tilde{x}, {\tilde{x}}^{'}), \dots, 𝒢_{n}^{'} k^{𝒱} (\tilde{x}, {\tilde{x}}^{'})] & (17 b) \end{matrix}$

- may be obtained.

The Gaussian properties make the posterior distributions of custom-character and given y known analytically. At any general input location x, these posteriors are Gaussians distributions, therefore exactly defined by mean and variance. The means are computed as

$\begin{matrix} E [𝒯 (x) | 𝒟] = {K_{xX}^{𝒯τ} (K_{XX} + \sum_{e})}^{- 1} y, & (18 a) \end{matrix}$

$\begin{matrix} E [𝒱 (x) | 𝒟] = {K_{xX}^{𝒱τ} (K_{XX} + \sum_{e})}^{- 1} y, & (18 b) \end{matrix}$

- for and , respectively. The variances are computed as

$\begin{matrix} a) &  \\ 𝕍 [𝒯 (x)] = k^{𝒯} (x, x) - {K_{xX}^{𝒯τ} (K_{XX} + \sum_{e})}^{- 1} {(K_{xX}^{𝒯τ})}^{T}, & (19 a) \end{matrix}$

$\begin{matrix} b) &  \\ 𝕍 [𝒱 (x)] = k^{𝒱} (x, x) - {K_{xX}^{𝒱τ} (K_{XX} + \sum_{e})}^{- 1} {(K_{xX}^{𝒱τ})}^{T}, & (19 b) \end{matrix}$

- for and , respectively.

From the posterior distributions of custom-character and given y an estimate of the energies at arbitrary input location {tilde over (x)} may be obtained as

$\begin{matrix} \hat{𝒯} (\tilde{x}) = {K_{\tilde{x} X}^{𝒯τ} (K_{XX}^{τ} + \sum_{e})}^{- 1} y, & (20 a) \end{matrix}$

$\begin{matrix} \hat{𝒱} (\tilde{x}) = {K_{\tilde{x} X}^{𝒱τ} (K_{XX}^{τ} + \sum_{e})}^{- 1} y, & (20 b) \end{matrix}$

- where the covariance matrices ∈ and K∈ are obtained as

$K_{\tilde{x} X}^{𝒯τ} = [k^{𝒯 τ} (\tilde{x}, x_{1}), \dots, k^{𝒯 τ} (\tilde{x}, x_{N})]$

$and$

$K_{\tilde{x} X}^{𝒱τ} = [k^{𝒱 τ} (\tilde{x}, x_{1}), \dots, k^{𝒱 τ} (\tilde{x}, x_{N})] .$

Referring to FIG. 5A, at 501 of the method 500A, the covariance between the kinetic energy custom-character and the torques τ is computed as per eq. 17a above. At 503, a probabilistic posterior distribution of the kinetic energy is computed as per eq. 18a and 19a above. At block 505, the kinetic energy is estimated from the probabilistic posterior distribution of the kinetic energy based on the covariance between the kinetic energy custom-character and the torques τ, as

$E [(x) | 𝒟] = {(K_{XX} + \sum_{e})}^{- 1} y .$

Referring to FIG. 5B, at 507 of the method 500B, the covariance between the potential energy custom-character and the torques τ is computed as per eq. 17b above. At 509, a probabilistic posterior distribution of the potential energy is computed as per eq. 18b and 19b above. At block 511, the potential energy is estimated from the probabilistic posterior distribution of the kinetic energy based on the covariance between the kinetic energy custom-character and the torques τ, as

$E [(x) | 𝒟] = {(K_{XX} + \sum_{e})}^{- 1} y .$

Torque Components

The energy-based control law in eq. (5) requires estimating m, c and g, while the LQR described previously requires the inverse of the inertia matrix M as well as the term δg/δq. These quantities are derived similarly to how the energies are computed as described in the previous section.

First, the inertia matrix is estimated component wise. The element in position ij of M is

$M^{ij} (q) = \frac{\partial^{z} (q, q)}{\partial {\dot{q}}^{i} \partial {\dot{q}}^{j}} = : M^{ij} (\tilde{x}),$

- where we introduced the linear operator _M_ij. The covariance between M^ijand τ at general input locations {tilde over (x)} and {tilde over (x)}′ can be computed as

$Cov [M^{ij} (x), τ ({\tilde{x}}^{'})] = Cov [M^{ij} (\tilde{x}), ({\tilde{x}}^{'})] = M^{ij} (\tilde{x}, {\tilde{x}}^{'}) = : k^{M_{ij} τ} (\tilde{x}, {\tilde{x}}^{'}) .$

Accordingly, M^ijmay be estimated at any input location {tilde over (x)} as

$M^{ij} (\tilde{x}) = {K_{M^{{ij}_{τ}}} (K_{XX}^{τ} + \sum_{e})}^{- 1} y$

with

$K_{M^{ij} τ} = [k^{M^{{ij}_{τ}}} (\tilde{x}, {\tilde{x}}_{1}), \dots, k^{M^{{ij}_{τ}}} (\tilde{x}, {\tilde{x}}_{N})] .$

Then, from the estimate of the inertia matrix an estimate of the inertial torque component may be derived as {circumflex over (m)}({tilde over (x)})={circumflex over (M)}({tilde over (x)}){umlaut over (q)}.

Next, the gravity contribution g may be estimated. Recall that the i-th component of the vector g is defined as

$g^{i} (q) = \frac{\partial (q)}{\partial q^{i}} .$

The covariance between gⁱand τ at general input locations {tilde over (x)} and {tilde over (x)}′ is

$Cov [g^{i} (x), τ ({\tilde{x}}^{'})] = Cov [\frac{\partial (x)}{\partial q^{i}}, ({\tilde{x}}^{'})] = \frac{\partial k^{v} (\tilde{x}, {\tilde{x}}^{'})}{\partial q^{i}} = : k^{g^{i} τ} (\tilde{x}, {\tilde{x}}^{'}) .$

Accordingly gⁱcan be estimated at any input location {tilde over (x)} as

$\hat{g} (\tilde{x}) = K_{\tilde{x} X}^{g^{i} τ} ({(K_{XX}^{τ} + \sum_{e})}^{- 1} y),$

with

$K_{\tilde{x} X}^{g^{i} τ} = [k^{g^{i} τ} (\tilde{x}, {\tilde{x}}_{1}), \dots, k^{g^{i} τ} (\tilde{x}, {\tilde{x}}_{N})] .$

Given the estimates of m and g, it is possible to obtain an estimate of the Coriolis and centripetal contribution c as ĉ({tilde over (x)})={circumflex over (τ)}({tilde over (x)})−m({tilde over (x)})−g({tilde over (x)}).

Finally, the matrix

$\frac{\partial g}{\partial q} \in ℝ^{n \times n}$

required in the computation of matrix A in eq. (12), is estimated following the same procedure adopted for the inertia matrix. Its element in position ij is given by.

${[\frac{\partial g (q)}{\partial q}]}_{ij} = \frac{\partial^{2} v (q)}{\partial q^{i} \partial q^{j}} = : G^{ij} (q) = : G^{ij} (\tilde{x}) .$

Then, the covariance between G^ij({tilde over (x)}) and the torque vector τ at general input locations {tilde over (x)} and {tilde over (x)}′ is

$Cov [G^{ij} (\tilde{x}), τ ({\tilde{x}}^{'})] = Cov [G^{ij} (q), ({\tilde{x}}^{'})] = G^{ij} (\tilde{x}, {\tilde{x}}^{'}) = : k^{G^{ij} τ} (\tilde{x}, {\tilde{x}}^{'}) .$

Finally, the estimate at a general input location {tilde over (x)} is computed as

${\hat{G}}^{ij} (\tilde{x}) = {K_{\tilde{x} X}^{G^{{ij}_{τ}}} (K_{XX}^{τ} + \sum_{e})}^{- 1} y,$

with

$K_{\tilde{x} X}^{G^{{ij}_{τ}}} = [k^{G^{{ij}_{τ}}} (\tilde{x}, {\tilde{x}}_{1}), \dots, k^{G^{{ij}_{τ}}} (\tilde{x}, {\tilde{x}}_{N})] .$

FIG. 6 illustrates a block diagram of a system 600 for controlling an underactuated mechanical system 602, according to some embodiments. The system 600 comprises an energy-based controller 604 that generates control commands 601 to control the underactuated mechanical system 606 to track a trajectory 603. In this regard, the controller 604 interfaces with a trained energy-based inverse dynamics model 606 such as the model 107 of FIG. 1A and 4. The model 606 takes the trajectory 603 and current states 609 as input to produce values of energies 605 and dynamic components 607 of the underactuated mechanical system 602. According to some embodiments, the model 606 outputs values of the kinetic and potential energies of the system 602 and one or more dynamic components such as the inertia, the inertial torques, Coriolis and centripetal torques, gravity contributions etc.

Some embodiments are based on the realization that the estimated kinetic energy and the potential energy can be used to detect anomaly of the underactuated mechanical system 602 during operation of the mechanical system 602 (e.g., while performing the task). The anomaly detection based on the estimated kinetic energy and the potential energy of the system 602 is explained next.

FIG. 7 illustrates a flow diagram of a method 700 for an anomaly detection based on the estimated kinetic energy and the estimated potential energy of the underactuated mechanical system 602, according to some embodiments. At block 701, the kinetic energy and the potential energy of the underactuated mechanical system 602, are estimated for example in accordance with the framework described above with reference to FIG. 5A and FIG. 5B.

Further, the method 700 comprises comparing 703 the estimated kinetic energy and the estimated potential energy with a respective threshold. Towards this end, the estimated kinetic energy may be compared with a first threshold and the estimated potential energy may be compared with a second threshold. Based on such comparison, the anomaly is detected 705. For instance, at block 703, it may be checked if the estimated kinetic energy and the estimated potential energy are greater than the respective threshold. If the estimated kinetic energy and the estimated potential energy are greater than the first threshold and the second threshold, respectively, then, at block 705, it is inferred that the anomaly is detected. If the estimated kinetic energy and the estimated potential energy are not greater than the first threshold and the second threshold, respectively, then, at block 707, it is inferred that no anomaly is detected.

Alternatively, in some embodiments, it may be checked if the estimated kinetic energy and the estimated potential energy are less than the first threshold and the second threshold, respectively. If the estimated kinetic energy and the estimated potential energy are less than the first threshold and the second threshold, respectively, then it is inferred that the anomaly is detected. If the estimated kinetic energy and the estimated potential energy are not less than the first threshold and the second threshold, respectively, then it is inferred that no anomaly is detected. If the estimated kinetic energy is greater than the first threshold and the estimated potential energy is less than the second threshold, then a fault detection is inferred as a special case of anomaly detection depending only on potential energy faults. If the estimated potential energy is greater than the second threshold and the estimated kinetic energy is less than the first threshold, then a fault detection is inferred as a special case of anomaly detection depending only on kinetic energy faults.

Additionally, in some embodiments, the estimated potential energy and the estimated kinetic energy may be used to adjust the control commands for the underactuated mechanical system 602. For instance, based on the estimated potential energy and the estimated kinetic energy, a passivity controller may be designed to control the underactuated mechanical system 602. The passivity controllers are effective to control mechanical systems but require precise models of the energies to define Hamiltonian or Lagrangian of the system. Some embodiments can estimate accurate models of the energies that result in accurate passivity controllers.

Additionally, in an embodiment, a processor may be configured to determine, based on the estimated kinetic energy and the estimated potential energy, a motion plan that consumes a minimum amount of energy for performing the task. The motion plan may include a trajectory for performing the task. Such an embodiment is described below in FIG. 8.

FIG. 8 illustrates a motion plan 809 that consumes a minimum amount of energy for performing the task by the robotic manipulator 800, according to some embodiments. The motion plan 809 is a trajectory to be tracked by the robotic manipulator 800 for performing the task. It is an object of some embodiments to configure the robotic manipulator 800 to perform a task of inserting an object 801 in a hole 803. A motion plan may be determined for the robotic manipulator 800 to perform the task. Different motion plans, such as a motion plan 805 and a motion plan 807, may be determined for performing the task. However, it is desired to determine a motion plan that consumes the minimum amount of energy for performing the task. The processor 103 is configured to determine, based on the estimated kinetic energy and the estimated potential energy, the motion plan 809 that consumes the minimum amount of energy for performing the task. Further, the processor 103 is configured to control the robotic manipulator 800 according to the motion plan 809 to perform the task by consuming the minimum amount of energy. For example, based on the motion plan 809, the processor 103 is configured to determine control commands for the different actuators of the robotic manipulator 800. The control commands are applied to the different actuators of the robotic manipulator 800. The control commands cause the robotic manipulator 800 to track the motion plan 809 for performing the task.

FIG. 9 is a schematic illustrating a computing device 900 for implementing the controller 101 and methods of the present disclosure. The computing device 900 includes a power source 901, a processor 903, a memory 905, a storage device 907, all connected to a bus 909. Further, a high-speed interface 911, a low-speed interface 913, high-speed expansion ports 915 and low speed connection ports 917, can be connected to the bus 909. In addition, a low-speed expansion port 919 is in connection with the bus 909. Further, an input interface 921 can be connected via the bus 909 to an external receiver 923 and an output interface 925. A receiver 927 can be connected to an external transmitter 929 and a transmitter 931 via the bus 909. Also connected to the bus 909 can be an external memory 933, external sensors 935, machine(s) 937, and an environment 939. Further, one or more external input/output devices 941 can be connected to the bus 909. A network interface controller (NIC) 943 can be adapted to connect through the bus 909 to a network 945, wherein data or other data, among other things, can be rendered on a third-party display device, third party imaging device, and/or third-party printing device outside of the computer device 900.

The memory 905 can store instructions that are executable by the computer device 900 and any data that can be utilized by the methods and systems of the present disclosure. The memory 905 can include random access memory (RAM), read only memory (ROM), flash memory, or any other suitable memory systems. The memory 905 can be a volatile memory unit or units, and/or a non-volatile memory unit or units. The memory 905 may also be another form of computer-readable medium, such as a magnetic or optical disk.

The storage device 907 can be adapted to store supplementary data and/or software modules used by the computer device 900. The storage device 907 can include a hard drive, an optical drive, a thumb-drive, an array of drives, or any combinations thereof. Further, the storage device 907 can contain a computer-readable medium, such as a floppy disk device, a hard disk device, an optical disk device, or a tape device, a flash memory or other similar solid-state memory device, or an array of devices, including devices in a storage area network or other configurations. Instructions can be stored in an information carrier. The instructions, when executed by one or more processing devices (for example, the processor 903), perform one or more methods, such as those described above.

The computing device 900 can be linked through the bus 909, optionally, to a display interface or user Interface (HMI) 947 adapted to connect the computing device 900 to a display device 949 and a keyboard 951, wherein the display device 949 can include a computer monitor, camera, television, projector, or mobile device, among others. In some implementations, the computer device 900 may include a printer interface to connect to a printing device, wherein the printing device can include a liquid inkjet printer, solid ink printer, large-scale commercial printer, thermal printer, UV printer, or dye-sublimation printer, among others.

The high-speed interface 911 manages bandwidth-intensive operations for the computing device 900, while the low-speed interface 913 manages lower bandwidth-intensive operations. Such an allocation of functions is only an example. In some implementations, the high-speed interface 911 can be coupled to the memory 905, the user interface (HMI) 947, and to the keyboard 951 and the display 949 (e.g., through a graphics processor or accelerator), and to the high-speed expansion ports 915, which may accept various expansion cards via the bus 909. In an implementation, the low-speed interface 913 is coupled to the storage device 907 and the low-speed expansion ports 917, via the bus 909. The low-speed expansion ports 917, which may include various communication ports (e.g., USB, Bluetooth, Ethernet, wireless Ethernet) may be coupled to the one or more input/output devices 941. The computing device 900 may be connected to a server 953 and a rack server 955. The computing device 900 may be implemented in several different forms. For example, the computing device 900 may be implemented as part of the rack server 955.

In accordance with several embodiments, the controller 101 is configured to control the mechanical system 109 using the inverse dynamics model 107. The inverse dynamics model 107 models the energy of the mechanical system 109. Modeling the energy captures mutual effects of the torques of the different actuators on each other. Thereby, the inverse dynamics model 107 enables accurate controlling of the mechanical system 101. Additionally, the formulation of the inverse dynamics model 107 requires minimum physical information about the mechanical system 109. To that end, the formulation of the inverse dynamics model 107 is computationally inexpensive. Additionally, or alternatively, the inverse dynamics model 107 can be used to estimate the kinetic energy of the mechanical system 109 and the potential energy of the mechanical system 109.

The above description provides exemplary embodiments only, and is not intended to limit the scope, applicability, or configuration of the disclosure. Rather, the following description of the exemplary embodiments will provide those skilled in the art with an enabling description for implementing one or more exemplary embodiments. Contemplated are various changes that may be made in the function and arrangement of elements without departing from the spirit and scope of the subject matter disclosed as set forth in the appended claims.

Specific details are given in the following description to provide a thorough understanding of the embodiments. However, understood by one of ordinary skill in the art can be that the embodiments may be practiced without these specific details. For example, systems, processes, and other elements in the subject matter disclosed may be shown as components in block diagram form in order not to obscure the embodiments in unnecessary detail. In other instances, well-known processes, structures, and techniques may be shown without unnecessary detail in order to avoid obscuring the embodiments. Further, like reference numbers and designations in the various drawings indicated like elements. Also, individual embodiments may be described as a process which is depicted as a flowchart, a flow diagram, a data flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process may be terminated when its operations are completed but may have additional steps not discussed or included in a figure. Furthermore, not all operations in any particularly described process may occur in all embodiments. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. When a process corresponds to a function, the function's termination can correspond to a return of the function to the calling function or the main function.

Furthermore, embodiments of the subject matter disclosed may be implemented, at least in part, either manually or automatically. Manual or automatic implementations may be executed, or at least assisted, through the use of machines, hardware, software, firmware, middleware, microcode, hardware description languages, or any combination thereof. When implemented in software, firmware, middleware or microcode, the program code or code segments to perform the necessary tasks may be stored in a machine-readable medium. A processor(s) may perform the necessary tasks. Various methods or processes outlined herein may be coded as software that is executable on one or more processors that employ any one of a variety of operating systems or platforms. Additionally, such software may be written using any of a number of suitable programming languages and/or programming or scripting tools, and also may be compiled as executable machine language code or intermediate code that is executed on a framework or virtual machine. Typically, the functionality of the program modules may be combined or distributed as desired in various embodiments.

Embodiments of the present disclosure may be embodied as a method, of which an example has been provided. The acts performed as part of the method may be ordered in any suitable way. Accordingly, embodiments may be constructed in which acts are performed in an order different than illustrated, which may include performing some acts concurrently, even though shown as sequential acts in illustrative embodiments. Further, use of ordinal terms such as “first,” “second,” in the claims to modify a claim element does not by itself connote any priority, precedence, or order of one claim element over another or the temporal order in which acts of a method are performed, but are used merely as labels to distinguish one claim element having a certain name from another element having a same name (but for use of the ordinal term) to distinguish the claim elements. Although the present disclosure has been described with reference to certain preferred embodiments, it is to be understood that various other adaptations and modifications can be made within the spirit and scope of the present disclosure. Therefore, it is the aspect of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the present disclosure.

	Number	Date	Country
Parent	18222540	Jul 2023	US
Child	18772131		US

SYSTEMS AND METHODS FOR CONTROLLING AN UNDERACTUATED MECHANICAL SYSTEM WITH MULTIPLE DEGREES OF FREEDOM

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

CROSS REFERENCE TO RELATED APPLICATIONS

Provisional Applications (1)

Continuation in Parts (1)