ORIGINAL RESEARCH article
Sec. Humanoid Robotics
Benchmarking Whole‐Body Controllers on the TALOS Humanoid Robot
- 1LAAS-CNRS, University of Toulouse, Toulouse, France
- 2Airbus, Toulouse, France
- 3Artificial and Natural Intelligence, Toulouse Institute, Toulouse, France
This article presents a comparison of three control schemes applied on the commercially available TALOS humanoid robot. The aim is to highlight the advantages and drawbacks of each model applied on three locomotion problems: walking on flat and non-flat terrain and climbing stairs. The different models are based on position control (first and second models) or torque control (third model). The first one uses a hierarchical quadratic program at velocity level. The second one uses a weighted quadratic program named Task Space Inverse Dynamic (TSID) at acceleration level. Finally, the last one also uses TSID but at torque level. The controller performances are compared in simulation, using Gazebo, on the accuracy of their tracking, their energy consumption, and their computational time execution.
Bipedal locomotion of humanoid robots is considered a difficult problem because of the complexity of robot dynamics, the numerous constraints of the motion, and the unknown environment. The design choice made when designing a robot may have a strong impact on the control laws that are really working on the system and the real performances. A recent example is the Digit robot which has very impressive capabilities by choosing a careful tradeoff between the chosen actuation technology and robot weight distribution Robotics (2022). The robot is very robust to impact and allows torque control but is slightly limited by the payload it can carry (10 kg). Realizing torque control on electric-based bipedal system is challenging. If it was successfully realized on the TORO robot Englsberger et al. (2014) for standing whole body control and walking, it is notoriously more difficult to achieve than position control. A striking example is given by the iCub robot with which impressive Tai chi motions have been realized Pucci et al. (2016) but where walking in torque control mode is still difficult to achieve Romualdi et al. (2019). The goal of this study is to report a similar evaluation with the commercially available TALOS robot from PAL-Robotics.
1.2 Motion Execution Pipeline
Three stages are usually considered to execute a motion on a humanoid robot: the contact sequence generation, trajectory planning, and whole-body control.
Most of trajectory planning methods use centroidal dynamics to generate consistent behaviors for a legged robot. In this study, we use preplanned trajectories provided either by a standard walking pattern generator or by a multicontact planner Fernbach et al. (2020) The latter is used for a platform which can be easily rebuilt for benchmarking walking on uneven terrain. This planner provides a centroidal trajectory that is dynamically balanced on uneven terrain and does not assume that the robot behaves completely like a Linear Inverted Pendulum (LIPM). Because the centroidal dynamics is planned and the setup limits the number of contacts to one or two, it is still possible to apply the concept of Divergent Component of Motion (DCM) Takenaka et al. (2009); Englsberger et al. (2015) for control. The newly generated reference DCM is used for admittance control on the Center of Mass (CoM) as given by Caron et al. (2019); Romualdi et al. (2019).
Then, to track the reference trajectories, a whole-body controller is needed. Whole-body controllers are based on the task function approach Samson et al. (1991); Escande et al. (2014) from which a quadratic program is formulated. Complex motions combine several nonlinear tasks and constraints. In this study, two types of QP formulations are compared, a hierarchical QP which imposes a strict hierarchy between the tasks Henze et al. (2016); Herzog et al. (2014) and a weighted QP which sets weights to prioritize the tasks Koolen et al. (2016); Cisneros et al. (2018).
In the recent literature, there is a growing number of implementations of torque-based whole-body control algorithms Koolen et al. (2016); Herzog et al. (2014); Lee and Goswami (2012); Englsberger et al. (2015). Indeed, due to the intrinsic compliance of the torque control formulation, it is more suitable for interactions with humans and for multicontact problems where external interactions and several contact points are needed. However, the transition from the simulations to the real experiments is harder due to inaccuracies on the actuation chain model Ramuzat et al. (2020). Such inaccuracies do not appear when using position control.
Following the existing benchmarking of humanoid robot control architectures Romualdi et al. (2019); Stasse et al. (2018), this study contributes by benchmarking the TALOS humanoid robot. It is performed by comparing three whole-body control schemes on the TALOS robot in simulation. Two are using position control associated with DCM and CoM admittance controls and one is using torque control. The first one is based on a lexicographic QP using inverse kinematics (denoted IK in this article), while the second and the third one use a weighted QP (WQP) with inverse dynamics and an angular momentum (AM) task (denoted respectively TSID position and TSID torque). They are evaluated in Gazebo simulations on three locomotion problems: walking on flat, uneven terrains, and stairs (Figure 1), on the criterion of trajectory tracking, energy consumption, passivity, and computational cost. As a first consequence of our torque control scheme, we achieve the highest walking velocity for the robot TALOS in simulation: 0.6 m/s. We believe that the motion on an uneven terrain with the platforms is novel and offers an interesting new benchmark. Finally, we also provide an evaluation of the passivity gait measure that we believe is interesting to measure the efficiency of a balance strategy in terms of energy.
We organize the article as follows: Section 2 recalls the centroidal dynamics equations, DCM control, and AM task. Section 3 details the three task-space whole-body control schemes compared in this study. Section 4 presents the energy criterion used. Section 5 describes the planning methodologies used to obtain the reference trajectories for the simulations. Then, Section 6 presents these simulations results, and Section 7 describes the experiments achieved on TALOS and their limitations.
2 Centroidal Dynamics
The under-actuated part of the robot whole-body dynamics is called centroidal dynamics. It uses the Newton–Euler equations of motion which couple the variations of the centroidal momentum with contact forces Orin et al. (2013):
2.1 Divergent Component of Motion
with z, ξ respectively the zero moment point (ZMP) and DCM and
In terms of ZMP, the obtained control law is as given by Caron et al. (2019):
with zref, ξref the respective ZMP and DCM reference values, given by the planning. Finally, this desired ZMP is used into a CoM admittance control as found in the study by Caron et al. (2019):
The two position control schemes presented in this study use this stabilization formulation. In Figure 2, Eq. 3 is implemented in the DCM Ctrl blue block and Eq. 4 in the CoM Admittance Ctrl one. See Table 1 for the gains value used in the simulations.
FIGURE 2. Position control schemes: IK and TSID. The OR block is used to activate only one controller at a time.
TABLE 1. Task gains of the control schemes, tilted platforms, and stair simulations use the same gains.
2.2 Centroidal Momentum Tasks
The objective is to consider the angular momentum part of the Euler equation generated by the contact transition (Kajita et al. (2003b). Using equation Eq. 1, the centroidal dynamics is, therefore, defined by
The tasks dynamics are given by the following equations:
3 Whole-Body Controller
3.1 Lexicographic Quadratic Programming
The first controller used is a lexicographic QP task-based inverse kinematics described by Mansard et al. (2009). In this controller, the task errors e to be reduced in the cost function are implemented as velocity-based tracking laws in the Lie group SE(3). Having the robot configuration vector q and the joint velocity
The following dynamics is imposed on these errors:
with ⊖ the difference operator of the Lie group.
Inverse Kinematics QP: IK—This control scheme is based on a DCM controller Eq. 3, a CoM admittance controller, Eq. 4 and a lexicographic QP solving the inverse kinematics of the robot (Figure 2). The authors have implemented this scheme in an open-source package GEPETTO Team LAAS-CNRS (2021c) based on the QP in the study by Mansard et al. (2009) adding the DCM and CoM admittance controllers.
The tasks used during the simulations are (the priority 0 is the highest one):
• Feet tracking (priority 0)
• CoM height tracking (priority I)
• CoM lateral–sagittal tracking (priority II)
• Waist orientation (priority III)
• Posture regularization in half-sitting (priority IV)
The respective task gains are defined in Table 1.
3.2 Task Space Inverse Dynamics
TSID Del Prete (2021) is a WQP which sums the task functions in a general cost function using weights to define their priorities (as opposed to the IK controller, it is not a strict hierarchy; it has only two strict layers: the constraint and the cost). In this controller, the task errors e to be reduced are implemented as acceleration-based tracking laws in the task space. Having the robot configuration vector q and the joint acceleration
The following dynamics is imposed on these errors:
Inverse Dynamics WQP: TSID Position—This control scheme is based on a DCM controller Eq. 3, a CoM admittance controller Eq. 4 and a WQP solving the inverse dynamics of the robot, Figure 2. Compared to the previous controller, this one implements an AM task, which regulates the angular momentum to 0, using the formulation of Eq. 5. The authors have implemented this controller using TSID Del Prete (2021) library in the same package than the controller TSID Torque with the DCM and CoM admittance controllers.
The tasks considered during the simulations are as follows:
• Feet tracking (priority 0)
• Feet contacts (priority 0)
• CoM height tracking (priority I, weight 103)
• CoM lateral-sagittal tracking (priority I, weight 103)
• Waist orientation (priority I, weight 1)
• Posture regularization in half-sitting (priority I, weight 0.1)
• AM velocity-acceleration regularization (priority I, weight 2 × 10–2)
The respective task gains are defined in Table 1. The weights and gains have been chosen through trials and errors with an a priori heuristic.
Inverse Dynamics WQP: TSID Torque—This control scheme is based on a WQP solving the inverse dynamics of the robot (with an AM regularization task, using the formulation of Eq. 5), as shown in Figure 3. From the desired acceleration computed by the QP, TSID retrieves the associated torque by using the robot equation of the dynamics. The authors have implemented this controller using the TSID Del Prete (2021) library in the open-source package GEPETTO Team LAAS-CNRS (2020).
The tasks considered in the simulations are the same as TSID position, with different gains (Table 1).
3.3 Remark on the State Feedback
For the position control, it is needed to integrate the result of the QP (one time for IK and two times for TSID position, Figure 2) to obtain the desired command. To avoid instabilities, the control loop of both QPs use these integrated values in the next iteration instead of the measured ones. The measured position and velocity of the robot are only used to compute the CoM, DCM, and ZMP for the admittance control in the position schemes. On the contrary, the torque control scheme uses the measured values at each iteration of the QP (Figure 3) and, in particular, the position and velocity of the robot base (or free-flyer).
4 Energetic Comparison Criterion
4.1 Energy Cost
Based on the study by Torricelli et al. (2015), a relevant criterion to compare the energy consumption of the control schemes is the cost of transport. It can be computed as the energetic cost of transport Cet using the whole mechanical work of the actuation system Em or as the mechanical cost of transport Cmt using only the positive one Em+.
with m the mass of the system, g the gravity constant, D the distance traveled by the system, and τi, ωi the respective torque and velocity of each robot joint for all (N) joints and ϱi(t) = τi(t)ωi(t).
4.2 Passivity Gait Measure
Another interesting energetic criterion is the ability to minimize joint torques to increase the passivity of the walk Torricelli et al. (2015). The passivity gait measure (PGM) Mummolo and Kim (2012) quantifies the passivity of a biped walking motion:
where RMS is the root mean square along the period of time T, τsa stands for the torque on the stance ankle joint, and τtot for the torque on all robot joints.
5 Locomotion Planning
5.1 Walking Pattern Generator
The trajectories used in the straight walk simulations have been computed using the algorithm described in the study by Kajita et al. (2003a); Stasse et al. (2008); GEPETTO Team LAAS-CNRS (2021a). This algorithm provides desired trajectories for the ZMP z*, the CoM c*, and the feet
5.2 Multicontact Locomotion Planning
The trajectories used in the tilted platform and stair simulations have been computed using the open-source framework multicontact locomotion planning GEPETTO Team LAAS-CNRS (2021b). Given the initial and final poses of the robot, the framework computes a reachability plan and a contact sequence as given by Tonneau et al. (2020). Then, it optimizes the centroidal dynamics (Section 2) using two convex relaxations based on trust regions Ponton et al. (2018). Similar to the pattern generator method, it takes into account the momentum generated by the swing leg owing to iterations between a kinematic whole-body formulation and the centroidal dynamic optimization. In contrast, when solving Eq. 1, it does not assume that
6 Simulation Results
The simulations realized in this study have been made using Gazebo. A video illustrating the simulations is available at the following link: https://peertube.laas.fr/videos/watch/4b5d3a5b-2355-47a0-8197-f41ed4f885c6. The chosen simulations are walking on flat or uneven terrains and climbing stairs. Based on the study by Torricelli et al. (2015), they cover different aspects of locomotion skills for a stationary environment with and without unexpected disturbances.
6.1 Straight Walk of 20-cm Steps
In the simulation, the robot executes six steps forward at 0.2 m/s and a final step (traveled distance of 1.2 m). The time distribution is 0.9 s for single-support phase and 0.115 s for double-support phase (leading to steps of approx. 0.20 m). The controllers have also been successfully tested on a faster walk with single/double support time of 0.711/0.089 s. Figure 4 presents a comparison of the three control schemes on their estimated ZMP on the sagittal (x-axis, top curves on the figure) and lateral (y-axis, bottom curves) planes only because the desired height of the CoM is constant. Figure 5 shows the forces applied on the ground along the z-axis on the left foot. The tracking of the CoM and feet is accurately followed by the three controllers (tracking error lesser than 1 cm).
The two position controllers achieve similar results, tracking correctly the ZMP reference of Eq. 3, with an average error of 2 cm (Table 2). Noticeably, the torque control presents a ZMP which is close to the position control results in Figure 4, even though there is no explicit control on the ZMP or the DCM. In the Tables presenting the error on the ZMP, for the torque scheme, the estimated ZMP is compared to the desired ZMP (from the planning). In particular, in the lateral plane, the error is quite low, 1 cm in average.
Figure 5 illustrates the ground impact problem in the position control compared to the better foot landing observed in torque control. Indeed, each time the left foot comes into contact with the ground (1.5, 3.5 s, … ), the IK and TSID position schemes show peaks in the foot force (
6.2 Straight Walk of 60-cm Steps in Torque Control
In the study by Mesesan et al. (2019) the humanoid robot TORO successfully performed a walk on flat terrain with a step length of 55 cm (single/double support time of 1.1/0.4 s). In the following simulation, the torque controller is pushed to its limits to show its capability to achieve a similar result. The robot TALOS executes six steps forward of 0.6 m/s and a final one to go back to the initial position. The time distribution used is of 0.9 s for single-support phase and 0.115 s for double-support phase (leading to steps of approx. 60 cm).
Figure 6 presents the results obtained on the tracking of the feet and the CoM (Table 3); the ZMP and DCM estimations. The feet track well the desired trajectories along the y-axis (maximum error of 6 mm); however, along the x-axis, they show some delay (maximum error of 6 cm). Thus, it induces greater tracking errors on the x-axis for the CoM (peaks of 5 cm along the x-axis and 1.5 cm along the y-axis).
One can notice that the DCM and ZMP along the x-axis are more stable, whereas along the y-axis, they present large oscillations (which are caused by the feet impacts on the ground when landing).
In Figure 7, the AM behavior is shown along the three axes. The AM task minimizes the momentum to zero. The x and y momentum components are the most solicited, leading to inclination of the torso forward and backward and to important moves of the arms to compensate the delay of the CoM and succeed the 60-cm steps. The authors observed that without this AM task, the walk cannot be achieved.
6.3 Walk on the Tilted Platforms: Uneven Terrain
In this third simulation, the robot walks on tilted platforms which represent uneven terrain (Figure 1). This walk is achieved using the multicontact locomotion planning trajectories (Section 5.2). The framework ensures the stability of the controllers on non-flat terrain when the feet are tilted.
Figure 8 illustrates the tracking performance of the controllers. The ones in position present the largest oscillations as TSID torque is the most stable (Table 4). Both the IK and torque control show oscillations at t ≈ 18 s; it corresponds to the worst case where the robot has its two feet tilted to keep its balance on two opposite platforms, leading to small slippages of the feet (this behavior can be observed in the linked video). These oscillations are larger in the case of the IK scheme. Similar oscillations on the contact forces in this part of the motion have also been observed, which are smaller in the case of the torque control. Increasing the gains on the feet only generates more instability, but raising the ones on the DCM and admittance control reduces the oscillations (at the cost of a more rigid behavior).
Finally, the same result on the feet forces is obtained in this simulation with respect to the 20-cm steps one. Due to the high gains on the DCM, to avoid the slippage of the robot, the IK control presents bigger peaks of force.
6.4 Climbing Stairs
In the last simulation, the robot is climbing six stairs of 10 cm height and 30 cm long (Figure 1). The trajectories are planned with the multicontact locomotion planning. Figure 9 shows the ZMP evolution of each controller, where the result is similar to the uneven terrain simulation. The TSID torque scheme behaves significantly better than the others, with the ZMP matching the one planned (errors lesser than 1 cm, Table 5). Noticeably, the IK scheme presents higher oscillations at the end of the move in the lateral plane. The robot ends displaced on the right compared to the desired trajectories due to slippages of the feet when it finishes to climb a stair (shown in the linked video).
6.5 Energy Cost and Passivity Gait Measure
The results obtained for the cost of transport of the four simulations are presented in Table 6 depending on the control scheme. The results obtained for iCub by Romualdi et al. (2019) are also presented for comparison (computed using Eq. 10), as the human ones. The lower the energy consumption is, the better, and similarly, getting closer to the human cost of transport is an improvement.
Compared to the results obtained on iCub, the control in torque has a similar cost for the 20-cm step simulation. However, the cost of the position controllers presented in this article is higher because of their higher gains. The human efficiency is closer to the torque control, walking with a Cet around 0.2 J/kg/m Collins et al. (2005). Noticeably, the energy costs in torque for the tilted platforms and stair trajectories are still less important than the simpler walk in position; the Cmt never exceeds 1, even for the 60-cm walk. Overall, the controller TSID position consumes less energy than the IK.
The passivity gait measure comparison of the different simulations is reported in Table 7 for three gait stages: single support (single S. corresponding to the stance ankle), double support (double S.), and flying foot (flying F. where the foot has no contact with the ground). The human results are given as an indicator Mummolo and Kim (2012); the robot behavior is expected to be similar during double support and flying foot phase where the ankle should be passive.
The results of the position control schemes show a behavior which is the opposite of the human one. The passivity of the ankle is higher during the stance phase because of the control of the ZMP which minimizes ankle torque. In addition, it is weaker during the double support and flying phases due to the high PID gains of the low-level position control.
The control scheme in torque shows much more passive behavior (except on the stance foot) with a completely passive foot during the flying phase. During the double-support phase, the ankle is almost passive (PGM∼0.9), which is close to the human result. These results are better than the one expected in the study by Mummolo and Kim (2012), where the torque controlled robot has higher control on its stance ankle (PGM = 0.2).
Finally, on the uneven terrain, the double-support phase corresponds to the worst case where the robot has its two feet tilted to keep its balance on two opposite platforms. This leads to greater actuation than on flat floor (decreasing the passivity). Similarly, the stance phase corresponds to the left support phase on the final platform (highest slope), also leading to bigger actuation of the ankle.
6.6 Execution Time of the Control Schemes
The computational time obtained during the execution of one control loop of the three schemes is presented in Table 8, according to the simulations.
The computational time of the IK is better due to computational efficiency of the null space projectors of the tasks. Exploiting this specific structure allows it to keep its control frequency higher than 1 kHz in average with four hierarchy levels. In TSID, this method can only be used once because it comprises two strict layers: the constraints and cost.
7 Experiments Realized on the Real Robot Using the Controllers
In this section are presented the results we succeeded to achieve on the real robot TALOS and the difficulties we encountered. These experiments are intermediate steps toward transferring the whole simulated results on the real robot. We detail the blocking points preventing us to successfully achieve these complete experiments.
7.1 TALOS Robot
Our robot TALOS is a humanoid robot of 1.75 m tall and about 100 kg comprising 32 joints and an under-actuated part called floating base (38 degrees-of-freedom in total). It provides the possibility to control the actuators in position control and torque control modes. It is performed owing to torque sensors on all the actuators, but the head and wrists. Humanoid robots often have flexible or compliant components. For instance, the actuator stiffness of the robot WALKMAN Negrello et al. (2017) can be directly tuned, creating an intended flexibility. Another example of the humanoid robot with compliant material is HRP-2 Nakaoka et al. (2007). It includes a bush rubber in the ankle to smooth impacts. In the robot TALOS, a non-intended flexibility on the hip link has been observed and impacts meaningfully the control of its legs and, therefore, its balance and locomotion. Indeed, this flexibility (not modeled in the simulator) leads to errors in the landing positions of the feet on the real robot. However, the deflection is not directly measurable by the encoders and cannot be directly modified.
7.2 Position Control
7.2.1 Static Stabilization
Using the whole-body admittance control and the stabilizer described as the IK scheme in Section 3, the team achieved good results for balancing during quasi-static moves and standing position. Indeed, the admittance control at the CoM allows a quick reaction when applying external perturbations, such as pushing the robot. Figure 10 presents the reactive balancing of the TALOS humanoid robot when it is pushed from the front and from the side while standing on one foot. A video about this experiment is available at the following link: https://peertube.laas.fr/videos/watch/2dec7dba-cc57-4df4-8f10-a7d387404301.
In the video are shown push-recovery experiments while the robot is standing on both feet and with one foot raised. One can notice that the robot is more stable with both feet on the ground; nonetheless, the IK scheme allows a good stabilization at the CoM level. The stabilizer correctly achieves the balance of the robot: it controls the DCM such that the CoM does not diverge and applies correct contact wrenches to avoid falling (no slipping, not too much forces on one foot which imbalance the robot). It is important to underline that the admittance control is only implemented on the CoM; thus, the robot is stiff on its upper parts while more compliant on its lower parts (in particular, the hips and ankles). This is why in the video, pushing the robot arm produces motions on the whole robot and, in particular, its CoM.
The robot can achieve tasks with its upper body while external perturbations occur and keep its balance. It can also stabilize itself when non-dynamic trajectories are asked to the legs or with no contact with the ground (for instance, execute a swing on its foot). The difficulties appear with dynamic tasks involving the creation of contacts with the ground, typically during walking.
7.2.2 Dynamic Stabilization
The dynamic stabilization of the robot is an ongoing work. The actual implementation of the stabilizer should allow the robot to achieve this goal; however, this is compromised by the flexibility in the hip of the robot TALOS. By tuning the gains of the admittance controller, the team manages to achieve once a straight walk of 20 cm using a WPG reference trajectories. The video of this success is available at the following link: https://peertube.laas.fr/videos/watch/b56d80ed-7c6c-46a7-8750-fdb7ea6d1636.
Later on, we successfully achieved a repeatable on-spot walking which is quite stable (Figure 11). The video of this on-spot walking is available at the following link: https://peertube.laas.fr/videos/watch/1a920902-c75f-4fb0-a638-33bb9b48d649. One can notice that the left wrist of the robot is tilted; indeed, its absolute and relative encoders did not send the same value. Thus, when controlling its position, the wrist had an abnormal behavior as its returned position was not the good one. We had to deactivate its control for the experiment and then fix the offset of the relative encoder.
In both videos, the impacts on the ground are large and lead to instabilities, in particular slippage (which can also be caused by the flexibility in the hip). The robot has to move its upper body to compensate for them and because of that, at the end of the 20-cm walk, the robot almost falls. These impacts are partly due to the wrong positions of the feet when making contacts with the ground. The flexibility in each hip of the robot cannot be measured by the encoders; then, it creates an error between the positions given by the encoders and the real ones. These displacements at the hips are small, but when transferred at the feet positions, it can lead to errors of up to 5 cm. Thus, the controller is assuming a false position of the feet, and the robot enters in contact with the ground at a wrong position (even at the wrong moment, sooner if the displacement is in the direction of the walk or later in the opposite case). This creates large impacts and slippage, which prevents us to achieve successful walking; this is why compensating this flexibility is necessary. In the next subsection is presented the experiment realized to compensate it with a fixed value.
An additional way to cope with the stabilization problem would be to reschedule the footsteps and their location according to the landing time.
7.2.3 Fixed Compensation of the Flexibility
We first try to compensate the flexibility by using a feedforward on the commanded position of the hip taking into account the torsional stiffness and measured torque. However, because of the noises on the torque sensors, we had to filter it which leads the compensation to be applied with delay. We also tried to activate this compensation only on single-support phases and not on double-support ones to avoid accumulation of internal efforts (on double support, the robot will try to correct its hip position while having its feet in contact with the ground and thus not moving, leading to this accumulation of energy). Even with such modifications, the results were not enough to successfully perform repeatable walk.
Thus, we then tried to impose a fix compensation of the flexibility without taking into account the measured torque. With a leg of 1 m weighting 20 kg, we fixed the compensation on the hip to
7.3 Torque Control
7.3.1 PAL Robotics Low-Level Controller
To achieve torque control on the real robot, it is needed to transform the joint torque commands to motor currents. We decided to use the PAL robotics constructor low-level controller which computes new commands respecting the robot actuator dynamics. This low-level controller is a proprietary black-box, which uses a ros-control hardware interface to communicate with the robot. To interface our control scheme (based on the SoT with the WPG), we had to create a new version of the roscontrol-sot package. Indeed, our control scheme needs no more to communicate directly with the robot but with the PAL robotics controller, which implements different functions and formulations. One of the major difficulties is that the proprietary code source is not available; we only had access to its C++ headers and some basic tutorials. Developing this interface to keep all the functionalities implemented in the roscontrol-sot package (for instance, to keep the recording of the logs and creating all the necessary signals needed by the SoT in the dynamic-graph structure), took us months of work (including the following remark).
Moreover, as the robot has a modified operating system called Ferrum (equivalent to Ubuntu), we created a Docker Merkel (2014) container to have exactly the same environment as the one on the robot to test our codes. Installing the SoT packages on this environment was not trivial as some packages had conflicting dependencies with the PAL robotics packages. Finally, we succeeded to test in this Docker container our interface and our torque controller using the PAL simulator available on Ferrum. An additional difficulty is that the simulator renders the behavior at a rate five times slower than the reality. Then, a small and slow oscillation in the simulator is, in fact, a high-frequency one in reality and can lead to dangerous behaviors.
One has to note that in the study by Dantec et al. (2021), the MPC is not embedded on the robot and is interfaced with the PAL robotics low-level controller via a ROS topic. This simpler choice was made because it is a stand-alone package (no SoT or dynamic-graph framework) and does not send commands at high frequency (200 Hz). ROS topics may induce latency and do not allow to send high-frequency commands, leading to real-time issues.
7.3.2 Experiment Results on a Posture Task
Once we achieve satisfying results on the PAL robotics simulator, we tested the classical formulation of our torque controller using inverse dynamics on the real robot on a simple postural task. The task weights and gains used are presented in Tables 9, 10, as the “Fail” experiment.
After few repetitions of a sinusoidal motion on the robot arm, the system diverged brutally and blocked two of its harmonic drives: the waist and the right shoulder (we pushed the emergency button, but the robot had the time to reach the harmonic drive blocks). Figure 12 presents the result failure. After investigation, it seems that the gains tuned in simulation (which simulates the actuation chains) were too high for the real robot. Thus, tuning the gains even on a proper simulator with the model of the actuators is not enough to ensure the safety of the solution. We know that some tuning is always necessary on the real robot, but we wrongly assumed that the solution would remain quite stable. Thus, to provide a safe and reliable interaction with the environment and possibly humans, we have looked for a way to ensure system stability. The video of the failed experiment is available at this link: https://peertube.laas.fr/videos/watch/31fa2562-ba13-4043-a996-c2b8d5b21f4a. Unfortunately, it was not possible to repair the robot at the laboratory because the right shoulder and the torso were preventing the back cover of the robot to be removed (which needed to be removed to access the shoulder harmonic drive). Thus, the robot had to be sent back to PAL robotics for repair.
By lowering the gain value on the posture task, as presented in Table 10, we succeeded to have a stable and compliant behavior of the robot. A video demonstrating this compliant behavior is available at the following link: https://peertube.laas.fr/videos/watch/e9d8948d-08d5-4de9-8f42-2986fbbf0242 and depicted by Figure 13. At the end of the video, the robot falls because the contacts on the feet have been disturbed (the feet moved), breaking the constraint of the QP.
This small success encouraged us to add the CoM task for further tests. Unfortunately, this task does not work on the real robot. Instead of correcting the CoM error, the QP seems to make it divergent. It is a behavior that does not appear in the simulator where the experiment is carried out. After investigations, this problem may arise due to imprecise calibration or identification of the robot. The difficulty of performing this procedure once the robot is assembled is to excite the parameters to be identified. For instance, part of the torso is particularly difficult to manipulate to observe the variables to be identified. It was the starting point of another research work outside the scope of this study.
The contribution of this article is the benchmarking of three whole-body control implementations on the commercially available humanoid robot TALOS. Two of them are position-based (with DCM and CoM admittance control): a lexicographic QP using inverse kinematics and a WQP using TSID with an AM task. The last one is a WQP using TSID in torque with an AM task. They are evaluated in Gazebo on flat, uneven terrains, and climbing stairs on the criterion of trajectory tracking, energy consumption, passivity, and computational cost.
In general, both position control schemes present the same results with less energy consumption and higher passivity for the TSID position controller. A better tuning of the tasks gains may improve its results on the ZMP tracking.
On the other hand, the TSID torque controller shows better results in terms of smoothness of the trajectory tracking, energy consumption, and passivity of the walk without impacts and can achieve a 60-cm walk with steps of 1 s in simulation. This confirms the high capabilities of a torque control scheme coupled with angular momentum regularization (see, for instance, Atlas in DARPA robotics challenge Koolen et al. (2016)). In average, the TSID controllers reach 1 kHz of the control loop necessary for real-time control; nonetheless, the IK scheme has the best computational time.
For our future studies, we plan to control the hip flexibility of TALOS so that we can evaluate the three controllers on the real robot. Moreover, it would be interesting to compare the controllers on different robotic platforms.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.
NR is the first author of this study and performed most of the new software development as well as the experimental results. OS and SB performed the scientific and technical management for this project.
This study is funded by the ROB4FAM research agreement.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
TALOS is a robot created from the ACTANTHROPE ERC-Grant.
Caron, S., Kheddar, A., and Tempier, O. (2019). “Stair Climbing Stabilization of the Hrp-4 Humanoid Robot Using Whole-Body Admittance Control,” in Int. Conf. On Robotics and Automation (ICRA) (IEEE). doi:10.1109/icra.2019.8794348
Carpentier, J., Saurel, G., Buondonno, G., Mirabel, J., Lamiraux, F., Stasse, O., et al. (2019). “The Pinocchio C++ Library – a Fast and Flexible Implementation of Rigid Body Dynamics Algorithms and Their Analytical Derivatives,” in Int. Symp. On System Integrations (Paris, France: IEEE). doi:10.1109/SII.2019.8700380
Cisneros, R., Benallegue, M., Benallegue, A., Morisawa, M., Audren, H., Gergondet, P., et al. (2018). “Robust Humanoid Control Using a Qp Solver with Integral Gains,” in Int. Conf. On Intelligent Robots and Systems (IROS) (Madrid, Spain: IEEE). doi:10.1109/IROS.2018.8593417
Dantec, E., Budhiraja, R., Roig, A., Lembono, T., Saurel, G., Stasse, O., et al. (2021). “Whole Body Model Predictive Control with a Memory of Motion: Experiments on a Torque-Controlled Talos,” in Int. Conf. On Robotics and Automation (ICRA) (IEEE). doi:10.1109/icra48506.2021.9560742
Del Prete, A., Mansard, N., Ponce, O., Stasse, O., and Nori, F. (2016). Implementing Torque Control with High-Ratio Gear Boxes and without Joint-Torque Sensors. Int. J. Humanoid Robotics 13, 1. doi:10.1142/s0219843615500449
[Dataset] Del Prete, A. (2021). Tsid. Available at: https://github.com/stack-of-tasks/tsid.
[Dataset] GEPETTO Team LAAS-CNRS (2021a). Jrl-Walkgen. Available at: https://github.com/stack-of-tasks/jrl-walkgen.
[Dataset] GEPETTO Team LAAS-CNRS (2021b). Multicontact-Locomotion-Planning. Available at: https://github.com/loco-3d/multicontact-locomotion-planning.
[Dataset] GEPETTO Team LAAS-CNRS (2021c). Sot-Talos-Balance. Available at: https://github.com/loco-3d/sot-talos-balance.
[Dataset] GEPETTO Team LAAS-CNRS (2020). Talos-Torque-Control. Available at: https://github.com/stack-of-tasks/talos-torque-control.
[Dataset] Robotics, A. (2022). Meet Digit. Available at: https://www.agilityrobotics.com/meet-digit.
Englsberger, J., Werner, A., Ott, C., Henze, B., Roa, M. A., Garofalo, G., et al. (2014). “Overview of the Torque-Controlled Humanoid Robot Toro,” in Int. Conf. On Humanoid Robotics (ICHR) (IEEE). doi:10.1109/humanoids.2014.7041473
Fernbach, P., Tonneau, S., Stasse, O., Carpentier, J., and Taïx, M. (2020). “C-croc: Continuous and Convex Resolution of Centroidal Dynamic Trajectories for Legged Robots in Multicontact Scenarios,” in IEEE Transactions on Robotics (IEEE). doi:10.1109/tro.2020.2964787
Henze, B., Dietrich, A., and Ott, C. (2016). An Approach to Combine Balancing with Hierarchical Whole-Body Control for Legged Humanoid Robots. IEEE Robot. Autom. Lett. 1, 700–707. doi:10.1109/lra.2015.2512933
Herzog, A., Rotella, N., Mason, S., Grimminger, F., Schaal, S., and Righetti, L. (2014). Momentum Control with Hierarchical Inverse Dynamics on a Torque-Controlled Humanoid. Autonomous Robots 40, 3. doi:10.1007/s10514-015-9476-6
Kajita, S., Kanehiro, F., Kaneko, K., Fujiwara, K., Harada, K., Yokoi, K., et al. (2003a). “Biped Walking Pattern Generation by Using Preview Control of Zero-Moment point,” in Int. Conf. On Robotics and Automation (ICRA) (IEEE).
Kajita, S., Kanehiro, F., Kaneko, K., Fujiwara, K., Harada, K., Yokoi, K., et al. (2003b). “Resolved Momentum Control: Humanoid Motion Planning Based on the Linear and Angular Momentum,” in Int. Conf. On Intelligent Robots and Systems (IROS) (IEEE).
Kajita, S., Morisawa, M., Miura, K., Nakaoka, S., Harada, K., Kaneko, K., et al. (2010). “Biped Walking Stabilization Based on Linear Inverted Pendulum Tracking,” in Int. Conf. On Intelligent Robots and Systems (IROS) (IEEE). doi:10.1109/iros.2010.5651082
Koolen, T., Bertrand, S., Thomas, G., de Boer, T., Wu, T., Smith, J., et al. (2016). Design of a Momentum-Based Control Framework and Application to the Humanoid Robot Atlas. Int. J. Humanoid Robotics 13, 1. doi:10.1142/s0219843616500079
Mansard, N., Stasse, O., Evrard, P., and Kheddar, A. (2009). “A Versatile Generalized Inverted Kinematics Implementation for Collaborative Working Humanoid Robots: The Stack of Tasks,” in International Conference on Advanced Robotics (ICAR) (IEEE).
Mesesan, G., Englsberger, J., Garofalo, G., Ott, C., and Albu-Schäffer, A. (2019). “Dynamic Walking on Compliant and Uneven Terrain Using Dcm and Passivity-Based Whole-Body Control,” in Int. Conf. On Humanoid Robotics (ICHR) (IEEE). doi:10.1109/humanoids43949.2019.9035053
Nakaoka, S., Hattori, S., Kanehiro, F., Kajita, S., and Hirukawa, H. (2007). “Constraint-based Dynamics Simulator for Humanoid Robots with Shock Absorbing Mechanisms,” in IEEE Intelligent Conference on Intelligent Robots and Systems (IROS) (IEEE), 3641–3647. doi:10.1109/iros.2007.4399415
Negrello, F., Catalano, M. G., Garabini, M., Poggiani, M., Caldwell, D. G., Tsagarakis, N. G., et al. (2017). “Design and Characterization of a Novel High-Compliance spring for Robots with Soft Joints,” in IEEE International Conference on Advanced Intelligent Mechatronics, AIM 2017 (Munich, Germany: IEEE). doi:10.1109/aim.2017.8014029
Ramuzat, N., Forget, F., Bonnet, V., Gautier, M., Boria, S., and Stasse, O. (2020). “Actuator Model, Identification and Differential Dynamic Programming for a Talos Humanoid Robot,” in European Control Conference (ECC) (IEEE). doi:10.23919/ecc51009.2020.9143817
Romualdi, G., Dafarra, S., Hu, Y., Ramadoss, P., Chavez, F. A., Traversaro, S., et al. (2019). A Benchmarking of Dcm Based Architectures for Position, Velocity and Torque Controlled Humanoid Robots. Int. J. Humanoid Robotics 17, 1. doi:10.1142/S0219843619500348
Stasse, O., Giraud-Esclasse, K., Brousse, E., Naveau, M., Régnier, R., Avrin, G., et al. (2018). Benchmarking the Hrp-2 Humanoid Robot during Locomotion. frontiers Robotics AI 5, 122. doi:10.3389/frobt.2018.00122
Stasse, O., Verrelst, B., Wieber, P.-B., Vanderborght, B., Evrard, P., Kheddar, A., et al. (2008). Modular Architecture for Humanoid Walking Pattern Prototyping and Experiments. Adv. Robotics 22, 589–611. doi:10.1163/156855308x305236
Sugihara, T. (2009). “Standing Stabilizability and Stepping Maneuver in Planar Bipedalism Based on the Best Com-Zmp Regulator,” in Int. Conf. On Robotics and Automation (ICRA) (IEEE). doi:10.1109/robot.2009.5152284
Takenaka, T., Matsumoto, T., and Yoshiike, T. (2009). “Real Time Motion Generation and Control for Biped Robot-1st Report: Walking Gait Pattern Generation-,” in Int. Conf. On Intelligent Robots and Systems (IROS) (IEEE).
Tonneau, S., Song, D., Fernbach, P., Mansard, N., Taïx, M., and Del Prete, A. (2020). “SL1M: Sparse L1-Norm Minimization for Contact Planning on Uneven Terrain,” in Int. Conf. On Robotics and Automation (ICRA) (IEEE). doi:10.1109/icra40945.2020.9197371
Torricelli, D., Gonzalez-Vargas, J., Veneman, J., Mombaur, K., Tsagarakis, N., del Ama, A., et al. (2015). Benchmarking Bipedal Locomotion: A Unified Scheme for Humanoids, Wearable Robots, and Humans. IEEE Robotics Automation Magazine.
Keywords: humanoid robot, benchmarking, whole body control, balance, torque control
Citation: Ramuzat N, Stasse O and Boria S (2022) Benchmarking Whole‐Body Controllers on the TALOS Humanoid Robot. Front. Robot. AI 9:826491. doi: 10.3389/frobt.2022.826491
Received: 30 November 2021; Accepted: 31 January 2022;
Published: 04 March 2022.
Edited by:Fumio Kanehiro, National Institute of Advanced Industrial Science and Technology (AIST), Japan
Reviewed by:Wael Suleiman, Université de Sherbrooke, Canada
Jianwen Luo, The Chinese University of Hong Kong, China
Copyright © 2022 Ramuzat, Stasse and Boria. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: O. Stasse, firstname.lastname@example.org