ORIGINAL RESEARCH article
Human-Inspired Eigenmovement Concept Provides Coupling-Free Sensorimotor Control in Humanoid Robot
- 1Institute of Higher Nervous Activity and Neurophysiology, Russian Academy of Science, Moscow, Russia
- 2Department of Neurology, University Clinics of Freiburg, Freiburg, Germany
- 3Russian National Research Medical University, Moscow, Russia
- 4Institute of Computer Science, Academy of Science of the Czech Republic, Prague, Czechia
Control of a multi-body system in both robots and humans may face the problem of destabilizing dynamic coupling effects arising between linked body segments. The state of the art solutions in robotics are full state feedback controllers. For human hip-ankle coordination, a more parsimonious and theoretically stable alternative to the robotics solution has been suggested in terms of the Eigenmovement (EM) control. Eigenmovements are kinematic synergies designed to describe the multi DoF system, and its control, with a set of independent, and hence coupling-free, scalar equations. This paper investigates whether the EM alternative shows “real-world robustness” against noisy and inaccurate sensors, mechanical non-linearities such as dead zones, and human-like feedback time delays when controlling hip-ankle movements of a balancing humanoid robot. The EM concept and the EM controller are introduced, the robot's dynamics are identified using a biomechanical approach, and robot tests are performed in a human posture control laboratory. The tests show that the EM controller provides stable control of the robot with proactive (“voluntary”) movements and reactive balancing of stance during support surface tilts and translations. Although a preliminary robot-human comparison reveals similarities and differences, we conclude (i) the Eigenmovement concept is a valid candidate when different concepts of human sensorimotor control are considered, and (ii) that human-inspired robot experiments may help to decide in future the choice among the candidates and to improve the design of humanoid robots and robotic rehabilitation devices.
Most human skeletal movements involve several interconnected body segments. Starting from buttressing segments such as the feet when standing, a chain of segments interleaves to the end effector such as the hand in reaching. The joint rotations in such a reaching-while-standing movement occur in a coordinated way, with two aims dictated by physics standing out. A kinematic aim is to maintain the center of mass (COM) of all body segments supported by the ankle joints above the base of support, which is the area under and between the feet, in order to maintain balance against external forces acting on the body such as gravity. A kinetic aim of movement coordination is to minimize effects of inter-segmental coupling torques. Dysfunction in the matching of timing and torque magnitudes across the chain of segments results in inappropriate compensation for body segment masses and inertia and neural time delays, imposing clinically for the kinematic chain as balance problems (Massion, 1992; Mergner, 2012) and for the kinetic chain as irregular and oscillating movements, a pathological symptom called ataxia that is typically found in cerebellar patients (Bastian, 1997). So far neuroscientists devoted considerable attention to the neural mechanisms underlying human kinematic coordination (Massion, 1992), but paid less attention to the neural mechanisms underlying kinetic coordination, on which this paper focuses.
In the technical domain, where industrial robotic devices are often fixed to the ground, kinematic coordination plays a minor role and the kinetic problem in controlling a chain of serially connected links with coupled dynamics such as a robotic arm can easily be solved. The solution is traditionally done by a full state approach, meaning that feedback and feed forward controls of all joints are computed together in a coordinated way that takes into account a full dynamic model of the arm and solves the inverse dynamics problem. In humanoid robots controlling position of an unstable body posture, the situation is more complex and inter-link force compensation is often performed using servo controllers, one for each joint. If feedback time delays are too large to be fully accounted for by predictive algorithms and if damping of the dynamic coupling effects is insufficient, destabilization of the control may result (Ott et al., 2014, 2016). Also, problems of control stability may arise in humanoid robots with several degrees of freedom (DoF) when the body dynamics are not fully known. Measuring acceleration in each link or a distal link using inertial sensors may help to solve the problem. Also, learning algorithms can be used to produce the needed coordination patterns. Usually reinforcement learning is employed in this context where the desired output is known in terms of performance, but not yet in terms of the needed controller outputs.
In the biological domain, neuroscientists studied for example the electromyographic effects from externally evoked coupling forces in the arm muscles. They observed typical response patterns in muscle activity (Lacquaniti and Soechting, 1986) belonging to the long-latency reflexes, which take into account the current arm configuration (Kurtzer et al., 2009) with a response amplitude scaling that involves the cerebellum (Kurtzer et al., 2013). The underlying neural control mechanisms are still unknown. Theoretically, at least, one could conceive that humans use a neural representation of a full state control. A more parsimonious solution has been suggested in terms of the Eigenmovement (EM) concept (Alexandrov et al., 2001a,b, 2005; Alexandrov and Frolov, 2011). It allows designing the control of the kinematics of the chain in the form of independent SISO (single input, single output) controllers.
Since the EM principle often produces coaction of joint torques in the context of predetermined kinematic synergies, there exists a clear overlap with the important concepts of motor primitives and modular control. Many aspects of motor modularity are discussed by d'Avella et al. (2015) and Flash and Bizzi (2016) including its theoretical and experimental substantiations and robotic applications. According to the modularity concept any movement can be decomposed as a superposition of motor primitives or synergies used as building blocks in a modular control architecture. Each module imposes as a specific pattern of motor activity in terms of kinematic, kinetic or EMG synergy. The specificity of an EM mechanism in this context is that it solves the problem of the dynamic coupling as the basis for controlling each kinematic synergy independently from the others. Although the independent control was demonstrated experimentally so far mostly in relatively simple movements we conceive that the EM concept can be extended to the wide class of multi-joint movements (see Discussion, also for robotic implementations).
Development of the EM concept started from biomechanically describing human hip-knee-ankle coordination during trunk bending as movements along eigenvectors of the motion equation (Alexandrov et al., 2001a,b). After showing that the contribution from the knee joints to this coordination tends to be negligible, the approach was restricted to hip-ankle coordination, and it was shown that the concept is applicable to independently controlled feed-forward and feedback situations (Alexandrov and Frolov, 2011) and that postural reactions to external perturbations can be formalized using a PD (proportional, derivative) control with time delays in the feedback loops (Alexandrov et al., 2005). In their studies, the authors considered the possibility that humans use EM controllers in some neurally implemented form. However, similar as with other concepts of human sensorimotor control, the evidence is indirect and still rather limited as long as analogies are drawn mainly from mathematical calculations or computer simulations, while biological constrains such as neural feedback time delays have experimentally not been considered in face of “real world” challenges such as noisy and inaccurate sensors and non-linearities from computational and mechanical “dead zones” and friction, backlashes, etc. Such limitations may constrain also the potential use of the EM concept in humanoid robots and robotic rehabilitation devices.
With these reservations in mind, this paper investigates whether the EM concept is able to control a humanoid robot that shows human anthropometrics and is equipped with human-inspired sensors and actuators. The robot used, PostuRob II (Figure 1), served already before in experiments that tested a human-derived control concept as robotic implementation (Hettich et al., 2014). Similarly, it is currently used in modified form in another neurorobotics study for the overarching goal to experimentally evaluate the “real-world robustness” of human-inspired control concepts and to obtain back from the robot experiments inspirations for the human sensorimotor research.
Figure 1. Postural humanoid robot PostuRob II standing freely on a 6 DOFs motion platform. (A) weights for human-like weight distribution; (B), artificial vestibular system; (C), pneumatic system; (D), hip joint (joint angle and torque sensors); (E), artificial pneumatic “muscles”; (F), ankle joint (joint angle and torque sensors). (See Appendix 1 in Supplementary Material for details)
The following sections describe first the hip-ankle biomechanics model and the EM controller and its operational capabilities. The subsequent sections describe identification of the robot's specific transfer characteristics by estimating its inertial, gravitational and geometric parameters and the properties of the transformation from joint torque commands at the controller output to the actual torques that were experimentally observed at the joints (which in autonomous systems may be achieved by learning). Then, experimental results from testing the robot in a human posture control laboratory are described, including preliminary comparisons with human data, followed by Discussion. Details of the mathematical concepts, the robot, and experimental procedures are given in Appendices.
Materials and Methods
Eigenmovement (EM) Concept and Biomechanical Hip-Ankle Model
Humanoid sagittal movements around hip and ankle joints in the vicinity of vertical body position (Figure 2) are described by
where q is the vector of hip and ankle joint angles, B0 and G0 are the inertial and gravity matrices, and τcon is the vector of joint control torques. The coefficients of B0 and G0 are calculated via the length-mass parameters of Posturob II, as described in Appendix 1 in supplementary Material.
Figure 2. Two-link biomechanical model of the humanoid robot. The trunk is represented by red dotted line and the legs are represented by a green solid line. L1, L2, c1, and c2 are lengths and locations of centers of mass for legs and trunk respectively. On the left body position is expressed in terms of joint coordinates (ankle and hip angles, q1 and q2), on the right in terms of space coordinates trunk in space (TS) and leg in space (LS) relative to the gravitational vertical.
Each EM is in the linear approach the movement along one eigenvector wi that, by definition, satisfies equation
where λi is the corresponding eigenvalue (Alexandrov et al., 2001a). The vector ξ of the time courses of the two EMs is obtained by transforming the vector q by inversion of equation.
where the two columns of matrix W are the eigenvectors wi, (i = 1,2). According to Equations (2) and (3), the dynamic equation (1) takes the following form in terms of EMs.
where Λ is a diagonal matrix with eigenvalues λi, and
The two columns ui (i = 1,2) of matrix U in Equation (5) are the vectors, whose components define the contributions of ankle and hip joint torques to the EM dynamics.
As shown previously (Kuo, 1995; Welch and Ting, 2008; Frolov et al., 2000, 2006), the joint torques τcon, which generates the desired body movement, can be implemented as a PD-controller with time delay Δt in the form
where qd and q define the time course of the desired and actual changes in joint angles, while S and V represent “stiffness” and “viscosity” matrices whose elements define the gain coefficients in the feedback loop.
In EMs, Equation (6) takes the form:
Independent control of each of the two EM means that matrices Seig and Veig are diagonal, so that the vector Equation (1) in terms of joint angles and torques splits into two scalar equations in terms of EMs (Alexandrov et al., 2001a, 2005; Alexandrov and Frolov, 2011), each equivalent to a PD-control of a single-link inverted pendulum:
where Sieig and Vieig are the diagonal elements of matrices Seig and Veig, respectively (λ = J/mgh; J, moment of inertia relative to pendulum axis of rotation; m, pendulum mass and h its altitude; g, gravitational acceleration).
The inverse transformation of Equation (8) gives the stiffness and viscosity matrices S and V in Equation (6) in terms of joint angles:
When matrices Seig and Veig are diagonal, then matrices S and V are symmetrical, but not diagonal (Alexandrov et al., 2005). Therefore, the PD-control in terms of joint angles needs to take into account not only the kinematics and dynamics of a given joint, but also those of all other joints. The number of feedback control parameters in this kind of control, referred to as “full-state feedback control” (Barin, 1989; Park et al., 2004), is equal to the number of elements in the stiffness and viscosity matrices S and V. In the EM approach, notably, the number of feedback control parameters is reduced to the number of diagonal coefficients in the matrices Seig and Veig.
The EM PD-controller is shown schematically in Figure 3. The time delays outside the controller represent time delays Δt1 and Δt2 between controller commands τC and actual torque τ applied to the robot segments, which are mainly induced by the generation of the torques in the robot. These “actuation” delays were intentionally equalized in the present study inside the PD-controller by adding delays of Δ and Δ to the respective joints (Figure 3) such that the total time delays ΔtA and ΔtH of the transformation for ankle and hip joint torques respectively were equal and amounted to ΔtA = Δt1 + Δ = ΔtH = Δt2 + Δ = Δt.
Figure 3. Scheme of EM control. Shown are the EM controller, the actuation, the plant and the sensors. The PD-controller transforms desired and sensory variables q into EM kinematic variables ξ (boxes W−1) and then into EM dynamic variables ηcon (boxes Seig + 1 and Veig) which are in turn transformed into output joint torques τC (box U) with controller time delays (boxes ΔtC). Torque τC becomes effective at the robot segment after an actuation time delay (boxes Δt). Note, that in boxes Seig + 1 the unit is added to Seig in order to take into account the first “gravity” term in the right side of Equation (7). Matrices Seig, Veig, W and U are defined above.
For the robot experiments (see below), the controller has been implemented as a program in Simulink, which allows controlling the robot in real time. As to the controller inputs, the joint angles q give the desired body position in terms of joint angles with respect to each other and the support surface. This is by itself not sufficient to balance in the general case in which the support surface is not a stable reference. The Posturob platform integrates a human inspired vestibular system, mechanically fixed to the upper body and providing the trunk orientation in space (Mergner et al., 2009). Using the vestibular information, the control can be generalized to the condition of support surface tilt in space. In particular, in experiments with the robot standing on moving platform, the information of leg-in-space angle was calculated with help of the vestibular sensor and used as input signal q1 for the leg segment control, and the joint angle signal from the hip joint sensor was used for the hip control. With stable platform, angle and vestibular sensor signals were combined for each joint to improve the signal to noise ratio.
Theoretical Analysis of Control Stability
The independent control for each EM allows the analysis of whole body control stability by two separate analyses of each EM's stability. The stability of each EM is defined by the roots μ of the secular equation of Equation (9):
When in equation (9) Δt > 0, Equation (11) has an infinite number of complex roots μ = α + iω, where α and ω are the real and imaginary parts of the root and i is the imaginary unit (Alexandrov et al., 2005). The solution of equation (9) is stable if the real part α of all the roots of equation (11) is negative. The maximum value of the real part of all the roots of the secular equation is called Lyapunov index. Thus, the solution of equation (9) is stable if its Lyapunov index α < 0. The Lyapunov index defines the characteristic time Δtchr = |α|−1 of the complex system response to the external perturbation.
The minimization of the Lyapunov index for each EM was used as a criterion for optimizing the PD-controller parameters. The optimum parameters were obtained according to a method based on calculations of the ranges in the space of Seig and Veig in which the Lyapunov index does not exceed given values α (Appendix 2 in Supplementary Material). The main results are shown in Figure 4. It shows the minimal Lyapunov index αmin which can be achieved for a given λ and Δt. The values of Seig and Veig which provide αmin were treated as optimal. For each delay Δt there exists some critical value λcrit at which αmin becomes zero. If λ < λcrit, no range space of stability for the given feedback loop delay exists, meaning that the PD-controller does not provide stable control of this dynamic system. Thus, stable PD-control is impossible if feedback loop delay Δt > Δtmax for a given λ or if λ < λcrit for a given Δt.
Figure 4. Effects of feedback delay time on PD-control stability. The negative Lyapunov index αmin (ordinate), is plotted as a function of the inversed index of inertia λ (abscissa) for different delay times Δt as indicated.
The humanoid robot PostuRob II (Figure 1) comprises trunk, legs and feet segments interconnected by the hip and ankle joints. Signals from mechatronic vestibular and joint angle sensors are real-time inputs to a PC. The implemented control system controls artificial pneumatic “muscles” (FESTO AG &Co.KG, Esslingen, Germany; Typ MAS20), which generate a desired torque in the hip and ankle joints. The EM control model was executed as a compiled Simulink model (Real-Time Windows Target, The Math Works Inc., Natick, USA). In the presented experiments, the robot was standing freely on firm support, a 6 DoF motion platform (Mergner et al., 2003), and performed active sinusoidal trunk and leg movements in the sagittal plane with different frequencies and amplitudes, reactive postural responses to external disturbances such as support surface rotation or translation in the sagittal plane. During the experiments, performed in a human posture control laboratory, sensory signals of joint angles and joint angular velocities as well as desired joint angle signals and actual joint torques were recorded with an acquisition rate of 200 Hz (further details in Appendix 1 in Supplementary Material).
In the first series, the robot's characteristics were evaluated in terms of (a) its inertial and gravity matrices B0 and G0, (b) the “actuation” time delays Δt1 and Δt2 in the transformation of joint torque commands to effective torques at the robot's joints, (c) the optimal PD-controller parameters, and (d) the dynamic response characteristics of the robot and (e) Experimental transfer functions and dynamic response of the controlled system. For a, preliminary (theoretically optimal) parameters of the PD-controller were calculated based on the model of the two-linked rigid rods described above, and for b the time delays Δt1 and Δt2 for the transformation of torque commands to the effective joint torques were preliminarily estimated to be 100 ms. This allowed for stable PostuRob II movements within the tested conditions and, at a later step (comparing the theoretical with the measured transfer functions), to finally calculate optimal parameters of the PD-controller on the basis of the experimentally obtained matrices B0 and G0.
The commanded joint angles in this approach, for the ankle joint and for the hip joint in Equation (1), had synchronous sinusoidal time courses with seven different frequencies f: 0.05, 0.1, 0.2, 0.4, 0.6, 0.8 and 1.0 Hz. Five pairs of amplitudes A1 and A2 of desired signals and were tested for each frequency: (A1, A2) = (0, 10°), (−2.5°, 7.5°), (−5°, 5°), (−4.5°, 1.5°), and (3°, 0). With these amplitudes, the zero moment point position remained inside the support area defined by the area comprised by the feet.
Evaluation of the Robot'S Inertial and Gravity Matrices
The evaluation was performed by integration of Equation (1) with the help of the experimentally recorded joint torques and a given set of elements for matrices B0 and G0 (compare Kuo, 1995; Alexandrov and Frolov, 2011). The elements of these matrices, which minimize the error between the joint angles obtained by the numerical integration and the experimental joint angles, were taken for estimating the robot's characteristics. Their search was performed by the gradient descent method given in the MATLAB software. Parameters presented in Appendix 1 in Supplementary Material were taken as the initial ones. The search terminated on the step when the tolerance function decreased less than by 10−4 as compared with the previous step.
Evaluation of Time Delays in Transformation from Torque Commands to Effective Joint Torques
The transformation from the joint torque commands at the PD-controller output to the experimentally obtained joint torques is realized in PostuRob II independently for the hip joint and the ankle joint. Crosstalk between the joint torques is negligible and the transformation is performed in each joint separately with different time delays (boxes Δt1, Δt2 in Figure 2). The matrix Fττ describes the transfer function of the desired torque commands to the effective torques at the robot segments by
where ω = 2πf, f is the frequency of the sinusoidal robot movement, i is the imaginary unit, and Δt1, Δt2 are the delays in the transformation of torque commands to the torques in the ankle and hip joints, respectively. The delays Δt1 and Δt2 that provided the best fit with the experimental transfer function were then taken for the subsequent estimations.
The experiments with Posturob II were performed on a motion platform in a human posture control laboratory. They comprised in addition to voluntary movement tests also tests of balancing biped stance during external disturbances (details in Appendix 1 in Supplementary Material). We refrained from adjusting the above control parameters to specific experimental conditions even when this was associated with particular technical insufficiencies such as an increased static friction. Proactive lean movement and reactive postural lean responses were tested. Both could be performed either in space coordinates using the artificial vestibular sensor (change in trunk-space angle, TS, or leg-space angle, LS) or in proprioceptive coordinates (change in trunk-leg angle, TL, or leg-foot angle, LF).
Evaluation of the Robot'S Inertial and Gravity Matrices
The elements of matrices B0 and G0 obtained experimentally amounted to B011 = 65.01 N m s2 rad−1, B012 = B021 = 10.09 N m s2 rad−1, G011 = 460.01 N m rad−1, G012 = G021 = G022 = 103.02 N m rad−1 (these values replaced in the following the initial values given in Appendix 1 in supplementary Material). The experimental transfer function was calculated according to Appendix 3 in Supplementary Material for the case that the two components of the signal X(t) are the joint angles and the two components of the signal Y(t) are the corresponding joint torques recorded during the above 35 described cyclic movements of PostuRob II. The theoretical transfer function was calculated according to equation
using the inertial and gravity matrices B0 and G0 obtained experimentally. The root-mean-square error of the mismatch between and amounted to 9.9%.
Evaluation of Delays in Torque Actuators
Figure 5 shows the experimentally obtained transfer function and the theoretical transfer function for the transformation of the torque commands to the joint torques. The function was calculated according to Equation (12) and according to Appendix 3 in Supplementary Material. The two components of the signal X(t) are the control signals and at the PD-controller output (Figure 3) for the ankle and hip joints respectively, and the two components of the signal Y(t) are the corresponding experimental recordings of the joint torques. The “actuation delays” that provided the minimum root-mean-square error between the and amounted to Δt1 = 0.091 s (ankle) and Δt2 = 0.053 s (hip). The minimum root-mean-square error amounted to 5.1%. Note that the off-diagonal elements of the experimental transformation matrix are small compared to the diagonal elements, indicating very small crosstalk torques between the joints.
Figure 5. The gain (A) and phase (B) characteristics of theoretical transfer function (curves) calculated according to (13) with joint torque delays τ1 = 0.091 s (ankle) and τ2 = 0.053 s (hip) and experimental (markers) for the transformation of the controller output joint torques to the actual joint torques. Experimental data points: ° , □ , ▾ , ♢ .
Readjustments of the Parameters for the EM PD-Controller
The solutions of Equations (2) and (5) for the experimentally obtained matrices B0 and G0 give the following eigenvalues λi, eigenvectors wi, and vectors ui (i = 1, 2) defined in Appendix 1 in Supplementary Material:
The actuation delays Δt1 = 0.091 s and Δt2 = 0.053 s were intentionally equalized by adding delays of = 0.009 s and = 0.047 s to the respective joint inside the PD-controller (Figure 3). As a result, the total delays for both ankle and hip joint torques amounted to Δt = 0.1 s. The two markers in Figure 4 located on the dashed curve for Δt = 0.1 s and the two eigenvalues λ1 and λ2 of PostuRob II indicate that the Lyapunov indexes in the experimental movements amounted to αmin1 = −5.6 s−1 for the first EM and αmin2 = −4.1 s−1 for the second EM.
The following optimal values of stiffness and viscosity in the PD-controller were calculated according to Appendix 2 in Supplementary Material for the obtained values of λ1, λ2, and Δt = 0.1 s:
To obtain the estimate of the transfer function Fqq from desired to actual kinematics, cyclic movements of PostuRob II were recorded using the optimal parameters obtained so far. These recordings were used to calculate an experimental transfer function from desired to actual kinematics (Appendix 3 in Supplementary Material) and were compared to a theoretical transfer function (Appendix 4 in Supplementary Material).
Figure 6 shows the gain (Figure 6A) and phase (Figure 6B) characteristics of the transfer function Fqq from the desired kinematics to the actual kinematics in terms of joint angles. The experimentally obtained values calculated according to Appendix 3 in Supplementary Material are compared with the theoretical ones calculated according to Appendix 4 in Supplementary Material.
Figure 6. The gain (A) and phase (B) characteristics of the theoretical (curves) and experimental (data points) transfer functions of the desired kinematics to the actual kinematics in terms of joint angles q. Experimental points: ° - , □ , ▾ , ♢ .
The off-diagonal elements and of the transfer function Fqq are theoretically and experimentally non-zero. However, the off-diagonal elements are small as compared with the diagonal elements and . In general, the experimental data points in Figure 6 qualitatively correspond to the theoretical results (curves), this despite some data scatter.
Proactive Movements of the Robot
Proactive TS and LS movements were performed in addition to the robot experiments also in model simulations. A first overview was obtained with desired sinusoidal TS movements in space coordinates. Figure 7 shows the “voluntary” signals (desired trunk-space angle, TS!) in comparison with the executed movements signal (TS) for 0.2 and 1.2 Hz sine frequency (A, simulation data; B, robot data). At 0.2 Hz, only a very small coupling effect of TS on LS is visible in the simulations (A1), whereas a small in-phase reaction occurs in the robot (B1). At 1.2 Hz, the evoked LS excursions are increased and shifted toward counter-phase already in the simulation (A2) and more so in the robot (B2).
Figure 7. Model simulation (A) and robot experiments (B) of internally generated (“voluntary”) sinusoidal rotations of desired trunk-space angle (TS!). Dynamic effects on leg-in-space (LS) would ideally be absent (see text). Note that the support surface is here not rotating, so that desired ankle angle is equal to leg in space, LS, while (i.e., using a notation that is more intuitive in the following).
Noting that the separation between dynamic and static effects and the use of space coordinates Pcomplicate the interpretation of the robots behavior, movement commands with smoothed ramp-like waveform (raised cosine velocity function, see Appendix 1 in Supplementary Material) and proprioceptive coordinates were used in the simulations and robot experiments (Figure 8). Stable stance was obtained in both scenarios. In the model simulations a desired trunk-leg angle signal of 4° (TL! = 4°) leads to a slight TL overshoot and a weak dynamic LF counter excursion (A1). With a desired leg-foot angle (LF!) as command a slight LF overshoot and a clearly larger transient dynamic TL counter responses occurred (A2). In the corresponding robot experiments, the resulting TL movement also shows a dynamic response, mainly attributed to static friction effects (B1). LF showed no considerable dynamic effect, but a static excursion in TL lean direction. With a desired LF command, the resulting LF lean movement showed a dynamic overshoot (B2). The effect on TL consisted of very small dynamic counter-effects and a relatively large static excursion in the direction of LF of approximately 5.5°. The large TL response is mainly due to the relatively large weight of the trunk. It should be noticed that the used controller (PD) does not guarantee a null static error in the general case. This is more evident in robot experiments (B2) than in simulations (A2) where the control parameters can be perfectly tuned to the system. Taken together, dynamic coupling in simulations and the robot experiments were not completely abolished, but strongly reduced (e.g., A2), while gravitational torque effects where prominent, this mainly when the control operated in proprioceptive coordinates (Figure 8) and less so in space coordinates (Figure 7).
Figure 8. Model simulation (A) and robot experiments (B) of internally generated (“voluntary”) rotations with ‘raised cosine velocity profile’ (amplitude, 4°; dominant frequency, 0.2 Hz) of TL (desired, TL!) in (A1,B1) and of LF (desired, LF!) in (A2,B2).
Reactive Responses to External Disturbances
Using the pseudo-random ternary sequence (PRTS) stimulus allows to analyze externally evoked LS and TS sway responses over a broad spectrum of frequencies (Hettich et al., 2014; adopted from Peterka, 2002). The method allows analyzing the data in the frequency domain in terms of frequency response functions (FRFs) and coherence functions (see Appendix 1 in Supplementary Material). Examples of the time series of the stimulus and responses in the sagittal plane are given in Figure 9A for support surface tilt with vestibular input (control of leg segment operated in coordinates of gravitational space) and in Figure 9B for support surface translation (control operated in joint coordinates). Note that the robot successfully maintains balance with relatively small angular leg and trunk excursions in the two experiments shown and in other balancing experiments performed (see also Film in Supplementary Materials). The FRF results for the tilt experiments with pp 2° and pp 8° are given in Figure 10. They show that the robot keeps the orientation of the legs in space and the trunk in space upright. The robot was able to maintain balance also without vestibular input (control of leg segment operated with respect to the feet) when the support surface tilt amplitude was reduced to 3° or smaller (Figures 11, 12), a performance that qualitatively is similar to that of vestibular loss human subjects (see Discussion).
Figure 9. Time series of stimulus and sway responses of the robot to support surface tilt (A, with vestibular function; leg-in-space tilt, LS, and trunk-in-space tilt, TS, in space coordinates) and support surface translation (B, without vestibular function; leg-in-space tilt, LS, and trunk-in-space tilt, TS, in platform coordinates) using the PRTS stimulus waveform.
Figure 10. (A–C) Sway responses to support surface tilt of the robot with vestibular input presented in terms of frequency response functions (FRF) of LS to tilt (A), TS to tilt (B), and the ratio curves of TS gain to LS gain and difference curves between TS phase and LS phase (C) (PRTS stimulus of pp 2° and pp 8°).
Figure 11. Time series of stimulus and sway responses of the robot without vestibular function to support surface tilt. (A) PRTS tilt stimuli of peak-peak amplitudes of 1°, 2°, and 3°. (B) Responses of leg in space, LS, and trunk in space, TS. Limiting tilt peak-peak amplitudes to 3° prevented falling.
Figure 12. Balancing of stance on tilting support surface of the robot without vestibular function (control is exclusively proprioceptive). Shown are frequency response functions (FRFs) and coherence functions for leg in space (LS; A) and trunk in space (TS; B). Superimposed are human data for comparison (mean of 5 vestibular-loss subjects, eyes closed).
Theoretically, the EM concept is a relatively simple and efficient method to cope in a multi-DoF system with the coupling forces between mechanically linked segments. This study tested whether the EM concept is able to cope with coupling forces also in the control of a real-world technical device such as a humanoid robot, in which the control faces non-ideal properties such as noisy, inaccurate and non-linear sensors, friction and backlash, etc. These real world conditions may challenge control robustness in face of human-like feedback time delays. The robot experiments demonstrate that the EM control method copes considerably well with the real-world properties in a humanoid robot with human-like anthropometrics and equipped with human-inspired sensors and actuators. Therefore, we consider the EM control method a valid candidate that should be considered when making inferences on which method humans may use for their sensorimotor control. In the following, we first briefly address general issues of the EM concept before considering our experimental findings and consider alternatives to the EM control method.
Considering the EM method in this study was not meant to reduce the number of degrees of freedom of the system or to solve a redundancy problem, because the number of DOFs is the same when controlling joints or EM spaces. Rather, the benefit of using the EM concept is a control simplification in that multiple EMs can simultaneously be controlled dynamically independently of each other. For example, with a trunk bending forward the robot may simultaneously move the leg segment backwards in order to balance the COM over its feet as base of support (corresponding to a kinematic synergy), but alternatively may maintain the leg segment vertical (coping with forward shift of the COM through corresponding ankle torque). Generally, it should be noted that EMs are not synergies observable in the space of joint angles, but a step in the design of a control system that can produce arbitrary poses and trajectories (within the limit of the robot's dynamic responses). Independent PD-control of separate EMs allows clearly longer time delays in the feedback loop than the independent PD control of separate joints. For example, the Lyapunov index showed that the independent PD-controls of separate EMs in PostuRob II is stable up to a time delay of Δt = 200 ms, while the limit is Δt = 150 ms using independent controls of separate joints (see Appendix 5 in Supplementary Material). Conceivably, in a more complex control system such as the human one, additional or other mechanisms may contribute to control stability in face of long time delays.
The definition of the EMs implies a linearization of the system. With the control of upright body posture, a natural choice for the linearization point is the vertical position. Principally, however, the system has previously been shown to work also for a wider range of movements, exploiting successive linearization points as described for arm movements (Frolov et al., 2013). It remains to be shown how complexity increases when the EM method has to deal continuously with large operative spaces.
Despite the fact that the design of Posturob II takes into account the human anthropometry, time delays in the control loop and some human-inspired sensors and actuators it ignores several known constituents of the human posture control system such as load-related proprioceptive sensors (Dietz, 1998), foot deformation (Wright et al., 2012), some minor role of the knee joints (Alexandrov et al., 2001b), and more. Even though ignoring these parts of the human posture control, the EM approach provided stable maintenance of posture and movements in the humanoid robot. Similarly, previous feedback control models, using in the absence of visual information only joint angle proprioceptive and vestibular sensory inputs, were sufficient to quantitatively describe human responses to moderate support surface tilt stimuli in the sagittal plane, as also shown with other control models such as the independent channel (IC) model (SIP biomechanics Peterka, 2002) or the disturbance estimation and compensation (DEC) model (DIP biomechanics; (Hettich et al., 2014); this study includes a direct comparison between data of humans and of Posturob II using the human-derived model). The model of Park et al. (2004); see also (Kuo and Zajac, 1993), one of several currently available models of posture control, used proprioceptive linear full state feedback control to describe human responses to backwards translation and found only moderate improvement when increasing complexity from a 2-segment to a 4-segment model.
The main result of the present study is the experimental demonstration that the EM method copes well with a PD-control of a “real-world” mechanical anthropomorphic robot. The feedback loop parameters for the independent control in each EM were calculated from the robot anthropometrics, including human-like feedback time delays. Other characteristics of the robot as a “real-world” system, which typically are not exactly known such as friction, nonlinearities, noise, backlashes, inaccuracies, etc., were ignored. These unaccounted factors led to clear deviations of measured results from model predicted results, but influenced relatively little the overall characteristics of the movements and did not contradict the hypothesis that the EM concept can provide in principle stable performance of the robot. This applies to both proactive movements and reactive balancing of stance during unforeseen external disturbances in the sagittal plane (such as horizontal translation of the support surface and, when vestibular information was included into the control, support surface tilt).
Another aspect to be considered in the present approach was that the EM implementation aimed at an optimal stability of the control (see Section Theoretical Analysis of Control Stability). This does not imply that thereby the robot's postural responses automatically become similar to human subjects. In other words, optimizing the EM control for Posturob II does not mean that the robot's postural responses become human-like because of its human-inspired sensors and actuators alone. The present approach differs from that in a previous study, which also used Posturob II (Hettich et al., 2014). There, human-like postural responses of the robot were obtained by fitting the control parameters of a posture control model, which later controlled the robot, to the human responses. Still, we considered it as interesting to compare in the present context the robot data with human data to visualize differences in the sway response behavior. To this end, we superimposed on the robot's frequency response functions shown in Figure 12 the results obtained with the same stimulus and set up obtained from a group of vestibular loss human subjects with eyes closed. The reason for this choice was to consider an especially simple control that uses mainly proprioceptive sensors (in humans possibly including force cues; Mergner et al., 2009). Outstanding differences between human and robot data are larger human sway responses in the mid-frequency range of the PRTS stimulus, a slightly different phase behavior, and smaller coherence in the low to mid frequency range, possibly indicating higher sensory or motor noise (or more general, effects not fully taken into account by the model). Future studies may use parameter identification methods in order to fit the EM concept to human data.
Possible Implications for Robotics
Kinematic synergies are widely used in humanoid robotics, typically with the purpose to simplify movement control. Kinetic synergies, i.e., predefined coordination between joint torques, although considered in numerous human studies (Prattichizzo et al., 2010; Shim et al., 2010), are rather sparsely implemented in explicit form in robots. As to the EM, they are formally kinematic synergies, yet the variables η are kinetic synergies defined by the coefficients of the matrix U. Formally, EM are defined to be coupling-free and to cover the whole space of joint motion. They are not producing a simplification of the control in terms of DOFs, yet, being dynamically independent, they simplify the structure of the control problem to the case of the control of several independent SISO systems. This can be advantageous for robotic control both in simplifying the design of the control system and to improve robustness with respect to modifications in control parameters, since modifications to one of the SISO controllers does not affect the others. This can be important in cases when the control parameters need to be dynamically adjusted, e.g., to improve balance performance during different tasks or in different scenarios.
However, the simplification of the control design by using SISO controllers comes at the cost of designing the transformation matrices U and W. This transformation requires a reliable model of the body dynamics, which may be problematic with certain tasks such as with full body control of a humanoid that involves a large number of DOFs. A further limitation is given by the system linearization required to define the Eigen movements, since this implies the necessity to linearize the system in different points of the control space when trying to cover a wide range of motions and/or when involving several distributed DOFs in complex behaviors such as walking. Overall, considering advantages versus limitations of the EM concept applied to robotics, we conceive that whole body control tasks like balancing upright stance would profit from it, because the limited range of motion makes errors due to linearization negligible. We would expect profits also for applications in which the dynamic requirements of the task and the presence of time delays make the coupling forces highly relevant, such as in the control of fast arm reaching (Frolov et al., 2013). In case that the number of DOFs involved in a task is large so as to make full state solutions non trivial, use of the EM approach with accurate definition of W and U still may be practicable, this especially if the control task is reduced to a subset of variables by means of integration of task specific constraints.
A major conclusion from the present experiments refers to the robot experiments as an experimental tool when studying the human sensorimotor control system. Using a humanoid robot for comparing different bio-inspired control concepts with each other on the same robot will help to define criteria for presumed human-likeness of control algorithms—with potential benefits also for use in humanoid robotics and user acceptance in robotic neurorehabilitation. In doing so, the robot experiments provide a valuable “real world” test that complements model simulations, especially in addressing the problem of control stability in face of human time delays. Finally, an experience from this study is that “learning by doing” in the robot experiments provides inspirations also for the research of the human control system.
All subjects gave their informed written consent to the study that was approved by the Ethics Committee of the Freiburg University Clinics, in accordance with the Declaration of Helsinki.
All authors performed the experiments and collected the data. GH, VL, and TM performed the computer simulations and analyzed the robot and human data. All authors contributed to the interpretation of the data and contributed to writing the manuscript. All authors approved the final version of the manuscript for submission.
The work was supported by the Russian Science Foundation, grant 16-15-00219, for the author AF and by the European Commission FP7 Grant 600698 H2R and 610454 EMBalance for the authors GH, TM, and VL.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fnbot.2017.00022/full#supplementary-material
Alexandrov, A. V., Frolov, A. A., and Massion, J. (2001a). Biomechanical analysis of movement strategies in human forward trunk bending. I. Modeling. Biol. Cybern. 84, 425–434. doi: 10.1007/PL00007986
Alexandrov, A. V., Frolov, A. A., and Massion, J. (2001b). Biomechanical analysis of movement strategies in human forward trunk bending. II. Experimental study. Biol. Cybern. 84, 435–443. doi: 10.1007/PL00007987
d'Avella, A., Giese, M., Ivanenko, Y. P., Schack, T., and Flash, T. (2015). Editorial: modularity in motor control: from muscle synergies to cognitive action representation. Front. Comput. Neurosci. 9:126. doi: 10.3389/fncom.2015.00126
Frolov, A. A., Biryukova, E. V., Bobrov, P. D., Mokienko, O. A., Platonov, A. K., Pryanichnikov, V. E., et al. (2013). Principles of neurorehabilitation based on the brain-computer interface and biologically adequate control of the exoskeleton. Hum. Physiol. 39, 196–208. doi: 10.1134/S0362119713020035
Frolov, A. A., Prokopenko, R. A., Dufosse, M., and Ouezdou, F. B. (2006). Adjustment of the human arm viscoelastic properties to the direction of reaching. Biol. Cybern. 94, 97–109. doi: 10.1007/s00422-005-0018-8
Hettich, G., Asslander, L., Gollhofer, A., and Mergner, T. (2014). Human hip-ankle coordination emerging from multisensory feedback control. Hum. Mov. Sci. 37, 123–146. doi: 10.1016/j.humov.2014.07.004
Kurtzer, I., Pruszynski, J. A., and Scott, S. H. (2009). Long-latency responses during reaching account for the mechanical interaction between the shoulder and elbow joints. J. Neurophysiol. 102, 3004–3015. doi: 10.1152/jn.00453.2009
Kurtzer, I., Trautman, P., Rasquinha, R. J., Bhanpuri, N. H., Scott, S. H., and Bastian, A. J. (2013). Cerebellar damage diminishes long-latency responses to multijoint perturbations. J. Neurophysiol. 109, 2228–2241. doi: 10.1152/jn.00145.2012
Lacquaniti, F., and Soechting, J. F. (1986). EMG responses to load perturbations of the upper limb: effect of dynamic coupling between shoulder and elbow motion. Exp. Brain Res. 61, 482–496. doi: 10.1007/BF00237573
Mergner, T. (2012). “Postural control by disturbance estimation and compensation through long-loop responses,” in Routledge Handbook of Motor Control and Motor Learning, eds A. Gollhofer, W. Taube, and J. B Nielsen (London: Routledge), 50–70.
Ott, C., Dietrich, A., and Roa, M. A. (2014). “Torque-based multi-task and balancing control for humanoid robots,” in International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) (Kuala Lumpur), 143–144.
Ott, C., Henze, B., Hettich, G., Seyde, T. N., Roa, M. A., Lippi, V., et al. (2016). Good posture, good balance: comparison of bioinspired and model-based approaches for posture control of humanoid robots. IEEE Robot. Autom. Magazine 23, 22–33. doi: 10.1109/MRA.2015.2507098
Shim, J. K., Hooke, A. W., Kim, Y. S., Park, J., Karol, S., and Kim, Y. H. (2010). Handwriting: hand-pen contact force synergies in circle drawing tasks. J. Biomech. 43, 2249–2253. doi: 10.1016/j.jbiomech.2010.04.033
Welch, T. D., and Ting, L. H. (2008). A feedback model reproduces muscle activity during human postural responses to support-surface translations. J. Neurophysiol. 99, 1032–1038. doi: 10.1152/jn.01110.2007
Keywords: human sensorimotor system, neuromechanics, biorobotics, motor control, eigenmovements
Citation: Alexandrov AV, Lippi V, Mergner T, Frolov AA, Hettich G and Husek D (2017) Human-Inspired Eigenmovement Concept Provides Coupling-Free Sensorimotor Control in Humanoid Robot. Front. Neurorobot. 11:22. doi: 10.3389/fnbot.2017.00022
Received: 31 January 2017; Accepted: 04 April 2017;
Published: 25 April 2017.
Edited by:Massimo Sartori, University of Göttingen, Germany
Reviewed by:Yury Ivanenko, Fondazione Santa Lucia (IRCCS), Italy
Malte Schilling, Bielefeld University, Germany
Copyright © 2017 Alexandrov, Lippi, Mergner, Frolov, Hettich and Husek. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Thomas Mergner, firstname.lastname@example.org