Scirobotics Add6864
Scirobotics Add6864
only shown to be so in quasi-static control in a low-deflection actuators, and workspaces. The first demonstrates curvatures up
regime (approximately 18° of curvature). It is important to note to 110° (robot #1) and the second up to 180° (robot #2) (Fig. 1).
that simple linearized models are likely to work at these low deflec- This capability is enabled by the introduction of the static
tions because the full nonlinearity of the dynamics may only be ex- Koopman pregain, which maps held inputs to converged robot con-
plored at high deflections. Some studies (34–36) show imperfect yet figurations. After being learned from data, we used it as a pregain
functional controllers using purely state-space linear MPC, suggest- term in the LQR implementation. The static Koopman pregain
ing the quasi-linearity of these systems. greatly increases the accuracy of static pointing tasks and improves
In addition to MPC, the combination of the Koopman operator the stability of dynamic tasks.
and the linear quadratic regulator (K-LQR) optimal control scheme We show that our approach requires minimal training and low
has shown promise in rigid robot applications (38, 39) and the computational cost, both for determining the model and for con-
control of fluid dynamics problems (40). Mamakoukas et al. (41) trolling the robot. Collecting our training data takes less than 5
show promise in a 1–degree-of-freedom soft robotic fish application min, and the computation of the model takes less than a second,
using a similar Koopman structure. as opposed to the long training times required by many neural
Even with these many advances in the field, existing soft arm net–based approaches. Our approach estimates both the static and
control implementations (34–36, 41, 42) have yet to be demonstrat- dynamic control Koopman operators, enabling the use of low-
ed in the inertial, nonlinear regime. To compare with other works, latency, efficient optimal control methods; this enables real-time
we introduce the following definitions of the “inertial regime” and tracking of fast-moving reference positions, even if field-deployed
“nonlinear dynamics.” We define the inertial regime for soft arms to on a low-power microcontroller.
be when the inertial force experienced by the tip Ftip is of the order
of its weight Ftip = matip ≍ mg, meaning atip ≍ g. Here, m is the mass
Table 1. Comparison with existing soft, continuum arms shows advances in speed, acceleration, and deflection during closed-loop control. This work
demonstrates a 10× increase in reference tracking tip speed (Speed) and a 4× improvement in tip deflection angle (Deflection) and advances closed-loop control
of soft robot arms into the inertial regime atip > g = 9.81 m/s2. The acceleration (Accel) of the soft arm’s tip atip was computed using the centripetal acceleration of
soft arms for which circular reference tracking data are available. The distance from the base to the tip of each arm is also given (Length). Note that closed-loop
deflection data do not include the large-deflection open-loop tests present in some works. LQR, linear quadratic regulator; RNN, recurrent neural network; ROM,
(analytical) reduced order model; PCC, piecewise constant curvature; MPC, model predictive control; LSTM, long short-term memory; TRPO, trust region policy
optimization; GPR, Gaussian process regression; TO, trajectory optimization; FFC, feedforward compensator; SM, sliding mode; AF, analytical feedback; R1, robot
#1; R2, robot #2.
Robot Length (m) Speed (m/s) Accel (m/s2) Deflection (°) Model Control method
Dynamic and static Koopman operator optimal control Koopman operators involves training on data that are augmented
The successful real-time control of a soft arm in the inertial and by a chosen basis of observables. The data are collected through a
nonlinear regime requires both a model that captures these dynam- series of training experiments, performed by commanding step
ics and a control methodology that adapts to the motion of the robot inputs with randomly distributed magnitudes. These training data
in real time. We achieved this by building a controller that leverages are partitioned into dynamic and static components, which are used
both the dynamic and the static Koopman operators of the soft arm to train the two separate Koopman operators (see Materials and
system. The Koopman operators describe the evolution in time of Methods). Both the training and model computation processes
functions defined on the robot configurations and inputs. These are fast, requiring only 5 min (approximately 18,000 samples at
functions are called observables, and the approximation of the 60-Hz collection rate) for training data collection, and the matrix
Closed-loop circle tracking in the inertial, nonlinear regime sample robotic tasks. We challenged our soft continuum arm in two
With our control architecture in place, we first sought to character- ways: first, to catch a ball swinging through the air as we demon-
ize the performance across a range of deflections and soft arm strate in Fig. 5, and second, to receive an object from an operator
speeds in a planar circular reference tracking (smooth changes in and to throw it into a reference bin as shown in Fig. 6. Both tests
reference position). We commanded the soft arm’s tip to trace are shown in movie S3. The catching component of this demonstra-
out circular paths in the X-Y plane with three radii (100, 180, and tion is similar to the ball catching performed by a two-link arm with
220 mm) and six frequencies (0.1, 0.3, 0.5, 0.7, 0.9, and 1.1 Hz), as a soft joint (45) but here completed with a fully soft continuum
shown in Fig. 3. The same controller was used for all references, as robot arm.
described in Materials and Methods.
These results show that the soft arm tracks the reference with
consistent performance throughout the full range of deflections DISCUSSION
and speeds tested (Fig. 3, left, and movie S2). The fastest and We present a data-driven framework for the modeling and control
highest deflection circle tracking result demonstrates a tip speed of inertial and nonlinear soft robots. We used KOT to enable the
of 1.5 m/s, a speed-to-length ratio of 3.23 s−1, and a tip acceleration application of linear control methods to this highly nonlinear, iner-
of 11.6 m/s2 in closed-loop control. This is approximately an order tial system. We introduce a K-LQR with static Koopman pregain
of magnitude faster and 40 times higher acceleration than any soft capable of accurately controlling two different soft robots that
continuum arm of which we are aware (see Table 1). The system was exhibit high deflections and inertial motions. Advancing the state
trained exclusively on step inputs, and as such, the model had no a of the art, the proposed method allows the construction and deploy-
priori knowledge of the control objective, nor had it been trained on ment of both a model and an optimal controller from less than 5
circular behaviors. min of training data—to the best of the authors’ knowledge, the
Here, S is the possibly nonlinear state transition function S: M → (24–26, 45). This is true without restriction on the dynamics or
M and x + is the time-shifted state. In our application, M = ℝn. observables.
This nonlinearity is often critical to modeling a system in state We wanted to exploit this linearity to enable the design of an ef-
space, but it complicates the design of control algorithms. We ficient optimal control scheme. This requires extending the
instead turned to an operator-theoretic perspective of dynamics Koopman framework to systems of the form x + = S(x, u), where u
of observables (24). Observables are complex-valued functions ∈ ℝp is a p-dimensional vector of user-specified inputs. In full gen-
defined on the state space f : M → ℂ. We will restrict ourselves to erality, the Koopman operator for systems with input acts on ob-
real-valued observables f : M → ℝ. The set of all possible observables servables of the form f : M � U ! C, where U is the space of all
forms a vector space that is usually infinite dimensional. The control sequences indexed by time uð�Þ : N ! Rp . We redefine
Koopman operator K is defined by Kf :¼ f � S. the state transition function to include inputs S : M × ℝp → M
This operator describes the evolution of observables under the and introduce the left shift operator T : U 7 ! U, which simply
action of the dynamics (1). Although the underlying state space chooses the next input in a sequence ðTuÞðkÞ ¼ uðk þ 1Þ. When
system is nonlinear, the Koopman operator K is always linear the observables are defined on both the states and inputs, their
Xþ :¼ ½xþ þ
1 . . .xK � ð8Þ
Xþ þ þ
lift :¼ ½zðx1 Þ. . .zðxK Þ� ð11Þ
The desired matrices A and B satisfy the equation
Xþ
lift ¼ AX lift þ BU ð12Þ
To approximate A and B, we recasted this equation as a minimi-
Fig. 7. Convergence of the Koopman model and control system. (A) The
zation problem
dynamic Koopman model requires the addition of five time delay observables
and only 1 min of training data to reach minimum prediction error. To determine min kXþ
lift AX lift BUkF ð13Þ
this error, the single-step prediction error of the dynamic Koopman model was A;B
collected for all points as the soft arm moved on a circular path in the X-Y plane
which has the solution
(inset), and the root mean square average was taken. For comparison, a Koopman
model using monomials of the state up to order 4 gave no improvement over the �� ��y
X lift
state-only model. This reconstruction was performed on a model trained on zero ½A B� ¼ Xþ
lift ð14Þ
sinusoidal trajectories. (B) In the closed-loop control, the combination static/
U
dynamic Koopman controller required only 5 min of training data and two time where † is the Moore-Penrose pseudoinverse. Because we prescribe
delays to reach minimum prediction error; accordingly, we used this controller our first n observables to be the states x ∈ M, we can compute the
design for every experiment. Each controller was commanded to move the soft
output matrix using a partial identity matrix
arm’s tip to a sequence of points in the workspace of the soft arm, and the
average root mean square error for all these points was calculated. Using zero
� �
I n�n 0n�m n
time delays resulted in the soft arm being unable to stabilize at any reference po- C¼ ð15Þ
0m n�n 0m n�m n
sition, so that line is not shown.
The action of the matrices A and B on the lifted state via Eq. 5
approximates the action of the Koopman operator K in Eq. 3.
Under certain assumptions, this representation of the Koopman op-
erator converges to the true Koopman operator (30). True
convergence requires infinite data samples, which are uniformly A core assumption of this component of our model is that when
distributed in state space and a collection of observables, which held for enough time, all transient dynamics dissipate, and the robot
span an invariant subspace of the Koopman operator’s underlying achieves a static pose. Therefore, the set of admissible step inputs
function space. We discuss our method of generating training data ustatic corresponds to a set of input-mediated fixed points xstatic.
in the “Training and observables” section. We sought a mapping from the data matrix of step inputs, Ustatic,
to the data matrix of stationary states, Xstatic. Ideally, this mapping
K-LQR would be linear to enable us to use fast, optimal control. The
To date, similar investigations have used MPC to control their soft Koopman framework usually requires the domain and range to be
robotic systems (34–36). Using predictions of the dynamics and a the same, but this requirement can be relaxed if we consider the
tunable prediction horizon, this architecture calculates input se- static Koopman operator (44). The static Koopman operator con-
quences that move the system toward a desired reference position. trasts with the dynamic Koopman operator, which describes the
This enables the use of explicit input and state constraints, but the evolution of observables f : M → ℝ under the action of the
real-time constrained optimizations involved in this method mapping T: M → M. If we define observables on the inputs as g:
demand a high computational overhead. ℝp → ℝ, the static Koopman operator Kstat is defined
In our inertial soft arm controller, explicit constraints are less as Kstat f ðxstat Þ ¼ gðustat Þ.
important than keeping computational cost and latency low. For We desired to approximate the action of the static Koopman op-
unconstrained linear optimal control problems with quadratic erator with a finite dimensional matrix G. To do so, we first con-
cost, the LQR provides an analytical solution that does not structed the data matrix Ustatic with unique step inputs as the
require predictions of the dynamics in real time (46). For our con- columns of the matrix. By feeding these inputs to the system and
troller, we began with the application of LQR to the dynamic allowing transient dynamics to dissipate, we are left with a unique
components, which were used to train dynamic and static Koopman To this end, the first arm was designed to have four actuators
operators (see the “Dynamic and static Koopman operator optimal (two antagonistic pairs) longitudinally aligned with the main
control” section). body to produce planar actuation. This design is behaviorally
Choosing observables is difficult in practice. We chose to imple- similar to others present in the literature (15, 16, 47). When fabri-
ment DMDc with time delay observables (also known as Hankel cated with appropriate pretension, this construction allows for ap-
DMDc) because of their provable convergence as the number of proximately 110° of curvature when fully actuated. With a length of
time delays goes to infinity under certain assumptions on the dy- 45 cm and a maximum diameter (main body diameter plus the di-
namics (31, 32, 43). In reality, adding more time delays gives a di- ameter of the fully inflated muscles) of 6.25 cm, the slenderness
minishing return in prediction accuracy (see Fig. 7A). A single time ratio of this device was 7.2 (the ratio of length to maximum
delay with hundreds of monomials was used in (34–36, 42), but we diameter).
found that time delay–only observables offer better results, with im- The second arm was designed with three actuators, all of which
provements in reconstruction with up to 10 observables (see were affixed to the body such that a torsional deflection would be
Fig. 7A). To create our observables, we used the current measure- induced when inflated. This produces a helical actuation that is
ment of the X-Y-Z positions of the motion trackers xk and appended markedly different from that of the first embodiment. With a
two time-delayed versions of the same states zk = [xk xk − 1 xk − 2]T. length of 53 cm and a maximum diameter of 3.8 cm, this device
Each time delay looks 1/60 s into the past. This proved to be suffi- exhibited a slenderness ratio of 13.9. The muscles were affixed
cient for closed-loop control. For reconstruction, more time delays with pretensions such that, when fully actuated, this device is
give further increases to the model’s accuracy, as shown in Fig. 7A. capable of achieving approximately 180° of curvature.
The synergy of step inputs and time delays allows the discovery For objective (i), with an angle of curvature of at least 110° for
of system eigenvalues in the important 1- to 5-Hz range (the span of both arms, the nonlinearity metric was well achieved (see Fig. 2).
pressures continuously. These three-port valves were chosen for 19. X. Wang, N. Rojas, A data-efficient model-based learning framework for the closed-loop
control of continuum robots. arXiv:2204.10454 [cs.RO] (22 April 2022).
three reasons: their fast response times (<10 ms), accurate response
20. T. G. Thuruthel, B. Shih, C. Laschi, M. T. Tolley, Soft robot perception using embedded soft
(0.75% full-scale absolute accuracy and 0.4% full-scale repeatability sensors and recurrent neural networks. Sci. Robot. 4, eaav1488 (2019).
error), and the ability to accept forced exhaust through their third 21. R. L. Truby, C. Della Santina, D. Rus, Distributed proprioception of 3D configuration in soft,
port. However, this accuracy requires a lower flow rate, which pre- sensorized robots via deep learning. IEEE Robot. Autom. Lett. 5, 3299–3306 (2020).
cluded the use in the much larger main body (primarily because of 22. S. Neppalli, B. Jones, W. McMahan, V. Chitrakaran, I. Walker, M. Pritts, M. Csencsits, C. Rahn,
persistent leaks). Additional information on the general control cir- M. Grissom, Octarm—A soft robotic manipulator, in IEEE/RSJ International Conference on
Intelligent Robots and Systems (IEEE, 2007), p. 2569.
cuitry configuration can be found in the Supplementary Materials.
23. D. Braganza, D. M. Dawson, I. D. Walker, N. Nath, A neural network controller for continuum
robots. IEEE Transact. Robot. 23, 1270–1277 (2007).
24. B. O. Koopman, Hamiltonian systems and transformation in hilbert space. Proc. Natl. Acad.
Supplementary Materials Sci. U.S.A. 17, 315–318 (1931).
This PDF file includes: 25. I. Mezić, Spectral properties of dynamical systems, model reduction and decompositions.
Discussion Nonlinear Dyn. 41, 309–325 (2005).
Figs. S1 to S5 26. I. Mezić, A. Banaszuk, Comparison of systems with complex behavior. Phys. D Nonlinear
Phenom. 197, 101–133 (2004).
Other Supplementary Material for this 27. J. L. Proctor, S. L. Brunton, J. N. Kutz, Dynamic mode decomposition with control. SIAM
manuscript includes the following: J. Appl. Dyn. Syst. 15, 142–161 (2016).
Movies S1 to S5 28. J. L. Proctor, S. L. Brunton, J. N. Kutz, Generalizing Koopman theory to allow for inputs and
control. SIAM J. Appl. Dyn. Syst. 17, 909–930 (2018).
29. L. Shi, K. Karydis, ACD-EDMD: Analytical construction for dictionaries of lifting functions in
REFERENCES AND NOTES Koopman operator-based nonlinear robotic systems. IEEE Robot. Autom. Lett. 7,
9. T. George Thuruthel, Y. Ansari, E. Falotico, C. Laschi, Control strategies for soft robotic 36. D. Bruder, X. Fu, R. B. Gillespie, C. D. Remy, R. Vasudevan, Koopman-based control of a soft
manipulators: A survey. Soft Robot. 5, 149–163 (2018). continuum manipulator under variable loading conditions. arXiv:2002.01407 [cs.RO] (4
February 2020).
10. J. Wang, A. Chortos, Control strategies for soft robot systems. Adv. Intell. Syst. 4,
2100165 (2022). 37. J. Chen, Y. Dang, J. Han, Offset-free model predictive control of a soft manipulator using
the Koopman operator. Mechatronics 86, 102871 (2022).
11. L. Shi, Z. Liu, K. Karydis, Koopman operators for modeling and control of soft robotics.
arXiv:2301.09708 [cs.RO] (7 February 2023). 38. H. Yin, M. Welle, D. Kragic, Policy learning with embedded Koopman optimal control. Proc.
Mach. Learn. Res. 144, 1–14 (2018).
12. O. Yasa, Y. Toshimitsu, M. Y. Michelis, L. S. Jones, M. Filippi, T. Buchner, R. K. Katzschmann,
An overview of soft robotics. Annu. Rev. Control Robot. Auton. Syst. 6, 1–29 (2023). 39. I. Abraham, T. D. Murphey, Active learning of dynamics for data-driven control using
Koopman operators. IEEE Transact. Robot. 35, 1071–1083 (2019).
13. R. K. Katzschmann, C. Della Santina, Y. Toshimitsu, A. Bicchi, D. Rus, Dynamic motion control
of multi-segment soft robots using piecewise constant curvature matched with an aug- 40. A. Gibson, “Application of Koopman linear quadratic regulator to the control of a spherical
mented rigid body model, in 2019 2nd IEEE International Conference on Soft Robotics (Ro- microbubble,” thesis, University of Colorado, Colorado Springs (2022).
boSoft) (IEEE, 2019), pp. 454–461. 41. G. Mamakoukas, M. L. Castano, X. Tan, T. D. Murphey, Derivative-based Koopman operators
14. R. K. Katzschmann, M. Thieffry, O. Goury, A. Kruszewski, T.-M. Guerra, C. Duriez, D. Rus, for real-time control of robotic systems. IEEE Transact. Robot. 37, 2173–2192 (2021).
Dynamically closed-loop controlled soft robotic arm using a reduced order finite element 42. D. Bruder, X. Fu, G. Brent, D. Remy, R. Vasudevan, Data-driven control of soft robots using
model with state observer, in 2019 2nd IEEE International Conference on Soft Robotics (Ro- Koopman operator theory. IEEE Transact. Robot. 37, 948–961 (2021).
boSoft) (IEEE, 2019), pp. 717–724. 43. I. Mezić, On numerical approximations of the Koopman operator. Mathematics 10,
15. C. Della Santina, R. K. Katzschmann, A. Bicchi, D. Rus, Model-based dynamic feedback 1180 (2022).
control of a planar soft robot: Trajectory tracking and interaction with the environment. Int. 44. I. Mezić, Koopman operator, geometry, and learning of dynamical systems. Not. Am. Math.
J. Robot. Res. 39, 490–513 (2020). Soc. 68, 1087–1105 (2021).
16. C. Della Santina, R. L. Truby, D. Rus, Data-driven disturbance observers for estimating ex- 45. Y. Huang, M. Hofer, R. D’Andrea, Offset-free model predictive control: A ball catching ap-
ternal forces on soft robots. IEEE Robot. Autom. Lett. 5, 5717–5724 (2020). plication with a spherical soft robotic arm, in 2021 International Conference on Intelligent
17. T. G. Thuruthel, E. Falotico, F. Renda, C. Laschi, Model-based reinforcement learning for Robots and Systems (IROS) (IEEE/RSJ, 2021), pp. 563–570.
closed-loop dynamic control of soft robotic manipulators. IEEE Transact. Robot. 35, 46. B. D. Anderson, J. B. Moore, Optimal Control: Linear Quadratic Methods (Courier Corpora-
124–134 (2019). tion, 2007).
18. A. Centurelli, L. Arleo, A. Rizzo, S. Tolu, C. Laschi, E. Falotico, Closed-loop dynamic control of 47. M. M. Coad, L. H. Blumenschein, S. Cutler, J. A. Reyna Zepeda, N. D. Naclerio, H. El-Hussieny,
a soft manipulator using deep reinforcement learning. IEEE Robot. Autom. Lett. 7, U. Mehmood, J.-H. Ryu, E. W. Hawkes, A. M. Okamura, Vine robots: Design, teleoperation,
4741–4748 (2022). and deployment for navigation and exploration. IEEE Robot. Autom. Mag. 27,
120–132 (2020).
48. N. D. Naclerio, E. W. Hawkes, Simple, low-hysteresis, foldable, fabric pneumatic artificial control algorithm, prepared the figures, and wrote the manuscript. E.K. designed the
muscle. IEEE Robot. Autom. Lett. 5, 3406–3413 (2020). electronics, performed the experiments, and wrote the manuscript. A.B.C. ran the experiments,
49. L. H. Blumenschein, N. S. Usevitch, B. H. Do, E. W. Hawkes, A. M. Okamura, Helical actuation performed the analysis, and prepared the figures. P.C.C. performed experiments. I.M. advised
on a soft inflated robot body, in 2018 IEEE International Conference on Soft Robotics (Ro- the design of the modeling and control algorithm, the paper, and the experiments. E.W.H.
boSoft) (IEEE, 2018), pp. 245–252. advised the design of the robot, wrote the manuscript, and advised the paper and experiments.
50. A. Kazemipour, O. Fischer, Y. Toshimitsu, K. W. Wong, R. K. Katzschmann, A robust adaptive Competing interests: D.A.H. is the founder and CEO of Vine Medical Inc., which has no financial
approach to dynamic control of soft continuum manipulators. arXiv:2109.11388 [cs.RO] (26 interest in the subject work, nor did it support the investigation in any way. The other authors
February 2021). declare that they have no competing interests. Data and materials availability: All data
needed to support the conclusions of this paper can be found in the Supplementary Materials
Acknowledgments or at https://doi.org/10.5281/zenodo.8184777.
Funding: This work was supported in part by the NSF grant no. 1935327 and the ARO-MURI
W911NF1710306: From Data-Driven Operator Theoretic Schemes to Prediction, Inference, and Submitted 28 June 2022
Control of Systems. Part of the work performed by E.K. was funded via the Fulbright Foundation. Accepted 2 August 2023
Author contributions: D.A.H. designed the robots and test apparatus, wrote the manuscript, Published 30 August 2023
prepared the movies, and performed the experiments. M.J.B. designed the modeling and 10.1126/scirobotics.add6864
Science Robotics (ISSN ) is published by the American Association for the Advancement of Science. 1200 New York Avenue NW,
Washington, DC 20005. The title Science Robotics is a registered trademark of AAAS.
Copyright © 2023 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim
to original U.S. Government Works