SciML for robotics

Abstract

In this project we developed a range of scientific machine learning (SciML) methods for data-driven surrogate modeling and control of robotics platforms with embedded HW devices. Specifically, we employ differentiable programming that allow us to systematically combine physics-based models with black-box function approximators to obtain physics-informed dynamical system models of complex robotics platforms. The differentiability of such models allows us to use them in differentiable predictive control (DPC) method developed by our team. In DPC, we leverage the system dynamics surrogates as generative models to train neural control policies in closed-loop simulations. This offline model-based policy optimization approach is inspired by traditional model predictive control (MPC) and allows for systematic handling of state and input constraints for guaranteed performance and safety. For the implementation of these methods, we leverage our Pytorch-based SciML library Neuromancer developed by our team at PNNL. For more details, see the associated project.

Differentiable predictive control deployment on embedded HW devices

Method: We are concerned with the problem of deploying complex nonlinear control policies on HW devices with limited computational and memory resources. To address this issue, we present DPC as a deep learning-based alternative to the explicit MPC. In DPC, we trained a neural state-space model from time-series measurements of the real system and used it to train the neural control policy via stochastic gradient descent by differentiating the MPC loss function through the closed-loop system dynamics model. The DPC method learns model-based control policies with state and input constraints, while supporting time-varying references and constraints.

Embedded HW deployment: In this project, we collaborated with a team at the Slovak University of Technology, in Bratislava, who deployed the trained SciML models and DPC policies on embedded HW platform. Specifically, we demonstrated sim2real transfer and deploy the trained policy on a Raspberry-Pi platform. We experimentally demonstrate that it is possible to train constrained control policies purely based on the measurements of the unknown nonlinear system. We compare the control performance of the DPC method against explicit MPC and report efficiency gains in online computational demands, memory requirements, policy complexity, and construction time. For more details, see our paper.

DPC method: From real system measurements to surrogate neural state space model (SSM) of the system dynamics to offline model-based control policy to deployment on embedded HW device.

Team:

Ján Drgoňa
Karol Kiš (STU, Bratislava)
Aaron Tuor
Draguna Vrabie
Martin Klauco (STU, Bratislava)

Differentiable programming for modeling autonomous underwater vehicles

Method: Conventional physics-based modeling is a time-consuming bottleneck in control design for complex nonlinear systems like autonomous underwater vehicles (AUVs). In contrast, purely data-driven models require a large number of observations and lack operational guarantees for safety-critical systems. SciML models leveraging prior knowledge of the system dynamics have potential to provide reliable models in a typical data-limited scenario for high value complex systems, thereby avoiding months of expensive expert modeling time. In this work, present control-oriented parametric SciML models with varying levels of domain-awareness that exploit known system structure and prior physics knowledge to create constrained deep neural dynamical system models. We employ universal differential equations (UDEs) to construct data-driven control-oriented blackbox and graybox representations of the AUV dynamics. For more details, see our paper.

Conceptual methodology of the differentiable programming-based method for end-to-end modeling and control of autonomous underwater vehicles.

Real-world data: In this project, we collaborated with University of Washington’s Applied Physics Laboratory (UW APL) team who collected experimental data using their IVER AUV platform. The collected datasets have been used to train and validate the SciML surrogate models developed by our team.

Field data collection by the UW APL team using the IVER AUV platform.

Team:

Soumya Vasisht
Wenceslao Shaw Cortez
Aaron Tuor
James Koch
Draguna Vrabie
Peter Brodsky (UW APL)
Greg Okopal (UW APL)
Joseph Sammartino (UW APL)
Ján Drgoňa

Differentiable predictive control of a quadcopter

In this project we develop DPC method with safety guarantees to train neural policies to control a 17 dimensional quadcopter system. The quadrotor has 3 simulations, one differentiable model in PyTorch to allow for DPC policy optimization, one in CasADi to allow for MPC, and one in Mujoco to allow for a validation environment.

Closed-loop simulation profiles of the DPC policy evaluated under different tasks using Mujoco environment.

This is an active project by John Viljoen. Stay tuned for the updates.

Team:

John Viljoen (UC Berkeley - Summer internship)
Wenceslao Shaw Cortez
Draguna Vrabie
Ján Drgoňa

Acknowledgements

This research was supported by the Mathematics for Artificial Reasoning in Science (MARS) initiative via the Laboratory Directed Research and Development (LDRD) investments at Pacific Northwest National Laboratory (PNNL). PNNL is a multi-program national laboratory operated for the U.S. Department of Energy (DOE) by Battelle Memorial Institute under Contract No. DE-AC05-76RL0-1830.

Conventional physics-based modeling is a time-consuming bottleneck in control design for complex nonlinear systems like autonomous underwater vehicles (AUVs). In contrast, purely data-driven models require a large number of observations and lack operational guarantees for safety-critical systems. Data-driven models leveraging available partially characterized dynamics have potential to provide reliable systems models in a typical data-limited scenario for high value complex systems, thereby avoiding months of expensive expert modeling time. In this work we explore this middle-ground between expert-modeled and pure data-driven modeling. We present control-oriented parametric models with varying levels of domain-awareness that exploit known system structure and prior physics knowledge to create constrained deep neural dynamical system models. We employ universal differential equations to construct data-driven blackbox and graybox representations of the AUV dynamics. In addition, we explore a hybrid formulation that explicitly models the residual error related to imperfect graybox models. We compare the prediction performance of the learned models for different distributions of initial conditions and control inputs to assess their suitability for control.

We present differentiable predictive control (DPC) as a deep learning-based alternative to the explicit model predictive control (MPC) for unknown nonlinear systems. In the DPC framework, a neural state-space model is learned from time-series measurements of the system dynamics. The neural control policy is then optimized via stochastic gradient descent approach by differentiating the MPC loss function through the closed-loop system dynamics model. The proposed DPC method learns model-based control policies with state and input constraints, while supporting time-varying references and constraints. In embedded implementation using a Raspberry-Pi platform, we experimentally demonstrate that it is possible to train constrained control policies purely based on the measurements of the unknown nonlinear system. We compare the control performance of the DPC method against explicit MPC and report efficiency gains in online computational demands, memory requirements, policy complexity, and construction time. In particular, we show that our method scales linearly compared to exponential scalability of the explicit MPC solved via multiparametric programming.

Abstract

Differentiable predictive control deployment on embedded HW devices

Differentiable programming for modeling autonomous underwater vehicles

Differentiable predictive control of a quadcopter

Acknowledgements

References

2023

2022