The requirements for real-world manipulation tasks are diverse and often conflicting; some tasks require precise motion while others require force compliance; some tasks require avoidance of certain regions while others require convergence to certain states. Satisfying these varied requirements with a fixed state-action representation and control strategy is challenging, impeding the development of a universal robotic foundation model. In this work, we propose Meta-Control, the first LLM-enabled automatic control synthesis approach that creates customized state representations and control strategies tailored to specific tasks. Our core insight is that a meta-control system can be built to automate the thought process that human experts use to design control systems. Specifically, human experts heavily use a model-based, hierarchical (from abstract to concrete) thought model, then compose various dynamic models and controllers together to form a control system. Meta-Control mimics the thought model and harness LLM's extensive control knowledge with Socrates' "art of midwifery" to automate the thought process. Meta-Control stands out for its fully model-based nature, allowing rigorous analysis, generalizability, robustness, efficient parameter tuning, and reliable real-time execution.
Real-world manipulation tasks have inherently different and even opposite requirements. Using an inapproriate representation or control strategy may lead to failure or dangerous behaviors during manipulation. Therefore, a method to customize representation and control strategy is needed.
Failure: Wipe the whiteboard with a Cartesian trajectory planner. Failed because force constraints can not be specified.
Failure: Open the cabinet with a joint space planner. Failed because planned swing path is not accurate.
Failure: Balance a cart pole with a MPC. Failed because the feedback frequency is too low.
Meta-Control: Gripepr planner + force pose hybrid controller. Maintain contact force on the whiteboard.
Meta-Control: Cartesian space planner + stiffness controller. Executable even with inaccurate swing path.
Meta-Control: High frequency LQR for the cart + hybrid pose force tracking.
Meta-Control can satisfy diverse task requirements that may happen in open world manipulation tasks, such as
Meta-Control synthesized control system is fully model-based, enabling generalization to different embodiments. For instance, a control system synthesized on Kinova Gen3 can easily directly generalize to a Franka Panda robot.
Meta-Control synthesized control system can easily generalize to scenarios of different states thanks to the model-based nature.
initial pole angle = +0.1 rad
door width = 0.3 m
object arrangement 1
initial pole angle = -0.5 rad
door width = 0.6 m
object arrangement 2
For tasks involving unknown dynamics, Meta-Control can exploit dynamics prior internalized by LLM. For example, in the balance cart pole task. The dynamics of the whole system is unknown. But Meta-Control can give an analytical approximation of the system on the task level. The synthesized control system for this task is shown below. For simplicity, we describe them by text. Specifically, the task level system is modeled as the linearized dynamics in the form of $\dot z = A z + B v$ around the upright position of the pole, where $A$ and $B$ are directly given by the LLM. Exploiting dynamics priors enables Meta-Control to synthesize high-performance controllers rigorously.
Meta-Control can efficiently tune the parameters of the chosen models and controllers to achieve the desired performance. For example, Meta-Control chooses an LQR controller for the balance task, where the Q and R matrices are critical to the performance. Before parameter tuning, the model fails to balance the pole. But with only two rounds of execution, Meta-Control finds the proper parameter that successfully balance the pole.
Meta-Control synthesized controller is fully model-based. Therefore, we can give rigourous analysis and guarantees for the synthesized control system.
Overview of Meta-Control pipeline. Meta-Control harness LLM's control knowledge to synthesize skills with a 3-level pipeline: strategy level, data flow level, and parameter level. The user only needs to provide the skill description. Meta-Control first designs the system with dynamic model and controller templates, then connects all modules by correctly understanding the semantic meaning of the interfaces, and finally evaluates the synthesized control system and optimizes the parameters based on execution results through few-shot interactions.
Prompts of the Composer:
Design Composer |
Implementation Composer |
Parameter Tuning Composer
Prompts of the control system synthesis:
Design Models and Controllers |
Design Reflection |
Design Summary |
Task Controller Implementation |
Tracking Controller Implementation |
Implementation Reflection |
Parameter Tuning |
Dynamic Model Templates |
Controller Templates |
Input Port Samples
Prompts of the tasks:
Reach the Goal with Collision Avoidance |
Wipe the Whiteboard |
Open the Door |
Balance the Cart Pole |
@article{wei2024meta,
title={Meta-Control: Automatic Model-based Control System Synthesis for Heterogeneous Robot Skills},
author={Wei, Tianhao and Ma, Liqian and Chen, Rui and Zhao, Weiye and Liu, Changliu},
journal={arXiv preprint},
year={2024}
}