title

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

volume

genre

issued

pdf

extras

Nonasymptotic regret analysis of adaptive linear quadratic control with model misspecification

The strategy of pre-training a large model on a diverse dataset, then fine-tuning for a particular application has yielded impressive results in computer vision, natural language processing, and robotic control. This strategy has vast potential in adaptive control, where it is necessary to rapidly adapt to changing conditions with limited data. Toward concretely understanding the benefit of pre-training for adaptive control, we study the adaptive linear quadratic control problem in the setting where the learner has prior knowledge of a collection of basis matrices for the dynamics. This basis is misspecified in the sense that it cannot perfectly represent the dynamics of the underlying data generating process. We propose an algorithm that uses this prior knowledge, and prove upper bounds on the expected regret after $T$ interactions with the system. In the regime where $T$ is small, the upper bounds are dominated by a term scales with either $\texttt{poly}(\log T)$ or $\sqrt{T}$, depending on the prior knowledge available to the learner. When $T$ is large, the regret is dominated by a term that grows with $\delta T$, where $\delta$ quantifies the level of misspecification. This linear term arises due to the inability to perfectly estimate the underlying dynamics using the misspecified basis, and is therefore unavoidable unless the basis matrices are also adapted online. However, it only dominates for large $T$, after the sublinear terms arising due to the error in estimating the weights for the basis matrices become negligible. We provide simulations that validate our analysis. Our simulations also show that offline data from a collection of related systems can be used as part of a pre-training stage to estimate a misspecified dynamics basis, which is in turn used by our adaptive controller.

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

lee24a

0

Nonasymptotic regret analysis of adaptive linear quadratic control with model misspecification

980

992

980-992

980

false

Lee, Bruce and Rantzer, Anders and Matni, Nikolai

given	family
Bruce	Lee

given	family
Anders	Rantzer

given	family
Nikolai	Matni

2024-06-11

Proceedings of the 6th Annual Learning for Dynamics & Control Conference

242

inproceedings

date-parts

2024

6

11

https://proceedings.mlr.press/v242/lee24a/lee24a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024-06-11-lee24a.md

2024-06-11-lee24a.md

Files

2024-06-11-lee24a.md

Latest commit

History

2024-06-11-lee24a.md

File metadata and controls