1 Introduction

2 Problem Statement

modeling the errors of the simulator in a principled way and trading off evaluation effort and information gain
one simulation -> arbitrary number
cost (being optimized) being partly explained, \(J(\theta)=J_{sim}(\theta)+J_{err}(\theta)\)
parameter vector, additional binary, \(\delta\)
sim: only covariances between, captured by \(k_{sim}\)
- both physics: \(k_{err}\), covariate strongly
synthetic example
- blue: partly, red: directly
noise of measurement
quantify the goal, ES, low entropy
- \(\delta=0\) only provides information about part of the cost, \(J_{sim}\)
trade off, do not require tuning, lead to more experiments on the physical...
entropy, consistent unit of measurement for both information sources
best gain per unit of effort
- "whether the simulator is reliable enough to lead to additional information"
switch, not 2-stage, the quality is not known in advance

cart-pole, Quanser Linear Inverted Pendulum
Simulink model, manufactor, simulator
static state-feedback controller
cost function, control
LQR, two parameters, prior
systematically, lower prior
GP model, hyperparam, convergence
MF-ES and ES
- 10 times
- physical experiments, sim experiments (good illustration! no word games!)

发表于 2021-11-05 00:08 minor_second 阅读(32) 评论(0) 编辑收藏举报