Autonomous vehicles use a variety of sensors and machine-learned models to
predict the behavior of surrounding road users. Most of the machine-learned
models in the literature focus on quantitative error metrics like the root mean
square error (RMSE) to learn and report their models' capabilities. This focus
on quantitative error metrics tends to ignore the more important behavioral
aspect of the models, raising the question of whether these models really
predict human-like behavior. Thus, we propose to analyze the output …

