Simulating many years of robotic interaction is quite feasible with modern parallel computing, physics simulation, and rendering technology. Moreover, the resulting data comes with automatically-generated annotations, which is particularly important for tasks where success is hard to infer automatically. The challenge with simulated training is that even the best available simulators do not perfectly capture reality. Models trained purely on synthetic data fail to generalize to the real world, as there is a discrepancy between simulated and real environments, in terms of both visual and physical properties. In fact, the more we increase the fidelity of our simulations, the more effort we have to expend in order to build them, both in terms of implementing complex physical phenomena and in terms of creating the content (e.g., objects, backgrounds) to populate these simulations. This difficulty is compounded by the fact that powerful optimization methods based on deep learning are exceptionally proficient at exploiting simulator flaws: the more powerful the machine learning algorithm, the more likely it is to discover how to "cheat" the simulator to succeed in ways that are infeasible in the real world. The question then becomes: how can a robot utilize simulation to enable it to perform useful tasks in the real world

Do they live in a simulation? Training models for dynamic environment

Generation of synthetic data for machine learning

Hallison Paz

Generation of synthetic data for machine learning

1. Why does synthetic data matter?

2. How to generate synthetic data and train a model with it

3. Do they live in a simulation? Training models for dynamic environments

Recap

Two weeks ago ...

Last week ...

Today's agenda

Why Simulations?

Train agents for dynamic and complex environments

Tasks | Self-driving cars

Tasks | Unmanned Aircraft Systems

Tasks | Human like manipulation

Tasks | Perform alongside humans

BMW Factory Digital Twin

Behavior | Digital Humans

Behavior | Crowds

Behavior | Crowds

Computer Graphics | Stunts

Computer Graphics | Animation

New possibilities | Digital Humans...or dogs

What if we perform tasks in the virtual world?

Why learn on simulations

We want to...

Why learn on simulations

We also want to...

Techniques

Reinforcement Learning

Reinforcement Learning

Reinforcement Learning

Reinforcement Learning

Reinforcement Learning

Reinforcement Learning

Imitation Learning

IDEA: learn from an expert demonstration, rather than a carefully designed reward function.

Imitation Learning

Challenges

Challenges

Challenges

Closing the Sim-to-Real Loop

Closing the Sim-to-Real Loop

Challenges

Grasp GAN

Retina GAN

Retina GAN

Retina GAN

Tools

Tools

Conclusion

Conclusion

Wow! What's next?

We made it!

Generation of synthetic data for machine learning

1. Why does synthetic data matter?

2. How to generate synthetic data and train a model with it

3. Do they live in a simulation? Training models for dynamic environments

THANK YOU!