Evaluation

"We built a training and evaluation benchmark of 1158 and 250 real RGB images, respectively, at a resolution of 960x720"
Benchmark training set
- Random subsets of the objects of interest
- Cluttered background
- Different lighting conditions
Each object is shown in various poses and appears equally
All three models have been trained using distributed asynchronous stochastic gradient descent with a learning rate of 0.0001 for 850K iterations.

How to generate synthetic data and train a model with it?

Generation of synthetic data for machine learning

Hallison Paz

Generation of synthetic data for machine learning

1. Why does synthetic data matter?

2. How to generate synthetic data and train a model with it

3. Do they live in a simulation? Training models for dynamic environments

Recap

Last week, we saw ...

Today's agenda

Tools

Tools

Blender

BlenderProc

Omniverse

Omniverse

Unity

Unity

Strategy

An annotation saved is an annotation earned: Using fully synthetic training for object detection

Last week...

Pipeline

Pose

Deterministic schedule for poses

Occlusion layer

Post processing

Evaluation

Synthetic vs Real

Random vs Curriculum strategy

Relative size of background objects

Higher number of foreground objects -> better

"note that we only set an upper limit to the number of foreground objects drawn in one image"

Real vs Synthetic Background

Other parameters

How to generate the synthetic dataset

Unity Tutorials

Dataset evaluation

Can we do better?

Procedural generation?

How to train a model

Demo on Pytorch

Conclusion

Conclusion

Wow! What's next?

Next week...

Generation of synthetic data for machine learning

1. Why does synthetic data matter?

2. How to generate synthetic data and train a model with it

3. Do they live in a simulation? Training models for dynamic environments

THANK YOU!