"We built a training and evaluation benchmark of 1158 and 250 real RGB images, respectively, at a resolution of 960x720"
Benchmark training set
Each object is shown in various poses and appears equally
All three models have been trained using distributed asynchronous stochastic gradient descent with a learning rate of 0.0001 for 850K iterations.
direita, imagem de cenário 3d com eixos