8th International Workshop on Recovering 6D Object Pose (R6D)

Program

October 3 (PM), 2023, UTC+2 time zone

The workshop was streamed and recorded via Zoom (the recordings are linked below).
Anyone could have joined virtually, ICCV registration was not required.

13:30	Opening: Tomáš Hodaň (Meta) Recording
13:45	Invited talk 1: Jonathan Tremblay (Nvidia): A Robot Can See – A Pose Estimation Journey Recording
14:15	Invited talk 2: Shubham Tulsiani (CMU): Generalizable Sparse-view 6D Pose Estimation Recording
14:45	Results of BOP 2023: Tomáš Hodaň (Meta), Martin Sundermeyer (Google), Yann Labbé (Meta) Recording
15:15	Coffee break at workshop posters
15:45	Invited talk 3: Fabian Manhardt (Google): Learning to Estimate the 6D Pose From Unlabelled Data Recording
16:15	Invited talk 4: Yu Xiang (The University of Texas at Dallas): Connecting 6D Object Pose Estimation with Robot Manipulation Recording
16:45	Oral presentations of workshop papers and BOP winners Recording
17:30	Poster session (posters of BOP winners, workshop papers and invited conference papers)
18:00	End of workshop

News

Oct 7, 2023 - Recordings from the workshop are now available (see links in program).
Oct 3, 2023 - Winners of the BOP Challenge 2023 have been announced at the workshop (recording).
Sep 29, 2023 - The workshop will be streamed and recorded via Zoom – meeting link.
Sep 26, 2023 - The submission deadline for BOP Challenge 2023 is extended to September 28.
Jul 12, 2023 - The workshop papers can have 4–8 pages and should be submitted to CMT.
Jun 07, 2023 - BOP Challenge 2023 is now open!
Apr 19, 2023 - Workshop website launched.

Introduction

The workshop covers topics related to estimating the 6D object pose (3D translation and 3D rotation) from RGB/RGB-D images, which is an important problem for application fields such as robotic manipulation, augmented reality and autonomous driving. The introduction of RGB-D sensors, advent of deep learning, and novel data generation pipelines led to substantial improvements in object pose estimation. Yet there remain challenges to address such as robustness against occlusion and clutter, scalability to multiple objects, effective synthetic-to-real domain transfer, fast and reliable object learning/modeling, and handling non-rigid objects and object categories. Addressing these challenges is necessary for achieving reliable solutions that can be deployed in real-world settings.

In conjunction with the workshop, we organize the BOP Challenge 2023, the fifth in a series of public competitions with the goal of capturing the status quo in the field of object pose estimation. The 2023 challenge introduces new tasks of detection, segmentation and pose estimation of objects unseen during training. By introducing these tasks, we wish to encourage development of practical methods that can learn novel objects on the fly just from provided 3D models, which is an important capability for industrial setups.

The workshop features invited talks by experts in the field, presentation of the BOP Challenge 2023 results, and oral/poster presentations of accepted workshop papers and of papers invited from the main conference. The workshop is expected to be attended by people working on related topics in both academia and industry.

Previous workshop editions: 1st edition (ICCV 2015), 2nd edition (ECCV 2016), 3rd edition (ICCV 2017), 4th edition (ECCV 2018), 5th edition (ICCV 2019), 6th edition (ECCV 2020), 7th edition (ECCV 2022).

BOP Challenge 2023

To measure the progress in the field of object pose estimation, we created the BOP benchmark and have been organizing challenges on the benchmark datasets in conjunction with the R6D workshops since 2017. This year is no exception. The BOP benchmark is far from being solved, with the pose estimation accuracy improving significantly every challenge — the state of the art moved from 56.9 AR (Average Recall) in 2019, to 69.8 AR in 2020, and to new heights of 83.7 AR in 2022. Out of 49 pose estimation methods evaluated since 2019, the top 18 are from 2022. More details can be found in the BOP challenge 2022 paper.

Besides the three tasks from 2022 (object detection, segmentation and pose estimation of objects seen during training), the 2023 challenge introduces new tasks of detection, segmentation and pose estimation of objects unseen during training. In the new tasks, methods need to learn new objects during a short object onboarding stage (max 5 min per object) and then recognize the objects in images of diverse environments. Such methods are of a high practical relevance as they do not require expensive training for every new object, which is required by most existing methods and severely limits their scalability. This year, methods are provided 3D mesh models during the onboarding stage (next years, we are planning to introduce an even more challenging variant where only a few reference images of each object will be provided).

An implicit goal of BOP is to identify the best synthetic-to-real domain transfer techniques. The capability of methods to effectively train on synthetic images is crucial as collecting ground-truth object poses for real images is prohibitively expensive. In 2020, to foster the progress, we joined the development of BlenderProc, an open-source synthesis pipeline, and used it to generate photorealistic training images for the benchmark datasets. Methods trained on these images achieved major performance gains on real test images. However, we can still observe a performance drop due to the domain gap between the synthetic training and real test images. We therefore encourage participants to build on top of BlenderProc and publish their solutions.

Join the challenge (the submission system stays open even after the workshop)

Accepted workshop papers

Recording of oral presentations

SpyroPose: SE(3) Pyramids for Object Pose Distribution Estimation
Rasmus Haugaard, Frederik Hagelskjær, Thorbjorn M. Iversen
Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation
Jieming Zhou, Tong Zhang, Zeeshan Hayder, Lars Petersson, Mehrtash Harandi
Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation
Jaime Mr Corsetti, Davide Boscaini, Fabio Poiesi
Accidental Turntables: Learning 3D Pose by Watching Objects Turn
Zezhou Cheng, Matheus A Gadelha, Subhransu Maji
NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation
Fu Li, Shishir Reddy Vutukur, Hao Yu, Slobodan Ilic, Benjamin Busam, Ivan Shugurov, Shaowu Yang
CNOS: A Strong Baseline for CAD-based Novel Object Segmentation
Van Nguyen Nguyen, Georgy Ponimatkin, Thibault Groueix, Vincent Lepetit, Tomas Hodan
Reconstruction of 3D Interaction Models from Images using Shape Prior
Mehrshad Mirmohammadi, Parham Saremi, Yen-Ling Kuo, Xi Wang
PoseMatcher: One-shot Model Free Pose Estimation by Deep Feature Matching
Pedro Castro, Tae-Kyun Kim

Dates

Paper submission deadline: July 24, 2023 (11:59PM PST)
Paper acceptance notification: August 4, 2023
Paper camera-ready version: August 21, 2023 (11:59PM PST)
Deadline for submissions to the BOP Challenge 2023: September 26, 2023 (11:59PM UTC)
Workshop date: October 3 (PM), 2023

Organizers

Tomáš Hodaň, Reality Labs at Meta, tomhodan@meta.com
Martin Sundermeyer, Google, msundermeyer42@gmail.com
Yann Labbé, Reality Labs at Meta, labbe.yann1994@gmail.com
Gu Wang, Tsinghua University
Eric Brachmann, Niantic
Bertram Drost, MVTec
Lingni Ma, Reality Labs at Meta
Sindi Shkodrani, Reality Labs at Meta
Ales Leonardis, University of Birmingham
Carsten Steger, Technical University of Munich, MVTec
Vincent Lepetit, ENPC ParisTech, Technical University Graz
Jiří Matas, Czech Technical University in Prague