MRLabeling: Create RGB-D Datasets On-The-Fly using Mixed Reality
Abstract
One of the best ways to build better vision models is to train large models on big datasets. However, the process of building such datasets is often costly and tedious. With the ever increasing adoption of Mixed Reality in professional settings, and with the performance improvements of headsets in recent years, we see an opportunity for a tool that combines data collection and annotation in a single process, and that leverages both RGB and depth data provided by the headset sensors. Moreover, assisting machine learning predictive models with user inputs through natural mixed reality interactions is a promising prospective for human-artificial intelligence interactions. In this paper, we present MRLabeling, an application developped for the Microsoft Hololens 2 that allows the easy creation and annotation of datasets directly in Mixed Reality. We first describe the design of the system, the way 3D bounding boxes drawn by the user are projected in 2D to create annotated images on the fly, and the use of segmentation algorithms to go beyond bounding boxes. After that, we explore the use of depth data, and the current limitations of the system, as well as avenues for future work.
Citation
Nguyen, Richard, et al. "MRLabeling: Create RGB-D Datasets On-The-Fly using Mixed Reality." 2023 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct). IEEE, 2023.