A Comparative Analysis of Tightly-Coupled Monocular, Binocular, and Stereo VINS

Mrinal Kanti Paul¹, Kejian Wu², Joel A. Hesch³, Esha Nerurkar⁴, Stergios Roumeliotis⁵

¹University of Minnesota - Twin Cities
²XREAL
³Google Inc.
⁴University of Minnesota
⁵Apple Inc.

Details

10:15 - 10:20 | Tue 30 May | Room 4311/4312 | TUA3.5

Session: Computer Vision 1

Abstract

In this paper, a sliding-window two-camera vision-aided inertial navigation system (VINS) is presented in the square-root inverse domain. The performance of the system is assessed for the cases where feature matches across the two-camera images are processed with or without any stereo constraints (i.e., stereo vs. binocular). To support the comparison results, a theoretical analysis on the information gain when transitioning from binocular to stereo is also presented. Additionally, the advantage of using a two-camera (both stereo and binocular) system over a monocular VINS is assessed. Furthermore, the impact on the achieved accuracy of different image-processing frontends and estimator design choices is quantified. Finally, a thorough evaluation of the algorithm's processing requirements, which runs in real-time on a mobile processor, as well as its achieved accuracy as compared to alternative approaches is provided, for various scenes and motion profiles.