Blogs

How to Engineer and Deploy Mainstream VIO/VSLAM Systems

Posted by

July 23, 2025 On August 27, 2025

This article focuses on engineering transformation strategies for mainstream VIO (Visual-Inertial Odometry) and VSLAM (Visual Simultaneous Localization and Mapping) systems (such as VINS-MONO and DSO). While the content aims for conciseness, a comprehensive grasp of core methods requires readers to have interdisciplinary capabilities spanning software, hardware, and algorithms.

Core Engineering Issues in Mainstream VIO Systems

1、ZUPT (Zero Velocity Update) and Failure in Special Scenarios

It is crucial to emphasize that hardware selection is vital for engineering deployment:

For indoor environments, a global shutter camera must be used, with time synchronization between the IMU and camera achieved via a hardware MCU (Td controlled to approach 0);
For outdoor use with rolling shutter cameras, exposure time must be precisely controlled to achieve commercial-grade loop closure performance.

Research teams have conducted a series of engineering validations using mainstream devices such as RealSense, Kinect V2, and Xiaomi MiVision. Even after full software optimization, performance on high-computing platforms like the i7-12th Gen remained subpar (Kinect V2 performed relatively better). Ultimately, self-developed camera components were used to completely resolve hardware-level bottlenecks.

2、Optimized Design for Engineering Overhead of the Main System

As classic academic achievements, VINS-MONO and DSO, along with their derivatives (e.g., VINS-Fusion, VI-DSO), have seen continuous algorithmic optimizations. However, aside from the aforementioned zero-velocity update issues, they still suffer from high overhead in engineering scenarios. The following two core links require key breakthroughs:

(1) Lightweight Design of Front-End Video Stream Display and Interaction Guidance Modules
The core function of this module is to provide clear video feedback through a human-machine interface, enabling users to intuitively perceive system initialization status and feature point tracking results. Yet, its optimization is often overlooked.

Laboratory versions (e.g., VINS, DSO) rely heavily on tools like Pangolin and RVIZ, directly calling raw data (RAW DATA) and using OpenCV’s software layer to overlay and display feature points. Such designs have significant flaws: processing raw data consumes substantial computing power, and CPU calls to library functions for point overlay further increase overhead, making the system unfit for embedded platforms. Even if it run, latency in transmitting visual data to the user end increases drastically.

Solution: Implement video stream encoding using the hardware codec engine of high-performance visual SOCs, and complete feature point overlay and status display via hardware OSD operators. This solution resolves long latency and high overhead issues for video streams on both embedded systems and PCs. The technology is mature in machine vision, supported by mainstream SOCs (e.g., HiSilicon 3519A), with diverse implementation methods—though it requires R&D teams to have strong PCBA design and embedded development capabilities.

(2) Engineering Implementation of 3D Point Cloud and Pose Display Modules
The core data output by VIO/VSLAM systems includes two types: 3D spatial coordinate information (e.g., inverse depth, homogeneous coordinates) and camera poses. For large-scale systems, this data is typically encapsulated into SDKs and data structures, but user-side visualization needs still require solving 3D point cloud processing issues. While VINS and DSO can extend display functions via RVIZ or Pangolin, this approach is not recommended for engineering use.

Solutions:

For embedded system development, NVIDIA (NV) solutions simplify implementation due to their robust ecosystem; for Mali-based GPU solutions, proficiency in OpenCL is required.
A better approach is to extend development on the PC side: first complete structured processing of point cloud and pose data with SDK encapsulation, then choose a suitable development method:
Option 1: C++ development based on the PCL point cloud library and QT framework.
Option 2: Browser-side development.

Initially, basic functions like rotation, translation, and scaling are sufficient to meet requirements.

The above optimization tasks, while labor-intensive and demanding high R&D capabilities, are essential for engineering deployment. Without these optimizations, VIO/VSLAM systems remain confined to the laboratory, failing to meet the practical needs of industry and end-users—over 95% of users will not allocate additional main control computing power to non-core functions.

Note: Core computing resources should prioritize the system’s core modules. In VIO and multi-sensor fusion systems, core modules include only front-end feature processing and back-end state estimation; loop closure detection is merely a “semi-core” function.

RoboBaton-VIOBOT2 provides pure-vision spatial perception cameras designed specifically for robot vision to enhance a robot’s environmental awareness. Our cameras deliver real-time spatial perception data, including depth maps, position, and posture, helping robots achieve more efficient spatial localization, object recognition, path planning, dynamic scene understanding, and obstacle avoidance. They are a core hardware component for boosting robot vision performance.

https://mrp.trcai.de/product/hessian-robobaton-visual-spatial-computing-piece-viobot2-binocular-fisheye-slam-camera-pre-sale/

About myrobotproject123

View all posts by myrobotproject123

12 Mar

Technical Principles

Ultimate RTK: The Revolutionary Tech Powering Centimeter-Level Precision Positioning

Posted by

he yike

March 13, 2026

In the era of intelligent positioning, the demand for ultra-precise location data has never been higher. From precision agriculture to ...

28 Feb

Case Studies

7 Ways Warehouse Depth Camera Efficiency Safeguards Profit

Posted by

myrobotproject123

February 28, 2026

7 ways warehouse depth camera efficiency safeguards profit Look, in the high-stakes world of global logistics, every millimeter of err...

28 Feb

Technical Principles

Revolutionizing Aerial Intelligence: Why the Ultimate Tri-Spectral Gimbal Pod is a Game-Changer

Posted by

he yike

March 13, 2026

In the rapidly evolving world of unmanned aerial vehicles (UAVs), the payload mounted beneath the drone is quite literally the "make or...

24 Feb

Case Studies

6 Ways Stereo Vision Camera Navigation Safeguards ROI

Posted by

myrobotproject123

February 24, 2026

6 ways stereo vision camera navigation safeguards ROI I’ve seen too many warehouse managers lose sleep over expensive robots that just...

23 Feb

Case Studies

Revolutionize 3D Depth Camera Robotics: 5 Hidden Truths

Posted by

myrobotproject123

February 24, 2026

Revolutionize 3d depth camera robotics: 5 hidden truths to save your roi Mastering the 3D depth camera robotics landscape is the only ...

18 Feb

Case Studies

Mastering 3 Industrial Depth Camera Accuracy Standards

Posted by

myrobotproject123

February 24, 2026

Mastering 3 industrial depth camera accuracy standards Unlock peak robotic precision. Learn how mastering industrial depth camera accu...

16 Feb

Case Studies

5 Ways UAV Thermal Imaging Camera Slashes Search Costs

Posted by

myrobotproject123

February 24, 2026

5 ways uav thermal imaging camera slashes search costs Look, if you are still relying on massive ground crews to sweep a rugged site, ...

12 Feb

Uncategorized

SpacemiT K3 Chip: Full Specifications, Interfaces & Core Innovations of the RVA23 RISC-V AI CPU

Posted by

myrobotproject123

February 12, 2026

Introduction SpacemiT K3, the flagship AI CPU chip launched by Chinese RISC-V pioneer SpacemiT in January 2026, stands as the world’...

06 Feb

Uncategorized

Stop dragging a cable: make ANY UARTpose/odometry sensor wireless (ESP8266 bridge)

Posted by

myrobotproject123

February 6, 2026

I’m done dragging a USB cable just to watch pose/odometry update in realtime. So I built a tiny UART → Wi-Fi bridge with an ESP8266 ...

03 Feb

Uncategorized

How to Connect RoboBaton-mini to Steam VR

Posted by

myrobotproject123

February 6, 2026

With the growing popularity of VR development and application scenarios, we casually came up with the idea of integrating our in-house ...

29 Jan

Insight

Revolutionary Robotic Arm: Unleash Unmatched Efficiency in Industrial & Commercial Applications

Posted by

myrobotproject123

January 16, 2026

In the fast-paced world of manufacturing and automation, the robotic arm has emerged as a game-changing technology that redef...

27 Jan

Insight

Can Thermal Imaging Cameras See Through These 8 Things? | Myths Debunked & Practical Guide

Posted by

myrobotproject123

December 19, 2025

Uncover the truth about thermal imaging cameras! Debunk myths on wall penetration & ghost detection, and learn their real-world uses in firefighting, security & more.

Blogs

How to Engineer and Deploy Mainstream VIO/VSLAM Systems

Core Engineering Issues in Mainstream VIO Systems

About myrobotproject123

Related posts

Ultimate RTK: The Revolutionary Tech Powering Centimeter-Level Precision Positioning

7 Ways Warehouse Depth Camera Efficiency Safeguards Profit

Revolutionizing Aerial Intelligence: Why the Ultimate Tri-Spectral Gimbal Pod is a Game-Changer

6 Ways Stereo Vision Camera Navigation Safeguards ROI

Revolutionize 3D Depth Camera Robotics: 5 Hidden Truths

Mastering 3 Industrial Depth Camera Accuracy Standards

5 Ways UAV Thermal Imaging Camera Slashes Search Costs

SpacemiT K3 Chip: Full Specifications, Interfaces & Core Innovations of the RVA23 RISC-V AI CPU

Stop dragging a cable: make ANY UARTpose/odometry sensor wireless (ESP8266 bridge)

How to Connect RoboBaton-mini to Steam VR

Revolutionary Robotic Arm: Unleash Unmatched Efficiency in Industrial & Commercial Applications

Can Thermal Imaging Cameras See Through These 8 Things? | Myths Debunked & Practical Guide

Leave a Reply Cancel reply

All Category

Product

Applications

About us

Consult