AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments

System-level (left) and user-level (right) perspective of the immersive AI assistant.

Abstract

Many industrial sectors rely on well-trained employees that are able to operate complex machinery. In this work, we demonstrate an AI-powered immersive assistance system that supports users in performing complex tasks in industrial environments. Specifically, our system leverages a VR environment that resembles a juice mixer setup. This digital twin of a physical setup simulates complex industrial machinery used to mix preparations or liquids (e.g., similar to the pharmaceutical industry) and includes various containers, sensors, pumps, and flow controllers. This setup demonstrates our system’s capabilities in a controlled environment while acting as a proof-of-concept for broader industrial applications. The core components of our multimodal AI assistant are a large language model and a speech-to-text model that process a video and audio recording of an expert performing the task in a VR environment. The video and speech input extracted from the expert’s video enables it to provide step-by-step guidance to support users in executing complex tasks. This demonstration showcases the potential of our AI-powered assistant to reduce cognitive load, increase productivity, and enhance safety in industrial environments.

Publication
Proceedings of the 27th European Conference on Artificial Intelligence
Tomislav Đuričić
Tomislav Đuričić
Researcher / Machine Learning Engineer / Software Engineer

My research interests include social-based recommender systems, graph neural networks and user modeling.