Open Source for Real-Time Voice Understanding: An Edge AI Solution

Audience:

Topic:

This talk presents the architecture and implementation of a real-time voice understanding system for retail environments, powered by open source AI technologies and deployed on NVIDIA Jetson devices. The system integrates automatic speech recognition (ASR) and large language models (LLMs) to process customer interactions in real-time, enabling conversation understanding, summarization, and sentiment analysis. We will explore system architecture design, selection and evaluation of ASR and LLM models, design trade-offs between latency and accuracy, and share performance test results.

Room:

Ballroom F

Time:

Sunday, March 9, 2025 - 11:45 to 12:45

Audio/Video: