Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models

November 4, 2024

This paper was accepted at the Adaptive Foundation Models (AFM) workshop at NeurIPS Workshop 2024.
Follow-up conversations with virtual assistants (VAs) enable a user to seamlessly interact with a VA without the need to repeatedly invoke it using a keyword (after the first query). Therefore, accurate Device-Directed Speech Detection (DDSD) from the follow-up queries is critical for enabling naturalistic user experience. To this end, we explore the notion of Large Language Models (LLMs) and model the first query when making inference about the follow-ups (based on the ASR-decoded text), viaâ€¦

Source: Read MoreÂ

Previous ArticleOuteTTS-0.1-350M Released: A Novel Text-to-Speech (TTS) Synthesis Model that Leverages Pure Language Modeling without External Adapters

Next Article This AI Paper from the Technical University of Munich Introduces a Novel Machine Learning Approach to Improving Flow-Based Generative Models with Simulator Feedback

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

How to install and use Ollama to run AI LLMs on your Windows 11 PC

Community News: Latest PECL Releases (05.13.2025)

Community News: Latest PECL Releases (05.13.2025)

How We Use Epic Branches. Without Breaking Our Flow.

I think the ergonomics of generators is growing on me.

This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models

Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

CVE-2024-13940 – Ninja Forms Webhooks SSRF Vulnerability

Voice Interface Design

Test Automation Maintenance Costs: Smart Ways to Reduce

Google Patches 47 Android Security Flaws, Including Actively Exploited CVE-2024-53104

15 Best New Fonts, July 2024

Online Tailor Booking System using PHP and MySQL

XEphem – Serious Interactive Astronomical Software Ephemeris

Amazon Q Apps supports customization and governance of generative AI-powered apps

Build a news recommender application with Amazon Personalize

Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models

Related Posts