SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

March 16, 2025

In this work, we present and evaluate SELMA, a Speech-Enabled Language Model for virtual Assistant interactions that integrates audio and text as inputs to a Large Language Model (LLM). SELMA is designed to handle three primary and two auxiliary tasks related to interactions with virtual assistants simultaneously within a single end-to-end model. We employ low-rank adaptation modules for parameter-efficient training of both the audio encoder and the LLM. Additionally, we implement a feature pooling strategy enabling the system to recognize global patterns and improve accuracy on tasks less…

Source: Read MoreÂ

Previous ArticleM2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference

Next Article Garfield 911 Shirt

Highlights

CVE-2025-4170 – Xavin’s Review Ratings Stored Cross-Site Scripting Vulnerability

May 3, 2025

CVE ID : CVE-2025-4170

Published : May 3, 2025, 3:15 a.m. | 2 hours, 15 minutes ago

Description : The Xavin’s Review Ratings plugin for WordPress is vulnerable to Stored Cross-Site Scripting via the plugin’s ‘xrr’ shortcode in all versions up to, and including, 1.4.0 due to insufficient input sanitization and output escaping on user supplied attributes. This makes it possible for authenticated attackers, with contributor-level access and above, to inject arbitrary web scripts in pages that will execute whenever a user accesses an injected page.

Severity: 6.4 | MEDIUM

Visit the link for more details, such as CVSS details, affected products, timeline, and more…

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

DistroWatch Weekly, Issue 1120

New Xbox games launching this week, from May 5 through May 11 — Revenge of the Savage Planet hits Xbox Game Pass

I like SteelSeries’ tiniest high performance gaming keyboard, but it’s not the only great magnetic option

Motion Highlights #5

Laravel 11 CRUD Operation

Laravel 11 CRUD Operation

Brisa 0.2.12 – Near 0.3 🔜

Essential Git Command Reference: The Core Operations Every Developer Needs

DistroWatch Weekly, Issue 1120

DistroWatch Weekly, Issue 1120

New Xbox games launching this week, from May 5 through May 11 — Revenge of the Savage Planet hits Xbox Game Pass

FCC clears Surface Laptop 13″, Pro 12″ with Snapdragon, rounded design

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

Google Researchers Advance Diagnostic AI: AMIE Now Matches or Outperforms Primary Care Physicians Using Multimodal Reasoning with Gemini 2.0 Flash

Top Artificial Intelligence AI Courses from Google

How to remove software from a Mac – and why you should do so regularly

Write for Us – Technology, Business & Marketing

The best preorder deals on the Google Pixel Buds Pro 2 we’ve found

CVE-2025-4170 – Xavin’s Review Ratings Stored Cross-Site Scripting Vulnerability

Understanding CSS Box Model Stylesheet

Supercharge your auto scaling for generative AI inference â€“ Introducing Container Caching in SageMaker Inference

hotwired-laravel/turbo-laravel

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

Related Posts