Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks

June 20, 2025

We introduce a set of training-free ABX-style discrimination tasks to evaluate how multilingual language models represent language identity (form) and semantic content (meaning). Inspired from speech processing, these zero-shot tasks measure whether minimal differences in representation can be reliably detected. This offers a flexible and interpretable alternative to probing. Applied to XLM-R (Conneau et al, 2020) across pretraining checkpoints and layers, we find that language discrimination declines over training and becomes concentrated in lower layers, while meaning discrimination…

Source: Read MoreÂ

Previous ArticleScaling Laws for Unsupervised Finetuning of LLMs

Next Article web site review

CodeSOD: One Last ID

9 Ways AI Code Generation in React.js Reduces Technical Debt for Product Teams

GitHub details upcoming changes to improve security in wake of Shai-Hulud worm in npm ecosystem

Syncfusion restructures Essential Studio into multiple different suites to provide greater flexibility for developers

Development Release: MX Linux 25 Beta 1

DistroWatch Weekly, Issue 1140

Distribution Release: DietPi 9.17

Development Release: Zorin OS 18 Beta

A Stream-Oriented UI library for interactive web applications

A Stream-Oriented UI library for interactive web applications

billboard.js 3.17.0: ✨ New Axis Customization, Label Styling & Image Labels!

AEM and Cloudflare Workers: The Ultimate Duo for Blazing Fast Pages

How I Configure Polybar to Customize My Linux Desktop

How I Configure Polybar to Customize My Linux Desktop

Development Release: MX Linux 25 Beta 1

DistroWatch Weekly, Issue 1140

Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

Announcing the new cluster creation experience for Amazon SageMaker HyperPod

Pointron – Focus time tracker

CVE-2025-3457 – WordPress Ocean Extra Stored Cross-Site Scripting Vulnerability

CVE-2025-7936 – A vulnerability has been found in fuyang_lipengjun

Fake AI Tools Used to Spread Noodlophile Malware, Targeting 62,000+ via Facebook Lures

Vibe Coding and The Illusion of Progress

CVE-2025-4801 – Apache HTTP Server Command Injection

CVE-2025-8611 – AOMEI Cyber Backup Remote Code Execution (RCE) Missing Authentication

From Distrust to Defense – How AI is Strengthening Cybersecurity

Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks

Related Posts