Newsletter #37: Speaker Diarization Now in 5 New Languages ðŸ‡¨ðŸ‡³ðŸ‡®ðŸ‡³ðŸ‡¯ðŸ‡µðŸ‡°ðŸ‡·ðŸ‡»ðŸ‡³ & Latest Speech AI tutorials

Hey ðŸ‘‹, this weekly update contains the latest info on our new product features, tutorials, and our community.

New Language Support for Speaker Diarization

AssemblyAI’s Speaker Diarization model now supports five additional languages: Chinese ðŸ‡¨ðŸ‡³, Hindi ðŸ‡®ðŸ‡³, Japanese ðŸ‡¯ðŸ‡µ, Korean ðŸ‡°ðŸ‡·, and Vietnamese ðŸ‡»ðŸ‡³. This feature is available in both ourÂ BestÂ andÂ NanoÂ tiers.Â

The Speaker Diarization model detects multiple speakers in an audio file and identifies what each speaker said. To start building with this feature, simply setÂ speaker_labelsÂ toÂ trueÂ in your transcription configuration. For more examples, check out ourÂ documentation.

Fresh From Our Blog

Automatically determine video sections with AI using Python: Learn how to automatically determine video sections, how to generate section titles with LLMs, and how to format the information for YouTube chapters.Â Read more>>

Filter profanity from audio files using Python: Learn how to filter profanity out of audio and video files with fewer than 10 lines of code in this tutorial.Â Read more>>

How to use audio data in LlamaIndex with Python: Discover how to incorporate audio files into LlamaIndex and build an LLM-powered query engine in this step-by-step tutorial.Â Read more>>

Our Trending YouTube Tutorials

Coding an AI Voice Bot from Scratch: Real-Time Conversation with Python: Learn how to build a real-time AI voice assistant using Python that can handle incoming calls, transcribe speech, generate intelligent responses, and provide a human-like conversational experience. Perfect for call centers, customer support, and virtual receptionist applications.Â

How to use @postman to test LLMs with audio data (Transcribe and Understand): Learn how to transcribe audio and video files using AssemblyAI and also how to use LeMUR, AssemblyAI’s framework for using Large Language Models on spoken data without having to code at all.

Build A Talking AI with LLAMA 3 (Python tutorial): This tutorial shows you how to build a talking AI using real-time transcription with AssemblyAI, using LLAMA 3 as the language model with Ollama, and ElevenLabs for text-to-speech.Â

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Minecraft licensing robbed us of this controversial NFL schedule release video

The power of generators

The power of generators

Simplify Factory Associations with Laravel’s UseFactory Attribute

This Week in Laravel: React Native, PhpStorm Junie, and more

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Newsletter #37: Speaker Diarization Now in 5 New Languages ðŸ‡¨ðŸ‡³ðŸ‡®ðŸ‡³ðŸ‡¯ðŸ‡µðŸ‡°ðŸ‡·ðŸ‡»ðŸ‡³ & Latest Speech AI tutorials

New Language Support for Speaker Diarization

Fresh From Our Blog

Our Trending YouTube Tutorials

LLMs Struggle with Real Conversations: Microsoft and Salesforce Researchers Reveal a 39% Performance Drop in Multi-Turn Underspecified Tasks

This AI paper from DeepSeek-AI Explores How DeepSeek-V3 Delivers High-Performance Language Modeling by Minimizing Hardware Overhead and Maximizing Computational Efficiency

CVE-2025-46726 – Langroid XMLToolMessage XML External Entity (XXE) Denial of Service (DoS) and Local File Information Exposure

Trump’s AI-generated papal portrait sparks controversy and debate

CVE-2025-46571 – Open WebUI Unauthenticated JavaScript File Upload to Admin RCE

FunctionChat-Bench: Comprehensive Evaluation of Language Modelsâ€™ Function Calling Capabilities Across Interactive Scenarios

CVE-2025-4179 – Flynax Bridge WordPress Privilege Escalation

Kingdom Come: Deliverance 2 is 2025’s GOTY frontrunner, and it can be yours for less with these deals

CVE-2025-4237 – PCMan FTP Server MDELETE Command Handler Buffer Overflow

How to Automate Insurance Claims Processing

Newsletter #37: Speaker Diarization Now in 5 New Languages ðŸ‡¨ðŸ‡³ðŸ‡®ðŸ‡³ðŸ‡¯ðŸ‡µðŸ‡°ðŸ‡·ðŸ‡»ðŸ‡³ & Latest Speech AI tutorials

New Language Support for Speaker Diarization

Fresh From Our Blog

Our Trending YouTube Tutorials

Related Posts