Transcribe audio with Java using Universal-1

We recently announced our latest speech recognition model, Universal-1, which achieves state-of-the-art speech-to-text accuracy. Trained on millions of hours of audio data, Universal-1 demonstrates near-human accuracy, even with accented speech, background noise, and difficult phrases like flight numbers and email addresses.

Universal-1 is also an order of magnitude faster than our previous model, Conformer-2, and supports English, Spanish, French, and German, with more languages coming shortly.

Along with Universal-1, weâ€™ve also introduced two new classes of models: Best and Nano.

Best lets you take advantage of Universal-1 for applications where accuracy is paramount.
Nano is our new cost-effective alternative with support for 99 different languages.

In this post, youâ€™ll learn how to transcribe an audio file in your Java applications using Universal-1 and Nano.

Set up the AssemblyAI Java SDK

The easiest way to transcribe audio is by using one of our official SDKs.

To install the AssemblyAI Java SDK, include the latest version of the SDK in your project dependencies:

Maven:

<dependency>
<groupId>com.assemblyai</groupId>
<artifactId>assemblyai-java</artifactId>
<version>1.0.10</version>
</dependency>

Gradle:

dependencies {
implementation ‘com.assemblyai:assemblyai-java:1.0.10’
}

Then import the SDK namespaces in your Java code.

import com.assemblyai.api.AssemblyAI;
import com.assemblyai.api.resources.transcripts.types.SpeechModel;
import com.assemblyai.api.resources.transcripts.types.Transcript;
import com.assemblyai.api.resources.transcripts.types.TranscriptOptionalParams;

Configure a new authenticated SDK client using your AssemblyAI API key from your account dashboard.

var client = AssemblyAI.builder()
.apiKey(System.getenv(“ASSEMBLYAI_API_KEY”))
.build();

Youâ€™ll find all the operations you need on the AssemblyAI instance.

Transcribe an audio file using Universal-1

All transcriptions use the Best class of models by default, so youâ€™ll always get the highest accuracy without any extra configuration.

Use the following code to transcribe an audio file from a URL using Best:

Transcript transcript = client.transcripts().transcribe(
“https://storage.googleapis.com/aai-web-samples/5_common_sports_injuries.mp3”
);

if (transcript.getStatus().equals(TranscriptStatus.ERROR)) {
throw new Exception(transcript.getError().get());
}

System.out.println(transcript.getText().get());

If you instead want to transcribe a local file, create a File object pointing to the path of the file to transcribe:

Transcript transcript = client.transcripts().transcribe(
new File(“./audio.mp3”)
);

Nanoâ€”a cost-effective alternative

Switching between Best and Nano is only a matter of setting the speech model parameter in the parameters builder. To use Nano, set the speechModel(…) to SpeechModel.NANO:

Transcript transcript = client.transcripts().transcribe(
“https://storage.googleapis.com/aai-web-samples/5_common_sports_injuries.mp3”,
TranscriptOptionalParams.builder()
.speechModel(SpeechModel.NANO)
.build()
);

Best, Nano and More with Audio Intelligence

We just used Universal-1 through both the Best and Nano class of models to transcribe audio.

Next, there are many further features that AssemblyAI offers beyond transcription to explore, such as:

Entity detection to automatically identify and categorize key information.
Content moderation for detecting inappropriate content in audio files to ensure that your content is safe for all audiences.
PII redaction to minimize sensitive information about individuals by automatically identifying and removing it from your transcript.
LeMUR for applying Large Language Models (LLMs) to audio data in a single line of code.

You can also learn more about our approach to creating superhuman Speech AI models on our Research page.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

I can hardly believe just how amazing this $100 gimbal-tracking webcam is — Why recommend anything else?

Microsoft’s Copilot is trying to appeal to the masses — now power users are leaving it behind

“It’s literally tens ofmillions of hours.” Xbox CEO Phil Spencer celebrates Xbox Cloud Gaming’s “dramatic growth,” now with per-device usage charts.

I would pay a LOT of money for this absolutely incredible AI laptop… But maybe not as much as it actually costs

Community News: Latest PECL Releases (03.11.2025)

Community News: Latest PECL Releases (03.11.2025)

danielebarbaro/laravel-vat-eu-validator

Expense Management System Android App Using SQLite

Tuning Local LLMs With RAG Using Ollama and Langchain

Tuning Local LLMs With RAG Using Ollama and Langchain

I can hardly believe just how amazing this $100 gimbal-tracking webcam is — Why recommend anything else?

8 Best Free and Open Source Restic Wrappers

Transcribe audio with Java using Universal-1

Set up the AssemblyAI Java SDK

Transcribe an audio file using Universal-1

Nanoâ€”a cost-effective alternative

Best, Nano and More with Audio Intelligence

ASUS warns of critical auth bypass flaw in routers using AiCloud

Leaked KeyPlug Malware Infrastructure Contains Exploit Scripts to Hack Fortinet Firewall and VPN

Dark Web Actor Claims to Pilfer 2TB of Compressed Data from QuoteWizard

Phish Allergy – Recognizing Phishing Messages

The Samsung Galaxy S25 Slim may steal the show at Unpacked – even if it doesn’t release

Laravel News 2024 Recap

Microsoft created a new ‘state of matter’ to revolutionize quantum computing

London Hospitals Report Service Disruption from Synnovis Ransomware Attack

Researchers at KAUST Use Anderson Exploitation to Maximize GPU Efficiency with Greater Model Accuracy and Generalizability

Automate SQL Server discovery and assessment to accelerate migration to AWS

Transcribe audio with Java using Universal-1

Set up the AssemblyAI Java SDK

Transcribe an audio file using Universal-1

Nanoâ€”a cost-effective alternative

Best, Nano and More with Audio Intelligence

Related Posts