Machine Learning

Impel enhances automotive dealership customer experience with fine-tuned LLMs on Amazon SageMaker

June 4, 2025

This post is co-written with Tatia Tsmindashvili, Ana Kolkhidashvili, Guram Dentoshvili, Dachi Choladze from Impel. Impel transforms automotive retail through…

Beyond Text Compression: Evaluating Tokenizers Across Scales

June 4, 2025

Tokenizer design significantly impacts language model performance, yet evaluating tokenizer quality remains challenging. While text compression has emerged as a…

Proxy-FDA: Proxy-Based Feature Distribution Alignment for Fine-Tuning Vision Foundation Models Without Forgetting

June 4, 2025

Vision foundation models pre-trained on massive data encode rich representations of real-world concepts, which can be adapted to downstream tasks…

A Coding Implementation to Build an Advanced Web Intelligence Agent with Tavily and Gemini AI

June 4, 2025

In this tutorial, we introduce an advanced, interactive web intelligence agent powered by Tavily and Google’s Gemini AI. We’ll learn…

Machine Learning

NVIDIA AI Releases Llama Nemotron Nano VL: A Compact Vision-Language Model Optimized for Document Understanding

June 4, 2025

NVIDIA has introduced Llama Nemotron Nano VL, a vision-language model (VLM) designed to address document-level understanding tasks with efficiency and…

Distillation Scaling Laws

June 3, 2025

We propose a distillation scaling law that estimates distilled model performance based on a compute budget and its allocation between…

Machine Learning

Enhanced diagnostics flow with LLM and Amazon Bedrock agent integration

June 3, 2025

Noodoe is a global leader in EV charging innovation, offering advanced solutions that empower operators to optimize their charging station…

Machine Learning

Build a scalable AI assistant to help refugees using AWS

June 3, 2025

This post is co-written with Taras Tsarenko, Vitalil Bozadzhy, and Vladyslav Horbatenko. As organizations worldwide seek to use AI for…

Machine Learning

Unlocking the power of Model Context Protocol (MCP) on AWS

June 3, 2025

We’ve witnessed remarkable advances in model capabilities as generative AI companies have invested in developing their offerings. Language models such…

Machine Learning

Snowflake Charts New AI Territory: Cortex AISQL & Snowflake Intelligence Poised to Reshape Data Analytics

June 3, 2025

San Francisco, CA – The data cloud landscape is buzzing as Snowflake, a heavyweight in data warehousing and analytics, today…

Machine Learning

From Exploration Collapse to Predictable Limits: Shanghai AI Lab Proposes Entropy-Based Scaling Laws for Reinforcement Learning in LLMs

June 3, 2025

Recent advances in reasoning-centric large language models (LLMs) have expanded the scope of reinforcement learning (RL) beyond narrow, task-specific applications,…

Machine Learning

Hugging Face Releases SmolVLA: A Compact Vision-Language-Action Model for Affordable and Efficient Robotics

June 3, 2025

Despite recent progress in robotic control via large-scale vision-language-action (VLA) models, real-world deployment remains constrained by hardware and data requirements.…

OpenAI Introduces Four Key Updates to Its AI Agent Framework

June 3, 2025

OpenAI has announced a set of targeted updates to its AI agent development stack, aimed at expanding platform compatibility, improving…

Analyzing the Effect of Linguistic Similarity on Cross-Lingual Transfer: Tasks and Input Representations Matter

June 3, 2025

Cross-lingual transfer is a popular approach to increase the amount of training data for NLP tasks in a low-resource context.…

Machine Learning

This AI Paper Introduces LLaDA-V: A Purely Diffusion-Based Multimodal Large Language Model for Visual Instruction Tuning and Multimodal Reasoning

June 3, 2025

Multimodal large language models (MLLMs) are designed to process and generate content across various modalities, including text, images, audio, and…

Meta Releases Llama Prompt Ops: A Python Package that Automatically Optimizes Prompts for Llama Models

June 3, 2025

The growing adoption of open-source large language models such as Llama has introduced new integration challenges for teams previously relying…

Machine Learning

Hands-On Guide: Getting started with Mistral Agents API

June 3, 2025

The Mistral Agents API enables developers to create smart, modular agents equipped with a wide range of capabilities. Key features…

Machine Learning

Mistral AI Introduces Codestral Embed: A High-Performance Code Embedding Model for Scalable Retrieval and Semantic Understanding

June 3, 2025

Modern software engineering faces growing challenges in accurately retrieving and understanding code across diverse programming languages and large-scale codebases. Existing…

Machine Learning

MiMo-VL-7B: A Powerful Vision-Language Model to Enhance General Visual Understanding and Multimodal Reasoning

June 2, 2025

Vision-language models (VLMs) have become foundational components for multimodal AI systems, enabling autonomous agents to understand visual environments, reason over…

Machine Learning

Fast-track SOP processing using Amazon Bedrock

June 2, 2025

Standard operating procedures (SOPs) are essential documents in the context of regulations and compliance. SOPs outline specific steps for various…