Machine Learning

Interview with Hamza Tahir: Co-founder and CTO of ZenML

April 10, 2025

Bio: Hamza Tahir is a software developer turned ML engineer. An indie hacker by heart, he loves ideating, implementing, and…

Machine Learning

Boson AI Introduces Higgs Audio Understanding and Higgs Audio Generation: An Advanced AI Solution with Real-Time Audio Reasoning and Expressive Speech Synthesis for Enterprise Applications

April 10, 2025

In today’s enterprise landscape—especially in insurance and customer support —voice and audio data are more than just recordings; they’re valuable…

MM-Ego: Towards Building Egocentric Multimodal LLMs

April 10, 2025

This research aims to comprehensively explore building a multimodal foundation model for egocentric video understanding. To achieve this goal, we…

RelCon: Relative Contrastive Learning for a Motion Foundation Model for Wearable Data

April 10, 2025

We present RelCon, a novel self-supervised Relative Contrastive learning approach for training a motion foundation model from wearable accelerometry sensors.…

Adaptive Batch Size for Privately Finding Second-order Stationary Points

April 10, 2025

There is a gap between finding a first-order stationary point (FOSP) and a second-order stationary point (SOSP) under differential privacy…

Do LLMs Know Internally When They Follow Instructions?

April 10, 2025

Instruction-following is crucial for building AI agents with large language models (LLMs), as these models must adhere strictly to user-provided…

Machine Learning

This AI Paper Introduces a Machine Learning Framework to Estimate the Inference Budget for Self-Consistency and GenRMs (Generative Reward Models)

April 10, 2025

Large Language Models (LLMs) have demonstrated significant advancements in reasoning capabilities across diverse domains, including mathematics and science. However, improving…

Machine Learning

Pixtral Large is now available in Amazon Bedrock

April 10, 2025

Today, we are excited to announce that Mistral AI’s Pixtral Large foundation model (FM) is generally available in Amazon Bedrock.…

Machine Learning

T* and LV-Haystack: A Spatially-Guided Temporal Search Framework for Efficient Long-Form Video Understanding

April 10, 2025

Understanding long-form videos—ranging from minutes to hours—presents a major challenge in computer vision, especially as video understanding tasks expand beyond…

Machine Learning

ByteDance Introduces VAPO: A Novel Reinforcement Learning Framework for Advanced Reasoning Tasks

April 10, 2025

In the Large Language Models (LLM) RL training, value-free methods like GRPO and DAPO have shown great effectiveness. The true…

Machine Learning

Automating regulatory compliance: A multi-agent solution using Amazon Bedrock and CrewAI

April 10, 2025

Financial institutions today face an increasingly complex regulatory world that demands robust, efficient compliance mechanisms. Although organizations traditionally invest countless…

Machine Learning

Generate user-personalized communication with Amazon Personalize and Amazon Bedrock

April 10, 2025

Today, businesses are using AI and generative models to improve productivity in their teams and provide better experiences to their…

Machine Learning

Model customization, RAG, or both: A case study with Amazon Nova

April 10, 2025

As businesses and developers increasingly seek to optimize their language models for specific tasks, the decision between model customization and…

TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining

April 9, 2025

This paper was accepted at the Scalable Continual Learning for Lifelong Foundation Models (SCLLFM) Workshop at NeurIPS 2024. Large Language…

Machine Learning

Implement human-in-the-loop confirmation with Amazon Bedrock Agents

April 9, 2025

Agents are revolutionizing how businesses automate complex workflows and decision-making processes. Amazon Bedrock Agents helps you accelerate generative AI application…

Machine Learning

TorchSim: A Next-Generation PyTorch-Native Atomistic Simulation Engine for the MLIP Era

April 9, 2025

Radical AI has released TorchSim, a next-generation PyTorch-native atomistic simulation engine for the MLIP era. It accelerates materials simulation by…

Machine Learning

Unveiling Attention Sinks: The Functional Role of First-Token Focus in Stabilizing Large Language Models

April 9, 2025

LLMs often show a peculiar behavior where the first token in a sequence draws unusually high attention—known as an “attention…

Google Releases Agent Development Kit (ADK): An Open-Source AI Framework Integrated with Gemini to Build, Manage, Evaluate and Deploy Multi Agents

April 9, 2025

Google has released the Agent Development Kit (ADK), an open-source framework aimed at making it easier for developers to build,…

Machine Learning

Google Introduces Agent2Agent (A2A): A New Open Protocol that Allows AI Agents Securely Collaborate Across Ecosystems Regardless of Framework or Vendor

April 9, 2025

Google AI recently announced Agent2Agent (A2A), an open protocol designed to facilitate secure, interoperable communication among AI agents built on…

Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms

April 9, 2025

Building a generalist model for user interface (UI) understanding is challenging due to various foundational issues, such as platform diversity,…