Machine Learning

Normalizing Flows are Capable Generative Models

June 20, 2025

Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density estimation and generative…

Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results

June 20, 2025

Uncertainty Quantification (UQ) in Language Models (LMs) is key to improving their safety and reliability. Evaluations often use metrics like…

Machine Learning

From Backend Automation to Frontend Collaboration: What’s New in AG-UI Latest Update for AI Agent-User Interaction

June 20, 2025

Introduction AI agents are increasingly moving from pure backend automators to visible, collaborative elements within modern applications. However, making agents…

Machine Learning

This AI Paper from Google Introduces a Causal Framework to Interpret Subgroup Fairness in Machine Learning Evaluations More Reliably

June 20, 2025

Understanding Subgroup Fairness in Machine Learning ML Evaluating fairness in machine learning often involves examining how models perform across different…

Machine Learning

UC Berkeley Introduces CyberGym: A Real-World Cybersecurity Evaluation Framework to Evaluate AI Agents on Large-Scale Vulnerabilities Across Massive Codebases

June 20, 2025

Cybersecurity has become a significant area of interest in artificial intelligence, driven by the increasing reliance on large software systems…

Build an Intelligent Multi-Tool AI Agent Interface Using Streamlit for Seamless Real-Time Interaction

June 20, 2025

In this tutorial, we’ll build a powerful and interactive Streamlit application that brings together the capabilities of LangChain, the Google…

Machine Learning

MiniMax AI Releases MiniMax-M1: A 456B Parameter Hybrid Model for Long-Context and Reinforcement Learning RL Tasks

June 19, 2025

The Challenge of Long-Context Reasoning in AI Models Large reasoning models are not only designed to understand language but are…

Machine Learning

Accelerate foundation model training and inference with Amazon SageMaker HyperPod and Amazon SageMaker Studio

June 19, 2025

Modern generative AI model providers require unprecedented computational scale, with pre-training often involving thousands of accelerators running continuously for days,…

Machine Learning

Update on the AWS DeepRacer Student Portal

June 19, 2025

The AWS DeepRacer Student Portal will no longer be available starting September 15, 2025. This change comes as part of…

Building trust in AI: The AWS approach to the EU AI Act

June 19, 2025

As AI adoption accelerates and reshapes our future, organizations are adapting to evolving regulatory frameworks. In our report commissioned to…

Machine Learning

Build a scalable AI video generator using Amazon SageMaker AI and CogVideoX

June 19, 2025

In recent years, the rapid advancement of artificial intelligence and machine learning (AI/ML) technologies has revolutionized various aspects of digital…

Aligning LLMs by Predicting Preferences from User Writing Samples

June 19, 2025

Accommodating human preferences is essential for creating aligned LLM agents that deliver personalized and effective interactions. Recent work has shown…

Trade-offs in Data Memorization via Strong Data Processing Inequalities

June 19, 2025

Recent research demonstrated that training large language models involves memorization of a significant fraction of training data. Such memorization can…

Variational Rectified Flow Matching

June 19, 2025

We study Variational Rectified Flow Matching, a framework that enhances classic rectified flow matching by modeling multi-modal velocity vector-fields. At…

INRFlow: Flow Matching for INRs in Ambient Space

June 19, 2025

Flow matching models have emerged as a powerful method for generative modeling on domains like images or videos, and even…

Machine Learning

HtFLlib: A Unified Benchmarking Library for Evaluating Heterogeneous Federated Learning Methods Across Modalities

June 19, 2025

AI institutions develop heterogeneous models for specific tasks but face data scarcity challenges during training. Traditional Federated Learning (FL) supports…

Machine Learning

ReVisual-R1: An Open-Source 7B Multimodal Large Language Model (MLLMs) that Achieves Long, Accurate and Thoughtful Reasoning

June 19, 2025

The Challenge of Multimodal Reasoning Recent breakthroughs in text-based language models, such as DeepSeek-R1, have demonstrated that RL can aid…

OpenAI Releases an Open‑Sourced Version of a Customer Service Agent Demo with the Agents SDK

June 19, 2025

OpenAI has open-sourced a new multi-agent customer service demo on GitHub, showcasing how to build domain-specialized AI agents using its…

Machine Learning

Accelerate threat modeling with generative AI

June 18, 2025

In this post, we explore how generative AI can revolutionize threat modeling practices by automating vulnerability identification, generating comprehensive attack…

Machine Learning

Building a custom text-to-SQL agent using Amazon Bedrock and Converse API

June 18, 2025

Developing robust text-to-SQL capabilities is a critical challenge in the field of natural language processing (NLP) and database management. The…