Machine Learning

How to Verify Any (Reasonable) Distribution Property: Computationally Sound Argument Systems for Distributions

April 24, 2025

As statistical analyses become more central to science, industry and society, there is a growing need to ensure correctness of…

Machine Learning

Sequential-NIAH: A Benchmark for Evaluating LLMs in Extracting Sequential Information from Long Texts

April 24, 2025

Evaluating how well LLMs handle long contexts is essential, especially for retrieving specific, relevant information embedded in lengthy inputs. Many…

A Coding Guide to Asynchronous Web Data Extraction Using Crawl4AI: An Open-Source Web Crawling and Scraping Toolkit Designed for LLM Workflows

April 24, 2025

In this tutorial, we demonstrate how to harness Crawl4AI, a modern, Python‑based web crawling toolkit, to extract structured data from…

A New Citibank Report/Guide Shares How Agentic AI Will Reshape Finance with Autonomous Analysis and Intelligent Automation

April 24, 2025

In its latest ‘Agentic AI Finance & the ‘Do It For Me’ Economy’ report, Citibank explores a significant paradigm shift…

Machine Learning

Protect sensitive data in RAG applications with Amazon Bedrock

April 23, 2025

Retrieval Augmented Generation (RAG) applications have become increasingly popular due to their ability to enhance generative AI tasks with contextually…

Machine Learning

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

April 23, 2025

Archival data in research institutions and national laboratories represents a vast repository of historical knowledge, yet much of it remains…

Machine Learning

NVIDIA AI Releases Describe Anything 3B: A Multimodal LLM for Fine-Grained Image and Video Captioning

April 23, 2025

Challenges in Localized Captioning for Vision-Language Models Describing specific regions within images or videos remains a persistent challenge in vision-language…

Meet Xata Agent: An Open Source Agent for Proactive PostgreSQL Monitoring, Automated Troubleshooting, and Seamless DevOps Integration

April 23, 2025

Xata Agent is an open-source AI assistant built to serve as a site reliability engineer for PostgreSQL databases. It constantly…

Machine Learning

AWS Introduces SWE-PolyBench: A New Open-Source Multilingual Benchmark for Evaluating AI Coding Agents

April 23, 2025

Recent advancements in large language models (LLMs) have enabled the development of AI-based coding agents that can generate, modify, and…

Machine Learning

Carnegie Mellon University at ICLR 2025

April 23, 2025

CMU researchers are presenting 143 papers at the Thirteenth International Conference on Learning Representations (ICLR 2025), held from April 24…

Open-Source TTS Reaches New Heights: Nari Labs Releases Dia, a 1.6B Parameter Model for Real-Time Voice Cloning and Expressive Speech Synthesis on Consumer Device

April 23, 2025

The development of text-to-speech (TTS) systems has seen significant advancements in recent years, particularly with the rise of large-scale neural…

Machine Learning

LLMs Can Now Learn without Labels: Researchers from Tsinghua University and Shanghai AI Lab Introduce Test-Time Reinforcement Learning (TTRL) to Enable Self-Evolving Language Models Using Unlabeled Data

April 23, 2025

Despite significant advances in reasoning capabilities through reinforcement learning (RL), most large language models (LLMs) remain fundamentally dependent on supervised…

Machine Learning

Muon Optimizer Significantly Accelerates Grokking in Transformers: Microsoft Researchers Explore Optimizer Influence on Delayed Generalization

April 23, 2025

Revisiting the Grokking Challenge In recent years, the phenomenon of grokking—where deep learning models exhibit a delayed yet sudden transition…

Machine Learning

Atla AI Introduces the Atla MCP Server: A Local Interface of Purpose-Built LLM Judges via Model Context Protocol (MCP)

April 22, 2025

Reliable evaluation of large language model (LLM) outputs is a critical yet often complex aspect of AI system development. Integrating…

Machine Learning

How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services

April 22, 2025

This post is co-written with Saibal Samaddar, Tanushree Halder, and Lokesh Joshi from Infosys Consulting. Critical insights and expertise are…

Machine Learning

Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits

April 22, 2025

In December, we announced the preview availability for Amazon Bedrock Intelligent Prompt Routing, which provides a single serverless endpoint to efficiently…