Machine Learning

Step by Step Coding Guide to Build a Neural Collaborative Filtering (NCF) Recommendation System with PyTorch

April 12, 2025

This tutorial will walk you through using PyTorch to implement a Neural Collaborative Filtering (NCF) recommendation system. NCF extends traditional…

Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling

April 11, 2025

Specialist language models (LMs) focus on a specific task or domain on which they often outperform generalist LMs of the…

The AdEMAMix Optimizer: Better, Faster, Older

April 11, 2025

Momentum based optimizers are central to a wide range of machine learning applications. These typically rely on an Exponential Moving…

Machine Learning

Building an AIOps chatbot with Amazon Q Business custom plugins

April 11, 2025

Many organizations rely on multiple third-party applications and services for different aspects of their operations, such as scheduling, HR management,…

Machine Learning

This AI Paper from Salesforce Introduces VLM2VEC and MMEB: A Contrastive Framework and Benchmark for Universal Multimodal Embeddings

April 11, 2025

Multimodal embeddings combine visual and textual data into a single representational space, enabling systems to understand and relate images and…

Machine Learning

Can LLMs Debug Like Humans? Microsoft Introduces Debug-Gym for AI Coding Agents

April 11, 2025

The Debugging Problem in AI Coding Tools Despite significant progress in code generation and completion, AI coding tools continue to…

Machine Learning

Allen Institute for AI (Ai2) Launches OLMoTrace: Real-Time Tracing of LLM Outputs Back to Training Data

April 11, 2025

Understanding the Limits of Language Model Transparency As large language models (LLMs) become central to a growing number of applications—ranging…

Simple ReFlow: Improved Techniques for Fast Flow Models

April 11, 2025

Diffusion and flow-matching models achieve remarkable generative performance but at the cost of many sampling steps, this slows inference and…

LLMs No Longer Require Powerful Servers: Researchers from MIT, KAUST, ISTA, and Yandex Introduce a New AI Approach to Rapidly Compress Large Language Models without a Significant Loss of Quality

April 11, 2025

HIGGS — the innovative method for compressing large language models was developed in collaboration with teams at Yandex Research, MIT,…

Machine Learning

Racing beyond DeepRacer: Debut of the AWS LLM League

April 11, 2025

The AWS DeepRacer League is the world’s first autonomous racing league, open to anyone. Announced at re:Invent 2018, it puts…

Machine Learning

How TransPerfect Improved Translation Quality and Efficiency Using Amazon Bedrock

April 11, 2025

This post is co-written with Keith Brazil, Julien Didier, and Bryan Rand from TransPerfect. TransPerfect, a global leader in language…

Machine Learning

Together AI Released DeepCoder-14B-Preview: A Fully Open-Source Code Reasoning Model That Rivals o3-Mini With Just 14B Parameters

April 11, 2025

The demand for intelligent code generation and automated programming solutions has intensified, fueled by a rapid rise in software complexity…

Machine Learning

Complete Guide: Working with CSV/Excel Files and EDA in Python

April 11, 2025

This hands-on tutorial will walk you through the entire process of working with CSV/Excel files and conducting exploratory data analysis…

Machine Learning

RoR-Bench: Revealing Recitation Over Reasoning in Large Language Models Through Subtle Context Shifts

April 11, 2025

In recent years, the rapid progress of LLMs has given the impression that we are nearing the achievement of Artificial…

Machine Learning

Balancing Accuracy and Efficiency in Language Models: A Two-Phase RL Post-Training Approach for Concise Reasoning

April 11, 2025

Recent advancements in LLMs have significantly enhanced their reasoning capabilities, particularly through RL-based fine-tuning. Initially trained with supervised learning for…

Machine Learning

Nvidia Released Llama-3.1-Nemotron-Ultra-253B-v1: A State-of-the-Art AI Model Balancing Massive Scale, Reasoning Power, and Efficient Deployment for Enterprise Innovation

April 11, 2025

As AI adoption increases in digital infrastructure, enterprises and developers face mounting pressure to balance computational costs with performance, scalability,…

MircoNN: An On-device Disk Resident Updatable Vector Database

April 10, 2025

Nearest neighbour search over dense vector collections has important applications in information retrieval, retrieval augmented generation (RAG), and content ranking.…

Machine Learning

Google AI Introduces Ironwood: A Google TPU Purpose-Built for the Age of Inference

April 10, 2025

At the 2025 Google Cloud Next event, Google introduced Ironwood, its latest generation of Tensor Processing Units (TPUs), designed specifically…

Machine Learning

Reduce ML training costs with Amazon SageMaker HyperPod

April 10, 2025

Training a frontier model is highly compute-intensive, requiring a distributed system of hundreds, or thousands, of accelerated instances running for…

Machine Learning

OpenAI Open Sources BrowseComp: A New Benchmark for Measuring the Ability for AI Agents to Browse the Web

April 10, 2025

Despite advances in large language models (LLMs), AI agents still face notable limitations when navigating the open web to retrieve…