Machine Learning

This AI Paper Introduces CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

February 11, 2025

Large language models (LLMs) struggle with precise computations, symbolic manipulations, and algorithmic tasks, often requiring structured problem-solving approaches. While language…

Machine Learning

Building an AI Research Agent for Essay Writing

February 11, 2025

In this tutorial, we will build an advanced AI-powered research agent that can write essays on given topics. This agent…

Machine Learning

Are Autoregressive LLMs Really Doomed? A Commentary on Yann LeCun’s Recent Keynote at AI Action Summit

February 11, 2025

Yann LeCun, Chief AI Scientist at Meta and one of the pioneers of modern AI, recently argued that autoregressive Large…

Machine Learning

Vintix: Scaling In-Context Reinforcement Learning for Generalist AI Agents

February 11, 2025

Developing AI systems that learn from their surroundings during execution involves creating models that adapt dynamically based on new information.…

Machine Learning

LLMDet: How Large Language Models Enhance Open-Vocabulary Object Detection

February 11, 2025

Open-vocabulary object detection (OVD) aims to detect arbitrary objects with user-provided text labels. Although recent progress has enhanced zero-shot detection…

Machine Learning

Advancing Scalable Text-to-Speech Synthesis: Llasa’s Transformer-Based Framework for Improved Speech Quality and Emotional Expressiveness

February 11, 2025

Recent advancements in LLMs, such as the GPT series and emerging “o1” models, highlight the benefits of scaling training and…

Machine Learning

This AI Paper Explores Long Chain-of-Thought Reasoning: Enhancing Large Language Models with Reinforcement Learning and Supervised Fine-Tuning

February 11, 2025

Large language models (LLMs) have demonstrated proficiency in solving complex problems across mathematics, scientific research, and software engineering. Chain-of-thought (CoT)…

Machine Learning

Shanghai AI Lab Releases OREAL-7B and OREAL-32B: Advancing Mathematical Reasoning with Outcome Reward-Based Reinforcement Learning

February 11, 2025

Mathematical reasoning remains a difficult area for artificial intelligence (AI) due to the complexity of problem-solving and the need for…

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

February 10, 2025

*Primary Contributors Attention is a key part of the transformer architecture. It is a sequence-to-sequence mapping that transforms each sequence…

Machine Learning

Revolutionizing business processes with Amazon Bedrock and Appian’s generative AI skills

February 10, 2025

This blog post is co-written with Louis Prensky and Philip Kang from Appian. The digital transformation wave has compelled enterprises…

Machine Learning

Automate bulk image editing with Crop.photo and Amazon Rekognition

February 10, 2025

Evolphin Software, Inc. is a leading provider of digital and media asset management solutions based in Silicon Valley, California. Crop.photo…

Machine Learning

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

February 10, 2025

AI agents are rapidly becoming the next frontier in enterprise transformation, with 82% of organizations planning adoption within the next…

Machine Learning

Efficient Alignment of Large Language Models Using Token-Level Reward Guidance with GenARM

February 10, 2025

Large language models (LLMs) must align with human preferences like helpfulness and harmlessness, but traditional alignment methods require costly retraining…

Machine Learning

Google DeepMind Introduces AlphaGeometry2: A Significant Upgrade to AlphaGeometry Surpassing the Average Gold Medalist in Solving Olympiad Geometry

February 10, 2025

The International Mathematical Olympiad (IMO) is a globally recognized competition that challenges high school students with complex mathematical problems. Among…

Machine Learning

Transforming credit decisions using generative AI with Rich Data Co and AWS

February 10, 2025

This post is co-written with Gordon Campbell, Charles Guan, and Hendra Suryanto from RDC. The mission of Rich Data Co…

Machine Learning

Zyphra Introduces the Beta Release of Zonos: A Highly Expressive TTS Model with High Fidelity Voice Cloning

February 10, 2025

Text-to-speech (TTS) technology has made significant strides in recent years, but challenges remain in creating natural, expressive, and high-fidelity speech…

Machine Learning

Adaptive Inference Budget Management in Large Language Models through Constrained Policy Optimization

February 10, 2025

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex reasoning tasks, particularly in mathematical problem-solving and coding applications. Research…

Machine Learning

Tutorial to Fine-Tuning Mistral 7B with QLoRA Using Axolotl for Efficient LLM Training

February 10, 2025

In this tutorial, we demonstrate the workflow for fine-tuning Mistral 7B using QLoRA with Axolotl, showing how to manage limited…

Machine Learning

Microsoft AI Researchers Release LLaVA-Rad: A Lightweight Open-Source Foundation Model for Advanced Clinical Radiology Report Generation

February 9, 2025

Large foundation models have demonstrated remarkable potential in biomedical applications, offering promising results on various benchmarks and enabling rapid adaptation…

Machine Learning

BARE: A Synthetic Data Generation AI Method that Combines the Diversity of Base Models with the Quality of Instruct-Tuned Models

February 9, 2025

As the need for high-quality training data grows, synthetic data generation has become essential for improving LLM performance. Instruction-tuned models…