Machine Learning

How Lumi streamlines loan approvals with Amazon SageMaker AI

April 4, 2025

This post is co-written with Paul Pagnan from Lumi. Lumi is a leading Australian fintech lender empowering small businesses with…

Machine Learning

Fine-tune large language models with reinforcement learning from human or AI feedback

April 4, 2025

Large language models (LLMs) can be used to perform natural language processing (NLP) tasks ranging from simple dialogues and information…

Machine Learning

Meet Open-Qwen2VL: A Fully Open and Compute-Efficient Multimodal Large Language Model

April 4, 2025

Multimodal Large Language Models (MLLMs) have advanced the integration of visual and textual modalities, enabling progress in tasks such as…

Machine Learning

AI Workforce: using AI and Drones to simplify infrastructure inspections

April 3, 2025

Inspecting wind turbines, power lines, 5G towers, and pipelines is a tough job. It’s often dangerous, time-consuming, and prone to…

Machine Learning

Shaping the future: OMRON’s data-driven journey with AWS

April 3, 2025

This post is co-written with Emrah Kaya and Xinyi Zhou from Omron Europe. Data is one of the most critical…

Machine Learning

How AWS Sales uses generative AI to streamline account planning

April 3, 2025

Every year, AWS Sales personnel draft in-depth, forward looking strategy documents for established AWS customers. These documents help the AWS…

Machine Learning

This AI Paper Unveils a Reverse-Engineered Simulator Model for Modern NVIDIA GPUs: Enhancing Microarchitecture Accuracy and Performance Prediction

April 3, 2025

GPUs are widely recognized for their efficiency in handling high-performance computing workloads, such as those found in artificial intelligence and…

Machine Learning

UB-Mesh: A Cost-Efficient, Scalable Network Architecture for Large-Scale LLM Training

April 3, 2025

As LLMs scale, their computational and bandwidth demands increase significantly, posing challenges for AI training infrastructure. Following scaling laws, LLMs…

Machine Learning

Introduction to MCP: The Ultimate Guide to Model Context Protocol for AI Assistants

April 3, 2025

The Model Context Protocol (MCP) is an open standard (open-sourced by Anthropic) that defines a unified way to connect AI…

Machine Learning

This AI Paper Introduces FASTCURL: A Curriculum Reinforcement Learning Framework with Context Extension for Efficient Training of R1-like Reasoning Models

April 3, 2025

Large language models have transformed how machines comprehend and generate text, especially in complex problem-solving areas like mathematical reasoning. These…

Machine Learning

Researchers from Dataocean AI and Tsinghua University Introduces Dolphin: A Multilingual Automatic Speech Recognition ASR Model Optimized for Eastern Languages and Dialects

April 3, 2025

Automatic speech recognition (ASR) technologies have advanced significantly, yet notable disparities remain in their ability to accurately recognize diverse languages.…

Machine Learning

Advancing Vision-Language Reward Models: Challenges, Benchmarks, and the Role of Process-Supervised Learning

April 3, 2025

Process-supervised reward models (PRMs) offer fine-grained, step-wise feedback on model responses, aiding in selecting effective reasoning paths for complex tasks.…

Machine Learning

Snowflake Proposes ExCoT: A Novel AI Framework that Iteratively Optimizes Open-Source LLMs by Combining CoT Reasoning with off-Policy and on-Policy DPO, Relying Solely on Execution Accuracy as Feedback

April 3, 2025

Text-to-SQL translation, the task of transforming natural language queries into structured SQL statements, is essential for facilitating user-friendly database interactions.…

Mutual Reinforcement of LLM Dialogue Synthesis and Summarization Capabilities for Few-Shot Dialogue Summarization

April 2, 2025

In this work, we propose Mutual Reinforcing Data Synthesis (MRDS) within LLMs to improve few-shot dialogue summarization task. Unlike prior…

Nomic Open Sources State-of-the-Art Multimodal Embedding Model

April 2, 2025

Nomic has announced the release of “Nomic Embed Multimodal,” a groundbreaking embedding model that achieves state-of-the-art performance on visual document…

Machine Learning

Mitigating Hallucinations in Large Vision-Language Models: A Latent Space Steering Approach

April 2, 2025

Hallucination remains a significant challenge in deploying Large Vision-Language Models (LVLMs), as these models often generate text misaligned with visual…