Machine Learning

NuMind AI Releases NuMarkdown-8B-Thinking: A Reasoning Breakthrough in OCR and Document-to-Markdown Conversion

August 11, 2025

NuMind AI has officially released NuMarkdown-8B-Thinking, an open-source (MIT License) reasoning OCR Vision-Language Model (VLM) that redefines how complex documents…

Building a Secure and Memory-Enabled Cipher Workflow for AI Agents with Dynamic LLM Selection and API Integration

August 11, 2025

In this tutorial, we walk through building a compact but fully functional Cipher-based workflow. We start by securely capturing our…

Optimal Corpus Aware Training for Neural Machine Translation

August 11, 2025

Corpus Aware Training (CAT) leverages valuable corpus metadata during training by injecting corpus information into each training example, and has…

Machine Learning

AI Agent Trends of 2025: A Transformative Landscape

August 10, 2025

The year 2025 marks a defining moment in the evolution of artificial intelligence, ushering in an era where agentic systems—autonomous…

Machine Learning

From 100,000 to Under 500 Labels: How Google AI Cuts LLM Training Data by Orders of Magnitude

August 10, 2025

Google Research has unveiled a groundbreaking method for fine-tuning large language models (LLMs) that slashes the amount of required training…

Using RouteLLM to Optimize LLM Usage

August 10, 2025

RouteLLM is a flexible framework for serving and evaluating LLM routers, designed to maximize performance while minimizing cost. Key features:…

AI-Driven Antitrust and Competition Law: Algorithmic Collusion, Self-Learning Pricing Tools, and Legal Challenges in the US and EU

August 10, 2025

AI in Market Economics and Pricing Algorithms AI-driven pricing models, particularly those utilizing reinforcement learning (RL), can lead to outcomes…

Machine Learning

VL-Cogito: Advancing Multimodal Reasoning with Progressive Curriculum Reinforcement Learning

August 9, 2025

Multimodal reasoning, where models integrate and interpret information from multiple sources such as text, images, and diagrams, is a frontier…

Machine Learning

Alibaba Qwen Unveils Qwen3-4B-Instruct-2507 and Qwen3-4B-Thinking-2507: Refreshing the Importance of Small Language Models

August 9, 2025

Smaller Models with Smarter Performance and 256K Context Support Alibaba’s Qwen team has introduced two powerful additions to its small…

Machine Learning

Technical Deep Dive: Automating LLM Agent Mastery for Any MCP Server with MCP- RL and ART

August 9, 2025

Table of contents Introduction What Is MCP- RL? ART: The Agent Reinforcement Trainer Code Walkthrough: Specializing LLMs with MCP- RL…

FAQs: Everything You Need to Know About AI Agents in 2025

August 9, 2025

Table of contents TL;DR 1) What is an AI agent (2025 definition)? 2) What can agents do reliably today? 3)…

Machine Learning

Mixture-of-Agents (MoA): A Breakthrough in LLM Performance

August 9, 2025

The Mixture-of-Agents (MoA) architecture is a transformative approach for enhancing large language model (LLM) performance, especially on complex, open-ended tasks…

Machine Learning

Graph-R1: An Agentic GraphRAG Framework for Structured, Multi-Turn Reasoning with Reinforcement Learning

August 9, 2025

Introduction Large Language Models (LLMs) have set new benchmarks in natural language processing, but their tendency for hallucination—generating inaccurate outputs—remains…

Building an Advanced PaperQA2 Research Agent with Google Gemini for Scientific Literature Analysis

August 9, 2025

In this tutorial, we walk through building an advanced PaperQA2 AI Agent powered by Google’s Gemini model, designed specifically for…

9 Agentic AI Workflow Patterns Transforming AI Agents in 2025

August 9, 2025

Table of contents Why Classic AI Agent Workflows Fail The 9 Agentic Workflow Patterns for 2025 Sequential Intelligence Parallel Processing…

Your LLM Knows the Future: Uncovering Its Multi-Token Prediction Potential

August 8, 2025

Autoregressive language models are constrained by their inherently sequential nature, generating one token at a time. This paradigm limits inference…

Machine Learning

Proxy Servers Explained: Types, Use Cases & Trends in 2025 [Technical Deep Dive]

August 8, 2025

Estimated reading time: 5 minutes Table of contents Introduction What Is a Proxy Server? Technical Architecture: Key Functions (2025): Types…

Meta CLIP 2: The First Contrastive Language-Image Pre-training (CLIP) Trained with Worldwide Image-Text Pairs from Scratch

August 8, 2025

Contrastive Language-Image Pre-training (CLIP) has become important for modern vision and multimodal models, enabling applications such as zero-shot image classification…

A Code Implementation to Build a Multi-Agent Research System with OpenAI Agents, Function Tools, Handoffs, and Session Memory

August 8, 2025

In this tutorial, we begin by showcasing the power of OpenAI Agents as the driving force behind our multi-agent research…

Cloudflare vs Perplexity: The Battle Over AI Web Scraping Heats Up

August 8, 2025

Reading through Cloudflare’s detailed exposé and the extensive media coverage, the controversy surrounding Perplexity AI’s web scraping practices is deeper…