We present Matrix3D, a unified model that performs several photogrammetry subtasks, including pose estimation, depth prediction, and novel view synthesis…
Machine Learning
In the media and entertainment industry, understanding and predicting the effectiveness of marketing campaigns is crucial for success. Marketing campaigns…
AI models today are expected to handle complex tasks such as solving mathematical problems, interpreting logical statements, and assisting with…
Computer science research has evolved into a multidisciplinary effort involving logic, engineering, and data-driven experimentation. With computing systems now deeply…
LLMs have shown advancements in reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR), which relies on outcome-based feedback rather…
As AI agents become more autonomous—capable of writing production code, managing workflows, and interacting with untrusted data sources—their exposure to…
OpenAI has launched Reinforcement Fine-Tuning (RFT) on its o4-mini reasoning model, introducing a powerful new technique for tailoring foundation models…
Multimodal AI rapidly evolves to create systems that can understand, generate, and respond using multiple data types within a single…
LLMs have made significant strides in language-related tasks such as conversational AI, reasoning, and code generation. However, human communication extends…
This post is co-written with Kilian Zimmerer and Daniel Ringler from Deutsche Bahn. Every day, Deutsche Bahn (DB) moves over…
Just ahead of its annual I/O developer conference, Google has released an early preview of Gemini 2.5 Pro (I/O Edition)—a…
In a notable step toward democratizing vision-language model development, Hugging Face has released nanoVLM, a compact and educational PyTorch-based framework…
NVIDIA continues to push the boundaries of open AI development by open-sourcing its Open Code Reasoning (OCR) model suite —…
Large Language Models (LLMs) have gained significant attention in recent years, yet understanding their internal mechanisms remains challenging. When examining…
Recent advancements in LLMs have significantly improved natural language understanding, reasoning, and generation. These models now excel at diverse tasks…
Large reasoning models (LRMs) have shown impressive capabilities in mathematics, coding, and scientific reasoning. However, they face significant limitations when…
This article demonstrates how to build an intelligent routing system powered by Anthropic’s Claude models. This system improves response efficiency…
Google has published the second installment in its Agents Companion series—an in-depth 76-page whitepaper aimed at professionals developing advanced AI…
AgentQL allows you to scrape any website with unstructured data by defining the exact shape of the information you want.…
With Amazon Bedrock Evaluations, you can evaluate foundation models (FMs) and Retrieval Augmented Generation (RAG) systems, whether hosted on Amazon…