DatabricksÂ AnnouncedÂ the Public Preview of Mosaic AIÂ Agent FrameworkÂ andÂ Agent EvaluationÂ

Databricks announced the public preview of the Mosaic AI Agent Framework and Agent Evaluation during the Data + AI Summit 2024. These innovative tools aim to assist developers in building and deploying high-quality Agentic and Retrieval Augmented Generation (RAG) applications on the Databricks Data Intelligence Platform.

Challenges in Building High-Quality Generative AI Applications

Creating a proof of concept for generative AI applications is relatively straightforward. However, delivering a high-quality application that meets the rigorous standards required for customer-facing solutions takes time and effort. Developers often struggle with:

Choosing the right metrics to evaluate application quality.

Efficiently collecting human feedback to measure quality.

Identifying the root causes of quality issues.

Rapidly iterating to improve application quality before deploying to production.

Introducing Mosaic AI Agent Framework and Agent Evaluation

The Mosaic AI Agent Framework and Agent Evaluation address these challenges through several key capabilities:

Human Feedback Integration: Agent Evaluation allows developers to define high-quality responses for their generative AI applications by inviting subject matter experts across their organization to review and provide feedback, even if they are not Databricks users. This process helps in gathering diverse perspectives and insights to refine the application.

Comprehensive Evaluation Metrics: Developed in collaboration with Mosaic Research, Agent Evaluation offers a suite of metrics to measure application quality. These metrics include accuracy, hallucination, harmfulness, and helpfulness. The system automatically logs responses and feedback to an evaluation table, facilitating quick analysis and identifying potential quality issues. AI judges, calibrated using expert feedback, evaluate responses to pinpoint the root causes of problems.

End-to-End Development Workflow: Integrated with MLflow, the Agent Framework allows developers to log and evaluate generative AI applications using standard MLflow APIs. This integration supports seamless transitions from development to production, with continuous feedback loops to enhance application quality.

App Lifecycle Management: The Agent Framework provides a simplified SDK for managing the lifecycle of agentic applications, from permissions management to deployment with Mosaic AI Model Serving. This comprehensive management system ensures that applications remain scalable and maintain high quality throughout their lifecycle.

Image Source

Building a High-Quality RAG Agent

To illustrate the capabilities of the Mosaic AI Agent Framework, Databricks provided an example of building a high-quality RAG application. This example involves creating a simple RAG application that retrieves relevant chunks from a pre-created vector index and summarizes them in response to queries. The process includes connecting to the vector search index, setting the index into a LangChain retriever, and leveraging MLflow to enable traces and deploy the application. This workflow demonstrates the ease with which developers can build, evaluate, and improve generative AI applications using the Mosaic AI tools.

Image Source

Real-World Applications and Testimonials

Several companies have successfully implemented the Mosaic AI Agent Framework to enhance their generative AI solutions. For instance, Corning used the framework to build an AI research assistant that indexes hundreds of thousands of documents, significantly improving retrieval speed, response quality, and accuracy. Lippert leveraged the framework to evaluate the results of their generative AI applications, ensuring data accuracy and control. FordDirect integrated the framework to create a unified chatbot for their dealerships, facilitating better performance assessment and customer engagement.

Pricing and Next Steps

The pricing for Agent Evaluation is based on judge requests, while Mosaic AI Model Serving is priced according to Mosaic AI Model Serving rates. Databricks encourages customers to try the Mosaic AI Agent Framework and Agent Evaluation by accessing various resources such as the Agent Framework documentation, demo notebooks, and the Generative AI Cookbook. These resources provide detailed guidance on building production-quality generative AI applications from proof of concept to deployment.

In conclusion, Databricksâ€™ announcement of the Mosaic AI Agent Framework and Agent Evaluation represents a significant advancement in generative AI. These tools provide developers with the necessary capabilities to efficiently build, evaluate, and deploy high-quality generative AI applications. By addressing common challenges and offering comprehensive support, Databricks empowers developers to create innovative solutions that meet the highest quality and performance standards.

The post DatabricksÂ AnnouncedÂ the Public Preview of Mosaic AIÂ Agent FrameworkÂ andÂ Agent EvaluationÂ appeared first on MarkTechPost.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

If you think you can do better than Xbox or PlayStation in the Console Wars, you may just want to try out this card game

Surviving a 10 year stint in dev hell, this retro-styled hack n’ slash has finally arrived on Xbox

Save $400 on the best Samsung TVs, laptops, tablets, and more when you sign up for Verizon 5G Home or Home Internet

NodeSource N|Solid Runtime Release – May 2025: Performance, Stability & the Final Update for v18

NodeSource N|Solid Runtime Release – May 2025: Performance, Stability & the Final Update for v18

Big Changes at Meteor Software: Our Next Chapter

Apps in Generative AI – Transforming the Digital Experience

Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

If you think you can do better than Xbox or PlayStation in the Console Wars, you may just want to try out this card game

Surviving a 10 year stint in dev hell, this retro-styled hack n’ slash has finally arrived on Xbox

DatabricksÂ AnnouncedÂ the Public Preview of Mosaic AIÂ Agent FrameworkÂ andÂ Agent EvaluationÂ

February 2025 Baseline monthly digest

Learn A1 Level Spanish

SteamOS isn’t just for Steam Deck anymore — Here are the pros and cons of it coming to other gaming handhelds like Legion Go S

Google releases reasoning model Gemini 2.5, its “most intelligent AI model” yet

See-Through Parallel Universes with Your Mind’s Eye – The Course Guidebook: Chapter 4

I’ll always recommend my favorite Xbox controller when it’s on sale, and this 50% discount is an absolute no-brainer

Deskflow – keyboard and mouse sharing app

Australian Mining Software Firm Opaxe Faces Unconfirmed Data Breach

This Lenovo laptop handled my various workflows with grace – and it’s surprisingly affordable

Bybit Confirms Record-Breaking $1.46 Billion Crypto Heist in Sophisticated Cold Wallet Attack

DatabricksÂ AnnouncedÂ the Public Preview of Mosaic AIÂ Agent FrameworkÂ andÂ Agent EvaluationÂ

Related Posts