Retrieval Augmented Generation for Claim Processing: Combining MongoDB Atlas Vector Search and Large Language Models

Following up on our previous blog, AI, Vectors, and the Future of Claims Processing: Why Insurance Needs to Understand The Power of Vector Databases, weâ€™ll pick up the conversation right where we left it. We discussed extensively how Atlas Vector Search can benefit the claim process in insurance and briefly covered Retrieval Augmented Generation (RAG) and Large Language Models (LLMs).

MongoDB.local NYC

Join us in person on May 2, 2024 for our keynote address, announcements, and technical sessions to help you build and deploy mission-critical applications at scale.

Use Code Web50 for 50% off your ticket!

One of the biggest challenges for claim adjusters is pulling and aggregating information from disparate systems and diverse data formats. PDFs of policy guidelines might be stored in a content-sharing platform, customer information locked in a legacy CRM, and claim-related pictures and voice reports in yet another tool. All of this data is not just fragmented across siloed sources and hard to find but also in formats that have been historically nearly impossible to index with traditional methods. Over the years, insurance companies have accumulated terabytes of unstructured data in their data stores but have failed to capitalize on the possibility of accessing and leveraging it to uncover business insights, deliver better customer experiences, and streamline operations. Some of our customers even admit theyâ€™re not fully aware of all the data in their archives. Thereâ€™s a tremendous opportunity to leverage this unstructured data to benefit the insurer and its customers.

Our image search post covered part of the solution to these challenges, opening the door to working more easily with unstructured data. RAG takes it a step further, integrating Atlas Vector Search and LLMs, thus allowing insurers to go beyond the limitations of baseline foundational models, making them context-aware by feeding them proprietary data. Figure 1 shows how the interaction works in practice: through a chat prompt, we can ask questions to the system, and the LLM returns answers to the user and shows what references it used to retrieve the information contained in the response. Great! Weâ€™ve got a nice UI, but how can we build an RAG application? Letâ€™s open the hood and see whatâ€™s in it!

Figure 1: UI of the claim adjuster RAG-powered chatbot

Architecture and flow

Before we start building our application, we need to ensure that our data is easily accessible and in one secure place. Operational Data Layers (ODLs) are the recommended pattern for wrangling data to create single views. This post walks the reader through the process of modernizing insurance data models with Relational Migrator, helping insurers migrate off legacy systems to create ODLs.

Once the data is organized in our MongoDB collections and ready to be consumed, we can start architecting our solution. Building upon the schema developed in the image search post, we augment our documents by adding a few fields that will allow adjusters to ask more complex questions about the data and solve harder business challenges, such as resolving a claim in a fraction of the time with increased accuracy. Figure 2 shows the resulting document with two highlighted fields, â€œclaimDescriptionâ€ and its vector representation, â€œclaimDescriptionEmbeddingâ€. We can now create a Vector Search index on this array, a key step to facilitate retrieving the information fed to the LLM.

Figure 2: document schema of the claim collection, the highlighted fields are used to retrieve the data that will be passed as context to the LLM

Having prepared our data, building the RAG interaction is straightforward; refer to this GitHub repository for the implementation details. Here, weâ€™ll just discuss the high-level architecture and the data flow, as shown in Figure 3 below:

The user enters the prompt, a question in natural language.

The prompt is vectorized and sent to Atlas Vector Search; similar documents are retrieved.

The prompt and the retrieved documents are passed to the LLM as context.

The LLM produces an answer to the user (in natural language), considering the context and the prompt.

Figure 3: RAG architecture and interaction flow

It is important to note how the semantics of the question are preserved throughout the different steps. The reference to â€œadverse weatherâ€ related accidents in the prompt is captured and passed to Atlas Vector Search, which surfaces claim documents whose claim description relates to similar concepts (e.g., rain) without needing to mention them explicitly. Finally, the LLM consumes the relevant documents to produce a context-aware question referencing rain, hail, and fire, as weâ€™d expect based on the user’s initial question.

So what?

To sum it all up, whatâ€™s the benefit of combining Atlas Vector Search and LLMs in a Claim Processing RAG application?

Speed and accuracy: Having the data centrally organized and ready to be consumed by LLMs, adjusters can find all the necessary information in a fraction of the time.

Flexibility: LLMs can answer a wide spectrum of questions, meaning applications require less upfront system design. There is no need to build custom APIs for each piece of information youâ€™re trying to retrieve; just ask the LLM to do it for you.

Natural interaction: Applications can be interrogated in plain English without programming skills or system training.

Data accessibility: Insurers can finally leverage and explore unstructured data that was previously hard to access.

Not just claim processing

The same data model and architecture can serve additional personas and use cases within the organization:

Customer Service: Operators can quickly pull customer data and answer complex questions without navigating different systems. For example, â€œSummarize this customer’s past interactions,â€ â€œWhat coverages does this customer have?â€ or â€œWhat coverages can I recommend to this customer?â€

Customer self-service: Simplify your membersâ€™ experience by enabling them to ask questions themselves. For example, â€œMy apartment is flooded. Am I covered?â€ or â€œHow long do windshield repairs take on average?â€

Underwriting: Underwriters can quickly aggregate and summarize information, providing quotes in a fraction of the time. For example, â€œSummarize this customer claim history.â€ â€œI Am renewing a customer policy. What are the customer’s current coverages? Pull everything related to the policy entity/customer. I need to get baseline info. Find relevant underwriting guidelines.â€

If you would like to discover more about Converged AI and Application Data Stores with MongoDB, take a look at the following resources:

RAG for claim processing GitHub repository

From Relational Databases to AI: An Insurance Data Modernization Journey

Modernize your insurance data models with MongoDB and Relational Migrator

Source: Read More

IBM’s next generation Granite models are now available

The Human Element: Using Research And Psychology To Elevate Data Storytelling

Google to offer free version of Gemini Code Assist

MongoDB acquires Voyage AI for its embedding and reranking models

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

OpenAI expands ‘Deep Reseach’ to those paying $20 a month or more, a day after Microsoft made OpenAI’s ‘Think Deeper’ free for all Copilot users with no usage caps

Rethink State💡 Why You Should Model Your Frontend Around Events

Rethink State💡 Why You Should Model Your Frontend Around Events

What To Expect When Migrating Your Site To A New Platform

Kotlin Multiplatform vs. React Native vs. Flutter: Building Your First App

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

Retrieval Augmented Generation for Claim Processing: Combining MongoDB Atlas Vector Search and Large Language Models

MongoDB.local NYC

Architecture and flow

So what?

Not just claim processing

ANDI Accessibility Testing Tool Tutorial

How Data Analytics in Insurance is Driving Smarter Decisions

10+ Best Web Design Tools & Resources For 2025 (Free & Paid)

How can I set the language of Safari and launch using Selenium?

HELP (Hierarchical Embeddings-based Log Parser): A Semantic Embeddings-based Framework for Real-Time Log Parsing

Cybercriminals Can Now Clone Any Brand’s Site in Minutes Using Darcula PhaaS v3

SpaceX Data Breach Back From the Dead: Hunters International Posts Alleged Stolen Information

AI Statistics Everyone Should Know in 2024

Researchers from Snowflake and CMU Introduce SuffixDecoding: A Novel Model-Free Approach to Accelerating Large Language Model (LLM) Inference through Speculative Decoding

Dell is getting rid of its Precision PCs. Here’s what will replace them.

Retrieval Augmented Generation for Claim Processing: Combining MongoDB Atlas Vector Search and Large Language Models

MongoDB.local NYC

Architecture and flow

So what?

Not just claim processing

Related Posts