Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Designing Better UX For Left-Handed People

      July 25, 2025

      This week in AI dev tools: Gemini 2.5 Flash-Lite, GitLab Duo Agent Platform beta, and more (July 25, 2025)

      July 25, 2025

      Tenable updates Vulnerability Priority Rating scoring method to flag fewer vulnerabilities as critical

      July 24, 2025

      Google adds updated workspace templates in Firebase Studio that leverage new Agent mode

      July 24, 2025

      Trump’s AI plan says a lot about open source – but here’s what it leaves out

      July 25, 2025

      Google’s new Search mode puts classic results back on top – how to access it

      July 25, 2025

      These AR swim goggles I tested have all the relevant metrics (and no subscription)

      July 25, 2025

      Google’s new AI tool Opal turns prompts into apps, no coding required

      July 25, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Laravel Scoped Route Binding for Nested Resource Management

      July 25, 2025
      Recent

      Laravel Scoped Route Binding for Nested Resource Management

      July 25, 2025

      Add Reactions Functionality to Your App With Laravel Reactions

      July 25, 2025

      saasykit/laravel-open-graphy

      July 25, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Sam Altman won’t trust ChatGPT with his “medical fate” unless a doctor is involved — “Maybe I’m a dinosaur here”

      July 25, 2025
      Recent

      Sam Altman won’t trust ChatGPT with his “medical fate” unless a doctor is involved — “Maybe I’m a dinosaur here”

      July 25, 2025

      “It deleted our production database without permission”: Bill Gates called it — coding is too complex to replace software engineers with AI

      July 25, 2025

      Top 6 new features and changes coming to Windows 11 in August 2025 — from AI agents to redesigned BSOD screens

      July 25, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»EraRAG: A Scalable, Multi-Layered Graph-Based Retrieval System for Dynamic and Growing Corpora

    EraRAG: A Scalable, Multi-Layered Graph-Based Retrieval System for Dynamic and Growing Corpora

    July 25, 2025

    Large Language Models (LLMs) have revolutionized many areas of natural language processing, but they still face critical limitations when dealing with up-to-date facts, domain-specific information, or complex multi-hop reasoning. Retrieval-Augmented Generation (RAG) approaches aim to address these gaps by allowing language models to retrieve and integrate information from external sources. However, most existing graph-based RAG systems are optimized for static corpora and struggle with efficiency, accuracy, and scalability when the data is continually growing—such as in news feeds, research repositories, or user-generated online content.

    Introducing EraRAG: Efficient Updates for Evolving Data

    Recognizing these challenges, researchers from Huawei, The Hong Kong University of Science and Technology, and WeBank have developed EraRAG, a novel retrieval-augmented generation framework purpose-built for dynamic, ever-expanding corpora. Rather than rebuilding the entire retrieval structure whenever new data arrives, EraRAG relies on localized, selective updates that touch only those parts of the retrieval graph affected by the changes.

    Core Features:

    • Hyperplane-Based Locality-Sensitive Hashing (LSH):
      Every corpus is chunked into small text passages which are embedded as vectors. EraRAG then uses randomly sampled hyperplanes to project these vectors into binary hash codes—a process that groups semantically similar chunks into the same “bucket.” This LSH-based approach maintains both semantic coherence and efficient grouping.
    • Hierarchical, Multi-Layered Graph Construction:
      The core retrieval structure in EraRAG is a multi-layered graph. At each layer, segments (or buckets) of similar text are summarized using a language model. Segments that are too large are split, while those too small are merged—ensuring both semantic consistency and balanced granularity. Summarized representations at higher layers enable efficient retrieval for both fine-grained and abstract queries.
    • Incremental, Localized Updates:
      When new data arrives, its embedding is hashed using the original hyperplanes—ensuring consistency with the initial graph construction. Only the buckets/segments directly impacted by new entries are updated, merged, split, or re-summarized, while the rest of the graph remains untouched. The update propagates up the graph hierarchy, but always remains localized to the affected region, saving significant computation and token costs.
    • Reproducibility and Determinism:
      Unlike standard LSH clustering, EraRAG preserves the set of hyperplanes used during initial hashing. This makes bucket assignment deterministic and reproducible, which is crucial for consistent, efficient updates over time.

    Performance and Impact

    Comprehensive experiments on a variety of question answering benchmarks demonstrate that EraRAG:

    • Reduces Update Costs: Achieves up to 95% reduction in graph reconstruction time and token usage compared to leading graph-based RAG methods (e.g., GraphRAG, RAPTOR, HippoRAG).
    • Maintains High Accuracy: EraRAG consistently outperforms other retrieval architectures in both accuracy and recall—across static, growing, and abstract question answering tasks—with minimal compromise in retrieval quality or multi-hop reasoning capabilities.
    • Supports Versatile Query Needs: The multi-layered graph design allows EraRAG to efficiently retrieve fine-grained factual details or high-level semantic summaries, tailoring its retrieval pattern to the nature of each query.

    Practical Implications

    EraRAG offers a scalable and robust retrieval framework ideal for real-world settings where data is continuously added—such as live news, scholarly archives, or user-driven platforms. It strikes a balance between retrieval efficiency and adaptability, making LLM-backed applications more factual, responsive, and trustworthy in fast-changing environments.


    Check out the Paper and GitHub. All credit for this research goes to the researchers of this project | Meet the AI Dev Newsletter read by 40k+ Devs and Researchers from NVIDIA, OpenAI, DeepMind, Meta, Microsoft, JP Morgan Chase, Amgen, Aflac, Wells Fargo and 100s more [SUBSCRIBE NOW]

    The post EraRAG: A Scalable, Multi-Layered Graph-Based Retrieval System for Dynamic and Growing Corpora appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleMicrosoft Quietly tests Copilot in Edge InPrivate Browsing Mode
    Next Article FEEDER: A Pre-Selection Framework for Efficient Demonstration Selection in LLMs

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    July 25, 2025
    Machine Learning

    Unsupervised System 2 Thinking: The Next Leap in Machine Learning with Energy-Based Transformers

    July 25, 2025
    Leave A Reply Cancel Reply

    For security, use of Google's reCAPTCHA service is required which is subject to the Google Privacy Policy and Terms of Use.

    Continue Reading

    How effective is CBT for social anxiety?

    Web Development

    A Minecraft Movie just hit theaters — but McDonald’s Nether Flame Sauce is the real star

    News & Updates

    CodeRabbit brings AI-powered code review into Visual Studio Code

    Tech & Work

    CVE-2025-48280 – AutomatorWP SQL Injection

    Common Vulnerabilities and Exposures (CVEs)

    Highlights

    Linux

    Screenshot Tool Gradia Adds Code Snippet Generator, Snap Install

    July 20, 2025

    Gradia’s latest update introduces a new option to generate eye-catching screenshots of source code snippets.…

    CitrixBleed 2: Electric Boogaloo — CVE-2025–5777

    June 24, 2025

    CVE-2025-5006 – Campcodes Online Shopping Portal SQL Injection Vulnerability

    May 20, 2025

    CVE-2025-37782 – Linux HFS slub Out-of-Bounds Write

    May 1, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.