Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      June 3, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      June 3, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      June 3, 2025

      How To Prevent WordPress SQL Injection Attacks

      June 3, 2025

      All the WWE 2K25 locker codes that are currently active

      June 3, 2025

      PSA: You don’t need to spend $400+ to upgrade your Xbox Series X|S storage

      June 3, 2025

      UK civil servants saved 24 minutes per day using Microsoft Copilot, saving two weeks each per year according to a new report

      June 3, 2025

      These solid-state fans will revolutionize cooling in our PCs and laptops

      June 3, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Community News: Latest PECL Releases (06.03.2025)

      June 3, 2025
      Recent

      Community News: Latest PECL Releases (06.03.2025)

      June 3, 2025

      A Comprehensive Guide to Azure Firewall

      June 3, 2025

      Test Job Failures Precisely with Laravel’s assertFailedWith Method

      June 3, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      All the WWE 2K25 locker codes that are currently active

      June 3, 2025
      Recent

      All the WWE 2K25 locker codes that are currently active

      June 3, 2025

      PSA: You don’t need to spend $400+ to upgrade your Xbox Series X|S storage

      June 3, 2025

      UK civil servants saved 24 minutes per day using Microsoft Copilot, saving two weeks each per year according to a new report

      June 3, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Zep AI Introduces a Smarter Memory Layer for AI Agents Outperforming the MemGPT in the Deep Memory Retrieval (DMR) Benchmark

    Zep AI Introduces a Smarter Memory Layer for AI Agents Outperforming the MemGPT in the Deep Memory Retrieval (DMR) Benchmark

    February 4, 2025

    The development of transformer-based large language models (LLMs) has significantly advanced AI-driven applications, particularly conversational agents. However, these models face inherent limitations due to their fixed context windows, which can lead to loss of relevant information over time. While Retrieval-Augmented Generation (RAG) methods provide external knowledge to supplement LLMs, they often rely on static document retrieval, which lacks the flexibility required for adaptive and evolving conversations.

    MemGPT was introduced as an AI memory solution that extends beyond traditional RAG approaches, yet it still struggles with maintaining coherence across long-term interactions. In enterprise applications, where AI systems must integrate information from ongoing conversations and structured data sources, a more effective memory framework is needed—one that can retain and reason over time.

    Introducing Zep: A Memory Layer for AI Agents

    Zep AI Research presents Zep, a memory layer designed to address these challenges by leveraging Graphiti, a temporally-aware knowledge graph engine. Unlike static retrieval methods, Zep continuously updates and synthesizes both unstructured conversational data and structured business information.

    In benchmarking tests, Zep has demonstrated strong performance in the Deep Memory Retrieval (DMR) benchmark, achieving 94.8% accuracy, slightly surpassing MemGPT’s 93.4%. Additionally, it has proven effective in LongMemEval, a benchmark designed to assess AI memory in complex enterprise settings, showing accuracy improvements of up to 18.5% while reducing response latency by 90%.

    Technical Design and Benefits

    1. A Knowledge Graph Approach to Memory

    Unlike traditional RAG methods, Zep’s Graphiti engine structures memory as a hierarchical knowledge graph with three key components:

    • Episode Subgraph: Captures raw conversational data, ensuring a complete historical record.
    • Semantic Entity Subgraph: Identifies and organizes entities to enhance knowledge representation.
    • Community Subgraph: Groups entities into clusters, providing a broader contextual framework.
    2. Handling Time-Based Information

    Zep employs a bi-temporal model to track knowledge with two distinct timelines:

    • Event Timeline (T): Orders events chronologically.
    • System Timeline (T’): Maintains a record of how data has been stored and updated. This approach helps AI systems retain a meaningful understanding of past interactions while integrating new information effectively.
    3. A Multi-Faceted Retrieval Mechanism

    Zep retrieves relevant information using a combination of:

    • Cosine Similarity Search (for semantic matching)
    • Okapi BM25 Full-Text Search (for keyword relevance)
    • Graph-Based Breadth-First Search (for contextual associations) These techniques allow AI agents to retrieve the most relevant information efficiently.
    4. Efficiency and Scalability

    By structuring memory in a knowledge graph, Zep reduces redundant data retrieval, leading to lower token usage and faster responses. This makes it well-suited for enterprise applications where cost and latency are critical factors.

    Performance Evaluation

    Zep’s capabilities have been validated through comprehensive testing in two key benchmarks:

    Hostinger
    1. Deep Memory Retrieval (DMR) Benchmark

    DMR measures how well AI memory systems retain and retrieve past information. Zep achieved:

    • 94.8% accuracy with GPT-4 Turbo, compared to 93.4% for MemGPT.
    • 98.2% accuracy with GPT-4o Mini, demonstrating strong memory retention.
    2. LongMemEval Benchmark

    LongMemEval assesses AI agents in real-world business scenarios, where conversations can span over 115,000 tokens. Zep demonstrated:

    • 15.2% and 18.5% accuracy improvements with GPT-4o Mini and GPT-4o, respectively.
    • Significant latency reduction, making responses 90% faster than traditional full-context retrieval methods.
    • Lower token usage, requiring only 1.6k tokens per response compared to 115k tokens in full-context approaches.
    3. Performance Across Different Question Types

    Zep showed strong performance in complex reasoning tasks:

    • Preference-Based Questions: 184% improvement over full-context retrieval.
    • Multi-Session Queries: 30.7% improvement.
    • Temporal Reasoning: 38.4% improvement, highlighting Zep’s ability to track and infer time-sensitive information.

    Conclusion

    Zep provides a structured and efficient way for AI systems to retain and retrieve knowledge over extended periods. By moving beyond static retrieval methods and incorporating a dynamically evolving knowledge graph, it enables AI agents to maintain coherence across sessions and reason over past interactions.

    With 94.8% DMR accuracy and proven effectiveness in enterprise-level applications, Zep represents an advancement in AI memory solutions. By optimizing data retrieval, reducing token costs, and improving response speed, it offers a practical and scalable approach to enhancing AI-driven applications.


    Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 75k+ ML SubReddit.

    🚨 Marktechpost is inviting AI Companies/Startups/Groups to partner for its upcoming AI Magazines on ‘Open Source AI in Production’ and ‘Agentic AI’.

    The post Zep AI Introduces a Smarter Memory Layer for AI Agents Outperforming the MemGPT in the Deep Memory Retrieval (DMR) Benchmark appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleNYU Researchers Introduce WILDCHAT-50M: A Large-Scale Synthetic Dataset for Efficient LLM Post-Training
    Next Article Kingdom Come: Deliverance 2 is 2025’s GOTY frontrunner, and it can be yours for less with these deals

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    June 3, 2025
    Machine Learning

    This AI Paper Introduces LLaDA-V: A Purely Diffusion-Based Multimodal Large Language Model for Visual Instruction Tuning and Multimodal Reasoning

    June 3, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    IBM to set up ‘full stack’ AI facility at university

    Development

    Using Tabulator in Lightning Web Components

    Development

    CVE-2025-4382 – GRUB TPM Auto- decryption Data Exposure

    Common Vulnerabilities and Exposures (CVEs)

    US Health Insurance Website HealthCare.gov Hacked Again? 7,500 Users Potentially Affected

    Development

    Highlights

    Development

    How to use Jmeter Xpath extractor for multiple run in dynamic API elements

    November 19, 2024

    XPath extractor response assertion passed first run but in second run it is getting failed because <name>pc</name> is not coming in API. How to skip this element if it is not coming in second run without failing response assertion or any suggestions for this?

    API response first run:

    API response in second run: missing element < name > pc < /name >

    Response assertion – DB values(name_1)using jdbc request match with xml element(name) in API response

    Dark Web Hacker ‘Tombstone’ Markets Google Subdomain Vulnerabilities on Cybercrime Forum

    June 12, 2024

    As Part of JMeter WEB DRIVER(Selenium Scripts) Integration With Azure Pipeline and how can I add Headless Browser in Azure Pipeline

    July 7, 2024

    Linux Lite just got a useful AI feature for desktop – and it’s more subtle than you think

    November 4, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.