Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      June 1, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      June 1, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      June 1, 2025

      How To Prevent WordPress SQL Injection Attacks

      June 1, 2025

      My top 5 must-play PC games for the second half of 2025 — Will they live up to the hype?

      June 1, 2025

      A week of hell with my Windows 11 PC really makes me appreciate the simplicity of Google’s Chromebook laptops

      June 1, 2025

      Elden Ring Nightreign Night Aspect: How to beat Heolstor the Nightlord, the final boss

      June 1, 2025

      New Xbox games launching this week, from June 2 through June 8 — Zenless Zone Zero finally comes to Xbox

      June 1, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Student Record Android App using SQLite

      June 1, 2025
      Recent

      Student Record Android App using SQLite

      June 1, 2025

      When Array uses less memory than Uint8Array (in V8)

      June 1, 2025

      Laravel 12 Starter Kits: Definite Guide Which to Choose

      June 1, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      My top 5 must-play PC games for the second half of 2025 — Will they live up to the hype?

      June 1, 2025
      Recent

      My top 5 must-play PC games for the second half of 2025 — Will they live up to the hype?

      June 1, 2025

      A week of hell with my Windows 11 PC really makes me appreciate the simplicity of Google’s Chromebook laptops

      June 1, 2025

      Elden Ring Nightreign Night Aspect: How to beat Heolstor the Nightlord, the final boss

      June 1, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Zep AI Introduces a Smarter Memory Layer for AI Agents Outperforming the MemGPT in the Deep Memory Retrieval (DMR) Benchmark

    Zep AI Introduces a Smarter Memory Layer for AI Agents Outperforming the MemGPT in the Deep Memory Retrieval (DMR) Benchmark

    February 4, 2025

    The development of transformer-based large language models (LLMs) has significantly advanced AI-driven applications, particularly conversational agents. However, these models face inherent limitations due to their fixed context windows, which can lead to loss of relevant information over time. While Retrieval-Augmented Generation (RAG) methods provide external knowledge to supplement LLMs, they often rely on static document retrieval, which lacks the flexibility required for adaptive and evolving conversations.

    MemGPT was introduced as an AI memory solution that extends beyond traditional RAG approaches, yet it still struggles with maintaining coherence across long-term interactions. In enterprise applications, where AI systems must integrate information from ongoing conversations and structured data sources, a more effective memory framework is needed—one that can retain and reason over time.

    Introducing Zep: A Memory Layer for AI Agents

    Zep AI Research presents Zep, a memory layer designed to address these challenges by leveraging Graphiti, a temporally-aware knowledge graph engine. Unlike static retrieval methods, Zep continuously updates and synthesizes both unstructured conversational data and structured business information.

    In benchmarking tests, Zep has demonstrated strong performance in the Deep Memory Retrieval (DMR) benchmark, achieving 94.8% accuracy, slightly surpassing MemGPT’s 93.4%. Additionally, it has proven effective in LongMemEval, a benchmark designed to assess AI memory in complex enterprise settings, showing accuracy improvements of up to 18.5% while reducing response latency by 90%.

    Technical Design and Benefits

    1. A Knowledge Graph Approach to Memory

    Unlike traditional RAG methods, Zep’s Graphiti engine structures memory as a hierarchical knowledge graph with three key components:

    • Episode Subgraph: Captures raw conversational data, ensuring a complete historical record.
    • Semantic Entity Subgraph: Identifies and organizes entities to enhance knowledge representation.
    • Community Subgraph: Groups entities into clusters, providing a broader contextual framework.
    2. Handling Time-Based Information

    Zep employs a bi-temporal model to track knowledge with two distinct timelines:

    • Event Timeline (T): Orders events chronologically.
    • System Timeline (T’): Maintains a record of how data has been stored and updated. This approach helps AI systems retain a meaningful understanding of past interactions while integrating new information effectively.
    3. A Multi-Faceted Retrieval Mechanism

    Zep retrieves relevant information using a combination of:

    • Cosine Similarity Search (for semantic matching)
    • Okapi BM25 Full-Text Search (for keyword relevance)
    • Graph-Based Breadth-First Search (for contextual associations) These techniques allow AI agents to retrieve the most relevant information efficiently.
    4. Efficiency and Scalability

    By structuring memory in a knowledge graph, Zep reduces redundant data retrieval, leading to lower token usage and faster responses. This makes it well-suited for enterprise applications where cost and latency are critical factors.

    Performance Evaluation

    Zep’s capabilities have been validated through comprehensive testing in two key benchmarks:

    1. Deep Memory Retrieval (DMR) Benchmark

    DMR measures how well AI memory systems retain and retrieve past information. Zep achieved:

    • 94.8% accuracy with GPT-4 Turbo, compared to 93.4% for MemGPT.
    • 98.2% accuracy with GPT-4o Mini, demonstrating strong memory retention.
    2. LongMemEval Benchmark

    LongMemEval assesses AI agents in real-world business scenarios, where conversations can span over 115,000 tokens. Zep demonstrated:

    • 15.2% and 18.5% accuracy improvements with GPT-4o Mini and GPT-4o, respectively.
    • Significant latency reduction, making responses 90% faster than traditional full-context retrieval methods.
    • Lower token usage, requiring only 1.6k tokens per response compared to 115k tokens in full-context approaches.
    3. Performance Across Different Question Types

    Zep showed strong performance in complex reasoning tasks:

    • Preference-Based Questions: 184% improvement over full-context retrieval.
    • Multi-Session Queries: 30.7% improvement.
    • Temporal Reasoning: 38.4% improvement, highlighting Zep’s ability to track and infer time-sensitive information.

    Conclusion

    Zep provides a structured and efficient way for AI systems to retain and retrieve knowledge over extended periods. By moving beyond static retrieval methods and incorporating a dynamically evolving knowledge graph, it enables AI agents to maintain coherence across sessions and reason over past interactions.

    With 94.8% DMR accuracy and proven effectiveness in enterprise-level applications, Zep represents an advancement in AI memory solutions. By optimizing data retrieval, reducing token costs, and improving response speed, it offers a practical and scalable approach to enhancing AI-driven applications.


    Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 75k+ ML SubReddit.

    🚨 Marktechpost is inviting AI Companies/Startups/Groups to partner for its upcoming AI Magazines on ‘Open Source AI in Production’ and ‘Agentic AI’.

    The post Zep AI Introduces a Smarter Memory Layer for AI Agents Outperforming the MemGPT in the Deep Memory Retrieval (DMR) Benchmark appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleNYU Researchers Introduce WILDCHAT-50M: A Large-Scale Synthetic Dataset for Efficient LLM Post-Training
    Next Article Kingdom Come: Deliverance 2 is 2025’s GOTY frontrunner, and it can be yours for less with these deals

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    June 1, 2025
    Machine Learning

    Enigmata’s Multi-Stage and Mix-Training Reinforcement Learning Recipe Drives Breakthrough Performance in LLM Puzzle Reasoning

    June 1, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    The rumor is true. Microsoft brings Phone Link to your Start menu in Windows 11

    Development

    How to share your location on Android: 3 ways that also work with iPhones

    Development

    40% off Lenovo’s most legendary business laptop? Don’t let tariffs get in the way of a good time.

    News & Updates

    Le notizie minori del mondo GNU/Linux e dintorni della settimana nr 1/2025

    Development
    GetResponse

    Highlights

    News & Updates

    I just bought this 34% off 1TB SSD for my gaming PC because it’s fast, cheap, and well-rated — Now I can finally upgrade my gaming PC

    March 21, 2025

    Thanks to a deal going on right now, I bought an internal SSD for my…

    Windows Central Podcast: Could we see the future of Windows and Copilot next month?

    March 16, 2025

    The Linux Foundation launches an initiative to support open-source Chromium-based browsers

    January 9, 2025

    It only took Call of Duty: Warzone going back in time to make the game so much better — to the point even I enjoy it now

    April 7, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.