Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 13, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 13, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 13, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 13, 2025

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025

      Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

      May 13, 2025

      NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

      May 13, 2025

      How to install and use Ollama to run AI LLMs on your Windows 11 PC

      May 13, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Community News: Latest PECL Releases (05.13.2025)

      May 13, 2025
      Recent

      Community News: Latest PECL Releases (05.13.2025)

      May 13, 2025

      How We Use Epic Branches. Without Breaking Our Flow.

      May 13, 2025

      I think the ergonomics of generators is growing on me.

      May 13, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025
      Recent

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025

      Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

      May 13, 2025

      NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

      May 13, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Qdrant Unveils BM42: A Cutting-Edge Pure Vector-Based Hybrid Search Algorithm Optimizing RAG and AI Applications

    Qdrant Unveils BM42: A Cutting-Edge Pure Vector-Based Hybrid Search Algorithm Optimizing RAG and AI Applications

    July 5, 2024

    Qdrant, a leading provider of vector search technology, has introduced BM42, a new algorithm designed to revolutionize hybrid search. For the past four decades, BM25 has been the standard algorithm used by search engines, from Google to Yahoo. However, the advent of vector search and the introduction of Retrieval-Augmented Generation (RAG) have highlighted the need for a more advanced solution. BM42 aims to bridge this gap by combining the strengths of BM25 with modern transformer models, offering a significant upgrade for search applications.

    The Legacy of BM25

    BM25 has remained relevant for a long time due to its simple yet effective formula, which calculates the relevance of documents based on term frequency and inverse document frequency (IDF). This method excels in traditional web search environments where document length and query structures are consistent. However, the landscape of text retrieval has shifted dramatically with the rise of RAG systems, which require handling shorter, more varied documents and queries. BM25’s reliance on document statistics, such as term frequency and document length, becomes less effective in these scenarios.

    The Introduction of BM42

    BM42 addresses these challenges by integrating the core principles of BM25 with the capabilities of transformer models. The key innovation in BM42 is using attention matrices from transformers to determine the importance of the term within documents. Transformers generate a range of outputs, including embeddings and attention matrices, highlighting each token’s significance in the input sequence. By leveraging the attention row corresponding to the special [CLS] token, BM42 can accurately gauge the importance of each token in a document, even for shorter texts typical in RAG applications.

    Advantages of BM42

    BM42 offers several advantages over BM25 and SPLADE, another modern alternative that uses transformers to create sparse embeddings. While SPLADE has shown superior performance in academic benchmarks, it needs to improve its performance, including the need for extensive computational resources and issues with tokenization and domain dependency. BM42, on the other hand, retains the interpretability and simplicity of BM25 while overcoming SPLADE’s limitations.

    One of BM42’s primary benefits is its efficiency. The algorithm can perform document and query inferences quickly, making it suitable for real-time applications. It also has a low memory footprint, ensuring it can handle large datasets without significant resource demands. BM42 supports multiple languages and domains, provided a suitable transformer model is available, making it highly versatile.

    Image Source

    Practical Implementation

    BM42 can be seamlessly integrated into Qdrant’s vector search engine. The implementation involves setting up a collection for hybrid search with BM42 and using dense embeddings from models like jina.ai. This combination allows for a balanced approach, where sparse and dense embeddings complement each other to enhance retrieval accuracy. Benchmarks conducted by Qdrant demonstrate that BM42 outperforms BM25 in scenarios involving short texts, a common use case in modern search applications.

    Encouraging Community Engagement

    Qdrant’s release of BM42 introduces a new algorithm and fosters community engagement and innovation. The company invites developers and researchers to experiment with BM42, share their projects, and contribute to its ongoing development. By providing this powerful tool, Qdrant aims to empower its community to push the boundaries of what is possible in search technology.

    Conclusion

    The release of BM42 by Qdrant marks a significant milestone in the evolution of search algorithms. By combining the robustness of BM25 with the intelligence of transformers, BM42 sets a new standard for hybrid search. It addresses the limitations of earlier methods and modern alternatives, offering a versatile, efficient, and highly accurate solution for today’s search applications.

    The post Qdrant Unveils BM42: A Cutting-Edge Pure Vector-Based Hybrid Search Algorithm Optimizing RAG and AI Applications appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleHow AI Scales with Data Size? This Paper from Stanford Introduces a New Class of Individualized Data Scaling Laws for Machine Learning
    Next Article Applying RLAIF for Code Generation with API-usage in Lightweight LLMs

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 13, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-3744 – Nomad Sentinel Policy Bypass

    May 13, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    🚀 How JavaScript Works: Understanding V8 JIT and Its Impact on UI Performance

    Development

    Advanced API Testing Part 2: JSON Schema Validation, Serialization & Deserialization Techniques

    Development

    Researchers Uncover Symlink Exploit Allowing TCC Bypass in iOS and macOS

    Development

    Dash to Panel GNOME Extension Gets Big Update

    Linux

    Highlights

    Development

    A Brief Introduction to Web Components

    May 9, 2025

    In a previous article, I gave a brief introduction to React. This tutorial introduces an…

    What happens when facial recognition gets it wrong – Week in security with Tony Anscombe

    June 1, 2024

    Claude 3.5 Sonnet launch on Bedrock doesn’t open AWS to OpenAI, Google models

    June 24, 2024

    Using Relative Date Helpers in Laravel’s Query Builder

    February 13, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.