Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      June 2, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      June 2, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      June 2, 2025

      How To Prevent WordPress SQL Injection Attacks

      June 2, 2025

      How Red Hat just quietly, radically transformed enterprise server Linux

      June 2, 2025

      OpenAI wants ChatGPT to be your ‘super assistant’ – what that means

      June 2, 2025

      The best Linux VPNs of 2025: Expert tested and reviewed

      June 2, 2025

      One of my favorite gaming PCs is 60% off right now

      June 2, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      `document.currentScript` is more useful than I thought.

      June 2, 2025
      Recent

      `document.currentScript` is more useful than I thought.

      June 2, 2025

      Adobe Sensei and GenAI in Practice for Enterprise CMS

      June 2, 2025

      Over The Air Updates for React Native Apps

      June 2, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      You can now open ChatGPT on Windows 11 with Win+C (if you change the Settings)

      June 2, 2025
      Recent

      You can now open ChatGPT on Windows 11 with Win+C (if you change the Settings)

      June 2, 2025

      Microsoft says Copilot can use location to change Outlook’s UI on Android

      June 2, 2025

      TempoMail — Command Line Temporary Email in Linux

      June 2, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Baidu Research Introduces EICopilot: An Intelligent Agent-based Chatbot to Retrieve and Interpret Enterprise Information from Massive Graph Databases

    Baidu Research Introduces EICopilot: An Intelligent Agent-based Chatbot to Retrieve and Interpret Enterprise Information from Massive Graph Databases

    January 31, 2025

    Knowledge graphs have been used tremendously in the field of enterprise lately, with their applications realized in multiple data forms from legal persons to registered capital and shareholder’s details. Although graphs have high utility, they have been criticized for intricate text-based queries and manual exploration, which obstruct the extraction of pertinent information.

    With the massive strides in natural language processing and generative intelligence in the past years, LLMs have been used to perform complex queries and summarization based on their language comprehension and exploration skill set. This article discusses the latest research that uses language models to streamline information extraction from graph databases.

    Researchers from Baidu presented  “EICopilot,” an agent-based solution that streamlines search, exploration, and summarization of corporate data stored in knowledge graph databases to gain valuable insights about enterprises efficiently. To appreciate the work more, we must look at the scale of data handled by EICopilot. A typical graph dataset of this nature consists of hundreds of millions of nodes, tens of billions of edges, hundreds of billions of attributes, and millions of subgraphs as company communities representing a country’s registered corporations, organizations, and companies.

    EICopilot is an LLM-based chatbot that utilizes a novel data preprocessing pipeline that optimizes database queries. To achieve this, the authors first gather real-world queries related to companies from general-purpose search engines. Post collection, developers reserve some representative queries exclusively as seed datasets and write search scripts for every query using Gremlin language for the graph dataset. Finally, the authors systematically annotate and augment the above queries and scripts to form a vector database that enhances search accuracy.EICopilot utilizes this vector database to generate search spaces in real-time for effective retrieval and exploration of graphs.

    In addition to the above data processing pipeline, EICopilot employs a comprehensive reasoning pipeline to provide precise query responses. This pipeline uses Chain-of-Thought (CoT) and In-Context Learning (ICL) to provide more accurate responses.

    The authors also highlight the importance of an entity name in the query rather than the intent in a vector database query matching. The authors also proposed a novel query masking strategy that masks entity names in queries to combat this.EICopilot ensures that queries are understood in their complexity and executed with greater precision and relevance to user intent.

    The authors provided us with an extensive empirical analysis and real-world experimentation that validate the utility of the proposed framework. They obtained data from Baidu’s internal data platform and processed it rigorously to construct a dataset involving a query and graph database query pair. The authors introduce a length complexity score based on the traversal length of the query. Based on the above score, the query was categorized as simple, moderate, or complex. To assess the performance of 𝐸𝐼𝐶𝑜𝑝𝑖𝑙𝑜𝑡, authors considered the SyntaxErrorRate and Execution Correctness of the generated Gremlin scripts. For the LLMs, EICopilot utilized  ErnieBot, ErnieBot-Speed, and Llama3-8b models.

    The empirical results from the above experiments proved the superior performance of EICopilot over baselines, especially in terms of speed and accuracy; notably, the Full Mask variant of EICopilot achieved a syntax error rate reduction to as low as 10.00% and an execution correctness of up to 82.14%. These results highlighted the critical role of the method’s components in enhancing query and summarization processes.

    Conclusion: This paper introduced EICopilot, an agent-based chatbot that enhances querying and summarization processes from massive knowledge graph databases in corporations. The authors proposed a series of innovations like script generation, novel data pre-processing, and masking techniques. The proposed method superseded baseline methods in speed and accuracy, thus revolutionizing large-scale knowledge graph exploration.


    Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 70k+ ML SubReddit.

    🚨 [Recommended Read] Nebius AI Studio expands with vision models, new language models, embeddings and LoRA (Promoted)

    The post Baidu Research Introduces EICopilot: An Intelligent Agent-based Chatbot to Retrieve and Interpret Enterprise Information from Massive Graph Databases appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleFrom Deep Knowledge Tracing to DKT2: A Leap Forward in Educational AI
    Next Article DeepSeek-R1 model now available in Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    June 2, 2025
    Machine Learning

    MiMo-VL-7B: A Powerful Vision-Language Model to Enhance General Visual Understanding and Multimodal Reasoning

    June 2, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Forget Anker: This 240W power bank is the premium charging accessory your iPhone deserves

    News & Updates

    How to Build a GraphQL API in Django

    Development

    U.S. Agencies Warn of Iranian Hacking Group’s Ongoing Ransomware Attacks

    Development

    How Rocket Companies modernized their data science solution on AWS

    Machine Learning

    Highlights

    Global Cybersecurity Agencies Warn of Spyware Targeting Uyghur, Tibetan, and Taiwanese Communities Development

    Global Cybersecurity Agencies Warn of Spyware Targeting Uyghur, Tibetan, and Taiwanese Communities

    April 9, 2025

    In a joint advisory, the United Kingdom’s National Cyber Security Centre (NCSC), along with international…

    User Research Is Storytelling

    June 1, 2024

    Fake job offers target software developers with infostealers

    February 22, 2025

    Windows Server Build 26244 is out now with only one, but important change

    July 1, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.