Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»DrBenchmark: The First-Ever Publicly Available French Biomedical Large Language Understanding Benchmark

    DrBenchmark: The First-Ever Publicly Available French Biomedical Large Language Understanding Benchmark

    April 29, 2024

    A group of researchers in France introduced Dr.Benchmark to address the need for the evaluation of masked language models in French, particularly in the biomedical domain. There have been significant advances in the field of NLP, particularly in pre-trained language models (PLMs), but evaluating these models remains difficult due to variations in evaluation protocols. The scarcity of evaluation benchmarks in the biomedical domain in languages other than English and Chinese has made this even more challenging. These issues created a gap in evaluating the accuracy of the latest French biomedical models.

    The existing method for evaluating French language models failed to provide standardized protocols and comprehensive benchmark datasets, leading to inconsistent results and stalling advancement in NLP research. DrBenchmark is the first publicly available French biomedical language understanding benchmark. This benchmark comprises 20 diversified tasks, including named-entity recognition, part-of-speech tagging, question-answering, semantic textual similarity, and classification. The primary contribution of DrBenchmark is its aggregation of diverse downstream tasks into a single benchmark, allowing the assessment of pre-trained language models’ intrinsic qualities from various perspectives. The paper also tests eight cutting-edge pre-trained masked language models (MLMs) on both general and biomedical data. The MLMs include French generalist models, cross-lingual generalist models, French biomedical models, and an English biomedical model.

    DrBenchmark offers a modular, reproducible, and easily customizable automated protocol for fair comparison among language models. It leverages the HuggingFace Datasets and the Transformers library for data loading, pre-training, and evaluation. The experimental protocol ensures consistency by fine-tuning all models using the same hyperparameters for each downstream task. Results from the experiments reveal that no single model excels across all tasks, highlighting the importance of domain-specific models for achieving peak performance in the biomedical field. Interestingly, even though French biomedical models exhibit superior performance in most tasks, certain out-of-domain models or models trained in different languages maintain competitiveness in specific tasks.

    In conclusion, the paper presents DrBenchmark to solve the lack of evaluation resources for French biomedical NLP models. By aggregating diverse downstream tasks into a comprehensive benchmark, DrBenchmark enables fair comparison among pre-trained language models. The evaluation results highlight the importance of employing domain-specific models for optimal performance in biomedical NLP tasks. The study also shows that certain models trained in different languages or outside of the domain can still compete in specific tasks, underscoring the need for more study in this field.

    Check out the Paper and Project page. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 40k+ ML SubReddit

    The post DrBenchmark: The First-Ever Publicly Available French Biomedical Large Language Understanding Benchmark appeared first on MarkTechPost.

    Source: Read More 

    Hostinger
    Facebook Twitter Reddit Email Copy Link
    Previous ArticleTop Data Science Courses in 2024
    Next Article 50+ Free Seamless Pattern Packs for Designers

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 17, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-48187 – RAGFlow Authentication Bypass

    May 17, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    CVE-2025-3647 – Moodle Information Disclosure

    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-2905 (CVSS 9.1): Critical XXE Vulnerability Found in WSO2 API Manager

    Security

    Chinese State-Backed Cyber Espionage Targets Southeast Asian Government

    Development

    Qwen2-Math Released: A Comprehensive AI Suite Featuring Models Ranging from 1.5B to 72B Parameters, Transforming Mathematical Computation

    Development
    GetResponse

    Highlights

    Development

    defineExpose and in Vue 3 for component interaction and theming

    November 7, 2024

    As Vue.js applications scale up, component interaction management and dynamic styling can become progressively more…

    Create a custom JavaScript sparkle cursor

    April 17, 2025

    Access control for vector stores using metadata filtering with Knowledge Bases for Amazon Bedrock

    July 2, 2024

    User Research is the Secret Weapon of Great Design

    June 24, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.