Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Assessing Noise Impact on Machine Learning Models for Voice Disorder Evaluation

    Assessing Noise Impact on Machine Learning Models for Voice Disorder Evaluation

    August 3, 2024

    Deep learning has become a powerful tool for classifying pathological voices, particularly in the GRBAS (Grade, Roughness, Breathiness, Asthenia, Strain) scale assessment. The GRBAS scale is a standardized method clinicians use to evaluate voice disorders based on auditory-perceptual judgment. Traditional methods for classifying pathological voices often rely on manual feature extraction and subjective analysis, which can be time-consuming and inconsistent. Deep learning techniques such as 1D convolutional neural networks (1D-CNNs) offer significant advantages by automatically learning relevant features from raw audio data, capturing complex patterns and nuances indicative of specific pathological conditions.

    However, noise can significantly impact the accuracy of these models. Since they rely on extracting subtle features from voice signals, any background noise or distortion can obscure important characteristics, leading to misclassification. Noise from recording environments, equipment, or background sounds poses a critical challenge in developing reliable voice pathology detection systems. Preprocessing techniques like noise reduction and signal enhancement are often employed, but they may only sometimes be sufficient to eliminate the effects of noise on classification performance.

    In this context, a new paper was recently published in the journal The Laryngoscope, which aims to assess the impact of background noise on machine learning models used for evaluating the GRBAS scale in voice disorder assessments.

    In this study, the authors created a unique dataset from clinical patients’ voice samples recorded in a soundproof room. These samples were rated according to the GRBAS scale by otolaryngologists and an expert speech and language therapist. The ratings’ median values were adopted as the correct answers, and inter-rater agreement was evaluated using Krippendorff’s alpha.

    The machine learning model was a 5-layer 1D-CNN, constructed and evaluated using TensorFlow. The dataset was divided into 80% training, 10% validation, and 10% test data. The training process was conducted without noise data. Gaussian noise of various intensities was added to the test samples to assess noise resilience. The model’s performance was evaluated using accuracy, F1 score, and quadratic weighted Cohen’s kappa score under different noise conditions. The study highlights the significance of noise as a challenge in applying machine learning models to real-world scenarios like examination rooms.

    The dataset of voice samples, balanced for age and gender, showed that the deep learning model performed well with noise-free data. As Gaussian noise intensity increased, performance metrics dropped significantly, with accuracy falling dramatically at the highest noise level. This degradation was observed across all GRBAS parameters, with certain scales showing the most significant declines.

    The study found that background noise severely affects the model’s accuracy and performance metrics. The model’s effectiveness decreased as noise levels increased, highlighting its vulnerability to real-world conditions. Certain GRBAS components were more sensitive to noise. The study suggests incorporating noise-resilient techniques such as data augmentation and noise reduction to improve model robustness. Limitations include the small number of evaluators and using only one type of vocal sample, which may not fully capture the variability in voice disorders. Future work should address these issues to enhance the model’s generalizability and performance in noisy environments.

    To conclude, the model’s performance significantly declined with increased background noise, impacting the evaluation metrics. Future research should focus on developing noise-tolerant methods, such as data augmentation, to enhance the model’s resilience in real-world conditions. Improving the GRBAS scale’s reliability can make it a valuable tool for both physicians and patients. Automated evaluations can facilitate earlier disease detection, leading to more effective treatments and better support for rehabilitation.

    Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

    Don’t Forget to join our 47k+ ML SubReddit

    Find Upcoming AI Webinars here

    Arcee AI Released DistillKit: An Open Source, Easy-to-Use Tool Transforming Model Distillation for Creating Efficient, High-Performance Small Language Models

    The post Assessing Noise Impact on Machine Learning Models for Voice Disorder Evaluation appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleWolf: A Mixture-of-Experts Video Captioning Framework that Outperforms GPT-4V and Gemini-Pro-1.5 in General Scenes, Autonomous Driving, and Robotics Videos
    Next Article SPRITE (Spatial Propagation and Reinforcement of Imputed Transcript Expression): Enhancing Spatial Gene Expression Predictions and Downstream Analyses Through Meta-Algorithmic Integration

    Related Posts

    Machine Learning

    Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation

    May 16, 2025
    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 16, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Facebook Banna GNU/Linux: Cosa Sta Succedendo e Alternative per la Comunità GNU/Linux

    Linux

    Prime Day 2024: Amazon finally confirms a start date, but some early deals are already live — Here’s everything you need to know

    Development

    Microsoft just made Windows 10 worse, and there’s (almost) nothing you can do about it

    News & Updates

    Long-Context Multimodal Understanding No Longer Requires Massive Models: NVIDIA AI Introduces Eagle 2.5, a Generalist Vision-Language Model that Matches GPT-4o on Video Tasks Using Just 8B Parameters

    Machine Learning
    GetResponse

    Highlights

    Development

    NATO Innovation Fund announces its new investment team

    May 24, 2024

    The NATO Innovation Fund (NIF) has announced its new investment team with experience that spans…

    OpenAI CEO Sam Altman anticipates GPT-5 as a “significant leap forward” over GPT-4, which occasionally “goes off the rails” with mistakes even a six-year-old wouldn’t make

    July 1, 2024

    Both of Getty’s commercial-safe AI image generators just got smarter and faster

    July 29, 2024

    A Developer’s Guide to Protecting Personal Data: Best Practices and Tools

    April 17, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.