Researchers at Stanford University Propose SleepFM: The First Multi-Modal Foundation Model for Sleep Analysis

Sleep is a vital physiological process that is intricately linked to overall health. However, accurately assessing sleep and diagnosing sleep disorders remains a complex task due to the need for multi-modal data interpretation, typically obtained through polysomnography (PSG). Current methods for sleep monitoring and analysis often rely on extensive manual evaluation by trained technicians, which is time-consuming and susceptible to variability. Researchers from Stanford University and the Technical University of Denmark have introduced SleepFM to capture the richness of sleep recording fully.

Existing methods for sleep analysis utilizing deep learning models, predominantly involve end-to-end convolutional neural networks (CNNs) trained on raw PSG data. While these models can automate some aspects of sleep analysis, they often need to improve in performance, particularly when dealing with multi-modal data from different physiological sources. SleepFM is the first multi-modal foundation model for sleep analysis that addresses existing modelsâ€™ limitations. SleepFM leverages a large dataset of PSG records from over 14,000 participants to learn robust embeddings through contrastive learning (CL). The model employs a novel leave-one-out approach to CL, which improves the performance of downstream tasks compared to the standard pairwise CL.

The researchers curated an extensive PSG dataset, encompassing 100,000 hours of recordings, and employed a multi-step preprocessing strategy to preserve crucial signal characteristics. SleepFMâ€™s architecture involves three 1D CNNs, each generating embeddings for different modalities (brain activity signals, ECG, and respiratory signals). These CNNs are based on EfficientNet architecture, optimized for efficiency and complexity reduction. The innovative leave-one-out CL framework allows the model to learn representations by aligning each modality with an aggregate representation of the remaining modalities, encouraging holistic learning of multi-modal data.

In performance evaluations, SleepFM demonstrated significant improvements over end-to-end CNNs. For sleep stage classification, the logistic regression model trained on SleepFMâ€™s embeddings achieved a macro AUROC of 0.88 compared to 0.72 from CNNs, and a macro AUPRC of 0.72 versus 0.48. In sleep-disordered breathing (SDB) detection, SleepFM similarly outperformed CNNs, with an AUROC of 0.85 and an AUPRC of 0.77. Additionally, SleepFM excelled in retrieving corresponding recording clips from different modalities, showcasing a 48% top-1 average accuracy among 90,000 candidates. These results underscore the modelâ€™s ability to capture rich, multi-modal sleep data representations effectively.

In summary, the proposed model addresses the challenges of sleep monitoring and disorder diagnosis and significantly outperforms traditional CNNs in various sleep-related tasks. The innovative leave-one-out contrastive learning approach and robust dataset curation highlight the potential of holistic multi-modal modeling to advance sleep analysis. SleepFMâ€™s superior performance in sleep stage classification and SDB detection, along with its robust generalization to external datasets, makes it a promising tool for enhancing sleep research and clinical applications.

Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter.Â

Join ourÂ Telegram Channel andÂ LinkedIn Group.

If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 45k+ ML SubReddit

Create, edit, and augment tabular data with the first compound AI system, Gretel Navigator, now generallyÂ available! [Advertisement]

The post Researchers at Stanford University Propose SleepFM: The First Multi-Modal Foundation Model for Sleep Analysis appeared first on MarkTechPost.

Source: Read MoreÂ

IBM’s next generation Granite models are now available

The Human Element: Using Research And Psychology To Elevate Data Storytelling

Google to offer free version of Gemini Code Assist

MongoDB acquires Voyage AI for its embedding and reranking models

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

OpenAI expands ‘Deep Reseach’ to those paying $20 a month or more, a day after Microsoft made OpenAI’s ‘Think Deeper’ free for all Copilot users with no usage caps

Rethink State💡 Why You Should Model Your Frontend Around Events

Rethink State💡 Why You Should Model Your Frontend Around Events

What To Expect When Migrating Your Site To A New Platform

Kotlin Multiplatform vs. React Native vs. Flutter: Building Your First App

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

Researchers at Stanford University Propose SleepFM: The First Multi-Modal Foundation Model for Sleep Analysis

ANDI Accessibility Testing Tool Tutorial

How Data Analytics in Insurance is Driving Smarter Decisions

Aaren: Rethinking Attention as Recurrent Neural Network RNN for Efficient Sequence Modeling on Low-Resource Devices

Nokia Faces Data Breach Allegations: 7,622 Employee Records Reportedly Compromised

Gaming or gambling? Lifting the lid on in-game loot boxes

Crystal Dock â€“ dock (desktop panel) for the Linux desktop

Germany Drafts Law to Shield Ethical Hackers, Tighten Penalties for Cybercrime

Critical Mozilla Vulnerabilities Prompt Urgent Updates for Firefox and Thunderbird Users

UK Data Privacy Watchdog Targets Social Media for Childrenâ€™s Privacy Violations

Overview of.NET MAUI: Easily Developing Cross-Platform Applications

Researchers at Stanford University Propose SleepFM: The First Multi-Modal Foundation Model for Sleep Analysis

Related Posts