DoRM: A Brain-Inspired Approach to Generative Domain Adaptation

Few-shot Generative Domain Adaptation (GDA) is a machine learning and domain adaptation concept that addresses the challenge of adapting a model trained on a source domain to perform well on a target domain, using only a few examples from the target domain. Such a technique is particularly useful when obtaining a large amount of labeled data from the target domain, which is expensive or impractical.

The main existing solution for GDA focuses on improving a special AI model called a â€œgenerator,â€ which creates new data samples that resemble the target domain, even with only a few examples. Techniques like consistency loss and GAN inversion help the generator produce high-quality and diverse data. These methods ensure the generated data maintains similarities and differences accurately across domains. However, challenges arise when the source and target domains have significant differences. In such cases, ensuring the generator can adapt and accurately generate data that fits both domains remains a considerable challenge.Â

To address these challenges, a recent paper presented at NeurIPS introduces Domain Re-Modulation (DoRM) for GDA. Unlike prior methods, DoRM enhances image synthesis quality, diversity, and cross-domain consistency while integrating memory and domain association capabilities inspired by human learning. By modifying the style space through new mapping and affine modules, DoRM can generate high-fidelity images across multiple domains, including hybrids not seen in training. The paperâ€™s authors also introduced a novel similarity-based structure loss for better cross-domain alignment, showcasing superior performance in experimental evaluations compared to existing approaches.

Concretely, DoRM enhances the generatorâ€™s capabilities for GDA by introducing several key innovations:

1. Source Generator Preparation: Initially, the method begins with a pre-trained StyleGAN2 generator that serves as the foundation for subsequent adaptations.

2. Introducing M&A Modules: The source generator is frozen to adapt to the new target domain, and new Mapping and Affine (M&A) modules are introduced. These modules are crucial as they specialize in capturing specific attributes unique to the target domain. By selectively activating these modules, the generator can finely adjust its output to match the nuances of different domains.

3. Style Space Adjustment: transforming the source domainâ€™s latent code into a new space tailored to the visual style of the target domain. This adjustment enables the generator to synthesize outputs that accurately reflect the characteristics of the target domain.

4. Linear Domain Shift: DoRM facilitates a linearly combinable domain shift in the generatorâ€™s style space using multiple M&A modules. These modules enable precise adjustments for specific domains, enhancing the generatorâ€™s flexibility to synthesize images across diverse domains and create seamless blends of attributes from multiple training sources.

5. Cross-Domain Consistency Enhancement: DoRM introduces a novel similarity-based structure loss (Lss) to ensure consistency across domains. This loss leverages CLIP image encoder tokens to align auto-correlation maps between source and target images, preserving structural coherence and fidelity to the target domainâ€™s characteristics in the generated outputs.

6. Training Framework: DoRM integrates an inclusive loss function that combines StyleGAN2â€™s original adversarial loss with Lss during training. This integrated framework optimizes generator and discriminator learning, ensuring stable training dynamics and robust adaptation to complex domain shifts.

The research team evaluated the proposed DoRM method using the Flickr-Faces-HQ Dataset (FFHQ). They applied a pre-trained StyleGAN2 model to enable stable training in 10-shot GDA. DoRM demonstrated superior synthesis quality and cross-domain consistency compared to other methods, especially in domains like Sketches and FFHQ-Babies. Quantitative metrics such as FrÃ©chet Inception Distance (FID) and Identity similarity consistently showed DoRM outperforming competitors. The method also excelled in multi-domain and hybrid-domain generation, showcasing its ability to integrate diverse domains and synthesize novel hybrid outputs efficiently. Ablation studies confirmed the effectiveness of DoRMâ€™s generator structure across various experimental setups.

To conclude, the research team introduces DoRM, a streamlined generator structure tailored for GDA. DoRM incorporates a novel similarity-based structure loss to ensure robust cross-domain consistency. Through rigorous evaluations, the method demonstrates superior synthesis quality, diversity, and cross-domain consistency compared to existing approaches. Like the human brain, DoRM integrates knowledge across domains, enabling the generation of images in novel hybrid domains not encountered during training.

Check out the Paper. All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter.Â

Join ourÂ Telegram Channel andÂ LinkedIn Group.

If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 46k+ ML SubReddit

The post DoRM: A Brain-Inspired Approach to Generative Domain Adaptation appeared first on MarkTechPost.

Source: Read MoreÂ

IBM’s next generation Granite models are now available

The Human Element: Using Research And Psychology To Elevate Data Storytelling

Google to offer free version of Gemini Code Assist

MongoDB acquires Voyage AI for its embedding and reranking models

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

OpenAI expands ‘Deep Reseach’ to those paying $20 a month or more, a day after Microsoft made OpenAI’s ‘Think Deeper’ free for all Copilot users with no usage caps

Rethink State💡 Why You Should Model Your Frontend Around Events

Rethink State💡 Why You Should Model Your Frontend Around Events

What To Expect When Migrating Your Site To A New Platform

Kotlin Multiplatform vs. React Native vs. Flutter: Building Your First App

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

DoRM: A Brain-Inspired Approach to Generative Domain Adaptation

ANDI Accessibility Testing Tool Tutorial

How Data Analytics in Insurance is Driving Smarter Decisions

OpenAI proposes a second neural net to catch ChatGPT’s code mistakes

Apple Fitness Plus gets a big update for the new year: 5 new or improved features

Ensuring Compliance: CFO Perspectives on Third-Party Risk Management

Mastering Algorithm Complexity: Big O Notation Explained with JavaScript

wlmaker â€“ Wayland compositor inspired by Window Maker

This AI Paper by Snowflake Introduces Arctic-Embed: Enhancing Text Retrieval with Optimized Embedding Models

Avformat-52.dll: What is it & How to Download it

Eloquent Performance: Enum VS Int/Tinyint and SoftDeletes

DoRM: A Brain-Inspired Approach to Generative Domain Adaptation

Related Posts