ScaleBiO: A Novel Machine Learning Based Bilevel Optimization Method Capable of Scaling to 34B LLMs on Data Reweighting Tasks

Bilevel optimization (BO) is a growing field of research, gaining attention for its success in various machine learning tasks like hyperparameter optimization, meta-learning, and reinforcement learning. BO involves a two-level structure where the solution to the outer problem depends on the solution to the inner problem. However, BO is not widely used for large-scale problems, despite being flexible and applicable to many problems. The main challenge is the interdependence between the upper and lower levels of problems that hinder the scalability of BO. This mutual dependency introduces significant computational challenges, especially when handling large-scale problems.

There are two main areas of related work discussed in this paper. The first is Bilevel Optimization, which can be divided into two types: (a) approximate implicit differentiation (AID) methods, and (b) iterative differentiation (ITD) methods. Both approaches follow a two-loop manner and need a lot of computational costs for large-scale problems. The second area is Data Reweighting, where the proportion of training data sources greatly impacts the performance of large language models (LLMs). Various methods are discussed in this paper to reweight data sources for optimal training data mixture. However, none of these methods guarantee optimal data weights, and there have been no scalable experiments on models larger than 30 billion parameters.

Researchers from The Hong Kong University of Science and Technology, and the University of Illinois Urbana-Champaign have introduced ScaleBiO, a new bilevel optimization method capable of scaling to 34B LLMs on data reweighting tasks. The ScaleBiO can run these large models on eight A40 GPUs by incorporating a memory-efficient training technique called LISA. This is the first time BO has been successfully applied to such large LLMs, showing its potential in real-world applications. ScaleBiO optimizes learned data weights effectively and provides a convergence guarantee similar to traditional first-order BO methods for smooth and strongly convex objectives.

Experiments on data reweighting show that ScaleBiO works well for different-sized models, such as GPT-2, LLaMA-3-8B, GPT-NeoX-20B, and Yi-34B, where BO effectively filters out irrelevant data and selects only the informative samples. The two experiments conducted are (a)Â Small Scale Experiments to understand ScaleBiO better and (b) Real-World Application Experiments to validate its effectiveness and scalability. To test ScaleBiOâ€™s effectiveness on small-scale language models, experiments were carried out with GPT-2 (124M) on three synthetic data tasks: data denoising, multilingual training, and instruction-following fine-tuning.

To evaluate ScaleBiO, 3,000 data are sampled from each source for reweighting, and then 10,000 data are sampled based on the final weights from BO to train the model. To show the effectiveness of ScaleBiO, the learned sampling weights are applied to fine-tune the LLaMA-3-8B and LLaMA-3-70B models. The LLMsâ€™ instruction-following abilities are evaluated using MT-Bench with single-answer grading, challenges chat assistants with complex, multi-turn, open-ended questions, and uses â€œLLM-as-a-judgeâ€ for evaluation. This benchmark is notable for its alignment with human preferences, containing 80 questions spread across 8 categories uniformly: Writing, Roleplay, Extraction, Reasoning, Math, Coding, Knowledge I (STEM), and Knowledge II (humanities/social science).

In summary, researchers have proposed ScaleBiO, a bilevel optimization instantiation capable of scaling to 34B LLMs on data reweighting tasks. ScaleBiO allows data reweighting on models with at least 7 billion parameters, creating an efficient way to filter and select pipelines to boost model performance on various tasks. Moreover, the sampling weights learned on LLaMA-3-8B can be applied to larger models like LLaMA-3-70B, resulting in significant performance improvements. However, ScaleBiOâ€™s effectiveness in large-scale pre-training still needs to be tested, which requires extensive computational resources. Therefore, demonstrating its success in large-scale fine-tuning settings could be an important first step.

Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter.Â

Join ourÂ Telegram Channel andÂ LinkedIn Group.

If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 45k+ ML SubReddit

The post ScaleBiO: A Novel Machine Learning Based Bilevel Optimization Method Capable of Scaling to 34B LLMs on Data Reweighting Tasks appeared first on MarkTechPost.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

I test a lot of AI coding tools, and this stunning new OpenAI release just saved me days of work

How to use your Android phone as a webcam when your laptop’s default won’t cut it

The 5 most customizable Linux desktop environments – when you want it your way

Gen AI use at work saps our motivation even as it boosts productivity, new research shows

Strategic Cloud Partner: Key to Business Success, Not Just Tech

Strategic Cloud Partner: Key to Business Success, Not Just Tech

Perficient’s “What If? So What?” Podcast Wins Gold at the 2025 Hermes Creative Awards

PIM for Azure Resources

Windows 11 24H2’s Settings now bundles FAQs section to tell you more about your system

Windows 11 24H2’s Settings now bundles FAQs section to tell you more about your system

You can now share an app/browser window with Copilot Vision to help you with different tasks

Microsoft will gradually retire SharePoint Alerts over the next two years

ScaleBiO: A Novel Machine Learning Based Bilevel Optimization Method Capable of Scaling to 34B LLMs on Data Reweighting Tasks

Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

CVE-2025-47785 – Emlog SQL Injection and Remote Code Execution

Unable to launch a website in firefox browser using firefox Driver config

Hacker Makes Claim of Largest Attack on United Arab Emirates in History

Thanks to Apple’s new smart doorbell, you’ll soon be able to open doors with FaceID

CISA Adds Broadcom Brocade Fabric OS Vulnerability to Known Exploited Vulnerabilities Catalog

Chinese APT Exploits BeyondTrust API Key to Access U.S. Treasury Systems and Documents

Amap – Gather Info in Easy Way

Hiring Kit: Virtual Reality Designer

CVE-2025-4311 – iSourcecode Content Management System SQL Injection Vulnerability

ScaleBiO: A Novel Machine Learning Based Bilevel Optimization Method Capable of Scaling to 34B LLMs on Data Reweighting Tasks

Related Posts