Researchers at Stanford Propose a Family of Representation Finetuning (ReFT) Methods that Operates on a Frozen Base Model and Learn Task-Specific Interventions on Hidden Representations

Pretrained language models (LMs) are commonly finetuned to adapt them to new domains or tasks, a process known as finetuning. While finetuning allows for adaptation to various functions with small amounts of in-domain data, it can be prohibitively expensive for large LMs.Â

Parameter-efficient finetuning (PEFT) methods offer a solution by updating only a fraction of the weights, reducing memory usage and training time. Adapters, a common PEFT approach, learn edits that can be added to a subset of model weights or operate alongside the frozen base model. Recent advancements like LoRA and its variants reduce the number of trainable parameters by using low-rank approximations during adapter training.

However, a significant aspect of current PEFT methods is their focus on modifying weights rather than representations, despite prior research indicating that representations encode rich semantic information. Representation Finetuning (ReFT) methods have been proposed in response to this by a team of researchers from Stanford and Pr(Ai)2R Group.

Instead of adapting model weights, ReFT methods train interventions to manipulate a small fraction of model representations, steering model behaviors to solve downstream tasks at inference time. Their approach draws inspiration from recent work in LM interpretability, which intervenes on representations to identify causal mechanisms and steer model behaviors at inference time.

One notable instance of the ReFT family is the Low-rank Linear Subspace ReFT (LoReFT), which intervenes on hidden representations in the linear subspace spanned by a low-rank projection matrix. LoReFT builds directly on existing methods like distributed alignment search (DAS), demonstrating state-of-the-art performance on various benchmarks while using significantly fewer parameters than traditional PEFT methods. Their results suggest that ReFT methods offer more efficient and effective alternatives to weight-based PEFTs, deserving further exploration across different model families and domains.

Future research directions for ReFT include exploring its effectiveness on other model families and vision-language models and automating hyperparameter search. Additionally, investigating more effective interventions for specific tasks and exploring the power of learned orthogonal subspaces are areas of interest. ReFT advances neural network interpretability research and contributes insights back to the field, challenging traditional approaches to interpreting individual neurons in isolation.

In terms of evaluation practices, itâ€™s essential to establish benchmarks that allow for fair comparisons of PEFTs and ReFTs, including compute- or time-matched hyperparameter-tuning comparisons and disallowing tuning or model selection based on the test set to mitigate overfitting and ensure real-world performance assessment.

Check out theÂ PaperÂ andÂ Github.Â All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter.Â Join ourÂ Telegram Channel,Â Discord Channel, andÂ LinkedIn Group.

If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 40k+ ML SubReddit

Want to get in front of 1.5 Million AI Audience?Â Work with us here

The post Researchers at Stanford Propose a Family of Representation Finetuning (ReFT) Methods that Operates on a Frozen Base Model and Learn Task-Specific Interventions on Hidden Representations appeared first on MarkTechPost.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Looking for an AI-powered website builder? Here’s your best option in 2025

SteamOS is officially not just for Steam Deck anymore — now ready for Lenovo Legion Go S and sort of ready for the ROG Ally

Microsoft’s latest AI model can accurately forecast the weather: “It doesn’t know the laws of physics, so it could make up something completely crazy”

OpenAI scientists wanted “a doomsday bunker” before AGI surpasses human intelligence and threatens humanity

A timeline of JavaScript’s history

A timeline of JavaScript’s history

Loading JSON Data into Snowflake From Local Directory

Streamline Conditional Logic with Laravel’s Fluent Conditionable Trait

Open-Typer is a typing tutor application

Open-Typer is a typing tutor application

RefreshOS is a distribution built on the robust foundation of Debian

Cosmicding is a client to manage your linkding bookmarks

Researchers at Stanford Propose a Family of Representation Finetuning (ReFT) Methods that Operates on a Frozen Base Model and Learn Task-Specific Interventions on Hidden Representations

Markus Buehler receives 2025 Washington Award

LWiAI Podcast #201 – GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

Learn to Use GitHub Actions: a Step-by-Step Guide

Escritoire creates standards-compliant letters

Stanikmas, Lynn. (2024). Angular Studies | Advanced Components. GitHub.

Expense Reconciliation: Step-by-Step Guide

Listening-While-Speaking Language Model (LSLM): An End-to-End System Equipped with both Listening and Speaking Channels

Tsinghua University Researchers Propose ADELIE: Enhancing Information Extraction with Aligned Large Language Models Around Human-Centric Tasks

Smashing Security podcast #371: Unmasking LockBitsupp, company extortion, and a Tinder fraudster

saasscaleup/laravel-log-alarm

Researchers at Stanford Propose a Family of Representation Finetuning (ReFT) Methods that Operates on a Frozen Base Model and Learn Task-Specific Interventions on Hidden Representations

Related Posts