Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 24, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 24, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 24, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 24, 2025

      Looking for an AI-powered website builder? Here’s your best option in 2025

      May 24, 2025

      SteamOS is officially not just for Steam Deck anymore — now ready for Lenovo Legion Go S and sort of ready for the ROG Ally

      May 23, 2025

      Microsoft’s latest AI model can accurately forecast the weather: “It doesn’t know the laws of physics, so it could make up something completely crazy”

      May 23, 2025

      OpenAI scientists wanted “a doomsday bunker” before AGI surpasses human intelligence and threatens humanity

      May 23, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      A timeline of JavaScript’s history

      May 23, 2025
      Recent

      A timeline of JavaScript’s history

      May 23, 2025

      Loading JSON Data into Snowflake From Local Directory

      May 23, 2025

      Streamline Conditional Logic with Laravel’s Fluent Conditionable Trait

      May 23, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Open-Typer is a typing tutor application

      May 24, 2025
      Recent

      Open-Typer is a typing tutor application

      May 24, 2025

      RefreshOS is a distribution built on the robust foundation of Debian

      May 24, 2025

      Cosmicding is a client to manage your linkding bookmarks

      May 24, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Researchers at Stanford Propose a Family of Representation Finetuning (ReFT) Methods that Operates on a Frozen Base Model and Learn Task-Specific Interventions on Hidden Representations

    Researchers at Stanford Propose a Family of Representation Finetuning (ReFT) Methods that Operates on a Frozen Base Model and Learn Task-Specific Interventions on Hidden Representations

    April 16, 2024

    Pretrained language models (LMs) are commonly finetuned to adapt them to new domains or tasks, a process known as finetuning. While finetuning allows for adaptation to various functions with small amounts of in-domain data, it can be prohibitively expensive for large LMs. 

    Parameter-efficient finetuning (PEFT) methods offer a solution by updating only a fraction of the weights, reducing memory usage and training time. Adapters, a common PEFT approach, learn edits that can be added to a subset of model weights or operate alongside the frozen base model. Recent advancements like LoRA and its variants reduce the number of trainable parameters by using low-rank approximations during adapter training.

    However, a significant aspect of current PEFT methods is their focus on modifying weights rather than representations, despite prior research indicating that representations encode rich semantic information. Representation Finetuning (ReFT) methods have been proposed in response to this by a team of researchers from Stanford and Pr(Ai)2R Group.

    Instead of adapting model weights, ReFT methods train interventions to manipulate a small fraction of model representations, steering model behaviors to solve downstream tasks at inference time. Their approach draws inspiration from recent work in LM interpretability, which intervenes on representations to identify causal mechanisms and steer model behaviors at inference time.

    One notable instance of the ReFT family is the Low-rank Linear Subspace ReFT (LoReFT), which intervenes on hidden representations in the linear subspace spanned by a low-rank projection matrix. LoReFT builds directly on existing methods like distributed alignment search (DAS), demonstrating state-of-the-art performance on various benchmarks while using significantly fewer parameters than traditional PEFT methods. Their results suggest that ReFT methods offer more efficient and effective alternatives to weight-based PEFTs, deserving further exploration across different model families and domains.

    Future research directions for ReFT include exploring its effectiveness on other model families and vision-language models and automating hyperparameter search. Additionally, investigating more effective interventions for specific tasks and exploring the power of learned orthogonal subspaces are areas of interest. ReFT advances neural network interpretability research and contributes insights back to the field, challenging traditional approaches to interpreting individual neurons in isolation.

    In terms of evaluation practices, it’s essential to establish benchmarks that allow for fair comparisons of PEFTs and ReFTs, including compute- or time-matched hyperparameter-tuning comparisons and disallowing tuning or model selection based on the test set to mitigate overfitting and ensure real-world performance assessment.

    Check out the Paper and Github. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 40k+ ML SubReddit

    Want to get in front of 1.5 Million AI Audience? Work with us here

    The post Researchers at Stanford Propose a Family of Representation Finetuning (ReFT) Methods that Operates on a Frozen Base Model and Learn Task-Specific Interventions on Hidden Representations appeared first on MarkTechPost.

    Source: Read More 

    Hostinger
    Facebook Twitter Reddit Email Copy Link
    Previous ArticleGNNBench: A Plug-and-Play Deep Learning Benchmarking Platform Focused on System Innovation
    Next Article Server Components Keep Getting Better

    Related Posts

    Artificial Intelligence

    Markus Buehler receives 2025 Washington Award

    May 24, 2025
    Artificial Intelligence

    LWiAI Podcast #201 – GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

    May 24, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Learn to Use GitHub Actions: a Step-by-Step Guide

    Development

    Escritoire creates standards-compliant letters

    Linux

    Stanikmas, Lynn. (2024). Angular Studies | Advanced Components. GitHub.

    Development

    Expense Reconciliation: Step-by-Step Guide

    Artificial Intelligence
    Hostinger

    Highlights

    Development

    Listening-While-Speaking Language Model (LSLM): An End-to-End System Equipped with both Listening and Speaking Channels

    August 7, 2024

    In the realm of human-computer interaction (HCI), dialogue stands out as the most natural form…

    Tsinghua University Researchers Propose ADELIE: Enhancing Information Extraction with Aligned Large Language Models Around Human-Centric Tasks

    May 12, 2024

    Smashing Security podcast #371: Unmasking LockBitsupp, company extortion, and a Tinder fraudster

    May 8, 2024

    saasscaleup/laravel-log-alarm

    February 5, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.