Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»This Machine Learning Paper Introduce PISSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models

    This Machine Learning Paper Introduce PISSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models

    April 10, 2024

    Fine-tuning large language models (LLMs) enhances task performance and ensures adherence to instructions while modifying behaviors. However, this process incurs significant costs due to high GPU memory requirements, especially for large models like LLaMA 65B and GPT-3 175B. Consequently, various parameter-efficient fine-tuning (PEFT) methods, such as low-rank adaptation (LoRA), are proposed, which reduces parameters and memory usage without increasing inference latency.

    Researchers from the Institute for Artificial Intelligence, Peking University, School of Intelligence Science and Technology, Peking University, and the National Key Laboratory of General Artificial Intelligence introduce Principal Singular values and Singular vectors Adaptation (PiSSA). This method optimizes a reduced parameter space by representing a matrix within the model as the product of two trainable matrices, along with a residual matrix for error correction. It utilizes Singular Value Decomposition (SVD) to factorize the matrix, initializing the principal singular values and vectors to train the two matrices while keeping the residual matrix frozen during fine-tuning. PiSSA shares the same architecture with LoRA, utilizing the hypothesis that changes in model parameters form a low-rank matrix.

    PiSSA method employs SVD to factorize matrices within self-attention and MLP layers. It initializes an adapter with principal singular values and vectors and a residual matrix with residual singular values and vectors. The adapter encapsulates the model’s primary capabilities while using fewer parameters during fine-tuning. PiSSA shares the architecture with LoRA, inheriting benefits such as reduced trainable parameters, quantization of the residual model, and easy deployment. PiSSA’s early introduction preserves the model’s capabilities by rendering the residual matrix negligible, enabling the adapter to encapsulate primary capabilities. Fine-tuning mirrors the full model process, unlike LoRA, potentially avoiding wasteful gradient steps and suboptimal outcomes.

    Comparative experiments between PiSSA, LoRA, and full parameter fine-tuning on LLaMA 2-7B, Mistral-7B-v0.1, and Gemma-7B models across various tasks demonstrate PiSSA’s superiority. Fine-tuning adapters initialized with principal singular values and vectors yield better outcomes, indicating that direct fine-tuning of the model’s principal components leads to superior results. PiSSA exhibits superior performance, converges more swiftly, and aligns closely with training data compared to LoRA, showcasing robust superiority under similar trainable parameter configurations. Also, employing the Fast SVD technique helps PiSSA balance initialization speed and performance.

    In conclusion, the research introduces PiSSA, a parameter-efficient fine-tuning technique that utilizes singular value decomposition to initialize adapters with principal components. Through extensive experiments, PiSSA demonstrates superior fine-tuning performance compared to LoRA, offering a promising approach to PEFT. Analogous to slicing and re-baking the richest pizza slice, PiSSA efficiently identifies and fine-tunes the model’s principal components. Sharing LoRA’s architecture, PiSSA presents an easy-to-use and efficient initialization method.

    Check out the Paper and Github. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 40k+ ML SubReddit

    The post This Machine Learning Paper Introduce PISSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleMetaGPT and MetaGPT RAG Module (with Sturdy Design of the Llama-Index)
    Next Article Webinar: Learn How to Stop Hackers from Exploiting Hidden Identity Weaknesses

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 17, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2024-47893 – VMware GPU Firmware Memory Disclosure

    May 17, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Researchers at Microsoft Introduce Aurora: A Large-Scale Foundation Model of the Atmosphere Trained on Over a Million Hours of Diverse Weather and Climate Data

    Development

    Critical Mistakes to Avoid When Starting a New React Project in 2024

    Development

    You can get this Dell XPS Desktop at just $1100

    Development

    CVE-2025-3769 – LatePoint WordPress Calendar Booking Plugin Insecure Direct Object Reference Vulnerability

    Common Vulnerabilities and Exposures (CVEs)

    Highlights

    The 45+ best early Amazon Presidents’ Day tech deals live right now

    February 7, 2025

    Presidents’ Day is still a week away, but Amazon already has lots of savings on…

    Harrison County Schools Hit by Cyberattack, Investigation Underway

    January 24, 2025

    Chameleon: An AI System for Efficient Large Language Model Inference Using Adaptive Caching and Multi-Level Scheduling Techniques

    November 30, 2024

    YouTube: Enhancing the user experience

    May 13, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.