Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 21, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 21, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 21, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 21, 2025

      The best smart glasses unveiled at I/O 2025 weren’t made by Google

      May 21, 2025

      Google’s upcoming AI smart glasses may finally convince me to switch to a pair full-time

      May 21, 2025

      I tried Samsung’s Project Moohan XR headset at I/O 2025 – and couldn’t help but smile

      May 21, 2025

      Is Google’s $250-per-month AI subscription plan worth it? Here’s what’s included

      May 21, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      IOT and API Integration With MuleSoft: The Road to Seamless Connectivity

      May 21, 2025
      Recent

      IOT and API Integration With MuleSoft: The Road to Seamless Connectivity

      May 21, 2025

      Celebrating GAAD by Committing to Universal Design: Low Physical Effort

      May 21, 2025

      Celebrating GAAD by Committing to Universal Design: Flexibility in Use

      May 21, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft open-sources Windows Subsystem for Linux at Build 2025

      May 21, 2025
      Recent

      Microsoft open-sources Windows Subsystem for Linux at Build 2025

      May 21, 2025

      Microsoft Brings Grok 3 AI to Azure with Guardrails and Enterprise Controls

      May 21, 2025

      You won’t have to pay a fee to publish apps to Microsoft Store

      May 21, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Meet GRAPE: A Plug-and-Play Algorithm to Generalize Robot Policies via Preference Alignment

    Meet GRAPE: A Plug-and-Play Algorithm to Generalize Robot Policies via Preference Alignment

    December 8, 2024

    The field of robotic manipulation has witnessed a remarkable transformation with the emergence of vision-language-action (VLA) models. These advanced computational frameworks have demonstrated significant potential in executing complex manipulation tasks across diverse environments. Despite their impressive capabilities, VLA models encounter substantial challenges in generalizing across novel contexts, including different objects, environments, and semantic scenarios. 

    The fundamental limitation stems from current training methodologies, particularly supervised fine-tuning (SFT), which predominantly relies on behavioral imitation through successful action rollouts. This approach restricts models from developing a comprehensive understanding of task objectives and potential failure mechanisms. Consequently, the models often struggle to adapt to nuanced variations and unforeseen scenarios, highlighting the critical need for more sophisticated training strategies.

    Previous research in robotic learning predominantly employed hierarchical planning strategies, with models like Code as Policies and EmbodiedGPT utilizing large language models and vision-language models to generate high-level action plans. These approaches typically utilize large language models to create action sequences, followed by low-level controllers to resolve local trajectory challenges. However, such methodologies demonstrate significant limitations in skill adaptability and generalization across everyday robotic manipulation tasks.

    VLA models have pursued two primary approaches to action planning: action space discretization and diffusion models. The discretization approach, exemplified by OpenVLA, involves uniformly truncating action spaces into discrete tokens, while preserving autoregressive language decoding objectives. Diffusion models, conversely, generate action sequences through multiple denoising steps rather than producing singular stepwise actions. Despite these structural variations, these models consistently rely on supervised training using successful action rollouts, which fundamentally constrains their generalizability to novel manipulation scenarios.

    Researchers from UNC Chapel-Hill, the University of Washington, and the University of Chicago introduce GRAPE (Generalizing Robot Policy via Preference Alignment), an innovative approach designed to address fundamental limitations in VLA model training. GRAPE presents a robust trajectory-wise preference optimization (TPO) technique that strategically aligns robotic policies by implicitly modeling rewards from successful and unsuccessful trial sequences. This methodology enables enhanced generalizability across diverse manipulation tasks by moving beyond traditional training constraints.

    At the core of GRAPE’s approach is a sophisticated decomposition strategy that breaks complex manipulation tasks into multiple independent stages. The method offers unprecedented flexibility by utilizing a large vision model to propose critical keypoints for each stage and associating them with spatial-temporal constraints. These customizable constraints allow alignment with varied manipulation objectives, including task completion, robot interaction safety, and operational cost-efficiency, marking a significant advancement in robotic policy development.

    The research team conducted comprehensive evaluations of GRAPE across simulation and real-world robotic environments to validate its performance and generalizability. In simulation environments like Simpler-Env and LIBERO, GRAPE demonstrated remarkable capabilities, outperforming existing models Octo-SFT and OpenVLA-SFT by significant margins. Specifically, in Simpler-Env, GRAPE exceeded the performance of previous models by an average of 24.48% and 13.57%, respectively, across various generalization aspects including subject, physical, and semantic domains.

    Hostinger

    The real-world experimental results further substantiated GRAPE’s effectiveness, with the model showcasing exceptional adaptability across diverse task scenarios. In in-domain tasks, GRAPE achieved a 67.5% success rate, representing a substantial 22.5% improvement over OpenVLA-SFT and dramatically surpassing Octo-SFT. Particularly impressive was GRAPE’s performance in challenging generalization tasks, where it maintained superior results across visual, action, and language grounding scenarios, with an impressive total average success rate of 52.3% – a significant 19% advancement over existing approaches.

    This research introduces GRAPE as a transformative solution to critical challenges confronting VLA models, particularly their limited generalizability and adaptability across manipulation tasks. By implementing a novel trajectory-level policy alignment approach, GRAPE demonstrates remarkable capability in learning from both successful and unsuccessful trial sequences. The methodology offers unprecedented flexibility in aligning robotic policies with diverse objectives, including safety, efficiency, and task completion through innovative spatiotemporal constraint mechanisms. Experimental findings validate GRAPE’s significant advancements, showcasing substantial performance improvements across in-domain and unseen task environments. 


    Check out the Paper and GitHub Page. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 60k+ ML SubReddit.

    🚨 [Must Attend Webinar]: ‘Transform proofs-of-concept into production-ready AI applications and agents’ (Promoted)

    The post Meet GRAPE: A Plug-and-Play Algorithm to Generalize Robot Policies via Preference Alignment appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleAuto-RAG: An Autonomous Iterative Retrieval Model Centered on the LLM’s Powerful Decision-Making Capabilities
    Next Article Microsoft Introduces Florence-VL: A Multimodal Model Redefining Vision-Language Alignment with Generative Vision Encoding and Depth-Breadth Fusion

    Related Posts

    Artificial Intelligence

    Markus Buehler receives 2025 Washington Award

    May 21, 2025
    Artificial Intelligence

    LWiAI Podcast #201 – GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

    May 21, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Meta Advances AI Capabilities with Next-Generation MTIA Chips

    Development

    Free Animated 3D Objects

    Development

    Attackers Using Obfuscation Tools to Deliver Multi-Stage Malware via Invoice Phishing

    Development

    Nvidia teases Rubin GPUs and CPUs to succeed Blackwell in 2026

    Development

    Highlights

    International Conference on Machine Learning (ICML) 2024

    July 26, 2024

    Post Content Source: Read More 

    Dynamic Color Schemes: How an Website Evolves Daily

    January 28, 2025

    The Witcher 3 celebrates its 10th anniversary, and it’s still one of my favorite games of all time

    May 19, 2025

    Distribution Release: Fedora 42

    April 15, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.