Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 14, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 14, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 14, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 14, 2025

      I test a lot of AI coding tools, and this stunning new OpenAI release just saved me days of work

      May 14, 2025

      How to use your Android phone as a webcam when your laptop’s default won’t cut it

      May 14, 2025

      The 5 most customizable Linux desktop environments – when you want it your way

      May 14, 2025

      Gen AI use at work saps our motivation even as it boosts productivity, new research shows

      May 14, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Strategic Cloud Partner: Key to Business Success, Not Just Tech

      May 14, 2025
      Recent

      Strategic Cloud Partner: Key to Business Success, Not Just Tech

      May 14, 2025

      Perficient’s “What If? So What?” Podcast Wins Gold at the 2025 Hermes Creative Awards

      May 14, 2025

      PIM for Azure Resources

      May 14, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Windows 11 24H2’s Settings now bundles FAQs section to tell you more about your system

      May 14, 2025
      Recent

      Windows 11 24H2’s Settings now bundles FAQs section to tell you more about your system

      May 14, 2025

      You can now share an app/browser window with Copilot Vision to help you with different tasks

      May 14, 2025

      Microsoft will gradually retire SharePoint Alerts over the next two years

      May 14, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»This AI Paper from Google DeepMind Introduces Enhanced Learning Capabilities with Many-Shot In-Context Learning

    This AI Paper from Google DeepMind Introduces Enhanced Learning Capabilities with Many-Shot In-Context Learning

    April 28, 2024

    In-context learning (ICL) in large language models (LLMs) utilizes input-output examples to adapt to new tasks without altering the underlying model architecture. This method has transformed how models handle various tasks by learning from direct examples provided during inference. The problem at hand is the limitation of a few-shot ICL in handling intricate tasks. These tasks often demand a deep comprehension that few-shot learning cannot provide, as it operates under the restriction of minimal input data. This scenario could be better for applications requiring detailed analysis and decision-making based on extensive data sets, such as advanced reasoning or language translation.

    Existing research in the field of ICL has primarily focused on the few-shot learning capabilities of models like GPT-3, which adapt to new tasks with a limited set of examples. Studies have investigated the performance limits of these models within small context windows, revealing constraints in task complexity and scalability. The development of models with larger context windows, such as Gemini 1.5 Pro, which supports up to 1 million tokens, represents a significant evolution. This expansion allows for exploring many-shot ICL, greatly enhancing the models’ ability to process and learn from a larger dataset.

    Researchers from Google Deepmind have introduced a shift toward many-shot ICL, leveraging larger context windows of models like Gemini 1.5 Pro. This move from few-shot to many-shot learning utilizes increased input examples, significantly enhancing model performance and adaptability across complex tasks. The unique aspect of this methodology is the integration of Reinforced ICL and Unsupervised ICL, which reduce reliance on human-generated content by employing model-generated data and domain-specific inputs alone.

    In terms of methodology, the Gemini 1.5 Pro model was employed to handle an expanded array of input-output examples, supporting up to 1 million tokens in its context window. This allowed the exploration of Reinforced ICL, where the model generates and evaluates its rationales for correctness, and Unsupervised ICL, which challenges the model to operate without explicit rationales. The experiments were conducted across diverse domains, including machine translation, summarization, and complex reasoning tasks, using datasets like MATH for mathematical problem-solving and FLORES for machine translation tasks to test and validate the effectiveness of the many-shot ICL framework.

    The results from implementing many-shot ICL demonstrate significant performance enhancements. In machine translation tasks, the Gemini 1.5 Pro model outperformed previous benchmarks, achieving a 4.5% increase in accuracy for Kurdish and a 1.5% increase for Tamil translations compared to earlier models. In mathematical problem-solving, the MATH dataset showed a 35% improvement in solution accuracy when using many-shot settings. These quantitative outcomes validate the effectiveness of many-shot ICL in enhancing the model’s adaptability and accuracy across diverse and complex cognitive tasks.

    In conclusion, the research marks a significant step forward in ICL by transitioning from few-shot to many-shot ICL using the Gemini 1.5 Pro model. By expanding the context window and integrating innovative methodologies like Reinforced and Unsupervised ICL, the study has successfully enhanced model performance across various tasks, including machine translation and mathematical problem-solving. These advancements not only improve the adaptability and efficiency of large language models but also pave the way for more sophisticated applications in AI.

    Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 40k+ ML SubReddit

    The post This AI Paper from Google DeepMind Introduces Enhanced Learning Capabilities with Many-Shot In-Context Learning appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleLMSYS ORG Introduces Arena-Hard: A Data Pipeline to Build High-Quality Benchmarks from Live Data in Chatbot Arena, which is a Crowd-Sourced Platform for LLM Evals
    Next Article Okta Warns of Unprecedented Surge in Proxy-Driven Credential Stuffing Attacks

    Related Posts

    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-47785 – Emlog SQL Injection and Remote Code Execution

    May 15, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-47928 – Spotify/Github Spotipy Untrusted Code Execution Vulnerability

    May 15, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Kingdom Come: Deliverance 2 interactive map — Every quest, point of interest, and more

    News & Updates

    Facteur is mail-merge software

    Linux

    Microsoft Store’s new individual download for video games is an awesome concept, but what does it mean?

    Operating Systems

    How to Get Investors for Your Software Product Business: Part 1

    Development

    Highlights

    This smart ring for fitness freaks nearly beats Oura. Here’s how

    July 29, 2024

    The Ultrahuman Ring Air is a subscription-free smart ring that rivals the Oura Ring with…

    GitHub Availability Report: December 2024

    January 15, 2025

    Website Performance Optimization

    January 31, 2025

    Creating and verifying stable AI-controlled systems in a rigorous and flexible way

    July 27, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.