Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 18, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 18, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 18, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 18, 2025

      I need to see more from Lenovo’s most affordable gaming desktop, because this isn’t good enough

      May 18, 2025

      Gears of War: Reloaded — Release date, price, and everything you need to know

      May 18, 2025

      I’ve been using the Logitech MX Master 3S’ gaming-influenced alternative, and it could be your next mouse

      May 18, 2025

      Your Android devices are getting several upgrades for free – including a big one for Auto

      May 18, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      YTConverter™ lets you download YouTube videos/audio cleanly via terminal — especially great for Termux users.

      May 18, 2025
      Recent

      YTConverter™ lets you download YouTube videos/audio cleanly via terminal — especially great for Termux users.

      May 18, 2025

      NodeSource N|Solid Runtime Release – May 2025: Performance, Stability & the Final Update for v18

      May 17, 2025

      Big Changes at Meteor Software: Our Next Chapter

      May 17, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      I need to see more from Lenovo’s most affordable gaming desktop, because this isn’t good enough

      May 18, 2025
      Recent

      I need to see more from Lenovo’s most affordable gaming desktop, because this isn’t good enough

      May 18, 2025

      Gears of War: Reloaded — Release date, price, and everything you need to know

      May 18, 2025

      I’ve been using the Logitech MX Master 3S’ gaming-influenced alternative, and it could be your next mouse

      May 18, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»01.AI Introduces Yi-1.5-34B Model: An Upgraded Version of Yi with a High-Quality Corpus of 500B Tokens and Fine-Tuned on 3M Diverse Fine-Tuning Samples

    01.AI Introduces Yi-1.5-34B Model: An Upgraded Version of Yi with a High-Quality Corpus of 500B Tokens and Fine-Tuned on 3M Diverse Fine-Tuning Samples

    May 18, 2024

    The recent Yi-1.5-34B model introduced by 01.AI has brought about yet another advancement in the field of Artificial Intelligence. Positioned as a major improvement over its predecessors, this unique model bridges the gap between Llama 3 8B and 70B. It promises better performance in a number of areas, such as multimodal capability, code production, and logical reasoning. The complexities of the Yi-1.5-34B model, its creation, and its possible effects on the AI community have been explored in depth by the team of researchers.

    The Yi-34B model served as the basis for the Yi-1.5-34B model’s development. The Yi-1.5-34B carries on the tradition of Yi-34B, which was recognized for its superior performance and functioned as an unofficial benchmark in the AI community. This is due to its improved training and optimization. The model’s intense training regimen has been demonstrated by the fact that it was pre-trained on an incredible 500 billion tokens, earning 4.1 trillion tokens in total.

    Yi-1.5-34B’s architecture is intended to be a well-balanced combination, providing the computational efficiency of Llama 3 8B-sized models and getting close to the broad capabilities of 70B-sized models. This equilibrium guarantees that the model can carry out intricate tasks without necessitating the enormous computational resources that are generally linked with large-scale models.

    When compared against benchmarks, the Yi-1.5-34B model has shown remarkable performance. Its large vocabulary helps it solve logical puzzles with ease and grasp complex ideas in a subtle way. Its capacity to produce code snippets longer than those generated by GPT-4 is one of its most notable properties, demonstrating its usefulness in actual applications. The model’s speed and efficiency have been commended by users who have tested it through demos, making it an appealing option for a variety of AI-driven activities.

    The Yi family encompasses multimodal and language models, going beyond text to include vision-language features. This is accomplished by aligning visual representations within the language model’s semantic space by combining a vision transformer encoder with the chat language model. Also, the Yi models are not limited to conventional settings. With lightweight ongoing pretraining, they have been extended to handle long contexts of up to 200,000 tokens. 

    One of the main reasons for the Yi models’ effectiveness is the careful data engineering procedure that has been used in their creation. The models used 3.1 trillion tokens from Chinese and English corpora for pretraining. To ensure the best quality inputs, this data was carefully selected utilizing a cascaded deduplication and quality filtering pipeline.

    The process of fine-tuning enhanced the model’s capabilities even further. Machine learning engineers iteratively refined and validated a small-scale instruction dataset with less than 10,000 instances. Thanks to this practical approach to data verification, the performance of the refined models is guaranteed to be precise and dependable.

    With its combination of excellent performance and usefulness, the Yi-1.5-34B model is a great development in Artificial Intelligence. It is a flexible tool for both researchers and practitioners because of its capacity to perform complicated tasks like multimodal integration, code development, and logical reasoning. 

    Check out the Model Card and Demo. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 42k+ ML SubReddit

    The post 01.AI Introduces Yi-1.5-34B Model: An Upgraded Version of Yi with a High-Quality Corpus of 500B Tokens and Fine-Tuned on 3M Diverse Fine-Tuning Samples appeared first on MarkTechPost.

    Source: Read More 

    Hostinger
    Facebook Twitter Reddit Email Copy Link
    Previous ArticleTop 60+ Artificial Intelligence Interview Questions & Answers
    Next Article How to automate a select an item from list with Appium?

    Related Posts

    Development

    February 2025 Baseline monthly digest

    May 18, 2025
    Artificial Intelligence

    Markus Buehler receives 2025 Washington Award

    May 18, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Multimodal LLMs Without Compromise: Researchers from UCLA, UW–Madison, and Adobe Introduce X-Fusion to Add Vision to Frozen Language Models Without Losing Language Capabilities

    Machine Learning
    Apple sheds $700 billion as Microsoft regains the world’s most valuable company title — amid President Trump’s steep tariffs on Chinese imports

    Apple sheds $700 billion as Microsoft regains the world’s most valuable company title — amid President Trump’s steep tariffs on Chinese imports

    News & Updates

    How Tesla is Revolutionizing AI and Energy in 2025

    Development

    CVE-2025-3853 – WordPress WPshop E-Commerce Plugin Insecure Direct Object Reference Vulnerability

    Common Vulnerabilities and Exposures (CVEs)

    Highlights

    Proton Pass vs. 1Password: Which password manager is right for you?

    February 5, 2025

    Proton Pass and 1Password offer secure password safekeeping with similarly priced plans. Still, one service…

    10 Ways to Become an IT Superstar (Free Download)

    April 8, 2025

    How to create an image compressor tool in JavaScript

    February 19, 2025

    CVE-2025-40616 – Bookgy Reflected XSS

    April 29, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.