Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 17, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 17, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 17, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 17, 2025

      Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

      May 17, 2025

      If you think you can do better than Xbox or PlayStation in the Console Wars, you may just want to try out this card game

      May 17, 2025

      Surviving a 10 year stint in dev hell, this retro-styled hack n’ slash has finally arrived on Xbox

      May 17, 2025

      Save $400 on the best Samsung TVs, laptops, tablets, and more when you sign up for Verizon 5G Home or Home Internet

      May 17, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      NodeSource N|Solid Runtime Release – May 2025: Performance, Stability & the Final Update for v18

      May 17, 2025
      Recent

      NodeSource N|Solid Runtime Release – May 2025: Performance, Stability & the Final Update for v18

      May 17, 2025

      Big Changes at Meteor Software: Our Next Chapter

      May 17, 2025

      Apps in Generative AI – Transforming the Digital Experience

      May 17, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

      May 17, 2025
      Recent

      Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

      May 17, 2025

      If you think you can do better than Xbox or PlayStation in the Console Wars, you may just want to try out this card game

      May 17, 2025

      Surviving a 10 year stint in dev hell, this retro-styled hack n’ slash has finally arrived on Xbox

      May 17, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»TRANSMI: A Machine Learning Framework to Create Baseline Models Adapted for Transliterated Data from Existing Multilingual Pretrained Language Models mPLMs without Any Training

    TRANSMI: A Machine Learning Framework to Create Baseline Models Adapted for Transliterated Data from Existing Multilingual Pretrained Language Models mPLMs without Any Training

    May 20, 2024

    The increasing availability of digital text in diverse languages and scripts presents a significant challenge for natural language processing (NLP). Multilingual pre-trained language models (mPLMs) often struggle to handle transliterated data effectively, leading to performance degradation. Addressing this issue is crucial for improving cross-lingual transfer learning and ensuring accurate NLP applications across various languages and scripts, which is essential for global communication and information processing.

    Current methods, including models like XLM-R and Glot500, perform well with text in their original scripts but struggle significantly with transliterated text due to ambiguities and tokenization issues. These limitations degrade their performance in cross-lingual tasks, making them less effective when handling text converted into a common script such as Latin. The inability of these models to accurately interpret transliterations poses a significant barrier to their utility in multilingual settings. 

    Researchers from the Center for Information and Language Processing, LMU Munich, and Munich Center for Machine Learning (MCML) introduced TRANSMI, a framework designed to enhance mPLMs for transliterated data without requiring additional training. TRANSMI modifies existing mPLMs using three merge modes—Min-Merge, Average-Merge, and Max-Merge—to incorporate transliterated subwords into their vocabularies, thereby addressing transliteration ambiguities and improving cross-lingual task performance.

    TRANSMI integrates new subwords tailored for transliterated data into the mPLMs’ vocabularies, particularly excelling in the Max-Merge mode for high-resource languages. The framework is tested using datasets that include transliterated versions of texts in scripts such as Cyrillic, Arabic, and Devanagari, showing that TRANSMI-modified models outperform their original versions in various tasks like sentence retrieval, text classification, and sequence labeling. This modification ensures that models retain their original capabilities while adapting to the nuances of transliterated text, thus enhancing their overall performance in multilingual NLP applications.

    The datasets used to validate TRANSMI span a variety of scripts, providing a comprehensive assessment of its effectiveness. For example, the FURINA model using Max-Merge mode shows significant improvements in sequence labeling tasks, demonstrating TRANSMI’s capability to handle phonetic scripts and mitigate issues arising from transliteration ambiguities. This approach ensures that mPLMs can process a wide range of languages more accurately, enhancing their utility in multilingual contexts.

    The results indicate that TRANSMI-modified models achieve higher accuracy compared to their unmodified counterparts. For instance, the FURINA model with Max-Merge mode demonstrates notable performance improvements in sequence labeling tasks across different languages and scripts, showcasing clear gains in key performance metrics. These improvements highlight TRANSMI’s potential as an effective tool for enhancing multilingual NLP models, ensuring better handling of transliterated data and leading to more accurate cross-lingual processing.

    In conclusion, TRANSMI addresses the critical challenge of improving mPLMs’ performance on transliterated data by modifying existing models without additional training. This framework enhances mPLMs’ ability to process transliterations, leading to significant improvements in cross-lingual tasks. TRANSMI offers a practical and innovative solution to a complex problem, providing a strong foundation for further advancements in multilingual NLP and improving global communication and information processing.

    Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 42k+ ML SubReddit

    The post TRANSMI: A Machine Learning Framework to Create Baseline Models Adapted for Transliterated Data from Existing Multilingual Pretrained Language Models mPLMs without Any Training appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleMeet Verba 1.0: Run State-of-the-Art RAG Locally with Ollama Integration and Open Source Models
    Next Article CinePile: A Novel Dataset and Benchmark Specifically Designed for Authentic Long-Form Video Understanding

    Related Posts

    Development

    February 2025 Baseline monthly digest

    May 17, 2025
    Development

    Learn A1 Level Spanish

    May 17, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Need a TikTok alternative? How to get started with Neptune – and why you should

    News & Updates

    Setting up CloudFront using Python

    Development

    CVE-2025-44180 – PhpGurukul Vehicle Record Management System XSS

    Common Vulnerabilities and Exposures (CVEs)

    MongoDB AI Course in Partnership with Andrew Ng and DeepLearning.AI

    Databases

    Highlights

    Development

    Managing Concurrent Requests with Laravel Session Blocking

    December 20, 2024

    Laravel’s session blocking prevents race conditions and data inconsistencies by controlling concurrent session access. This…

    Google AI Introduces Ironwood: A Google TPU Purpose-Built for the Age of Inference

    Google AI Introduces Ironwood: A Google TPU Purpose-Built for the Age of Inference

    April 10, 2025

    Case Study: Ciel Rose

    April 7, 2025

    Akamai diventa il fornitore ufficiale dell’infrastruttura del kernel Linux

    April 14, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.