Researchers at FPT Software AI Center Introduce XMainframe: A State-of-the-Art Large Language Model (LLM) Specialized for Mainframe Modernization to Address the $100B Legacy Code Modernization

Introduction

Mainframe operating systems, originating in the 1940s, remain essential to critical sectors such as finance and government. However, the vast legacy of COBOL codeâ€”estimated by IBM to be around 200 to 220 billion linesâ€”needs to be migrated to modern platforms and rewritten in contemporary programming languages. This task is monumental, with the cost of rewriting COBOL code using human resources estimated at 32 to 50 cents per line, presenting a $100 billion challenge. The time required for a complete rewrite by human programmers is still uncertain. These systems are often perceived as outdated, requiring significant maintenance and modernization. Addressing this challenge demands innovative tools capable of understanding and interacting with legacy codebases, a long-standing obstacle for the industry. The advent of Large Language Models (LLMs) offers a potential solution to this enduring problem. However, there are several concerns when applying LLMs to mainframe modernization.

Challenges in Using LLMs for Mainframe Modernization:

1. Limited Training on Mainframe Languages: While existing LLMs are trained on a wide range of languages, both natural and programming, they lack sufficient training on languages used in mainframes, such as COBOL. The relatively small amount of COBOL code available online leads to inadequate understanding and reasoning in these models.. Additionally, organizations tend to keep their mainframe codebases private due to the high security demands of financial-critical sectors, further limiting the available training data.

2. Lack of Proper Benchmarks: The absence of comprehensive documentation and clear business goals for mainframe systems makes it difficult to develop benchmarks to evaluate the quality of LLMs in this domain. This hinders the ability to measure their effectiveness and reliability in mainframe modernization tasks.

3. Complexity Beyond Code Generation: LLMs for coding are primarily trained for code generation, the most common use case in software engineering tasks. However, mainframe modernization involves more than just generating COBOL codeâ€”organizations aim to migrate their systems to other languages. Thus, LLMs must possess knowledge beyond code generation to effectively modernize these systems.

XMainframe

To address these challenges, researchers at FPT Software AI Center have developed XMainframe, a state-of-the-art large language model (LLM) specifically designed with expertise in mainframe legacy systems and COBOL codebases. The solution includes the creation of an extensive data collection pipeline to produce high-quality training datasets, significantly enhancing XMainframeâ€™s performance in this specialized domain. Additionally, they introduce MainframeBench, a comprehensive benchmark for evaluating mainframe knowledge through multiple-choice questions, question answering, and COBOL code summarization. Empirical evaluations show that XMainframe consistently outperforms existing state-of-the-art LLMs in these tasks, achieving 30% higher accuracy than DeepSeek-Coder on multiple-choice questions, doubling the BLEU score of Mixtral-Instruct 8x7B on question-answering, and scoring six times higher than GPT-3.5 on COBOL summarization. This work underscores XMainframeâ€™s potential to drive significant advancements in managing and modernizing legacy systems, ultimately enhancing productivity and saving time for software developers.

Illustration of steps to collect data to build Mainframe:

Results on MCQ:

Results on Q&A

Results on Code Summarization:

Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter and join ourÂ Telegram Channel andÂ LinkedIn Group. If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 48k+ ML SubReddit

Find Upcoming AI Webinars here

Thanks toÂ FPT Software AI CenterÂ for the thought leadership/ Resources for this article.Â FPT Software AI Center has supported us in this content/article.

The post Researchers at FPT Software AI Center Introduce XMainframe: A State-of-the-Art Large Language Model (LLM) Specialized for Mainframe Modernization to Address the $100B Legacy Code Modernization appeared first on MarkTechPost.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Minecraft licensing robbed us of this controversial NFL schedule release video

The power of generators

The power of generators

Simplify Factory Associations with Laravel’s UseFactory Attribute

This Week in Laravel: React Native, PhpStorm Junie, and more

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Researchers at FPT Software AI Center Introduce XMainframe: A State-of-the-Art Large Language Model (LLM) Specialized for Mainframe Modernization to Address the $100B Legacy Code Modernization

Introduction

Challenges in Using LLMs for Mainframe Modernization:

XMainframe

Illustration of steps to collect data to build Mainframe:

Results on MCQ:

Results on Q&A

Results on Code Summarization:

Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation

Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

The best Windows tablets in 2025: Expert tested and reviewed

WEBRL: A Self-Evolving Online Curriculum Reinforcement Learning Framework for Training High-Performance Web Agents with Open LLMs

LWiAI Podcast #206 – Llama 4, Nova Act, xAI buys X, PaperBench

Uni-MoE: A Unified Multimodal LLM based on Sparse MoE Architecture

mlut – Atomic CSS toolkit with Sass and ergonomics for creating styles of any complexity

Complete Creative Constructions

Get a taste of Monster Hunter Wild’s mount system with Capcom’s best alternative to PokÃ©mon

How to get started with Windows Recall on Windows 11

Researchers at FPT Software AI Center Introduce XMainframe: A State-of-the-Art Large Language Model (LLM) Specialized for Mainframe Modernization to Address the $100B Legacy Code Modernization

Introduction

Challenges in Using LLMs for Mainframe Modernization:

XMainframe

Illustration of steps to collect data to build Mainframe:

Results on MCQ:

Results on Q&A

Results on Code Summarization:

Related Posts