SemiKong: An Open Source Foundation Model for Semiconductor Manufacturing Process

Semiconductors are essential in powering various electronic devices and driving development across telecommunications, automotive, healthcare, renewable energy, and IoT industries. In semiconductor manufacturing and design, the two main phases, FEOL and BEOL, present unique challenges. LLMs are trained on vast amounts of text data using self-supervised learning techniques that can capture rich domain knowledge.LLMs can also help in tasks like design rule checking, layout generation, and space exploration in Integrated Circuit (IC) design. LLMs allow the generation of new designs that adhere to the specified constraints and optimize for desired performance metrics, learning from large IC layouts and design rule datasets. However, most models are general and do not possess specific knowledge within the semiconductor industry. This reflects unique problems, such as complex physics and chemistry for semiconductor devices and processes.

Currently, LLMs are general-purpose models that, despite their power, need more specialized knowledge for tasks specific to the semiconductor industry. Artificial Intelligence (AI) improved semiconductor manufacturing by improving mask optimization and hotspot detection through machine learning, deep reinforcement learning, and datasets like LithoBench. In the semiconductor industry, domain-specific large language models (LLMs) such as ChipGPT and ChatEDA outperformed general models in tasks like code generation, debugging, and chatbot assistance. LLMs also evaluated natural language generation tasks, using expert feedback to improve benchmarks and address challenges in complex domain-specific evaluations.Â

To integrate the power of LLMs in the semiconductor industry, researchers from Aitomatic Inc., FPT Software AI Center, and Tokyo Electron Ltd conducted detailed research and proposed SemiKong, the first industry-specific LLM for the semiconductor domain that provides a foundation for developing customized proprietary models. SemiKong 1.0 focuses on building a foundational model with an expert-level understanding of etching problems. This approach involves training models with comprehensive domain-specific data. The training process was divided into two stages: pretraining and fine-tuning.

There are very few high-quality datasets for the semiconductor domain. To address this, a large-scale text-based dataset focused on semiconductor concepts and etching problems emerged, including pretraining data from technical books, papers, and patents, along with instruction data featuring 50,000 questions. Tools like GPT-4o-mini handled formatting, while GPT-4o generated and answered some questions. The SemiKong model was trained in three steps. First, it was pre-trained using Llama3 checkpoints to learn about the semiconductor industry. Then, it went through supervised fine-tuning to improve its ability to handle tasks like answering questions and reasoning. Finally, the model was fine-tuned with quantization to make it ready for real-world use, gaining deeper knowledge about semiconductor manufacturing along the way. The researchers used 8 NVIDIA A100 80GB GPUs for training for better performance and training speed.

The evaluation of the SemiKong model involved comparing its performance across several criteria, including Clarity and Directness (C&D), Practicality and Immediate Usability (PIU), Efficiency and Brevity (E&B), Logical Flow and Coherence (LFC), Expert-to-Expert Communication (EEC), and Use of Examples and Specificity (UES). Experiments showed that fine-tuning alone did not significantly improve performance, as domain-specific knowledge was crucial. When pretraining was combined with fine-tuning, performance improved. Larger models with 70B parameters outperformed smaller ones, with the SemiKong 70B model excelling in all criteria.Â

In summary, the proposed method provided a robust solution for integrating LLM technology with the semiconductor industry and achieved great performance. It performed better than the open-source foundation model. However, SemiKong is in its initial phase, and significant work remains. This work of integrating the latest LLM technology in manufacturing can act as a baseline for future research in the domain of semiconductors and change it forever!

Check out the Paper and GitHub Page. All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter and join ourÂ Telegram Channel andÂ LinkedIn Group. If you like our work, you will love ourÂ newsletter.. Donâ€™t Forget to join ourÂ 55k+ ML SubReddit.

[FREE AI VIRTUAL CONFERENCE] SmallCon: Free Virtual GenAI Conference ft. Meta, Mistral, Salesforce, Harvey AI & more. Join us on Dec 11th for this free virtual event to learn what it takes to build big with small models from AI trailblazers likeÂ Meta, Mistral AI, Salesforce, Harvey AI, Upstage, Nubank, Nvidia, Hugging Face,Â and more.

The post SemiKong: An Open Source Foundation Model for Semiconductor Manufacturing Process appeared first on MarkTechPost.

Source: Read MoreÂ

IBM’s next generation Granite models are now available

The Human Element: Using Research And Psychology To Elevate Data Storytelling

Google to offer free version of Gemini Code Assist

MongoDB acquires Voyage AI for its embedding and reranking models

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

OpenAI expands ‘Deep Reseach’ to those paying $20 a month or more, a day after Microsoft made OpenAI’s ‘Think Deeper’ free for all Copilot users with no usage caps

Rethink State💡 Why You Should Model Your Frontend Around Events

Rethink State💡 Why You Should Model Your Frontend Around Events

What To Expect When Migrating Your Site To A New Platform

Kotlin Multiplatform vs. React Native vs. Flutter: Building Your First App

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

SemiKong: An Open Source Foundation Model for Semiconductor Manufacturing Process

ANDI Accessibility Testing Tool Tutorial

How Data Analytics in Insurance is Driving Smarter Decisions

The Future of Frontend Engineering: Trends for 2025 and Beyond

Apple settles Siri lawsuit for $95 million – here’s how much you could get

How to get your Apple devices ready for the last big OS update of 2024

Microsoftâ€™s Fluid Framework 2 is now production ready

Meta drops two versions of the Llama 3 model with a third imminent

Amazon Q Apps supports customization and governance of generative AI-powered apps

CodeSOD: False True is True False

Explore the Vocal Ranges of Iconic Singers

SemiKong: An Open Source Foundation Model for Semiconductor Manufacturing Process

Related Posts