This AI Research from China Provides Empirical Evidence on the Relationship between Compression and Intelligence

Many people think that intelligence and compression go hand in hand, and some experts even go so far as to say that the two are essentially the same. Recent developments in LLMs and their effects on AI make this idea much more appealing, prompting researchers to look at language modeling through the compression lens. Theoretically, compression allows for converting any prediction model into a lossless compressor and inversely. Since LLMs have proven themselves to be quite effective in compressing data, language modeling might be thought of as a type of compression.

For the present LLM-based AI paradigm, this makes the case that compression leads to intelligence all the more compelling. However, there is still a dearth of data demonstrating a causal link between compression and intelligence, even though this has been the subject of much theoretical debate. Is it a sign of intelligence if a language model can encode a text corpus with fewer bits in a lossless manner? That is the question that a groundbreaking new study by Tencent and The Hong Kong University of Science and Technology aims to address empirically. Their study takes a pragmatic approach to the concept of â€œintelligence,â€ concentrating on the modelâ€™s capability to do different downstream tasks rather than straying into philosophical or even contradictory ground. Three main abilitiesâ€”knowledge and common sense, coding, and mathematical reasoningâ€”are used to test intelligence.

To be more precise, the team tested the efficacy of different LLMs in compressing external raw corpora in the relevant domain (e.g., GitHub code for coding skills). Then, they use the average benchmark scores to determine the domain-specific intelligence of these models and test them on various downstream tasks.Â

Researchers establish an astonishing result based on studies with 30 public LLMs and 12 different benchmarks: the downstream ability of LLMs is roughly linearly related to their compression efficiency, with a Pearson correlation coefficient of about -0.95 for each assessed intelligence domain. Importantly, the linear link also holds true for most individual benchmarks. In the same model series, where the model checkpoints share most configurations, including model designs, tokenizers, and data, there have been recent and parallel investigations on the relationship between benchmark scores and compression-equivalent metrics like validation loss.

Regardless of the model size, tokenizer, context window duration, or pre training data distribution, this study is the first to show that intelligence in LLMs correlates linearly with compression. The research supports the age-old theory that higher-quality compression signifies higher intelligence by demonstrating a universal principle of a linear association between the two. Compression efficiency is a useful unsupervised parameter for LLMs since it allows for easy updating of text corpora to prevent overfitting and test contamination. Because of its linear correlation with the modelsâ€™ abilities, compression efficiency is a stable, versatile, and trustworthy metric that our results support for assessing LLMs. To make it easy for academics in the future to gather and update their compression corpora, the team has made their data collecting and processing pipelines open source.Â

The researchers highlight a few caveats to our study. To begin, fine-tuned models are not suitable as general-purpose text compressors, so they restrict their attention to base models. Nevertheless, they argue that there are intriguing connections between the compression efficiency of the basic model and the benchmark scores of the related improved models that need to be investigated further. Furthermore, itâ€™s possible that this studyâ€™s results only work for fully trained models and donâ€™t apply to LMs because the assessed abilities havenâ€™t even surfaced. The teamâ€™s work opens up exciting avenues for future research, inspiring the research community to delve deeper into these issues.Â

Check out theÂ Paper and Github.Â All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter.Â Join ourÂ Telegram Channel,Â Discord Channel, andÂ LinkedIn Group.

If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 40k+ ML SubReddit

For Content Partnership, Please Fill Out This Form Here..

The post This AI Research from China Provides Empirical Evidence on the Relationship between Compression and Intelligence appeared first on MarkTechPost.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

I test a lot of AI coding tools, and this stunning new OpenAI release just saved me days of work

How to use your Android phone as a webcam when your laptop’s default won’t cut it

The 5 most customizable Linux desktop environments – when you want it your way

Gen AI use at work saps our motivation even as it boosts productivity, new research shows

Strategic Cloud Partner: Key to Business Success, Not Just Tech

Strategic Cloud Partner: Key to Business Success, Not Just Tech

Perficient’s “What If? So What?” Podcast Wins Gold at the 2025 Hermes Creative Awards

PIM for Azure Resources

Windows 11 24H2’s Settings now bundles FAQs section to tell you more about your system

Windows 11 24H2’s Settings now bundles FAQs section to tell you more about your system

You can now share an app/browser window with Copilot Vision to help you with different tasks

Microsoft will gradually retire SharePoint Alerts over the next two years

This AI Research from China Provides Empirical Evidence on the Relationship between Compression and Intelligence

Georgia Tech and Stanford Researchers Introduce MLE-Dojo: A Gym-Style Framework Designed for Training, Evaluating, and Benchmarking Autonomous Machine Learning Engineering (MLE) Agents

A Step-by-Step Guide to Build an Automated Knowledge Graph Pipeline Using LangGraph and NetworkX

Container query units: cqi and cqb

Hire the Best Shopify Experts in Houston for Your Online Store

Styling a meter element with CSS and SVG

OpenAI’s new “Deep Research” blows ChatGPT o3-mini and DeepSeek out of the water with 26.6% accuracy in the world’s hardest “AI exam” — but it skipped the line

My everyday Anker power bank has a genius feature that makes it irreplaceable

How the GitHub CLI can now enable triangular workflows

CVE-2025-0855 – WordPress PGS Core Plugin PHP Object Injection Vulnerability

Will we care about frameworks in an AI world?

This AI Research from China Provides Empirical Evidence on the Relationship between Compression and Intelligence

Related Posts