This AI Paper from Princeton and the University of Warwick Proposes a Novel Artificial Intelligence Approach to Enhance the Utility of LLMs as Cognitive Models

Scientists studying Large Language Models (LLMs) have found that LLMs perform similarly to humans in cognitive tasks, often making judgments and decisions that deviate from rational norms, such as risk and loss aversion. LLMs also exhibit human-like biases and errors, particularly in probability judgments and arithmetic operations tasks. These similarities suggest the potential for using LLMs as models of human cognition. However, significant challenges remain, including the extensive data LLMs are trained on and the unclear origins of these behavioural similarities.

The suitability of LLMs as models of human cognition is debated due to several issues. LLMs are trained on much larger datasets than humans and may have been exposed to test questions, leading to artificial enhancements in human-like behaviors through value alignment processes. Despite these challenges, fine-tuning LLMs, such as the LLaMA-1-65B model, on human choice datasets has improved accuracy in predicting human behavior. Prior research has also highlighted the importance of synthetic datasets in enhancing LLM capabilities, particularly in problem-solving tasks like arithmetic. Pretraining on such datasets can significantly improve performance in predicting human decisions.

Researchers from Princeton University and Warwick University propose enhancing the utility of LLMs as cognitive models by (i) utilizing computationally equivalent tasks that both LLMs and rational agents must master for cognitive problem-solving and (ii) examining task distributions required for LLMs to exhibit human-like behaviors. Applied to decision-making, specifically risky and intertemporal choice, Arithmetic-GPT, an LLM pretrained on an ecologically valid arithmetic dataset, predicts human behavior better than many traditional cognitive models. This pretraining suffices to align LLMs closely with human decision-making.

Researchers address challenges in using LLMs as cognitive models by defining a data generation algorithm for creating synthetic datasets and gaining access to neural activation patterns crucial for decision-making. A small LM with a Generative Pretrained Transformer (GPT) architecture, named Arithmetic-GPT, was pretrained on arithmetic tasks. Synthetic datasets reflecting realistic probabilities and values were generated for training. Pretraining details include a context length of 26, batch size of 2048, and a learning rate of 10â»Â³. Human decision-making datasets in risky and intertemporal choices were reanalyzed to evaluate the modelâ€™s performance.

The experimental results show that embeddings from the Arithmetic-GPT model, pretrained on ecologically valid synthetic datasets, most accurately predict human choices in decision-making tasks. Logistic regression using embeddings as independent variables and human choice probabilities as the dependent variable demonstrates higher adjusted RÂ² values compared to other models, including LLaMA-3-70bInstruct. Benchmarks against behavioral models and MLPs reveal that while MLPs generally outperform other models, Arithmetic-GPT embeddings still provide a strong correspondence with human data, particularly in intertemporal choice tasks. Robustness is confirmed with 10-fold cross-validation.

The study concludes that LLMs, specifically Arithmetic-GPT pretrained on ecologically valid synthetic datasets, can closely model human cognitive behaviors in decision-making tasks, outperforming traditional cognitive models and some advanced LLMs like LLaMA-3-70bInstruct. This approach addresses key challenges by using synthetic datasets and neural activation patterns. The findings underscore the potential of LLMs as cognitive models, providing valuable insights for both cognitive science and machine learning, with robustness verified through extensive validation techniques.

Check out theÂ Paper. All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter.Â Join ourÂ Telegram Channel,Â Discord Channel, andÂ LinkedIn Group.

If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 43k+ ML SubReddit | Also, check out our AI Events Platform

Large language models show promise as cognitive models. The behaviors they produce often mirror human behaviors, suggesting we might gain insight into human cognition by studying LLMs. But why do LLMs behave like humans at all?https://t.co/0nTwekSGlj

â€” Zhu Jian-Qiao (@JQ_Zhu) May 30, 2024

The post This AI Paper from Princeton and the University of Warwick Proposes a Novel Artificial Intelligence Approach to Enhance the Utility of LLMs as Cognitive Models appeared first on MarkTechPost.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Intel’s latest Arc graphics driver is ready for DOOM: The Dark Ages, launching for Premium Edition owners on PC today

NVIDIA’s drivers are causing big problems for DOOM: The Dark Ages, but some fixes are available

Capcom breaks all-time profit records with 10% income growth after Monster Hunter Wilds sold over 10 million copies in a month

Microsoft plans to lay off 3% of its workforce, reportedly targeting management cuts as it changes to fit a “dynamic marketplace”

A cross-platform Markdown note-taking application

A cross-platform Markdown note-taking application

AI Assistant Demo & Tips for Enterprise Projects

Celebrating Global Accessibility Awareness Day (GAAD)

Intel’s latest Arc graphics driver is ready for DOOM: The Dark Ages, launching for Premium Edition owners on PC today

Intel’s latest Arc graphics driver is ready for DOOM: The Dark Ages, launching for Premium Edition owners on PC today

NVIDIA’s drivers are causing big problems for DOOM: The Dark Ages, but some fixes are available

Capcom breaks all-time profit records with 10% income growth after Monster Hunter Wilds sold over 10 million copies in a month

This AI Paper from Princeton and the University of Warwick Proposes a Novel Artificial Intelligence Approach to Enhance the Utility of LLMs as Cognitive Models

Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

CVE-2025-4732 – TOTOLINK A3002R/A3002RU HTTP POST Request Handler Buffer Overflow

Top 10 Data Extraction Tools in 2024

Composable Martech: Experience Builders

Revisit Large-Scale Image–Caption Data in Pre-training Multimodal Foundation Models

Qilin Becomes Top Ransomware Group Amid RansomHub Uncertainty

Boost Productivity with Custom Command Shortcuts Using Linux Aliases

Understanding Cyberconflict in the Geopolitical Context

Laravel News 2024 Recap

Distribution Release: Archcraft 2025.04.24

This AI Paper from Princeton and the University of Warwick Proposes a Novel Artificial Intelligence Approach to Enhance the Utility of LLMs as Cognitive Models

Related Posts