Machine Learning

AutoSculpt: A Pattern-based Automated Pruning Framework Designed to Enhance Efficiency and Accuracy by Leveraging Graph Learning and Deep Reinforcement Learning

December 30, 2024

Deploying Deep Neural Networks (DNNs) on edge devices, such as smartphones and autonomous vehicles, remains a significant challenge due to…

Development

Researchers from MIT, Sakana AI, OpenAI and Swiss AI Lab IDSIA Propose a New Algorithm Called Automated Search for Artificial Life (ASAL) to Automate the Discovery of Artificial Life Using Vision-Language Foundation Models

December 30, 2024

Artificial Life (ALife) research explores the emergence of lifelike behaviors through computational simulations, providing a unique framework to study “life…

Development

CMU Researchers Introduce TNNGen: An AI Framework that Automates Design of Temporal Neural Networks (TNNs) from PyTorch Software Models to Post-Layout Netlists

December 30, 2024

Designing neuromorphic sensory processing units (NSPUs) based on Temporal Neural Networks (TNNs) is a highly challenging task due to the…

Development

Advancing Parallel Programming with HPC-INSTRUCT: Optimizing Code LLMs for High-Performance Computing

December 29, 2024

LLMs have revolutionized software development by automating coding tasks and bridging the natural language and programming gap. While highly effective…

Development

This AI Paper Introduces XMODE: An Explainable Multi-Modal Data Exploration System Powered by LLMs for Enhanced Accuracy and Efficiency

December 29, 2024

Researchers are focusing increasingly on creating systems that can handle multi-modal data exploration, which combines structured and unstructured data. This…

Development

B-STAR: A Self-Taught AI Reasoning Framework for LLMs

December 29, 2024

A direct correlation exists between an LLM’s training corpus quality and its capabilities. Consequently, researchers have invested a great deal…

Development

NeuralOperator: A New Python Library for Learning Neural Operators in PyTorch

December 29, 2024

Operator learning is a transformative approach in scientific computing. It focuses on developing models that map functions to other functions,…

Development

Researchers from Tsinghua University Propose ReMoE: A Fully Differentiable MoE Architecture with ReLU Routing

December 29, 2024

The development of Transformer models has significantly advanced artificial intelligence, delivering remarkable performance across diverse tasks. However, these advancements often…

Development

This AI Paper Proposes TALE: An AI Framework that Reduces Token Redundancy in Chain-of-Thought (CoT) Reasoning by Incorporating Token Budget Awareness

December 29, 2024

Large Language Models (LLMs) have shown significant potential in reasoning tasks, using methods like Chain-of-Thought (CoT) to break down complex…

Development

This AI Paper Explores How Formal Systems Could Revolutionize Math LLMs

December 28, 2024

Formal mathematical reasoning represents a significant frontier in artificial intelligence, addressing fundamental logic, computation, and problem-solving challenges. This field focuses…

Development

Hypernetwork Fields: Efficient Gradient-Driven Training for Scalable Neural Network Optimization

December 28, 2024

Hypernetworks have gained attention for their ability to efficiently adapt large models or train generative models of neural representations. Despite…

Development

aiXplain Introduces a Multi-AI Agent Autonomous Framework for Optimizing Agentic AI Systems Across Diverse Industries and Applications

December 28, 2024

Agentic AI systems have revolutionized industries by enabling complex workflows through specialized agents working in collaboration. These systems streamline operations,…

Development

YuLan-Mini: A 2.42B Parameter Open Data-efficient Language Model with Long-Context Capabilities and Advanced Training Techniques

December 28, 2024

Large language models (LLMs) built using transformer architectures heavily depend on pre-training with large-scale data to predict sequential tokens. This…

Development

Quasar-1: A Rigorous Mathematical Framework for Temperature-Guided Reasoning in Language Models

December 28, 2024

Large language models (LLMs) encounter significant difficulties in performing efficient and logically consistent reasoning. Existing methods, such as CoT prompting,…

Development

Collective Monte Carlo Tree Search (CoMCTS): A New Learning-to-Reason Method for Multimodal Large Language Models

December 28, 2024

In today’s world, Multimodal large language models (MLLMs) are advanced systems that process and understand multiple input forms, such as…

Development

Camel-AI Open Sourced OASIS: A Next Generation Simulator for Realistic Social Media Dynamics with One Million Agents

December 28, 2024

Social media platforms have revolutionized human interaction, creating dynamic environments where millions of users exchange information, form communities, and influence…

Development

Meet SemiKong: The World’s First Open-Source Semiconductor-Focused LLM

December 27, 2024

The semiconductor industry enables advancements in consumer electronics, automotive systems, and cutting-edge computing technologies. The production of semiconductors involves sophisticated…

Development

Google DeepMind Introduces Differentiable Cache Augmentation: A Coprocessor-Enhanced Approach to Boost LLM Reasoning and Efficiency

December 27, 2024

Large language models (LLMs) are integral to solving complex problems across language processing, mathematics, and reasoning domains. Enhancements in computational…

Development

Unveiling Privacy Risks in Machine Unlearning: Reconstruction Attacks on Deleted Data

December 27, 2024

Machine unlearning is driven by the need for data autonomy, allowing individuals to request the removal of their data’s influence…

Development

Microsoft and Tsinghua University Researchers Introduce Distilled Decoding: A New Method for Accelerating Image Generation in Autoregressive Models without Quality Loss

December 27, 2024

Autoregressive (AR) models have changed the field of image generation, setting new benchmarks in producing high-quality visuals. These models break…