Knowledge Distillation (KD) has become a key technique in the field of Artificial Intelligence, especially in the context of Large…
Machine Learning
Deep learning has revolutionized various domains, with Transformers emerging as a dominant architecture. However, Transformers must improve the processing of…
Traditional molecular representations, primarily focused on covalent bonds, have neglected crucial aspects like delocalization and non-covalent interactions. Existing machine learning…
Recently, our CMU-MATH team proudly clinched 2nd place in the Artificial Intelligence Mathematical Olympiad (AIMO) out of 1,161 participating teams,…
The development of autonomous agents capable of performing complex tasks across various environments has gained significant traction in artificial intelligence…
Multimodal generative models represent an exciting frontier in artificial intelligence, focusing on integrating visual and textual data to create systems…
The European Artificial Intelligence Act came into force on August 1, 2024. It is a significant milestone in the global…
Unit testing aims to identify and resolve bugs at the earliest stages by testing individual components or units of code.…
Unstructured file types include about 80% of all company data, such as spreadsheets and PDFs. PDFs constitute the de facto…
Managing and optimizing API calls to various Large Language Model (LLM) providers can be complex, especially when dealing with different…
Traditional biomedical AI models are often specialized and need more flexibility, making them less effective for real-world applications requiring integrating…
Large Language Models (LLMs) have demonstrated exceptional performance on isolated code tasks, such as HumanEval and MBPP, but they struggle…
RGB-D cameras have a difficult time accurately capturing the depth of transparent objects because of the optical effects of reflection…
A key goal in the development of AI is the creation of general-purpose assistants utilizing Large Multimodal Models (LMMs). Building…
Introduction: Code Large Language Models (CodeLLMs) have demonstrated remarkable proficiency in generating code. However, they struggle with complex software engineering…
The Qwen Team has recently released the Qwen 2-Math series. This release, encompassing several model variants tailored for distinct applications,…
Human reward-guided learning is often modeled using simple RL algorithms that summarize past experiences into key variables like Q-values, representing…
Parler-TTS has emerged as a robust text-to-speech (TTS) library, offering two powerful models: Parler-TTS Large v1 and Parler-TTS Mini v1.…
Abacus.AI, a prominent player in AI, has recently unveiled its latest innovation: LiveBench AI. This new tool is designed to…
Migel Tissera has recently unveiled two groundbreaking projects on Hugging Face: Trinity-2-Codestral-22B and Tess-3-Mistral-Large-2-123B. These projects represent a leap forward…