Machine Learning

Enhancing Mobile Ad Hoc Network Security: A Hybrid Deep Learning Model for Flooding Attack Detection

February 6, 2025

Ad hoc networks are decentralized, self-configuring networks where nodes communicate without fixed infrastructure. They are commonly used in military, disaster…

Reinforcement Learning for Long-Horizon Interactive LLM Agents

February 5, 2025

Interactive digital agents (IDAs) leverage APIs of stateful digital environments to perform tasks in response to user requests. While IDAs…

Machine Learning

Build a multi-interface AI assistant using Amazon Q and Slack with Amazon CloudFront clickable references from an Amazon S3 bucket

February 5, 2025

There is consistent customer feedback that AI assistants are the most useful when users can interface with them within the…

Machine Learning

Google DeepMind Achieves State-of-the-Art Data-Efficient Reinforcement Learning RL with Improved Transformer World Models

February 5, 2025

Reinforcement Learning RL trains agents to maximize rewards by interacting with an environment. Online RL alternates between taking actions, collecting…

Machine Learning

Enhancing LLM Capabilities with NeMo Guardrails on Amazon SageMaker JumpStart

February 5, 2025

As large language models (LLMs) become increasingly integrated into customer-facing applications, organizations are exploring ways to leverage their natural language…

Machine Learning

Meet Satori: A New AI Framework for Advancing LLM Reasoning through Deep Thinking without a Strong Teacher Model

February 5, 2025

Large Language Models (LLMs) have demonstrated notable reasoning capabilities in mathematical problem-solving, logical inference, and programming. However, their effectiveness is…

Machine Learning

OfferUp improved local results by 54% and relevance recall by 27% with multimodal search on Amazon Bedrock and Amazon OpenSearch Service

February 5, 2025

This post is co-written with Andrés Vélez Echeveri and Sean Azlin from OfferUp. OfferUp is an online, mobile-first marketplace designed…

Machine Learning

Trellix lowers cost, increases speed, and adds delivery flexibility with cost-effective and performant Amazon Nova Micro and Amazon Nova Lite models

February 5, 2025

This post is co-written with Martin Holste from Trellix. Security teams are dealing with an evolving universe of cybersecurity threats.…

Machine Learning

ByteDance Proposes OmniHuman-1: An End-to-End Multimodality Framework Generating Human Videos based on a Single Human Image and Motion Signals

February 5, 2025

Despite progress in AI-driven human animation, existing models often face limitations in motion realism, adaptability, and scalability. Many models struggle…

Machine Learning

Meet Crossfire: An Elastic Defense Framework for Graph Neural Networks under Bit Flip Attacks

February 5, 2025

Graph Neural Networks (GNNs) have found applications in various domains, such as natural language processing, social network analysis, recommendation systems,…

Machine Learning

Creating an AI Agent-Based System with LangGraph: Putting a Human in the Loop

February 5, 2025

In our previous tutorial, we built an AI agent capable of answering queries by surfing the web and added persistence…

Machine Learning

Meta AI Introduces VideoJAM: A Novel AI Framework that Enhances Motion Coherence in AI-Generated Videos

February 5, 2025

Despite recent advancements, generative video models still struggle to represent motion realistically. Many existing models focus primarily on pixel-level reconstruction,…

Machine Learning

Zep AI Introduces a Smarter Memory Layer for AI Agents Outperforming the MemGPT in the Deep Memory Retrieval (DMR) Benchmark

February 4, 2025

The development of transformer-based large language models (LLMs) has significantly advanced AI-driven applications, particularly conversational agents. However, these models face…

Machine Learning

NYU Researchers Introduce WILDCHAT-50M: A Large-Scale Synthetic Dataset for Efficient LLM Post-Training

February 4, 2025

Large language model (LLM) post-training focuses on refining model behavior and enhancing capabilities beyond their initial training phase. It includes…

Machine Learning

Deep Agent Released R1-V: Reinforcing Super Generalization in Vision-Language Models with Cost-Effective Reinforcement Learning to Outperform Larger Models

February 4, 2025

Vision-language models (VLMs) face a critical challenge in achieving robust generalization beyond their training data while maintaining computational resources and…

Machine Learning

Fine-Tuning Llama 3.2 3B Instruct for Python Code: A Comprehensive Guide with Unsloth

February 4, 2025

In this tutorial, we’ll walk through how to set up and perform fine-tuning on the Llama 3.2 3B Instruct model…

Adaptive Training Distributions with Scalable Online Bilevel Optimization

February 4, 2025

Large neural networks pretrained on web-scale corpora are central to modern machine learning. In this paradigm, the distribution of the…

Machine Learning

Orchestrate seamless business systems integrations using Amazon Bedrock Agents

February 4, 2025

Generative AI has revolutionized technology through generating content and solving complex problems. To fully take advantage of this potential, seamless…

Machine Learning

Neural SpaceTimes (NSTs): A Class of Trainable Deep Learning-based Geometries that can Universally Represent Nodes in Weighted Directed Acyclic Graphs (DAGs) as Events in a Spacetime Manifold

February 4, 2025

Directed graphs are crucial in modeling complex real-world systems, from gene regulatory networks and flow networks to stochastic processes and…

Machine Learning

University of Bath Researchers Developed an Efficient and Stable Machine Learning Training Method for Neural ODEs with O(1) Memory Footprint

February 4, 2025

Neural Ordinary Differential Equations are significant in scientific modeling and time-series analysis where data changes every other moment. This neural…