LWiAI Podcast #201 - GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

Our 201st episode with a summary and discussion of last week’s big AI news!
Recorded on 03/02/2025

Join our brand new Discord here! https://discord.gg/nTyezGSKwP

Hosted by Andrey Kurenkov and guest host Sharon Zhou
Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

In this episode:

– The release of GPT-4.5 from OpenAI, Anthropic’s Claude 3.7, and Grok 3 from XAI, comparing their features, costs, and capabilities.
– Discussion on new tools and applications including Sesame’s new voice assistant and Google’s AI coding assistant, Gemini Code Assist, highlighting their unique benefits.
– OpenAI’s continued user growth despite competition, pricing models for Google’s text-to-video platform, and HP acquiring and shutting down Humane’s AI pin.
– Insights into new research on alignment and specification gaming in LLMs, including papers on fine-tuning causing broad misalignment and Google’s multi-agent system for scientific collaboration.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:01:36) News Preview
Tools & Apps
- (00:02:33) OpenAI announces GPT-4.5, warns it’s not a frontier AI model
- (00:07:22) Anthropic launches a new AI model that ‘thinks’ as long as you want
- (00:11:14) New Grok 3 release tops LLM leaderboards
- (00:16:43) Sesame is the first voice assistant I’ve ever wanted to talk to more than once
- (00:18:30) Google launches a free AI coding assistant with very high usage caps
- (00:20:45) Rabbit shows off the AI agent it should have launched with
- (00:22:23) Mistral’s Le Chat tops 1M downloads in just 14 days

Applications & Business
- (00:24:06) OpenAI Tops 400 Million Users Despite DeepSeek’s Emergence
- (00:27:37) Google’s new AI video model Veo 2 will cost 50 cents per second
- (00:29:52) HP is buying Humane and shutting down the AI Pin

Projects & Open Source
Research & Advancements
- (00:40:00) Towards an AI co-scientist
- (00:42:52) Magma: A Foundation Model for Multimodal AI Agents
Policy & Safety
- (00:47:32) Demonstrating specification gaming in reasoning models
- (00:51:03) Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Source: Read MoreÂ

Top 10 Use Cases of Vibe Coding in Large-Scale Node.js Applications

Cloudsmith launches ML Model Registry to provide a single source of truth for AI models and datasets

Kong Acquires OpenMeter to Unlock AI and API Monetization for the Agentic Era

Microsoft Graph CLI to be retired

‘Cronos: The New Dawn’ was by far my favorite experience at Gamescom 2025 — Bloober might have cooked an Xbox / PC horror masterpiece

ASUS built a desktop gaming PC around a mobile CPU — it’s an interesting, if flawed, idea

Hollow Knight: Silksong arrives on Xbox Game Pass this week — and Xbox’s September 1–7 lineup also packs in the horror. Here’s every new game.

The Xbox remaster that brought Gears to PlayStation just passed a huge milestone — “ending the console war” and proving the series still has serious pulling power

Magento (Adobe Commerce) or Optimizely Configured Commerce: Which One to Choose

Magento (Adobe Commerce) or Optimizely Configured Commerce: Which One to Choose

Updates from N|Solid Runtime: The Best Open-Source Node.js RT Just Got Better

Scale Your Business with AI-Powered Solutions Built for Singapore’s Digital Economy

‘Cronos: The New Dawn’ was by far my favorite experience at Gamescom 2025 — Bloober might have cooked an Xbox / PC horror masterpiece

‘Cronos: The New Dawn’ was by far my favorite experience at Gamescom 2025 — Bloober might have cooked an Xbox / PC horror masterpiece

ASUS built a desktop gaming PC around a mobile CPU — it’s an interesting, if flawed, idea

Hollow Knight: Silksong arrives on Xbox Game Pass this week — and Xbox’s September 1–7 lineup also packs in the horror. Here’s every new game.

LWiAI Podcast #201 – GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

Repurposing Protein Folding Models for Generation with Latent Diffusion

CVE-2025-6416 – PHPGurukul Art Gallery Management System SQL Injection Vulnerability

How iPadOS 26 convinced me to switch from Mac to iPad full-time – and why I don’t regret it

Google boss says AI isn’t a winner-takes-all competition: “I think all of us are going to do well in this scenario”

Microsoft’s new Surface Laptop 13-inch is now priced how I thought it should’ve always been — all thanks to this Prime Day deal that WON’T last forever

CVE-2025-44890 – Foresight Wireless FW-WGS-804HPT Stack Overflow Vulnerability

Gemini 2.5 Pro and Flash are generally available and Gemini 2.5 Flash-Lite preview is announced

CVE-2025-0831 – SOLIDWORKS eDrawings JT File Out-Of-Bounds Read Arbitrary Code Execution

The toughest phone I’ve tested packs a ridiculously long battery (and it’s $180 off)

LWiAI Podcast #201 – GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

Related Posts