Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Artificial Intelligence»LWiAI Podcast #201 – GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

    LWiAI Podcast #201 – GPT 4.5, Sonnet 3.7, Grok 3, Phi 4

    May 16, 2025

    Our 201st episode with a summary and discussion of last week’s big AI news!
    Recorded on 03/02/2025

    Join our brand new Discord here! https://discord.gg/nTyezGSKwP

    Hosted by Andrey Kurenkov and guest host Sharon Zhou
    Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

    In this episode:

    – The release of GPT-4.5 from OpenAI, Anthropic’s Claude 3.7, and Grok 3 from XAI, comparing their features, costs, and capabilities.
    – Discussion on new tools and applications including Sesame’s new voice assistant and Google’s AI coding assistant, Gemini Code Assist, highlighting their unique benefits.
    – OpenAI’s continued user growth despite competition, pricing models for Google’s text-to-video platform, and HP acquiring and shutting down Humane’s AI pin.
    – Insights into new research on alignment and specification gaming in LLMs, including papers on fine-tuning causing broad misalignment and Google’s multi-agent system for scientific collaboration.

    Timestamps + Links:

    • (00:00:00) Intro / Banter

    • (00:01:36) News Preview

    • Tools & Apps

      • (00:02:33) OpenAI announces GPT-4.5, warns it’s not a frontier AI model

      • (00:07:22) Anthropic launches a new AI model that ‘thinks’ as long as you want

      • (00:11:14) New Grok 3 release tops LLM leaderboards

      • (00:16:43) Sesame is the first voice assistant I’ve ever wanted to talk to more than once

      • (00:18:30) Google launches a free AI coding assistant with very high usage caps

      • (00:20:45) Rabbit shows off the AI agent it should have launched with

      • (00:22:23) Mistral’s Le Chat tops 1M downloads in just 14 days

    • Applications & Business

      • (00:24:06) OpenAI Tops 400 Million Users Despite DeepSeek’s Emergence

      • (00:27:37) Google’s new AI video model Veo 2 will cost 50 cents per second

      • (00:29:52) HP is buying Humane and shutting down the AI Pin

    • Projects & Open Source

      • (00:31:44) Microsoft launches next-gen Phi AI models.

      • (00:33:47) OpenAI introduces SWE-Lancer: A Benchmark for Evaluating Model Performance on Real-World Freelance Software Engineering Work

      • (00:37:12) SWE-Bench+: Enhanced Coding Benchmark for LLMs

    • Research & Advancements

      • (00:40:00) Towards an AI co-scientist

      • (00:42:52) Magma: A Foundation Model for Multimodal AI Agents

    • Policy & Safety

      • (00:47:32) Demonstrating specification gaming in reasoning models

      • (00:51:03) Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous Article3 Questions: Visualizing research in the age of AI
    Next Article Markus Buehler receives 2025 Washington Award

    Related Posts

    Artificial Intelligence

    Markus Buehler receives 2025 Washington Award

    May 16, 2025
    Artificial Intelligence

    3 Questions: Visualizing research in the age of AI

    May 16, 2025
    Leave A Reply Cancel Reply

    Hostinger

    Continue Reading

    WebDriverException Demystified: Expert Solutions

    Development

    CVE-2025-3520 – “WordPress Avatar Plugin File Deletion Vulnerability”

    Common Vulnerabilities and Exposures (CVEs)

    Which AI agent is the best? This new leaderboard can tell you

    News & Updates

    Free and open-source SVG icons

    Development
    GetResponse

    Highlights

    CVE-2025-4160 – PCMan FTP Server LS Command Handler Buffer Overflow Vulnerability

    May 1, 2025

    CVE ID : CVE-2025-4160

    Published : May 1, 2025, 10:15 a.m. | 1 hour, 38 minutes ago

    Description : A vulnerability was found in PCMan FTP Server up to 2.0.7. It has been rated as critical. Affected by this issue is some unknown functionality of the component LS Command Handler. The manipulation leads to buffer overflow. The attack may be launched remotely. The exploit has been disclosed to the public and may be used.

    Severity: 7.3 | HIGH

    Visit the link for more details, such as CVSS details, affected products, timeline, and more…

    LockBit ransomware gang breached, secrets exposed

    May 9, 2025

    Amara’s Law: How the AI Hype Cycle Leads to Disillusionment

    June 25, 2024

    AI Safety Benchmarks May Not Ensure True Safety: This AI Paper Reveals the Hidden Risks of Safetywashing

    August 5, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.