Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 13, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 13, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 13, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 13, 2025

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025

      Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

      May 13, 2025

      NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

      May 13, 2025

      How to install and use Ollama to run AI LLMs on your Windows 11 PC

      May 13, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Community News: Latest PECL Releases (05.13.2025)

      May 13, 2025
      Recent

      Community News: Latest PECL Releases (05.13.2025)

      May 13, 2025

      How We Use Epic Branches. Without Breaking Our Flow.

      May 13, 2025

      I think the ergonomics of generators is growing on me.

      May 13, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025
      Recent

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025

      Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

      May 13, 2025

      NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

      May 13, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Anthropic introduces prompt caching to reduce latency and costs

    Anthropic introduces prompt caching to reduce latency and costs

    August 14, 2024

    Anthropic has introduced a new feature to some of its Claude models that will allow developers to cut down on prompt costs and latency.

    Prompt caching allows users to cache frequently used context so that it can be used in future API calls. According to the company, by equipping the model with background knowledge and example outputs from the past, costs can be reduced by up to 90% and latency by up to 85% for long prompts.

    There are several use cases where prompt caching would be useful, including being able to keep a summarized version of a codebase for coding assistants to use, providing long-form documents in prompts, and providing detailed instruction sets with several examples of desired outputs. 

    Users could also use it to essentially converse with long-form content like books, papers, documentation, and podcast transcripts. According to Anthropic’s testing, chatting with a book with 100,000 tokens cached takes 2.4 seconds, whereas doing the same without information cached takes 11.5 seconds. This equates to a 79% reduction in latency. 

    It costs 25% more to cache an input token compared to the base input token price, but costs 10% less to actually use that cached content. Actual prices vary based on the specific model.

    Prompt caching is now available as a public beta on Claude 3.5 Sonnet and Claude 3 Haiku, and Claude 3 Opus will be supported soon.

    You may also like…

    Anthropic adds prompt evaluation feature to Console

    Anthropic updates Claude with new features to improve collaboration

    The post Anthropic introduces prompt caching to reduce latency and costs appeared first on SD Times.

    Source: Read More 

    Hostinger
    news
    Facebook Twitter Reddit Email Copy Link
    Previous ArticleCodeium’s new Cortex assistant utilizes complex reasoning engine for coding help
    Next Article Infragistics Ultimate 24.1 adds React code generation to App Builder

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 13, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-3744 – Nomad Sentinel Policy Bypass

    May 13, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    CVE-2025-3358 – CVE-2022-36337 Oracle WebLogic Server Cross-Site Scripting

    Common Vulnerabilities and Exposures (CVEs)

    CSS Hover Effects: 40 Engaging Animations To Try

    Development

    Privacy and security post-Snowden: Pew Research parallels ESET findings

    Development

    Threat Actor Offers Unauthorized Korean National Police Agency (KNPA) Access for $4000

    Development
    GetResponse

    Highlights

    Development

    Google Cloud Is the New Way to the Cloud

    August 3, 2024

    Explore Google Cloud’s powerful and versatile services, from AI to data storage, for businesses and…

    Lua – Laravel powered open-source URL shortener

    January 15, 2025

    A pattern for composable UI in Flask

    February 8, 2025

    Not all Echo devices will get Alexa+ – see if yours made the list

    February 26, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.