Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 27, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 27, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 27, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 27, 2025

      Don’t make this costly thermostat mistake – and the best place to put it

      May 27, 2025

      68% of tech vendor customer support to be handled by AI by 2028, says Cisco report

      May 27, 2025

      These $130 Anker earbuds have no business sounding this good for the price

      May 27, 2025

      Pocket is shutting down – here’s how to retrieve what little data you still can

      May 27, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Last Call: Early Access for NativePHP Ends This Week

      May 27, 2025
      Recent

      Last Call: Early Access for NativePHP Ends This Week

      May 27, 2025

      Setup Social Auth Redirects with Laravel Herd

      May 27, 2025

      Community News: Latest PECL Releases (05.27.2025)

      May 27, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft wants to make GamePad gaming faster on Chrome for Windows 11

      May 27, 2025
      Recent

      Microsoft wants to make GamePad gaming faster on Chrome for Windows 11

      May 27, 2025

      Windows 11 KB5058502 restores Win + C, direct download links for version 23H2

      May 27, 2025

      Leak hints at Windows 11’s new feature that optimizes performance, tied to Copilot branding (?)

      May 27, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Cohere Rerank 3.5 is now available in Amazon Bedrock through Rerank API

    Cohere Rerank 3.5 is now available in Amazon Bedrock through Rerank API

    December 1, 2024

    We are excited to announce the availability of Cohere’s advanced reranking model Rerank 3.5 through our new Rerank API in Amazon Bedrock. This powerful reranking model enables AWS customers to significantly improve their search relevance and content ranking capabilities. This model is also available for Amazon Bedrock Knowledge Base users. By incorporating Cohere’s Rerank 3.5 in Amazon Bedrock, we’re making enterprise-grade search technology more accessible and empowering organizations to enhance their information retrieval systems with minimal infrastructure management.

    In this post, we discuss the need for Reranking, the capabilities of Cohere’s Rerank 3.5, and how to get started using it on Amazon Bedrock.

    Reranking for advanced retrieval

    Reranking is a vital enhancement to Retrieval Augmented Generation (RAG) systems that adds a sophisticated second layer of analysis to improve search result relevance beyond what traditional vector search can achieve. Unlike embedding models that rely on pre-computed static vectors, rerankers perform dynamic query-time analysis of document relevance, enabling more nuanced and contextual matching. This capability allows RAG systems to effectively balance between broad document retrieval and precise context selection, ultimately leading to more accurate and reliable outputs from language models while reducing the likelihood of hallucinations.

    Existing search systems significantly benefit from reranking technology by providing more contextually relevant results that directly impact user satisfaction and business outcomes. Unlike traditional keyword matching or basic vector search, reranking performs an intelligent second-pass analysis that considers multiple factors, including semantic meaning, user intent, and business rules to optimize search result ordering. In ecommerce specifically, reranking helps surface the most relevant products by understanding nuanced relationships between search queries and product attributes, while also incorporating crucial business metrics like conversion rates and inventory levels. This advanced relevance optimization leads to improved product discovery, higher conversion rates, and enhanced customer satisfaction across digital commerce platforms, making reranking an essential component for any modern enterprise search infrastructure.

    Introducing Cohere Rerank 3.5

    Cohere’s Rerank 3.5 is designed to enhance search and RAG systems. This intelligent cross-encoding model takes a query and a list of potentially relevant documents as input, then returns the documents sorted by semantic similarity to the query. Cohere Rerank 3.5 excels in understanding complex information requiring reasoning and is able to understand the meaning behind enterprise data and user questions. Its ability to comprehend and analyze enterprise data and user questions across over 100 languages including Arabic, Chinese, English, French, German, Hindi, Japanese, Korean, Portuguese, Russian, and Spanish, makes it particularly valuable for global organizations in sectors such as finance, healthcare, hospitality, energy, government, and manufacturing.

    One of the key advantages of Cohere Rerank 3.5 is its ease of implementation. Through a single Rerank API call in Amazon Bedrock, you can integrate Rerank into existing systems at scale, whether keyword-based or semantic. Reranking strictly improves first-stage retrievals on standard text retrieval benchmarks.

    Cohere Rerank 3.5 is state of the art in the financial domain, as illustrated in the following figure.

    Cohere Rerank 3.5 is also state of the art in the ecommerce domain, as illustrated in the following figure. Cohere’s ecommerce benchmarks revolve around retrieval on various products, including fashion, electronics, food, and more.

    Products were structured as strings in a key-value pair format such as the following:

    “Title”: “Title” 
    “Description”: “Long-form description” “Type”: <Some categorical data> etc.....

    Cohere Rerank 3.5 also excels in hospitality, as shown in the following figure. Hospitality benchmarks revolve around retrieval on hospitality experiences and lodging options.

    Documents were structured as strings in a key-value pairs format such as the following:

    “Listing Title”: “Rental unit in Toronto” “Location”: “171 John Street, Toronto, Ontario, Canada”
    
    “Description”: “Escape to our serene villa with stunning downtown views....”

    We see noticeable gains in project management performance across all types of issue tracking tasks, as illustrated in the following figure.

    Cohere’s project management benchmarks span a variety of retrieval tasks, such as:

    • Search through engineering tickets from various project management and issue tracking software tools
    • Search through GitHub issues on popular open source repos

    Get started with Cohere Rerank 3.5

    To start using Cohere Rerank 3.5 with Rerank API and Amazon Bedrock Knowledge Bases, navigate to the Amazon Bedrock console, and click on Model Access on the left hand pane. Click on Modify Access, select Cohere Rerank 3.5, click Next and hit submit.

    Get Started with Amazon Bedrock Rerank API

    The Cohere Rerank 3.5 model, powered by the Amazon Bedrock Rerank API, allows you to rerank input documents directly based on their semantic relevance to a user query – without requiring a pre-configured knowledge base. The flexibility makes it a powerful tool for various use cases.

    To begin, set up your environment by importing the necessary libraries and initializing Boto3 clients:

    Hostinger
    import boto3
    import json
    region = boto3.Session().region_name
    
    bedrock_agent_runtime = boto3.client('bedrock-agent-runtime',region_name=region)
    
    modelId = "cohere.rerank-v3-5:0"
    model_package_arn = f"arn:aws:bedrock:{region}::foundation-model/{modelId}”
    

    Next, define a main function that reorders a list of text documents by computing relevance scores based on the user query:

    def rerank_text(text_query, text_sources, num_results, model_package_arn):
        response = bedrock_agent_runtime.rerank(
            queries=[
                {
                    "type": "TEXT",
                    "textQuery": {
                        "text": text_query
                    }
                }
            ],
            sources=text_sources,
            rerankingConfiguration={
                "type": "BEDROCK_RERANKING_MODEL",
                "bedrockRerankingConfiguration": {
                    "numberOfResults": num_results,
                    "modelConfiguration": {
                        "modelArn": model_package_arn,
                    }
                }
            }
        )
        return response['results']
    

    For instance, imagine a scenario where you need to identify emails related to returning items from a multilingual dataset. The example below demonstrates this process:

    example_query = "What emails have been about returning items?"
    
    documents = [
        "Hola, llevo una hora intentando acceder a mi cuenta y sigue diciendo que mi contraseña es incorrecta. ¿Puede ayudarme, por favor?",
        "Hi, I recently purchased a product from your website but I never received a confirmation email. Can you please look into this for me?",
        "مرحبًا، لدي سؤال حول سياسة إرجاع هذا المنتج. لقد اشتريته قبل بضعة أسابيع وهو معيب",
        "Good morning, I have been trying to reach your customer support team for the past week but I keep getting a busy signal. Can you please help me?",
        "Hallo, ich habe eine Frage zu meiner letzten Bestellung. Ich habe den falschen Artikel erhalten und muss ihn zurückschicken.",
        "Hello, I have been trying to reach your customer support team for the past hour but I keep getting a busy signal. Can you please help me?",
        "Hi, I have a question about the return policy for this product. I purchased it a few weeks ago and it is defective.",
        "早上好,关于我最近的订单,我有一个问题。我收到了错误的商品",
        "Hello, I have a question about the return policy for this product. I purchased it a few weeks ago and it is defective."
    ]

    Now, prepare the list of text sources that will be passed into the rerank_text() function:

    text_sources = []
    for text in documents:
        text_sources.append({
            "type": "INLINE",
            "inlineDocumentSource": {
                "type": "TEXT",
                "textDocument": {
                    "text": text,
                }
            }
        })

    You can then invoke rerank_text() by specifying the user query, the text resources, the desired number of top-ranked results, and the model ARN:

    response = rerank_text(example_query, text_sources, 3, model_package_arn)
    print(response)

    The output generated by the Amazon Bedrock Rerank API with Cohere Rerank 3.5 for this query is:

    [{'index': 4, 'relevanceScore': 0.1122397780418396},
     {'index': 8, 'relevanceScore': 0.07777658104896545},
     {'index': 2, 'relevanceScore': 0.0770234540104866}]

    The relevance scores provided by the API are normalized to a range of [0, 1], with higher scores indicating higher relevance to the query. Here the 5th item in the list of documents is the most relevant. (Translated from German to English: Hello, I have a question about my last order. I received the wrong item and need to return it.)

    You can also get started using Cohere Rerank 3.5 with Amazon Bedrock Knowledge Bases by completing the following steps:

    1. In the Amazon Bedrock console, choose Knowledge bases under Builder tools in the navigation pane.
    2. Choose Create knowledge base.
    3. Provide your knowledge base details, such as name, permissions, and data source.
    1. To configure your data source, specify the location of your data.
    2. Select an embedding model to convert the data into vector embeddings, and have Amazon Bedrock create a vector store in your account to store the vector data.

    When you select this option (available only in the Amazon Bedrock console), Amazon Bedrock creates a vector index in Amazon OpenSearch Serverless (by default) in your account, removing the need to manage anything yourself.

    1. Review your settings and create your knowledge base.
    2. In the Amazon Bedrock console, choose your knowledge base and choose Test knowledge base.
    3. Choose the icon for additional configuration options for testing your knowledge base.
    4. Choose your model (for this post, Cohere Rerank 3.5) and choose Apply.

    The configuration pane shows the new Reranking section menu with additional configuration options. The number of reranked source chunks returns the specified number of highest relevant chunks.

    Conclusion

    In this post, we explored how to use Cohere’s Rerank 3.5 model in Amazon Bedrock, demonstrating its powerful capabilities for enhancing search relevance and robust reranking capabilities for enterprise applications, enhancing user experience and optimizing information retrieval workflows. Start improving your search relevance today with Cohere’s Rerank model on Amazon Bedrock.

    Cohere Rerank 3.5 in Amazon Bedrock is available in the following AWS Regions: in us-west-2 (US West – Oregon), ca-central-1 (Canada – Central), eu-central-1 (Europe – Frankfurt), and ap-northeast-1 (Asia Pacific – Tokyo).

    Share your feedback to AWS re:Post for Amazon Bedrock or through your usual AWS Support contacts.

    To learn more about Cohere Rerank 3.5’s features and capabilities, view the Cohere in Amazon Bedrock product page.


    About the Authors

    Karan Singh is a Generative AI Specialist for third-party models at AWS, where he works with top-tier third-party foundation model (FM) providers to develop and execute joint Go-To-Market strategies, enabling customers to effectively train, deploy, and scale FMs to solve industry specific challenges. Karan holds a Bachelor of Science in Electrical and Instrumentation Engineering from Manipal University, a master’s in science in Electrical Engineering from Northwestern University and is currently an MBA Candidate at the Haas School of Business at University of California, Berkeley.

    James Yi is a Senior AI/ML Partner Solutions Architect at Amazon Web Services. He spearheads AWS’s strategic partnerships in Emerging Technologies, guiding engineering teams to design and develop cutting-edge joint solutions in generative AI. He enables field and technical teams to seamlessly deploy, operate, secure, and integrate partner solutions on AWS. James collaborates closely with business leaders to define and execute joint Go-To-Market strategies, driving cloud-based business growth. Outside of work, he enjoys playing soccer, traveling, and spending time with his family.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleHandBrake 1.9 Brings Lossless VP9 Encoding, Intel QSV VVC Decoding
    Next Article AWS DeepRacer: How to master physical racing?

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 28, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-48749 – Netwrix Directory Manager Data Exfiltration Vulnerability

    May 28, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Top 10 Python Web Frameworks to Use in 2024

    Development

    URI Path Components Using Laravel’s pathSegments() Method

    Development

    CVE-2025-21204: SYSTEM-Level Privilege Escalation in Windows Update Stack Exposed, PoC Released

    Security

    The power of App Inventor: Democratizing possibilities for mobile applications

    Artificial Intelligence

    Highlights

    Development

    Critical SQLi Vulnerability Found in Fortra FileCatalyst Workflow Application

    June 27, 2024

    A critical security flaw has been disclosed in Fortra FileCatalyst Workflow that, if left unpatched,…

    Opera One on iPhone: Minimalist Design Meets AI Assistance

    August 14, 2024

    Beyond Next-Token Prediction: Overcoming AI’s Foresight and Decision-Making Limits

    July 12, 2024

    Microsoft Warns: North Korean Hackers Turn to AI-Fueled Cyber Espionage

    April 25, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.