Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 27, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 27, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 27, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 27, 2025

      Buy a Samsung Galaxy Watch 7 and get a free SmartTag2 Bluetooth tracker – here’s how

      May 27, 2025

      I changed 8 settings on my Pixel phone to significantly improve the battery life

      May 27, 2025

      Should you ever pay for Linux? 5 times I would – and why

      May 27, 2025

      I replaced my iPad with a $100 Android tablet, and here’s my verdict after a week

      May 27, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Community News: Latest PECL Releases (05.27.2025)

      May 27, 2025
      Recent

      Community News: Latest PECL Releases (05.27.2025)

      May 27, 2025

      JavaScript Formatter

      May 27, 2025

      How to Master Recursion in JavaScript with Practical Examples

      May 27, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Windows 11 24H2’s Task Manager new CPU usage formula rolls out to everyone

      May 27, 2025
      Recent

      Windows 11 24H2’s Task Manager new CPU usage formula rolls out to everyone

      May 27, 2025

      I’ve Seen Things

      May 27, 2025

      Windows 11 is getting a built-in Color Picker tool for designers

      May 27, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Cohere Rerank 3.5 is now available in Amazon Bedrock through Rerank API

    Cohere Rerank 3.5 is now available in Amazon Bedrock through Rerank API

    December 1, 2024

    We are excited to announce the availability of Cohere’s advanced reranking model Rerank 3.5 through our new Rerank API in Amazon Bedrock. This powerful reranking model enables AWS customers to significantly improve their search relevance and content ranking capabilities. This model is also available for Amazon Bedrock Knowledge Base users. By incorporating Cohere’s Rerank 3.5 in Amazon Bedrock, we’re making enterprise-grade search technology more accessible and empowering organizations to enhance their information retrieval systems with minimal infrastructure management.

    In this post, we discuss the need for Reranking, the capabilities of Cohere’s Rerank 3.5, and how to get started using it on Amazon Bedrock.

    Reranking for advanced retrieval

    Reranking is a vital enhancement to Retrieval Augmented Generation (RAG) systems that adds a sophisticated second layer of analysis to improve search result relevance beyond what traditional vector search can achieve. Unlike embedding models that rely on pre-computed static vectors, rerankers perform dynamic query-time analysis of document relevance, enabling more nuanced and contextual matching. This capability allows RAG systems to effectively balance between broad document retrieval and precise context selection, ultimately leading to more accurate and reliable outputs from language models while reducing the likelihood of hallucinations.

    Existing search systems significantly benefit from reranking technology by providing more contextually relevant results that directly impact user satisfaction and business outcomes. Unlike traditional keyword matching or basic vector search, reranking performs an intelligent second-pass analysis that considers multiple factors, including semantic meaning, user intent, and business rules to optimize search result ordering. In ecommerce specifically, reranking helps surface the most relevant products by understanding nuanced relationships between search queries and product attributes, while also incorporating crucial business metrics like conversion rates and inventory levels. This advanced relevance optimization leads to improved product discovery, higher conversion rates, and enhanced customer satisfaction across digital commerce platforms, making reranking an essential component for any modern enterprise search infrastructure.

    Introducing Cohere Rerank 3.5

    Cohere’s Rerank 3.5 is designed to enhance search and RAG systems. This intelligent cross-encoding model takes a query and a list of potentially relevant documents as input, then returns the documents sorted by semantic similarity to the query. Cohere Rerank 3.5 excels in understanding complex information requiring reasoning and is able to understand the meaning behind enterprise data and user questions. Its ability to comprehend and analyze enterprise data and user questions across over 100 languages including Arabic, Chinese, English, French, German, Hindi, Japanese, Korean, Portuguese, Russian, and Spanish, makes it particularly valuable for global organizations in sectors such as finance, healthcare, hospitality, energy, government, and manufacturing.

    One of the key advantages of Cohere Rerank 3.5 is its ease of implementation. Through a single Rerank API call in Amazon Bedrock, you can integrate Rerank into existing systems at scale, whether keyword-based or semantic. Reranking strictly improves first-stage retrievals on standard text retrieval benchmarks.

    Cohere Rerank 3.5 is state of the art in the financial domain, as illustrated in the following figure.

    Cohere Rerank 3.5 is also state of the art in the ecommerce domain, as illustrated in the following figure. Cohere’s ecommerce benchmarks revolve around retrieval on various products, including fashion, electronics, food, and more.

    Products were structured as strings in a key-value pair format such as the following:

    “Title”: “Title” 
    “Description”: “Long-form description” “Type”: <Some categorical data> etc.....

    Cohere Rerank 3.5 also excels in hospitality, as shown in the following figure. Hospitality benchmarks revolve around retrieval on hospitality experiences and lodging options.

    Documents were structured as strings in a key-value pairs format such as the following:

    “Listing Title”: “Rental unit in Toronto” “Location”: “171 John Street, Toronto, Ontario, Canada”
    
    “Description”: “Escape to our serene villa with stunning downtown views....”

    We see noticeable gains in project management performance across all types of issue tracking tasks, as illustrated in the following figure.

    Cohere’s project management benchmarks span a variety of retrieval tasks, such as:

    • Search through engineering tickets from various project management and issue tracking software tools
    • Search through GitHub issues on popular open source repos

    Get started with Cohere Rerank 3.5

    To start using Cohere Rerank 3.5 with Rerank API and Amazon Bedrock Knowledge Bases, navigate to the Amazon Bedrock console, and click on Model Access on the left hand pane. Click on Modify Access, select Cohere Rerank 3.5, click Next and hit submit.

    Get Started with Amazon Bedrock Rerank API

    The Cohere Rerank 3.5 model, powered by the Amazon Bedrock Rerank API, allows you to rerank input documents directly based on their semantic relevance to a user query – without requiring a pre-configured knowledge base. The flexibility makes it a powerful tool for various use cases.

    To begin, set up your environment by importing the necessary libraries and initializing Boto3 clients:

    import boto3
    import json
    region = boto3.Session().region_name
    
    bedrock_agent_runtime = boto3.client('bedrock-agent-runtime',region_name=region)
    
    modelId = "cohere.rerank-v3-5:0"
    model_package_arn = f"arn:aws:bedrock:{region}::foundation-model/{modelId}”
    

    Next, define a main function that reorders a list of text documents by computing relevance scores based on the user query:

    def rerank_text(text_query, text_sources, num_results, model_package_arn):
        response = bedrock_agent_runtime.rerank(
            queries=[
                {
                    "type": "TEXT",
                    "textQuery": {
                        "text": text_query
                    }
                }
            ],
            sources=text_sources,
            rerankingConfiguration={
                "type": "BEDROCK_RERANKING_MODEL",
                "bedrockRerankingConfiguration": {
                    "numberOfResults": num_results,
                    "modelConfiguration": {
                        "modelArn": model_package_arn,
                    }
                }
            }
        )
        return response['results']
    

    For instance, imagine a scenario where you need to identify emails related to returning items from a multilingual dataset. The example below demonstrates this process:

    example_query = "What emails have been about returning items?"
    
    documents = [
        "Hola, llevo una hora intentando acceder a mi cuenta y sigue diciendo que mi contraseña es incorrecta. ¿Puede ayudarme, por favor?",
        "Hi, I recently purchased a product from your website but I never received a confirmation email. Can you please look into this for me?",
        "مرحبًا، لدي سؤال حول سياسة إرجاع هذا المنتج. لقد اشتريته قبل بضعة أسابيع وهو معيب",
        "Good morning, I have been trying to reach your customer support team for the past week but I keep getting a busy signal. Can you please help me?",
        "Hallo, ich habe eine Frage zu meiner letzten Bestellung. Ich habe den falschen Artikel erhalten und muss ihn zurückschicken.",
        "Hello, I have been trying to reach your customer support team for the past hour but I keep getting a busy signal. Can you please help me?",
        "Hi, I have a question about the return policy for this product. I purchased it a few weeks ago and it is defective.",
        "早上好,关于我最近的订单,我有一个问题。我收到了错误的商品",
        "Hello, I have a question about the return policy for this product. I purchased it a few weeks ago and it is defective."
    ]

    Now, prepare the list of text sources that will be passed into the rerank_text() function:

    text_sources = []
    for text in documents:
        text_sources.append({
            "type": "INLINE",
            "inlineDocumentSource": {
                "type": "TEXT",
                "textDocument": {
                    "text": text,
                }
            }
        })

    You can then invoke rerank_text() by specifying the user query, the text resources, the desired number of top-ranked results, and the model ARN:

    response = rerank_text(example_query, text_sources, 3, model_package_arn)
    print(response)

    The output generated by the Amazon Bedrock Rerank API with Cohere Rerank 3.5 for this query is:

    [{'index': 4, 'relevanceScore': 0.1122397780418396},
     {'index': 8, 'relevanceScore': 0.07777658104896545},
     {'index': 2, 'relevanceScore': 0.0770234540104866}]

    The relevance scores provided by the API are normalized to a range of [0, 1], with higher scores indicating higher relevance to the query. Here the 5th item in the list of documents is the most relevant. (Translated from German to English: Hello, I have a question about my last order. I received the wrong item and need to return it.)

    You can also get started using Cohere Rerank 3.5 with Amazon Bedrock Knowledge Bases by completing the following steps:

    1. In the Amazon Bedrock console, choose Knowledge bases under Builder tools in the navigation pane.
    2. Choose Create knowledge base.
    3. Provide your knowledge base details, such as name, permissions, and data source.
    1. To configure your data source, specify the location of your data.
    2. Select an embedding model to convert the data into vector embeddings, and have Amazon Bedrock create a vector store in your account to store the vector data.

    When you select this option (available only in the Amazon Bedrock console), Amazon Bedrock creates a vector index in Amazon OpenSearch Serverless (by default) in your account, removing the need to manage anything yourself.

    1. Review your settings and create your knowledge base.
    2. In the Amazon Bedrock console, choose your knowledge base and choose Test knowledge base.
    3. Choose the icon for additional configuration options for testing your knowledge base.
    4. Choose your model (for this post, Cohere Rerank 3.5) and choose Apply.

    The configuration pane shows the new Reranking section menu with additional configuration options. The number of reranked source chunks returns the specified number of highest relevant chunks.

    Conclusion

    In this post, we explored how to use Cohere’s Rerank 3.5 model in Amazon Bedrock, demonstrating its powerful capabilities for enhancing search relevance and robust reranking capabilities for enterprise applications, enhancing user experience and optimizing information retrieval workflows. Start improving your search relevance today with Cohere’s Rerank model on Amazon Bedrock.

    Cohere Rerank 3.5 in Amazon Bedrock is available in the following AWS Regions: in us-west-2 (US West – Oregon), ca-central-1 (Canada – Central), eu-central-1 (Europe – Frankfurt), and ap-northeast-1 (Asia Pacific – Tokyo).

    Share your feedback to AWS re:Post for Amazon Bedrock or through your usual AWS Support contacts.

    To learn more about Cohere Rerank 3.5’s features and capabilities, view the Cohere in Amazon Bedrock product page.


    About the Authors

    Karan Singh is a Generative AI Specialist for third-party models at AWS, where he works with top-tier third-party foundation model (FM) providers to develop and execute joint Go-To-Market strategies, enabling customers to effectively train, deploy, and scale FMs to solve industry specific challenges. Karan holds a Bachelor of Science in Electrical and Instrumentation Engineering from Manipal University, a master’s in science in Electrical Engineering from Northwestern University and is currently an MBA Candidate at the Haas School of Business at University of California, Berkeley.

    James Yi is a Senior AI/ML Partner Solutions Architect at Amazon Web Services. He spearheads AWS’s strategic partnerships in Emerging Technologies, guiding engineering teams to design and develop cutting-edge joint solutions in generative AI. He enables field and technical teams to seamlessly deploy, operate, secure, and integrate partner solutions on AWS. James collaborates closely with business leaders to define and execute joint Go-To-Market strategies, driving cloud-based business growth. Outside of work, he enjoys playing soccer, traveling, and spending time with his family.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleHandBrake 1.9 Brings Lossless VP9 Encoding, Intel QSV VVC Decoding
    Next Article AWS DeepRacer: How to master physical racing?

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 27, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-47672 – miniOrange Discord Integration PHP Remote File Inclusion Vulnerability

    May 27, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    How Atomic’s Center Parcs App is Transforming Guest Experiences

    Development

    6 WhatsApp Security Tips

    Development

    Microsoft rolls KB5039302 out again, allowing users to access CloudPC in Windows 11 without issues

    Development

    CVE-2025-24338 – CtrlX OS Cross-Site Scripting (XSS)

    Common Vulnerabilities and Exposures (CVEs)

    Highlights

    Development

    FreeBSD Releases Urgent Patch for High-Severity OpenSSH Vulnerability

    August 12, 2024

    The maintainers of the FreeBSD Project have released security updates to address a high-severity flaw…

    Bisheng: An Open-Source LLM DevOps Platform Revolutionizing LLM Application Development

    May 20, 2024

    Spf Permerror Troubleshooting Guide For Better Email Deliverability Today

    May 2, 2025

    Build a Real-Time Multiplayer Tic-Tac-Toe Game Using WebSockets and Microservices

    November 19, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.