Accelerate generative AI innovation in Canada with Amazon Bedrock cross-Region inference

Generative AI has created unprecedented opportunities for Canadian organizations to transform their operations and customer experiences. We are excited to announce that customers in Canada can now access advanced foundation models including Anthropic’s Claude Sonnet 4.5 and Claude Haiku 4.5 on Amazon Bedrock through cross-Region inference (CRIS). This post explores how Canadian organizations can use … Read more

Power up your ML workflows with interactive IDEs on SageMaker HyperPod

Amazon SageMaker HyperPod clusters with Amazon Elastic Kubernetes Service (EKS) orchestration now support creating and managing interactive development environments such as JupyterLab and open source Visual Studio Code, streamlining the ML development lifecycle by providing managed environments for familiar tools to data scientists. This feature introduces a new add-on called Amazon SageMaker Spaces for AI developers to create and … Read more

Claude Opus 4.5 now in Amazon Bedrock

Anthropic’s newest foundation model, Claude Opus 4.5, is now available in Amazon Bedrock, a fully managed service that offers a choice of high-performing foundation models from leading AI companies. Opus 4.5 is a meaningful step forward in what AI systems can do and sets a new standard across coding, agents, computer use, and office tasks. … Read more

Deploy GPT-OSS models with Amazon Bedrock Custom Model Import

Amazon Bedrock Custom Model Import now supports OpenAI models with open weights, including GPT-OSS variants with 20-billion and 120-billion parameters. GPT-OSS models offer reasoning capabilities and can be used with OpenAI Chat Completions API. By preserving full OpenAI API compatibility, organizations can migrate their existing applications to AWS, gaining enterprise-grade security, scaling, and cost control. … Read more

Streamline AI operations with the Multi-Provider Generative AI Gateway reference architecture

As organizations increasingly adopt AI capabilities across their applications, the need for centralized management, security, and cost control of AI model access is a required step in scaling AI solutions. The Generative AI Gateway on AWS guidance addresses these challenges by providing guidance for a unified gateway that supports multiple AI providers while offering comprehensive … Read more

Deploy geospatial agents with Foursquare Spatial H3 Hub and Amazon SageMaker AI

Organizations have used geospatial machine learning (ML) for property risk assessment, disaster response, and infrastructure planning. These systems worked well but couldn’t scale beyond specialized use cases. Each question required multiple geospatial datasets, each with its own model and often its own workflow, limiting these capabilities to a handful of high-value use cases at the … Read more

How Wipro PARI accelerates PLC code generation using Amazon Bedrock

This post is co-written with Rejin Surendran from Wipro Enterprises Limited and Bakrudeen K from ShellKode. In manufacturing environments, industrial automation engineers face a significant challenge: how to rapidly convert complex process requirements into Programmable Logic Controller (PLC) ladder text code. This traditional, manual process typically requires 3-4 days per query, creating bottlenecks in production … Read more

MSD explores applying generative Al to improve the deviation management process using AWS services

This post is co-written with Hossein Salami and Jwalant Vyas from MSD.  In the biopharmaceutical industry, deviations in the manufacturing process are rigorously addressed. Each deviation is thoroughly documented, and its various aspects and potential impacts are closely examined to help ensure drug product quality, patient safety, and compliance. For leading pharmaceutical companies, managing these … Read more

Accelerating genomics variant interpretation with AWS HealthOmics and Amazon Bedrock AgentCore

Genomic research stands at a transformative crossroads where the exponential growth of sequencing data demands equally sophisticated analytical capabilities. According to the 1000 Genomes Project, a typical human genome differs from the reference at 4.1–5.0 million sites, with most variants being SNPs and short indels. These variants, when aggregated across individuals, contribute to differences in … Read more

How Rufus scales conversational shopping experiences to millions of Amazon customers with Amazon Bedrock

Our team at Amazon builds Rufus, an AI-powered shopping assistant which delivers intelligent, conversational experiences to delight our customers. More than 250 million customers have used Rufus this year. Monthly users are up 140% YoY and interactions are up 210% YoY. Additionally, customers that use Rufus during a shopping journey are 60% more likely to … Read more

How Care Access achieved 86% data processing cost reductions and 66% faster data processing with Amazon Bedrock prompt caching

This post is co-written with Michelle Tat, Christopher Penrose, Rasmus Buchmann, and Daniel Hansen from Care Access. Organizations operating in the healthcare industry face a critical challenge as they process large volumes of medical records and face a critical challenge: how to efficiently analyze thousands of medical records daily while maintaining strict security and compliance … Read more

Claude Code deployment patterns and best practices with Amazon Bedrock

Claude Code is an AI-powered coding assistant from Anthropic that helps developers write, review, and modify code through natural language interactions. Amazon Bedrock is a fully managed service that provides access to foundation models from leading AI companies through a single API. This post shows you how to deploy Claude Code with Amazon Bedrock. You’ll … Read more

Amazon Bedrock Guardrails expands support for code domain

Amazon Bedrock Guardrails now supports protection against undesirable content within code elements including user prompts, comments, variables, function names, and string literals. Amazon Bedrock Guardrails provides configurable safeguards for building generative AI applications at scale. These safety controls work seamlessly whether you’re using foundation models from Amazon Bedrock, or applying them at various intervention points … Read more

Announcing the AWS Well-Architected Responsible AI Lens 

As AI applications grow more complex, many builders struggle to appropriately and responsibly balance AI benefits and risks. Few resources exist that help non-experts articulate and resolve the key design decisions they must make. However, it doesn’t have to be this way. Today, we’re announcing the AWS Well-Architected Responsible AI Lens—a set of thoughtful questions … Read more

How Amazon uses AI agents to support compliance screening of billions of transactions per day

At Amazon, we screen customers and transactions across our global business and its subsidiaries to comply with sanctions and other global laws. Failure to comply with these laws can result in severe financial penalties and reputational harm. Amazon’s Compliance team has developed an AI-driven screening and investigations system that has transformed Amazon’s compliance processes into … Read more

Build an agentic solution with Amazon Nova, Snowflake, and LangGraph

This post was written with Bharath Suresh and Mary Law from Snowflake. Agentic AI is a type of AI that functions autonomously, automating a broader range of tasks with minimal supervision. It combines traditional AI and generative AI capabilities to make decisions, perform tasks, and adapt to its environment without constant human intervention. These autonomous … Read more

Using Spectrum fine-tuning to improve FM training efficiency on Amazon SageMaker AI

Optimizing generative AI applications relies on tailoring foundation models (FMs) using techniques such as prompt engineering, RAG, continued pre-training, and fine-tuning. Efficient fine-tuning is achieved by strategically managing hardware, training time, data volume, and model quality to reduce resource demands and maximize value. Spectrum is a new approach designed to pinpoint the most informative layers … Read more

HyperPod enhances ML infrastructure with security and storage

Amazon SageMaker HyperPod is a purpose-built infrastructure for optimizing foundation model training and inference at scale. SageMaker HyperPod removes the undifferentiated heavy lifting involved in building and optimizing machine learning (ML) infrastructure for training foundation models (FMs). As AI moves towards deployment adopting to a multitude of domains and use cases, the need for security … Read more

Accelerating generative AI applications with a platform engineering approach

Over the past two years, I’ve worked with many customers using generative AI to transform their organizations. Most stall at experimentation, because costs stack up and timelines extend before delivering demonstrable value. A 2023 AWS MIT Chief Data Officer (CDO) Symposium survey backs this up, reporting that while 71% of Chief Data Officers were experimenting … Read more

Your complete guide to Amazon Quick Suite at AWS re:Invent 2025

What if you could answer complex business questions in minutes instead of weeks, automate workflows without writing code, and empower every employee with enterprise AI—all while maintaining security and governance? That’s the power of Amazon Quick Suite, and at AWS re:Invent 2025, we are showcasing how organizations are making it a reality. Launched in October … Read more

Accelerate enterprise solutions with agentic AI-powered consulting: Introducing AWS Professional Service Agents

AWS Professional Services set out to help organizations accelerate their cloud adoption with expert guidance and proven methodologies. Today, we’re at a pivotal moment in consulting. Just as cloud computing transformed how enterprises build technology, agentic AI is transforming how consulting services deliver value. We believe in a future where intelligent agents work alongside expert … Read more

Amazon Bedrock AgentCore and Claude: Transforming business with agentic AI

The enterprise AI conversation has fundamentally shifted. We’re no longer asking “Can AI understand language?” but rather “Can AI autonomously execute complex business processes that drive real value?” According to McKinsey research, agentic AI has the potential to generate $450 billion to $650 billion in additional annual revenue by 2030, representing a 5 to 10 … Read more

Build a biomedical research agent with Biomni tools and Amazon Bedrock AgentCore Gateway

This post is co-authored with the Biomni group from Stanford. Biomedical researchers spend approximately 90% of their time manually processing massive volumes of scattered information. This is evidenced by Genentech’s challenge of processing 38 million biomedical publications in PubMed, public repositories like the Human Protein Atlas, and their internal repository of hundreds of millions of … Read more

Make your web apps hands-free with Amazon Nova Sonic

Graphical user interfaces have carried the torch for decades, but today’s users increasingly expect to talk to their applications. Amazon Nova Sonic is a state-of-the-art foundation model from Amazon Bedrock, that helps enable this shift by providing natural, low-latency, bidirectional speech conversations over a simple streaming API. Users can collaborate with the applications through voice … Read more

Harnessing the power of generative AI: Druva’s multi-agent copilot for streamlined data protection

This post is co-written with David Gildea and Tom Nijs from Druva. Generative AI is transforming the way businesses interact with their customers and revolutionizing conversational interfaces for complex IT operations. Druva, a leading provider of data security solutions, is at the forefront of this transformation. In collaboration with Amazon Web Services (AWS), Druva is … Read more

Introducing agent-to-agent protocol support in Amazon Bedrock AgentCore Runtime

We recently announced the support for Agent-to-Agent (A2A) protocol on Amazon Bedrock AgentCore Runtime. With this addition, agents can discover peers, share capabilities, and coordinate actions across platforms using standardized communication. Amazon Bedrock AgentCore Runtime provides a secure, serverless environment designed for deploying AI agents and tools. It works with any framework and model, supports … Read more

Powering enterprise search with the Cohere Embed 4 multimodal embeddings model in Amazon Bedrock

The Cohere Embed 4 multimodal embeddings model is now available as a fully managed, serverless option in Amazon Bedrock. Users can choose between cross-Region inference (CRIS) or Global cross-Region inference to manage unplanned traffic bursts by utilizing compute resources across different AWS Regions. Real-time information requests and time zone concentrations are example events that can cause … Read more

A guide to building AI agents in GxP environments

Healthcare and life sciences organizations are transforming drug discovery, medical devices, and patient care with generative AI agents. In regulated industries, any system that impacts product quality or patient safety must comply with GxP (Good Practice) regulations, such as Good Clinical Practice (GxP), Good Laboratory Practice (GLP), Good Manufacturing Practice (GMP). Organizations must demonstrate to … Read more

Multi-Agent collaboration patterns with Strands Agents and Amazon Nova

Multi-agent generative AI systems use multiple specialized AI agents working together to handle complex, multi-faceted tasks that exceed the capabilities of any single model. By combining agents with different skills or modalities (for example, language, vision, audio, video), these systems can tackle tasks in parallel or sequence, yielding more robust results. Recent research shows that … Read more