How TP ICAP transformed CRM data into real-time insights with Amazon Bedrock

This post is co-written with Ross Ashworth at TP ICAP. The ability to quickly extract insights from customer relationship management systems (CRMs) and vast amounts of meeting notes can mean the difference between seizing opportunities and missing them entirely. TP ICAP faced this challenge, having thousands of vendor meeting records stored in their CRM. Using … Read more

Principal Financial Group accelerates build, test, and deployment of Amazon Lex V2 bots through automation

This guest post was written by Mulay Ahmed and Caroline Lima-Lane of Principal Financial Group. The content and opinions in this post are those of the third-party authors and AWS is not responsible for the content or accuracy of this post. With US contact centers that handle millions of customer calls annually, Principal Financial Group® … Read more

Beyond vibes: How to properly select the right LLM for the right task

Choosing the right large language model (LLM) for your use case is becoming both increasingly challenging and essential. Many teams rely on one-time (ad hoc) evaluations based on limited samples from trending models, essentially judging quality on “vibes” alone. This approach involves experimenting with a model’s responses and forming subjective opinions about its performance. However, … Read more

Splash Music transforms music generation using AWS Trainium and Amazon SageMaker HyperPod

Generative AI is rapidly reshaping the music industry, empowering creators—regardless of skill—to create studio-quality tracks with foundation models (FMs) that personalize compositions in real time. As demand for unique, instantly generated content grows and creators seek smarter, faster tools, Splash Music collaborated with AWS to develop and scale music generation FMs, making professional music creation … Read more

Iterative fine-tuning on Amazon Bedrock for strategic model improvement

Organizations often face challenges when implementing single-shot fine-tuning approaches for their generative AI models. The single-shot fine-tuning method involves selecting training data, configuring hyperparameters, and hoping the results meet expectations without the ability to make incremental adjustments. Single-shot fine-tuning frequently leads to suboptimal results and requires starting the entire process from scratch when improvements are … Read more

Voice AI-powered drive-thru ordering with Amazon Nova Sonic and dynamic menu displays

Artificial Intelligence (AI) is transforming the quick-service restaurant industry, particularly in drive-thru operations where efficiency and customer satisfaction intersect. Traditional systems create significant obstacles in service delivery, from staffing limitations and order accuracy issues to inconsistent customer experiences across locations. These challenges, combined with rising labor costs and demand fluctuations, have pushed the industry to … Read more

Optimizing document AI and structured outputs by fine-tuning Amazon Nova Models and on-demand inference

Multimodal fine-tuning represents a powerful approach for customizing vision large language models (LLMs) to excel at specific tasks that involve both visual and textual information. Although base multimodal models offer impressive general capabilities, they often fall short when faced with specialized visual tasks, domain-specific content, or output formatting requirements. Fine-tuning addresses these limitations by adapting … Read more

Transforming enterprise operations: Four high-impact use cases with Amazon Nova

Since the launch of Amazon Nova at AWS re:Invent 2024, we have seen adoption trends across industries, with notable gains in operational efficiency, compliance, and customer satisfaction. With its capabilities in secure, multimodal AI and domain customization, Nova is enhancing workflows and enabling cost efficiencies across core use cases. In this post, we share four … Read more

Building smarter AI agents: AgentCore long-term memory deep dive

Building AI agents that remember user interactions requires more than just storing raw conversations. While Amazon Bedrock AgentCore short-term memory captures immediate context, the real challenge lies in transforming these interactions into persistent, actionable knowledge that spans across sessions. This is the information that transforms fleeting interactions into meaningful, continuous relationships between users and AI … Read more

Configure and verify a distributed training cluster with AWS Deep Learning Containers on Amazon EKS

Training state-of-the-art large language models (LLMs) demands massive, distributed compute infrastructure. Meta’s Llama 3, for instance, ran on 16,000 NVIDIA H100 GPUs for over 30.84 million GPU hours. Amazon Elastic Kubernetes Service (Amazon EKS) is a managed service that simplifies the deployment, management, and scaling of Kubernetes clusters that can scale up to the ranges … Read more

Scala development in Amazon SageMaker Studio with Almond kernel

Scala stands out as a versatile programming language that combines object-oriented and functional programming approaches. By running on the Java Virtual Machine (JVM), it maintains seamless compatibility with Java libraries while offering a concise and scalable development experience. The language has distinguished itself in the realm of distributed computing and big data processing, with the … Read more

Build a device management agent with Amazon Bedrock AgentCore

The proliferation of Internet of Things (IoT) devices has transformed how we interact with our environments, from homes to industrial settings. However, as the number of connected devices grows, so does the complexity of managing them. Traditional device management interfaces often require navigating through multiple applications, each with its own UI and learning curve. This … Read more

How Amazon Bedrock Custom Model Import streamlined LLM deployment for Salesforce

This post is cowritten by Salesforce’s AI Platform team members Srikanta Prasad, Utkarsh Arora, Raghav Tanaji, Nitin Surya, Gokulakrishnan Gopalakrishnan, and Akhilesh Deepak Gotmare. Salesforce’s Artificial Intelligence (AI) platform team runs customized large language models (LLMs)—fine-tuned versions of Llama, Qwen, and Mistral—for agentic AI applications like Agentforce. Deploying these models creates operational overheads: teams spend … Read more

Transforming the physical world with AI: the next frontier in intelligent automation 

The convergence of artificial intelligence with physical systems marks a pivotal moment in technological evolution. Physical AI, where algorithms transcend digital boundaries to perceive, understand, and manipulate the tangible world, will fundamentally transform how enterprises operate across industries. These intelligent systems bridge the gap between digital intelligence and physical reality, unlocking unprecedented opportunities for efficiency … Read more

Medical reports analysis dashboard using Amazon Bedrock, LangChain, and Streamlit

In healthcare, the ability to quickly analyze and interpret medical reports is crucial for both healthcare providers and patients. While medical reports contain valuable information, they often remain underutilized due to their complex nature and the time-intensive process of analysis. This complexity manifests in several ways: the interpretation of multiple parameters and their relationships (such … Read more

Kitsa transforms clinical trial site selection with Amazon Quick Automate

This post was written with Ajay Nyamati from Kitsa. The clinical trial industry conducts medical research studies to evaluate the safety, efficacy, and effectiveness of new drugs, treatments, or medical devices before they reach the market. The industry is a cornerstone of medical innovation, yet it continues to face a fundamental bottleneck: selection of the … Read more

Connect Amazon Quick Suite to enterprise apps and agents with MCP

Organizations need solutions for people and AI agents to securely collaborate through a single interface to the organization’s data and take actions across enterprise applications to improve productivity. The ability of an AI agent to securely and seamlessly connect with organizational knowledge bases, enterprise applications, and other AI agents is foundational to drive adoption and … Read more

Make agents a reality with Amazon Bedrock AgentCore: Now generally available

Get agents out of prototype purgatory and into production with security, scalability, and reliability When we launched AWS in 2006, we believed that cloud computing would transform how organizations build and scale technology. We’re now at a similar inflection point with AI agents. We envision a world where billions of agents work together, transforming everything … Read more

Use Amazon SageMaker HyperPod and Anyscale for next-generation distributed computing

This post was written with Dominic Catalano from Anyscale. Organizations building and deploying large-scale AI models often face critical infrastructure challenges that can directly impact their bottom line: unstable training clusters that fail mid-job, inefficient resource utilization driving up costs, and complex distributed computing frameworks requiring specialized expertise. These factors can lead to unused GPU … Read more

Customizing text content moderation with Amazon Nova

Consider a growing social media platform that processes millions of user posts daily. Their content moderation team faces a familiar challenge: their rule-based system flags a cooking video discussing “knife techniques” as violent content, frustrating users, while simultaneously missing a veiled threat disguised as a restaurant review. When they try a general-purpose AI moderation service, … Read more

Vxceed builds the perfect sales pitch for sales teams at scale using Amazon Bedrock

This post was co-written with Cyril Ovely from Vxceed. Consumer packaged goods (CPG) companies face a critical challenge in emerging economies: how to effectively retain revenue and grow customer loyalty at scale. Although these companies invest 15–20% of their revenue in trade promotions and retailer loyalty programs, the uptake of these programs has historically remained … Read more

Implement a secure MLOps platform based on Terraform and GitHub

Machine learning operations (MLOps) is the combination of people, processes, and technology to productionize ML use cases efficiently. To achieve this, enterprise customers must develop MLOps platforms to support reproducibility, robustness, and end-to-end observability of the ML use case’s lifecycle. Those platforms are based on a multi-account setup by adopting strict security constraints, development best … Read more

Automate Amazon QuickSight data stories creation with agentic AI using Amazon Nova Act

Amazon QuickSight data stories support global customers by transforming complex data into interactive narratives for faster decisions. However, manual creation of multiple daily data stories consumes significant time and resources, delaying critical decisions and preventing teams from focusing on valuable analysis. Each organization has multiple business units, and each business unit creates and operates multiple … Read more

Implement automated monitoring for Amazon Bedrock batch inference

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies through a single API, along with capabilities to build generative AI applications with security, privacy, and responsible AI. Batch inference in Amazon Bedrock is for larger workloads where immediate responses aren’t critical. With a batch … Read more

Responsible AI: How PowerSchool safeguards millions of students with AI-powered content filtering using Amazon SageMaker AI

This post is cowritten with Gayathri Rengarajan and Harshit Kumar Nyati from PowerSchool. PowerSchool is a leading provider of cloud-based software for K-12 education, serving over 60 million students in more than 90 countries and over 18,000 customers, including more than 90 of the top 100 districts by student enrollment in the United States. When … Read more

Unlock global AI inference scalability using new global cross-Region inference on Amazon Bedrock with Anthropic’s Claude Sonnet 4.5

Organizations are increasingly integrating generative AI capabilities into their applications to enhance customer experiences, streamline operations, and drive innovation. As generative AI workloads continue to grow in scale and importance, organizations face new challenges in maintaining consistent performance, reliability, and availability of their AI-powered applications. Customers are looking to scale their AI inference workloads across … Read more

Secure ingress connectivity to Amazon Bedrock AgentCore Gateway using interface VPC endpoints

Agentic AI applications represent a significant development in enterprise automation, where intelligent agents autonomously execute complex workflows, access sensitive datasets, and make real-time decisions across your organization’s infrastructure. Amazon Bedrock AgentCore accelerates enterprise AI transformation by providing fully managed services that remove infrastructure complexity, maintain session isolation, and enable seamless integration with enterprise tools so … Read more

Enhance agentic workflows with enterprise search using Kore.ai and Amazon Q Business

This post was written with Meghana Chintalapudi and Surabhi Sankhla of Kore.ai. As organizations struggle with exponentially growing volumes of data distributed across multiple repositories and applications, employees lose significant time—approximately 30% according to the International Data Corporation (IDC)—searching for information that could be spent on higher-value work. The complexity of modern enterprise data networks … Read more

Accelerate development with the Amazon Bedrock AgentCore MCP server

Today, we’re excited to announce the Amazon Bedrock AgentCore Model Context Protocol (MCP) Server. With built-in support for runtime, gateway integration, identity management, and agent memory, the AgentCore MCP Server is purpose-built to speed up creation of components compatible with Bedrock AgentCore. You can use the AgentCore MCP server for rapid prototyping, production AI solutions, … Read more

How Hapag-Lloyd improved schedule reliability with ML-powered vessel schedule predictions using Amazon SageMaker

This post is cowritten with Thomas Voss and Bernhard Hersberger from Hapag-Lloyd. Hapag-Lloyd is one of the world’s leading shipping companies with more than 308 modern vessels, 11.9 million TEUs (twenty-foot equivalent units) transported per year, and 16,700 motivated employees in more than 400 offices in 139 countries. They connect continents, businesses, and people through … Read more