Machine Learning – Page 2

OpenAI models and Codex on Amazon Bedrock are now generally available

June 1, 2026 by kamal

GPT-5.5, GPT-5.4, and Codex are now generally available on Amazon Bedrock. Deploy them in production applications and agents today, on Bedrock’s high performance inference engine. Key takeaways GPT-5.5, the most advanced frontier model from OpenAI, is generally available on Amazon Bedrock. Pricing matches OpenAI first-party rates. Codex on Amazon Bedrock is generally available with pay-per-token pricing. Inference runs through Bedrock, and … Read more

Extending MCP support for Amazon Bedrock AgentCore Gateway

June 1, 2026 by kamal

While deploying Model Context Protocol (MCP) servers in production, enterprises need fine-grained access control across servers, observability into which teams use which tools, security guarantees against data exfiltration, and centralized credential management, all at scale. Amazon Bedrock AgentCore Gateway sits between MCP servers and the clients that consume them, centralizing credential management, observability, and secure … Read more

Secure AI agents with Policy and Lambda interceptors in Amazon Bedrock AgentCore gateway

June 1, 2026 by kamal

Securing AI agent behavior is a key customer challenge in building agentic solutions. As enterprises rapidly adopt AI agents to automate workflows, they face a scaling challenge in managing secure access to tools across the organization. Modern unified enterprise AI platforms have hundreds of agents serving users across the organization. These agents need to access … Read more

Enable safe agentic payments with built-in guardrails using Amazon Bedrock AgentCore payments

June 1, 2026 by kamal

Agents increasingly take actions on behalf of their end users, whether that’s selecting tools, browsing the web, and calling MCP servers autonomously to achieve a goal. When the tools, MCP endpoints, or web resources an agent reaches are paid, the agent gets stuck without the ability to transact. Amazon Bedrock AgentCore payments, announced in preview … Read more

AgentOps: Operationalize agentic AI at scale with Amazon Bedrock AgentCore

June 1, 2026 by kamal

When you build agentic AI solutions, you face unique operational challenges. Agents make unpredictable decisions, costs spiral unexpectedly, and debugging non-deterministic failures seems impossible. Agentic AI applications don’t just execute predetermined workflows. They reason, adapt, and make autonomous decisions, and DevOps practices need to be adapted. That’s where AgentOps comes in, the operational discipline for … Read more

Accelerate LLM model loading and increase context windows with GPUDirect on Amazon FSx for Lustre and TurboQuant

June 1, 2026 by kamal

If you’re iterating on deploying large language models (LLMs) on AWS GPU instances, you’ve probably noticed the larger the model to be loaded into GPU High Bandwidth Memory (HBM), the longer the painful wait until the GPUs are ready for inference. As models grow to hundreds of billions of parameters and GPU environments grow ever … Read more

Amazon Quick integration with time-series databases for market intelligence using MCP

June 1, 2026 by kamal

Model Context Protocol (MCP) integration in Amazon Quick transforms how financial analysts access time-series market intelligence, removing the need for complex database queries. As a financial analyst, you navigate millions of stock trades flowing through markets every second, searching for patterns that drive trading decisions. Financial institutions often use time series databases to analyze high-frequency … Read more

Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU utilization to LLM quality

May 29, 2026 by kamal

Deploying large language models (LLMs) at scale on Amazon SageMaker AI Inference makes observability a critical pillar of any production machine learning (ML) strategy. Unlike conventional software that returns deterministic outputs, LLMs generate variable, free-form responses that are difficult to validate with standard metrics. LLM output quality can change over time as input distributions shift, … Read more

Training Azerbaijani language models on Amazon SageMaker AI

May 28, 2026 by kamal

This solution builds on open source tools including PyTorch, Hugging Face Transformers, and Liger Kernels. The authors would also like to thank Aiham Taleb, Arefeh Ghahvechi, Manav Choudhary, Rohit Thekkanal, Daz Akbarov, Jamila Jamilova, Ross Povelikin, Almas Moldakanov, Christelle Xu, and Ivan Khvostishkov for their contributions in making this project possible. Azercell Telecom LLC, Azerbaijan’s … Read more

Build a custom portal with embedded Amazon SageMaker AI MLflow Apps

May 28, 2026 by kamal

As ML teams grow, embedding Amazon SageMaker AI MLflow Apps into a custom portal requires a scalable approach to access management. Distributing presigned URLs doesn’t scale for teams with dozens of data scientists, and granting individual AWS Management Console access adds operational overhead for administrators managing access controls. Teams who rely on SSO-integrated internal portals … Read more

Streamline external access to Amazon SageMaker MLflow using a REST API proxy

May 28, 2026 by kamal

Machine learning (ML) teams use MLflow to manage their ML lifecycle effectively. Amazon SageMaker MLflow provides comprehensive ML experiment tracking and model management capabilities. However, many enterprises have existing infrastructure requirements that need HTTPS-based integrations rather than direct SDK usage. Many organizations need to integrate Amazon SageMaker MLflow with their established systems while maintaining their … Read more

Evaluating Deep Agents using LangSmith on AWS

May 28, 2026 by kamal

This post was co-authored with Karan Singh, Head of Partnerships at LangChain Validating AI agent behavior before production is one of the hardest problems in applied AI. Agents are non-deterministic, multi-step where errors in early steps can affect downstream results. A single bad tool call can cascade through an entire workflow. LangSmith on AWS gives … Read more

Build a test suite that grows with your agent with dataset management in Amazon Bedrock AgentCore

May 28, 2026 by kamal

Agent evaluation is most powerful when you combine fast-moving online signals with stable offline baselines. To understand whether your agent is truly improving over time, you need a fixed benchmark alongside your changing real-world traffic. Managing test cases for evaluation baselines as a dataset in Amazon Bedrock AgentCore brings the discipline of versioned test fixtures … Read more

Claude Opus 4.8 is now available on AWS

May 28, 2026 by kamal

Today, we’re excited to announce the availability of Anthropic’s most advanced Opus model, Claude Opus 4.8, on Amazon Bedrock and the Claude Platform on AWS. Claude Opus 4.8 represents a meaningful step forward, delivering improvements across the workflows teams run in production, from agentic coding and deep knowledge work to multi-stage autonomous tasks that span … Read more

Automate AML alert triage with Amazon Quick and Snowflake Cortex AI

May 28, 2026 by kamal

Financial institutions running on AWS and Snowflake benefit from a deeply integrated framework that combines Snowflake’s AI Data Cloud with AWS cloud infrastructure, including integrations with AWS services such as Amazon Simple Storage Service (Amazon S3), AWS Glue, Amazon SageMaker, and Amazon Bedrock. With over 50 native integrations between AWS services and Snowflake, organizations can … Read more

Process financial documents using Amazon Bedrock Data Automation

May 27, 2026 by kamal

Financial institutions process thousands of documents daily, including tax forms, loan statements, and purchase orders. Each has a unique format, structure, and field names, making it challenging to create automation workflows using optical character recognition (OCR) software. Amazon Bedrock Data Automation (BDA) helps solve these challenges by automating the extraction, validation, and analysis of data … Read more

Building AI agents for business support using Amazon Bedrock AgentCore

May 27, 2026 by kamal

Developing AI agents for business support presents unique challenges that many organizations face when trying to automate routine HR tasks. Works Human Intelligence (WHI) develops, sells, and supports the integrated HR system “COMPANY” for major Japanese corporations and public interest corporations. In this post, we share how the AWS Generative AI Innovation Center (GenAIIC) collaborated … Read more

From data overload to actionable insights: How Verizon Connect scaled agentic AI to 100,000 users

May 27, 2026 by kamal

A special thanks goes to the Verizon Connect team who’s been working very hard on the project: Matteo Simoncini, Luca Bravi, Alberto Rossettini, Martin Villarruel, Ceyhun Unlu, Adriel Zuquini, Andrea Benericetti. Fleet managers today face an overwhelming challenge: transforming data overload into actionable insights. When you’re managing thousands of vehicles, each generating hundreds of daily … Read more

How AWS SMGS uses an AI-powered conversational assistant to transform business management with Amazon Bedrock AgentCore

May 27, 2026 by kamal

AWS leaders manage complex data across multiple hierarchies while making time-sensitive decisions that impact global operations. Traditional business intelligence relies on static dashboards and manual reports, which creates delays and limits organizational agility. NarrateAI, our intelligent conversational solution, addresses this through conversational agentic AI powered by our data lake and Amazon Bedrock AgentCore. Accessible through … Read more

Powering agentic AI sales strategy with Amazon Bedrock AgentCore

May 27, 2026 by kamal

As agent adoption scaled, we saw a common pattern emerge across enterprises, including our own sales organization: specialized agents deliver value, but without orchestration, users carry the cognitive load of choosing between them. At AWS Sales, this meant more than 20 domain-specific agents deployed across the global organization, with representatives context-switching between systems instead of … Read more

Technical deep dive: AgentCore payments and innovation in agentic commerce

May 26, 2026 by kamal

The industry is entering a world where billions of generative AI agents operate autonomously, acting on behalf of humans, making decisions, and completing tasks without human intervention. To support this shift, Amazon Bedrock AgentCore provides a modular, fully managed platform that helps developers build, deploy, and operate generative AI agents at scale. By abstracting the … Read more

Build highly scalable serverless LangGraph multi-agent systems in AWS with Amazon Bedrock AgentCore

May 26, 2026 by kamal

Generative AI has rapidly evolved from experimental prototypes into systems that are expected to operate reliably in production, at scale, and under real-world performance constraints. As organizations move beyond demos and proofs of concept, they increasingly encounter challenges related to inference latency, scalability, state management, and operational visibility. Building high-performance AI agents today requires more … Read more

Build high-performance generative AI systems with Strands Agents, NVIDIA NIM, and Amazon Bedrock AgentCore

May 26, 2026 by kamal

Building high-performance generative AI agents requires architecture that can deliver fast inference, coordinate multiple agents, and operate reliably under production workloads. If you are building generative AI agents to automate reviews, power digital assistants, and support complex decision-making workflows, you need these agents to perform well. They must reduce manual effort, respond in near real … Read more

AgentWatch: Proactive AWS monitoring with ambient agents

May 26, 2026 by kamal

AgentWatch delivers ambient AWS resource monitoring for your DevOps team, moving beyond the reactive cycle of managing Amazon CloudWatch alarms across multiple accounts. CloudWatch alarms trigger too late, AWS Lambda errors accumulate unnoticed, and Amazon Elastic Compute Cloud (Amazon EC2) performance degradation goes undetected until customers report problems. This leaves your team constantly firefighting rather … Read more

From idea to AI app: Creating intelligent research assistants with Strands

May 26, 2026 by kamal

Building an AI app shouldn’t require a PhD in machine learning (ML) or months of wrestling with complex architectures. Yet that’s exactly what happens when you try to orchestrate multiple API calls, manage conversation state, and create agents that can reason on their own. I’ve seen straightforward AI ideas balloon into sprawling projects that demand … Read more

Build an enterprise observability solution for Amazon Quick

May 26, 2026 by kamal

When hundreds to thousands of users are onboarded to an enterprise AI platform, business leaders and platform owners need visibility into who is using the platform, whether users are satisfied with the answers they receive, and which capabilities are driving the most engagement. Without a centralized observability solution, this data is scattered across multiple AWS … Read more