AWSML – Page 17 – Kamal Reader

Effective cost optimization strategies for Amazon Bedrock

June 10, 2025 by kamal

Customers are increasingly using generative AI to enhance efficiency, personalize experiences, and drive innovation across various industries. For instance, generative AI can be used to perform text summarization, facilitate personalized marketing strategies, create business-critical chat-based assistants, and so on. However, as generative AI adoption grows, associated costs can escalate in several areas including cost in … Read more

How E.ON saves £10 million annually with AI diagnostics for smart meters powered by Amazon Textract

June 10, 2025 by kamal

E.ON—headquartered in Essen, Germany—is one of Europe’s largest energy companies, with over 72,000 employees serving more than 50 million customers across 15 countries. As a leading provider of energy networks and customer solutions, E.ON focuses on accelerating the energy transition across Europe. A key part of this mission involves the Smart Energy Solutions division, which … Read more

Building intelligent AI voice agents with Pipecat and Amazon Bedrock – Part 1

June 9, 2025 by kamal

Voice AI is transforming how we interact with technology, making conversational interactions more natural and intuitive than ever before. At the same time, AI agents are becoming increasingly sophisticated, capable of understanding complex queries and taking autonomous actions on our behalf. As these trends converge, you see the emergence of intelligent AI voice agents that … Read more

Stream multi-channel audio to Amazon Transcribe using the Web Audio API

June 9, 2025 by kamal

Multi-channel transcription streaming is a feature of Amazon Transcribe that can be used in many cases with a web browser. Creating this stream source has it challenges, but with the JavaScript Web Audio API, you can connect and combine different audio sources like videos, audio files, or hardware like microphones to obtain transcripts. In this … Read more

How Kepler democratized AI access and enhanced client services with Amazon Q Business

June 9, 2025 by kamal

This is a guest post co-authored by Evan Miller, Noah Kershaw, and Valerie Renda of Kepler Group At Kepler, a global full-service digital marketing agency serving Fortune 500 brands, we understand the delicate balance between creative marketing strategies and data-driven precision. Our company name draws inspiration from the visionary astronomer Johannes Kepler, reflecting our commitment … Read more

Build a serverless audio summarization solution with Amazon Bedrock and Whisper

June 6, 2025 by kamal

Recordings of business meetings, interviews, and customer interactions have become essential for preserving important information. However, transcribing and summarizing these recordings manually is often time-consuming and labor-intensive. With the progress in generative AI and automatic speech recognition (ASR), automated solutions have emerged to make this process faster and more efficient. Protecting personally identifiable information (PII) … Read more

Implement semantic video search using open source large vision models on Amazon SageMaker and Amazon OpenSearch Serverless

June 6, 2025 by kamal

As companies and individual users deal with constantly growing amounts of video content, the ability to perform low-effort search to retrieve videos or video segments using natural language becomes increasingly valuable. Semantic video search offers a powerful solution to this problem, so users can search for relevant video content based on textual queries or descriptions. … Read more

Multi-account support for Amazon SageMaker HyperPod task governance

June 6, 2025 by kamal

GPUs are a precious resource; they are both short in supply and much more costly than traditional CPUs. They are also highly adaptable to many different use cases. Organizations building or adopting generative AI use GPUs to run simulations, run inference (both for internal or external usage), build agentic workloads, and run data scientists’ experiments. … Read more

Build a Text-to-SQL solution for data consistency in generative AI using Amazon Nova

June 6, 2025 by kamal

Businesses rely on precise, real-time insights to make critical decisions. However, enabling non-technical users to access proprietary or organizational data without technical expertise remains a challenge. Text-to-SQL bridges this gap by generating precise, schema-specific queries that empower faster decision-making and foster a data-driven culture. The problem lies in obtaining deterministic answers—precise, consistent results needed for … Read more

Modernize and migrate on-premises fraud detection machine learning workflows to Amazon SageMaker

June 5, 2025 by kamal

This post is co-written with Qing Chen and Mark Sinclair from Radial. Radial is the largest 3PL fulfillment provider, also offering integrated payment, fraud detection, and omnichannel solutions to mid-market and enterprise brands. With over 30 years of industry expertise, Radial tailors its services and solutions to align strategically with each brand’s unique needs. Radial … Read more

Contextual retrieval in Anthropic using Amazon Bedrock Knowledge Bases

June 5, 2025 by kamal

For an AI model to perform effectively in specialized domains, it requires access to relevant background knowledge. A customer support chat assistant, for instance, needs detailed information about the business it serves, and a legal analysis tool must draw upon a comprehensive database of past cases. To equip large language models (LLMs) with this knowledge, … Read more

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

June 5, 2025 by kamal

As organizations look to incorporate AI capabilities into their applications, large language models (LLMs) have emerged as powerful tools for natural language processing tasks. Amazon SageMaker AI provides a fully managed service for deploying these machine learning (ML) models with multiple inference options, allowing organizations to optimize for cost, latency, and throughput. AWS has always … Read more

Impel enhances automotive dealership customer experience with fine-tuned LLMs on Amazon SageMaker

June 4, 2025 by kamal

This post is co-written with Tatia Tsmindashvili, Ana Kolkhidashvili, Guram Dentoshvili, Dachi Choladze from Impel. Impel transforms automotive retail through an AI-powered customer lifecycle management solution that drives dealership operations and customer interactions. Their core product, Sales AI, provides all-day personalized customer engagement, handling vehicle-specific questions and automotive trade-in and financing inquiries. By replacing their … Read more

How climate tech startups are building foundation models with Amazon SageMaker HyperPod

June 4, 2025 by kamal

Climate tech startups are companies that use technology and innovation to address the climate crisis, with a primary focus on either reducing greenhouse gas emissions or helping society adapt to climate change impacts. Their unifying mission is to create scalable solutions that accelerate the transition to a sustainable, low-carbon future. Solutions to the climate crisis … Read more

Supercharge your development with Claude Code and Amazon Bedrock prompt caching

June 4, 2025 by kamal

Prompt caching in Amazon Bedrock is now generally available, delivering performance and cost benefits for agentic AI applications. Coding assistants that process large codebases represent an ideal use case for prompt caching. In this post, we’ll explore how to combine Amazon Bedrock prompt caching with Claude Code—a coding agent released by Anthropic that is now … Read more

Unlocking the power of Model Context Protocol (MCP) on AWS

June 3, 2025 by kamal

We’ve witnessed remarkable advances in model capabilities as generative AI companies have invested in developing their offerings. Language models such as Anthropic’s Claude Opus 4 & Sonnet 4, Amazon Nova, and Amazon Bedrock can reason, write, and generate responses with increasing sophistication. But even as these models grow more powerful, they can only work with … Read more

Build a scalable AI assistant to help refugees using AWS

June 3, 2025 by kamal

This post is co-written with Taras Tsarenko, Vitalil Bozadzhy, and Vladyslav Horbatenko. As organizations worldwide seek to use AI for social impact, the Danish humanitarian organization Bevar Ukraine has developed a comprehensive virtual generative AI-powered assistant called Victor, aimed at addressing the pressing needs of Ukrainian refugees integrating into Danish society. This post details our … Read more

Enhanced diagnostics flow with LLM and Amazon Bedrock agent integration

June 3, 2025 by kamal

Noodoe is a global leader in EV charging innovation, offering advanced solutions that empower operators to optimize their charging station operations and provide exceptional user experiences. Their universal charging stations are compatible with all EV brands and feature intuitive payment options, including credit cards and Apple Pay. Powered by the Noodoe EV OS cloud management … Read more

Build GraphRAG applications using Amazon Bedrock Knowledge Bases

June 2, 2025 by kamal

In these days, it is more common to companies adopting AI-first strategy to stay competitive and more efficient. As generative AI adoption grows, the technology’s ability to solve problems is also improving (an example is the use case to generate comprehensive market report). One way to simplify the growing complexity of problems to be solved … Read more

Streamline personalization development: How automated ML workflows accelerate Amazon Personalize implementation

June 2, 2025 by kamal

Crafting unique, customized experiences that resonate with customers is a potent strategy for boosting engagement and fostering brand loyalty. However, creating dynamic personalized content is challenging and time-consuming because of the need for real-time data processing, complex algorithms for customer segmentation, and continuous optimization to adapt to shifting behaviors and preferences—all while providing scalability and … Read more

Fast-track SOP processing using Amazon Bedrock

June 2, 2025 by kamal

Standard operating procedures (SOPs) are essential documents in the context of regulations and compliance. SOPs outline specific steps for various processes, making sure practices are consistent, efficient, and compliant with regulatory standards. SOP documents typically include key sections such as the title, scope, purpose, responsibilities, procedures, documentation, citations (references), and a detailed approval and revision … Read more

Deploy Amazon SageMaker Projects with Terraform Cloud

May 30, 2025 by kamal

Amazon SageMaker Projects empower data scientists to self-serve Amazon Web Services (AWS) tooling and infrastructure to organize all entities of the machine learning (ML) lifecycle, and further enable organizations to standardize and constrain the resources available to their data science teams in pre-packaged templates. For AWS customers using Terraform to define and manage their infrastructure-as-code (IaC), … Read more

How ZURU improved the accuracy of floor plan generation by 109% using Amazon Bedrock and Amazon SageMaker

May 30, 2025 by kamal

ZURU Tech is on a mission to change the way we build, from town houses and hospitals to office towers, schools, apartment blocks, and more. Dreamcatcher is a user-friendly platform developed by ZURU that allows users with any level of experience to collaborate in the building design and construction process. With the simple click of … Read more

Going beyond AI assistants: Examples from Amazon.com reinventing industries with generative AI

May 30, 2025 by kamal

Generative AI revolutionizes business operations through various applications, including conversational assistants such as Amazon’s Rufus and Amazon Seller Assistant. Additionally, some of the most impactful generative AI applications operate autonomously behind the scenes, an essential capability that empowers enterprises to transform their operations, data processing, and content creation at scale. These non-conversational implementations, often in … Read more

Architect a mature generative AI foundation on AWS

May 30, 2025 by kamal

Generative AI applications seem simple—invoke a foundation model (FM) with the right context to generate a response. In reality, it’s a much more complex system involving workflows that invoke FMs, tools, and APIs and that use domain-specific data to ground responses with patterns such as Retrieval Augmented Generation (RAG) and workflows involving agents. Safety controls … Read more

Using Amazon OpenSearch ML connector APIs

May 30, 2025 by kamal

When ingesting data into Amazon OpenSearch, customers often need to augment data before putting it into their indexes. For instance, you might be ingesting log files with an IP address and want to get a geographic location for the IP address, or you might be ingesting customer comments and want to identify the language they … Read more

Bridging the gap between development and production: Seamless model lifecycle management with Amazon Bedrock

May 30, 2025 by kamal

In the landscape of generative AI, organizations are increasingly adopting a structured approach to deploy their AI applications, mirroring traditional software development practices. This approach typically involves separate development and production environments, each with its own AWS account, to create logical separation, enhance security, and streamline workflows. Amazon Bedrock is a fully managed service that … Read more

Revolutionizing earth observation with geospatial foundation models on AWS

May 29, 2025 by kamal

Emerging transformer-based vision models for geospatial data—also called geospatial foundation models (GeoFMs)—offer a new and powerful technology for mapping the earth’s surface at a continental scale, providing stakeholders with the tooling to detect and monitor surface-level ecosystem conditions such as forest degradation, natural disaster impact, crop yield, and many others. GeoFMs represent an emerging research … Read more

Create an agentic RAG application for advanced knowledge discovery with LlamaIndex, and Mistral in Amazon Bedrock

May 29, 2025 by kamal

Agentic Retrieval Augmented Generation (RAG) applications represent an advanced approach in AI that integrates foundation models (FMs) with external knowledge retrieval and autonomous agent capabilities. These systems dynamically access and process information, break down complex tasks, use external tools, apply reasoning, and adapt to various contexts. They go beyond simple question answering by performing multi-step … Read more

Text-to-image basics with Amazon Nova Canvas

May 29, 2025 by kamal

AI image generation has emerged as one of the most transformative technologies in recent years, revolutionizing how you create and interact with visual content. Amazon Nova Canvas is a generative model in the suite of Amazon Nova creative models that enables you to generate realistic and creative images from plain text descriptions. This post serves … Read more