Machine Learning – Page 23

How 123RF saved over 90% of their translation costs by switching to Amazon Bedrock

November 25, 2024 by

In the rapidly evolving digital content industry, multilingual accessibility is crucial for global reach and user engagement. 123RF, a leading provider of royalty-free digital content, is an online resource for creative assets, including AI-generated images from text. In 2023, they used Amazon OpenSearch Service to improve discovery of images by using vector-based semantic search. Building … Read more

Connect SharePoint Online to Amazon Q Business using OAuth 2.0 ROPC flow authentication

November 25, 2024 by

Enterprises face significant challenges accessing and utilizing the vast amounts of information scattered across organization’s various systems. What if you could simply ask a question and get instant, accurate answers from your company’s entire knowledge base, while accounting for an individual user’s data access levels? Amazon Q Business is a game changing AI assistant that’s … Read more

John Snow Labs Medical LLMs are now available in Amazon SageMaker JumpStart

November 25, 2024 by

Today, we are excited to announce that John Snow Labs’ Medical LLM – Small and Medical LLM – Medium large language models (LLMs) are now available on Amazon SageMaker Jumpstart. Medical LLM is optimized for the following medical language understanding tasks: Summarizing clinical encounters – Summarizing discharge notes, progress notes, radiology reports, pathology reports, and … Read more

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

November 22, 2024 by

Companies across various scales and industries are using large language models (LLMs) to develop generative AI applications that provide innovative experiences for customers and employees. However, building or fine-tuning these pre-trained LLMs on extensive datasets demands substantial computational resources and engineering effort. With the increase in sizes of these pre-trained LLMs, the model customization process … Read more

Amazon SageMaker Inference now supports G6e instances

November 22, 2024 by

As the demand for generative AI continues to grow, developers and enterprises seek more flexible, cost-effective, and powerful accelerators to meet their needs. Today, we are thrilled to announce the availability of G6e instances powered by NVIDIA’s L40S Tensor Core GPUs on Amazon SageMaker. You will have the option to provision nodes with 1, 4, and … Read more

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

November 22, 2024 by

Companies across all industries are harnessing the power of generative AI to address various use cases. Cloud providers have recognized the need to offer model inference through an API call, significantly streamlining the implementation of AI within applications. Although a single API call can address simple use cases, more complex ones may necessitate the use … Read more

Build generative AI applications on Amazon Bedrock with the AWS SDK for Python (Boto3)

November 22, 2024 by

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI. With … Read more

Improve factual consistency with LLM Debates

November 22, 2024 by

In this post, we demonstrate the potential of large language model (LLM) debates using a supervised dataset with ground truth. In this LLM debate, we have two debater LLMs, each one taking one side of an argument and defending it based on the previous arguments for N(=3) rounds. The arguments are saved for a judge … Read more

Governing the ML lifecycle at scale, Part 3: Setting up data governance at scale

November 22, 2024 by

This post is part of an ongoing series about governing the machine learning (ML) lifecycle at scale. To view this series from the beginning, start with Part 1. This post dives deep into how to set up data governance at scale using Amazon DataZone for the data mesh. The data mesh is a modern approach … Read more

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

November 22, 2024 by

Today, we are excited to announce the general availability of Amazon Bedrock Flows (previously known as Prompt Flows). With Bedrock Flows, you can quickly build and execute complex generative AI workflows without writing code. Key benefits include: Simplified generative AI workflow development with an intuitive visual interface. Seamless integration of latest foundation models (FMs), Prompts, … Read more

Implement secure API access to your Amazon Q Business applications with IAM federation user access management

November 22, 2024 by

Amazon Q Business is a conversational assistant powered by generative AI that enhances workforce productivity by answering questions and completing tasks based on information in your enterprise systems, which each user is authorized to access. AWS recommends using AWS IAM Identity Center when you have a large number of users in order to achieve a … Read more

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

November 21, 2024 by

As generative AI models advance in creating multimedia content, the difference between good and great output often lies in the details that only human feedback can capture. Audio and video segmentation provides a structured way to gather this detailed feedback, allowing models to learn through reinforcement learning from human feedback (RLHF) and supervised fine-tuning (SFT). … Read more

Using responsible AI principles with Amazon Bedrock Batch Inference

November 21, 2024 by

Revolutionizing knowledge management: VW’s AI prototype journey with AWS

November 21, 2024 by

Today, we’re excited to share the journey of the VW—an innovator in the automotive industry and Europe’s largest car maker—to enhance knowledge management by using generative AI, Amazon Bedrock, and Amazon Kendra to devise a solution based on Retrieval Augmented Generation (RAG) that makes internal information more easily accessible by its users. This solution efficiently … Read more

Fine-tune large language models with Amazon SageMaker Autopilot

November 21, 2024 by

Fine-tuning foundation models (FMs) is a process that involves exposing a pre-trained FM to task-specific data and fine-tuning its parameters. It can then develop a deeper understanding and produce more accurate and relevant outputs for that particular domain. In this post, we show how to use an Amazon SageMaker Autopilot training job with the AutoMLV2 … Read more

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

November 20, 2024 by

In today’s data-intensive business landscape, organizations face the challenge of extracting valuable insights from diverse data sources scattered across their infrastructure. Whether it’s structured data in databases or unstructured content in document repositories, enterprises often struggle to efficiently query and use this wealth of information. In this post, we explore how you can use Amazon … Read more

Automate Q&A email responses with Amazon Bedrock Knowledge Bases

November 20, 2024 by

Email remains a vital communication channel for business customers, especially in HR, where responding to inquiries can use up staff resources and cause delays. The extensive knowledge required can make it overwhelming to respond to email inquiries manually. In the future, high automation will play a crucial role in this domain. Using generative AI allows … Read more

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

November 20, 2024 by

Retrieval Augmented Generation (RAG) has become a crucial technique for improving the accuracy and relevance of AI-generated responses. The effectiveness of RAG heavily depends on the quality of context provided to the large language model (LLM), which is typically retrieved from vector stores based on user queries. The relevance of this context directly impacts the … Read more

Embedding secure generative AI in mission-critical public safety applications

November 20, 2024 by

This post is co-written with Lawrence Zorio III from Mark43. Public safety organizations face the challenge of accessing and analyzing vast amounts of data quickly while maintaining strict security protocols. First responders need immediate access to relevant data across multiple systems, while command staff require rapid insights for operational decisions. Mission-critical public safety applications require … Read more

How FP8 boosts LLM training by 18% on Amazon SageMaker P5 instances

November 20, 2024 by

Large language models (LLMs) are AI systems trained on vast amounts of text data, enabling them to understand, generate, and reason with natural language in highly capable and flexible ways. LLM training has seen remarkable advances in recent years, with organizations pushing the boundaries of what’s possible in terms of model size, performance, and efficiency. … Read more

Racing into the future: How AWS DeepRacer fueled my AI and ML journey

November 19, 2024 by

In 2018, I sat in the audience at AWS re:Invent as Andy Jassy announced AWS DeepRacer—a fully autonomous 1/18th scale race car driven by reinforcement learning. At the time, I knew little about AI or machine learning (ML). As an engineer transitioning from legacy networks to cloud technologies, I had never considered myself a developer. … Read more

Your guide to generative AI and ML at AWS re:Invent 2024

November 19, 2024 by

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. This year, generative AI and machine learning (ML) will again be in focus, with exciting keynote announcements and a variety of sessions showcasing insights from AWS experts, customer stories, and hands-on experiences … Read more

Customize small language models on AWS with automotive terminology

November 19, 2024 by

In the rapidly evolving world of AI, the ability to customize language models for specific industries has become more important. Although large language models (LLMs) are adept at handling a wide range of tasks with natural language, they excel at general purpose tasks as compared with specialized tasks. This can create challenges when processing text … Read more

Automate emails for task management using Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails

November 19, 2024 by

In this post, we demonstrate how to create an automated email response solution using Amazon Bedrock and its features, including Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, and Amazon Bedrock Guardrails. Amazon Bedrock is a fully managed service that makes foundation models (FMs) from leading AI startups and Amazon Web Services available through an API, … Read more

Accelerate analysis and discovery of cancer biomarkers with Amazon Bedrock Agents

November 19, 2024 by

According to the National Cancer Institute, a cancer biomarker is a “biological molecule found in blood, other body fluids, or tissues that is a sign of a normal or abnormal process, or of a condition or disease such as cancer.” Biomarkers typically differentiate an affected patient from a person without the disease. Well-known cancer biomarkers … Read more

Automate building guardrails for Amazon Bedrock using test-driven development

November 19, 2024 by

As companies of all sizes continue to build generative AI applications, the need for robust governance and control mechanisms becomes crucial. With the growing complexity of generative AI models, organizations face challenges in maintaining compliance, mitigating risks, and upholding ethical standards. This is where the concept of guardrails comes into play, providing a comprehensive framework … Read more

Build cost-effective RAG applications with Binary Embeddings in Amazon Titan Text Embeddings V2, Amazon OpenSearch Serverless, and Amazon Bedrock Knowledge Bases

November 18, 2024 by

Today, we are happy to announce the availability of Binary Embeddings for Amazon Titan Text Embeddings V2 in Amazon Bedrock Knowledge Bases and Amazon OpenSearch Serverless. With support for binary embedding in Amazon Bedrock and a binary vector store in OpenSearch Serverless, you can use binary embeddings and binary vector store to build Retrieval Augmented … Read more

Automate cloud security vulnerability assessment and alerting using Amazon Bedrock

November 18, 2024 by

Cloud technologies are progressing at a rapid pace. Businesses are adopting new innovations and technologies to create cutting-edge solutions for their customers. However, security is a big risk when adopting the latest technologies. Enterprises often rely on reactive security monitoring and notification techniques, but those techniques might not be sufficient to safeguard your enterprises from … Read more

DXC transforms data exploration for their oil and gas customers with LLM-powered tools

November 18, 2024 by

This post is co-written with Kaustubh Kambli from DXC Technology. DXC Technology is an IT services leader with more than 130,000 employees supporting 6,000 customers in over 70 countries. DXC builds offerings across industry verticals to deliver accelerated value to their customers. One of the sectors DXC has deep expertise in is energy. The oil … Read more

How MSD uses Amazon Bedrock to translate natural language into SQL for complex healthcare databases

November 18, 2024 by

This post is co-written with Vladimir Turzhitsky, Varun Kumar Nomula and Yezhou Sun from MSD. Generative AI is transforming the way healthcare organizations interact with their data. Large language models (LLMs) can help uncover insights from structured data such as a relational database management system (RDBMS) by generating complex SQL queries from natural language questions, … Read more