Evaluating generative AI models with Amazon Nova LLM-as-a-Judge on Amazon SageMaker AI

Evaluating the performance of large language models (LLMs) goes beyond statistical metrics like perplexity or bilingual evaluation understudy (BLEU) scores. For most real-world generative AI scenarios, it’s crucial to understand whether a model is producing better outputs than a baseline or an earlier iteration. This is especially important for applications such as summarization, content generation, … Read more

Building cost-effective RAG applications with Amazon Bedrock Knowledge Bases and Amazon S3 Vectors

Vector embeddings have become essential for modern Retrieval Augmented Generation (RAG) applications, but organizations face significant cost challenges as they scale. As knowledge bases grow and require more granular embeddings, many vector databases that rely on high-performance storage such as SSDs or in-memory solutions become prohibitively expensive. This cost barrier often forces organizations to limit … Read more

In Cancer Research, AI Models Learn to See What Scientists Might Miss

Table of Links Abstract and I. Introduction Materials and Methods 2.1. Multiple Instance Learning 2.2. Model Architectures Results 3.1. Training Methods 3.2. Datasets 3.3. WSI Preprocessing Pipeline 3.4. Classification and RoI Detection Results Discussion 4.1. Tumor Detection Task 4.2. Gene Mutation Detection Task Conclusions Acknowledgements Author Declaration and References 5. Conclusions In this work, we … Read more