Tag: rag

  • RAG with Hybrid Search: How Does Keyword Search Work?

    RAG with Hybrid Search: How Does Keyword Search Work? Understanding keyword search, TF-IDF, and BM25 The post RAG with Hybrid Search: How Does Keyword Search Work? appeared first on Towards Data Science. Maria Mouschoutzi Go to original source

  • Agentic RAG vs Classic RAG: From a Pipeline to a Control Loop

    Agentic RAG vs Classic RAG: From a Pipeline to a Control Loop A practical guide to choosing between single-pass pipelines and adaptive retrieval loops based on your use case’s complexity, cost, and reliability requirements The post Agentic RAG vs Classic RAG: From a Pipeline to a Control Loop appeared first on Towards Data Science. Mostafa…

  • Chunk Size as an Experimental Variable in RAG Systems

    Chunk Size as an Experimental Variable in RAG Systems Understanding retrieval in RAG systems by experimenting with different chunk sizes The post Chunk Size as an Experimental Variable in RAG Systems appeared first on Towards Data Science. Sarah Schürch Go to original source

  • Six Lessons Learned Building RAG Systems in Production

    Six Lessons Learned Building RAG Systems in Production Best practices for data quality, retrieval design, and evaluation in production RAG systems The post Six Lessons Learned Building RAG Systems in Production appeared first on Towards Data Science. Sabrine Bendimerad Go to original source

  • How to Evaluate Retrieval Quality in RAG Pipelines (Part 3): DCG@k and NDCG@k

    How to Evaluate Retrieval Quality in RAG Pipelines (Part 3): DCG@k and NDCG@k The third and final part for evaluating the retrieval quality of your RAG pipeline with graded measures The post How to Evaluate Retrieval Quality in RAG Pipelines (Part 3): DCG@k and NDCG@k appeared first on Towards Data Science. Maria Mouschoutzi Go to…

  • Multi-Agent SQL Assistant, Part 2: Building a RAG Manager

    Multi-Agent SQL Assistant, Part 2: Building a RAG Manager A hands-on guide to comparing multiple RAG strategies — Keyword, FAISS, and Chroma The post Multi-Agent SQL Assistant, Part 2: Building a RAG Manager appeared first on Towards Data Science. Alle Sravani Go to original source

  • How to Evaluate Retrieval Quality in RAG Pipelines (part 2): Mean Reciprocal Rank (MRR) and Average Precision (AP)

    How to Evaluate Retrieval Quality in RAG Pipelines (part 2): Mean Reciprocal Rank (MRR) and Average Precision (AP) Evaluating the retrieval quality of your RAG pipeline with binary, order-aware measures The post How to Evaluate Retrieval Quality in RAG Pipelines (part 2): Mean Reciprocal Rank (MRR) and Average Precision (AP) appeared first on Towards Data…

  • Building a Multimodal RAG That Responds with Text, Images, and Tables from Sources

    Building a Multimodal RAG That Responds with Text, Images, and Tables from Sources Why do few chatbots return figures from source documents in their responses? The post Building a Multimodal RAG That Responds with Text, Images, and Tables from Sources appeared first on Towards Data Science. Partha Sarkar Go to original source

  • Graph RAG vs SQL RAG

    Graph RAG vs SQL RAG Evaluating RAGs on graph and SQL databases The post Graph RAG vs SQL RAG appeared first on Towards Data Science. Reinhard Sellmair Go to original source

  • How to Evaluate Retrieval Quality in RAG Pipelines: Precision@k, Recall@k, and F1@k

    How to Evaluate Retrieval Quality in RAG Pipelines: Precision@k, Recall@k, and F1@k In my previous posts, I have walked you through putting together a very basic RAG pipeline in Python, as well as chunking large text documents. We’ve also looked into how documents are transformed into embeddings, allowing us to quickly search for similar documents…

  • RAG Explained: Reranking for Better Answers

    RAG Explained: Reranking for Better Answers How reranking improves retrieval-augmented generation by surfacing the most relevant results The post RAG Explained: Reranking for Better Answers appeared first on Towards Data Science. Maria Mouschoutzi Go to original source

  • RAG Explained: Understanding Embeddings, Similarity, and Retrieval

    RAG Explained: Understanding Embeddings, Similarity, and Retrieval Let’s take a closer look at how the retrieval mechanism works The post RAG Explained: Understanding Embeddings, Similarity, and Retrieval appeared first on Towards Data Science. Maria Mouschoutzi Go to original source

  • Evaluating Your RAG Solution

    Evaluating Your RAG Solution A guide to building and evaluating RAG solutions by leveraging LLM-as-a-Judge capabilities. The post Evaluating Your RAG Solution appeared first on Towards Data Science. Alex Davis Go to original source

  • Hitchhiker’s Guide to RAG: From Tiny Files to Tolstoy with OpenAI’s API and LangChain

    Hitchhiker’s Guide to RAG: From Tiny Files to Tolstoy with OpenAI’s API and LangChain Scaling a simple RAG pipeline from simple notes to full books The post Hitchhiker’s Guide to RAG: From Tiny Files to Tolstoy with OpenAI’s API and LangChain appeared first on Towards Data Science. Maria Mouschoutzi Go to original source

  • Hitchhiker’s Guide to RAG with ChatGPT API and LangChain

    Hitchhiker’s Guide to RAG with ChatGPT API and LangChain Build a simple Python RAG pipeline using your local files as context The post Hitchhiker’s Guide to RAG with ChatGPT API and LangChain appeared first on Towards Data Science. Maria Mouschoutzi Go to original source

  • Agentic RAG Applications: Company Knowledge Slack Agents

    Agentic RAG Applications: Company Knowledge Slack Agents Lessons learnt using LlamaIndex and Modal The post Agentic RAG Applications: Company Knowledge Slack Agents appeared first on Towards Data Science. Ida Silfverskiöld Go to original source

  • Overcome Failing Document Ingestion & RAG Strategies with Agentic Knowledge Distillation

    Overcome Failing Document Ingestion & RAG Strategies with Agentic Knowledge Distillation Introduction Many generative AI use cases still revolve around Retrieval Augmented Generation (RAG), yet consistently fall short of user expectations. Despite the growing body of research on RAG improvements and even adding Agents into the process, many solutions still fail to return exhaustive results,…

  • Enhancing RAG: Beyond Vanilla Approaches

    Enhancing RAG: Beyond Vanilla Approaches Retrieval-Augmented Generation (RAG) is a powerful technique that enhances language models by incorporating external information retrieval mechanisms. While standard RAG implementations improve response relevance, they often struggle in complex retrieval scenarios. This article explores the limitations of a vanilla RAG setup and introduces advanced techniques to enhance its accuracy and…

  • Synthetic Data Generation with LLMs

    Synthetic Data Generation with LLMs Popularity of RAG Over the past two years while working with financial firms, I’ve observed firsthand how they identify and prioritize Generative AI use cases, balancing complexity with potential value. Retrieval-Augmented Generation (RAG) often stands out as a foundational capability across many LLM-driven solutions, striking a balance between ease of implementation…

  • Supercharge Your RAG with Multi-Agent Self-RAG

    Supercharge Your RAG with Multi-Agent Self-RAG Introduction Many of us might have tried to build a RAG application and noticed it falls significantly short of addressing real-life needs. Why is that? It’s because many real-world problems require multiple steps of information retrieval and reasoning. We need our agent to perform those as humans normally do,…

  • Multimodal RAG: Process Any File Type with AI

    Multimodal RAG: Process Any File Type with AI A beginner-friendly guide with example (Python) code This is the third article in a larger series on multimodal AI. In the previous posts, we discussed multimodal LLMs and embedding models, respectively. In this article, we will combine these ideas to enable the development of multimodal RAG systems. I’ll…

  • RAG: Hybrid Search Based on Two Indexes

    RAG: Hybrid Search Based on Two Indexes The proposition I will be talking about in this article is something I already have implemented and I am currently testing in a personal… Continue reading on Towards Data Science » Jérôme DIAZ Go to original source