2026-01-08

RAG Evaluation with RAGAS and MLflow - A Practical Guide

A comprehensive tutorial demonstrating RAG evaluation using RAGAS metrics through MLflow integration. Learn to build a minimal RAG pipeline with LangChain, create golden evaluation datasets, and systematically assess retrieval quality using Faithfulness, Context Precision, Context Recall, and Factual Correctness metrics. Supports OpenAI, Azure OpenAI, and Ollama backends.

RAG Evaluation with RAGAS and MLflow - A Practical Guide

To cite this article:

@article{Saf2026RAG,
    author  = {Krystian Safjan},
    title   = {RAG Evaluation with RAGAS and MLflow - A Practical Guide},
    journal = {Krystian's Safjan Blog},
    year    = {2026},
}