PHAROS Training Series - Course 9 "RAG End-to-End: Architecture, Retrieval, Generation and Evaluation"

Name: PHAROS Training Series - Course 9 "RAG End-to-End: Architecture, Retrieval, Generation and Evaluation"
Start: 2026-07-07T11:00:00+03:00
End: 2026-07-07T15:15:00+03:00
Location: No location set

7 July 2026

Europe/Athens timezone

RAG evaluation: retrieval quality, faithfulness, groundedness and RAGAS/DeepEval concepts

7 Jul 2026, 14:40

20m

George Drosatos (ATHENA RC) Sotiris Gyftopoulos (ATHENA RC)

This evaluation session explains how to assess RAG systems beyond a single final answer score. Participants will learn why evaluation must decompose the pipeline into retrieval, generation, citation quality and end-to-end behaviour. The session introduces retrieval metrics such as Precision@k, Recall@k, Mean Reciprocal Rank and NDCG, showing how they describe the evidence made available to the model. It then discusses answer-level criteria, including correctness, faithfulness, groundedness, citation quality, completeness and relevance. Participants will also be introduced to RAGAS and DeepEval concepts for automated RAG evaluation, regression testing and structured comparison of system variants. The emphasis is diagnostic: metrics should identify failure modes and guide concrete improvements, such as better chunking, query handling, filtering, reranking or prompt constraints. By the end, participants will understand how to evaluate and iterate RAG systems systematically. This prepares them to maintain RAG quality as data, prompts and models evolve in operational settings, after deployment too.

There are no materials yet.

PHAROS Training Series - Course 9 "RAG End-to-End: Architecture, Retrieval, Generation and Evaluation"

RAG evaluation: retrieval quality, faithfulness, groundedness and RAGAS/DeepEval concepts

Speakers

Description

Presentation Materials