BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//CERN//INDICO//EN
BEGIN:VEVENT
SUMMARY:Hands-on 2: chunking\, metadata and vector indexing
DTSTART;VALUE=DATE-TIME:20260707T093500Z
DTEND;VALUE=DATE-TIME:20260707T095000Z
DTSTAMP;VALUE=DATE-TIME:20260703T161438Z
UID:indico-contribution-1151@events.grnet.gr
DESCRIPTION:Speakers: Sotiris  Gyftopoulos (ATHENA RC)\nThis second hands-
 on tutorial applies the knowledge preparation concepts introduced in the m
 ethodology session. Participants will create document chunks from the load
 ed public-service dataset\, attach useful metadata and generate embeddings
  for each chunk. The session demonstrates how metadata such as document ti
 tle\, section\, source identifier and chunk position helps preserve tracea
 bility and makes later retrieval more controllable. Participants will then
  build a local vector index in the Colab environment and run an initial si
 milarity search to inspect the retrieved evidence. The emphasis is on visi
 bility and debugging: participants will read example chunks\, check whethe
 r they are coherent\, verify that metadata is correct and examine whether 
 the first retrieval results are meaningful. By the end\, participants will
  have a working indexed knowledge base and will understand how data prepar
 ation decisions shape everything that follows in the RAG pipeline. This re
 sult becomes the basis for hybrid retrieval\, reranking and answer generat
 ion exercises.\n\nhttps://events.grnet.gr/event/213/contributions/1151/
LOCATION:
URL:https://events.grnet.gr/event/213/contributions/1151/
END:VEVENT
END:VCALENDAR
