PHAROS Training Series - Course 12 "Compute-Efficient Methods for Large Language Models"

Name: PHAROS Training Series - Course 12 "Compute-Efficient Methods for Large Language Models"
Start: 2026-07-17T11:00:00+03:00
End: 2026-07-17T14:00:00+03:00
Location: No location set

17 July 2026

Europe/Athens timezone

Description
Program
Registration

Efficient training, fine-tuning and inference of large-scale ML models

17 Jul 2026, 11:00

Constantine Dovrolis (The Cyprus Institute)

This talk presents model-centric methods for efficient generative AI. It explains why training and inference of LLMs are computationally heavy, then covers model compression methods such as quantization, neural network pruning, low-rank approximations, and knowledge distillation. It also introduces efficient pre-training with mixed-precision acceleration and PHEW, parameter-efficient fine-tuning methods such as LLM-Adapters, LLaMA-Adapter, P-Tuning, and LoraHub, and efficient inference techniques including speculative decoding and KV-cache optimization.

There are no materials yet.

PHAROS Training Series - Course 12 "Compute-Efficient Methods for Large Language Models"

Efficient training, fine-tuning and inference of large-scale ML models

Speaker

Description

Presentation Materials