Seminar "Reinforcemnent Learning in LLM"

19 Dicembre 2025, ore 9:30 - 11:00

nella Sala Seminari, edificio Abacus (U14)

il Dott. Matteo Hessel

Google DeepMind

terrà un seminario dal titolo

Reinforcemnent Learning in LLM

Abstract

In this lecture we will discuss opportunities and challenges of using RL for fine-tuning LLMs. We will cover foundational ideas in RLHF (RL from human feedback) and RLVF (RL from Verifiable Rewards), and then discuss case studies and recent success in using RL for equipping LLMs with powerful Reasoning capabilities.

Breve bio

Matteo Hessel is a Senior Staff Researcher at Google DeepMind, working on Gemini. His research focuses on reinforcement learning, particularly its intersections with deep learning, large language models, and meta-learning.

Persona di contatto per questo seminario: enza.messina@unimib.it

Seminario Reinforcemnent Learning in LLM - 19 dciembre 2025.pdf

Argomento

Seminario