Skip to Main Content (Press Enter)

Logo UNIMI
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione

Expertise & Skills
Logo UNIMI

|

Expertise & Skills

unimi.it
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione
  1. Pubblicazioni

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs

Articolo
Data di Pubblicazione:
2025
Citazione:
A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs / L. Zierahn, D. Van Der Hoeven, T. Lancewicki, A. Rosenberg, N.A. Cesa Bianchi. - In: JOURNAL OF MACHINE LEARNING RESEARCH. - ISSN 1533-7928. - 26:(2025), pp. 104.1-104.60.
Abstract:
We derive a new analysis of Follow The Regularized Leader (FTRL) for online learning with delayed bandit feedback. By separating the cost of delayed feedback from that of bandit feedback, our analysis allows us to obtain new results in four important settings. We derive the first optimal (up to logarithmic factors) regret bounds for combinatorial semi-bandits with delay and adversarial Markov Decision Processes with delay (both known and unknown transition functions). Furthermore, we use our analysis to develop an efficient algorithm for linear bandits with delay achieving near-optimal regret bounds. In order to derive these results we show that FTRL remains stable across multiple rounds under mild assumptions on the regularizer.
Tipologia IRIS:
01 - Articolo su periodico
Elenco autori:
L. Zierahn, D. Van Der Hoeven, T. Lancewicki, A. Rosenberg, N.A. Cesa Bianchi
Autori di Ateneo:
CESA BIANCHI NICOLO' ANTONIO ( autore )
Link alla scheda completa:
https://air.unimi.it/handle/2434/1175556
Link al Full Text:
https://air.unimi.it/retrieve/handle/2434/1175556/3110968/24-0496.pdf
Progetto:
European Lighthouse of AI for Sustainability (ELIAS)
  • Aree Di Ricerca

Aree Di Ricerca

Settori


Settore INFO-01/A - Informatica
  • Informazioni
  • Assistenza
  • Accessibilità
  • Privacy
  • Utilizzo dei cookie
  • Note legali

Realizzato con VIVO | Progettato da Cineca | 25.11.5.0