Skip to Main Content (Press Enter)

Logo UNIMI
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione

Expertise & Skills
Logo UNIMI

|

Expertise & Skills

unimi.it
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione
  1. Pubblicazioni

Goal-Directed Planning via Hindsight Experience Replay

Contributo in Atti di convegno
Data di Pubblicazione:
2022
Citazione:
Goal-Directed Planning via Hindsight Experience Replay / L. Moro, A. Likmeta, M. Restelli, E. Prati - In: ICLR 2022 - 10th International Conference on Learning Representations[s.l] : International Conference on Learning Representations, ICLR, 2022. - pp. 1-16 (( Intervento presentato al 10. convegno International Conference on Learning Representations tenutosi a on line nel 2022.
Abstract:
We consider the problem of goal-directed planning under a deterministic transition model. Monte Carlo Tree Search has shown remarkable performance in solving deterministic control problems. By using function approximators to bias the search of the tree, MCTS has been extended to complex continuous domains, resulting in the AlphaZero family of algorithms. Nonetheless, these algorithms still struggle with control problems with sparse rewards such as goal-directed domains, where a positive reward is awarded only when reaching a goal state. In this work, we extend AlphaZero with Hindsight Experience Replay to tackle complex goal-directed planning tasks. We demonstrate the effectiveness of the proposed approach through an extensive empirical evaluation in several simulated domains, including a novel application to a quantum compiling domain.
Tipologia IRIS:
03 - Contributo in volume
Elenco autori:
L. Moro, A. Likmeta, M. Restelli, E. Prati
Autori di Ateneo:
PRATI ENRICO ( autore )
Link alla scheda completa:
https://air.unimi.it/handle/2434/991816
Titolo del libro:
ICLR 2022 - 10th International Conference on Learning Representations
  • Aree Di Ricerca

Aree Di Ricerca

Settori


Settore FIS/02 - Fisica Teorica, Modelli e Metodi Matematici
  • Informazioni
  • Assistenza
  • Accessibilità
  • Privacy
  • Utilizzo dei cookie
  • Note legali

Realizzato con VIVO | Progettato da Cineca | 26.1.3.0