Skip to Main Content (Press Enter)

Logo UNIMI
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione

Expertise & Skills
Logo UNIMI

|

Expertise & Skills

unimi.it
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione
  1. Pubblicazioni

SHAC++: A Neural Network to Rule All Differentiable Simulators

Contributo in Atti di convegno
Data di Pubblicazione:
2025
Citazione:
SHAC++: A Neural Network to Rule All Differentiable Simulators / F. Bertolotti, G. Aguzzi, W. Cazzola, M. Viroli (FRONTIERS IN ARTIFICIAL INTELLIGENCE AND APPLICATIONS). - In: ECAI 2025 / [a cura di] I. Lynce, N. Murano, M. Vallati, S. Villata, F. Chesani, M. Milano, A. Omicini, M. Dastani. - [s.l] : IOS Press BV, 2025. - ISBN 9781643686318. - pp. 2818-2825 (( 28th European Conference on Artificial Intelligence, ECAI 2025, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025 Bologna 2025 [10.3233/faia251138].
Abstract:
Reinforcement learning (RL) algorithms show promise in robotics and multi-agent systems but often suffer from low sample efficiency. While methods like SHAC leverage differentiable simulators to improve efficiency, they are limited to specific settings: they require fully differentiable environments, including transition and reward functions, and have primarily been demonstrated in single-agent scenarios. To overcome these limitations, we introduce SHAC++, a novel framework inspired by SHAC. SHAC++ removes the need for differentiable simulator components by using neural networks to approximate the required gradients, training these networks alongside the standard policy and value networks. This enables the core SHAC approach to be applied in both non-differentiable and multi-agent environments. We evaluate SHAC++ on challenging multi-agent tasks from the VMAS suite, comparing it against SHAC (where applicable) and PPO, a standard algorithm for non-differentiable settings. Our results demonstrate that SHAC++ significantly outperforms PPO in both single- and multi-agent scenarios. Furthermore, in differentiable environments where SHAC operates, SHAC++ achieves comparable performance despite lacking direct access to simulator gradients, thus successfully extending SHACs benefits to a broader class of problems. The full implementation is openly available at https://github.com/f14-bertolotti/shacpp.
Tipologia IRIS:
03 - Contributo in volume
Elenco autori:
F. Bertolotti, G. Aguzzi, W. Cazzola, M. Viroli
Autori di Ateneo:
CAZZOLA WALTER ( autore )
Link alla scheda completa:
https://air.unimi.it/handle/2434/1231675
Link al Full Text:
https://air.unimi.it/retrieve/handle/2434/1231675/3294023/ecai25-published.pdf
Titolo del libro:
ECAI 2025
Progetto:
Typeful Language Adaptation for Dynamic, Interacting and Evolving Systems
  • Aree Di Ricerca

Aree Di Ricerca

Settori


Settore INFO-01/A - Informatica
  • Informazioni
  • Assistenza
  • Accessibilità
  • Privacy
  • Utilizzo dei cookie
  • Note legali

Realizzato con VIVO | Progettato da Cineca | 26.4.5.0