Skip to content

Pierriccardo Olivieri

Ph.D. candidate, Politecnico di Milano

About

I am a Ph.D. candidate at the Department of Electronics, Information and Bioengineering (DEIB) at Politecnico di Milano under the supervision of Professor Nicola Gatti. Currently, I am a visiting Ph.D. student at the Machine Intelligence through Decision-making and Interaction (MIDI) Lab at the University of Texas at Austin, supervised by Professor Amy Zhang.

During my Ph.D. I studied adversarial attacks setting for online learning algorithms. Now I am interested in reinforcement learning (RL), more precisely, multi-task task adaptation via unsupervised RL, and continual RL.

Previously, I received my M.Sc. in Computer Science and Engineering from Politecnico di Milano in 2022 and subsequently worked as a research engineer at the AI Research and Innovation Center (AIRIC).


News

  • Nov 2025 — Paper accepted at AAAI with oral presentation
  • Jul 2025 — Paper accepted at Control and Decision Conference (CDC)
  • Apr 2025 — Started my visiting period at UT Austin.
  • Mar 2025 — Paper accepted for pubblication at CDC25
  • Mar 2025 — Paper accepted for pubblication at RLC25 Workshop

Selected publications

Conferences

  1. Do it for HER: First-order Logic Reward Specification in Reinforcement Learning
    P Olivieri, F Lasca, A Gianola, M Papini
    AAAI 2026 (Oral)
  2. Precision UAV Formation Control via PGPE-enhanced NMPC
    P Olivieri, A Sanchini, R Spica, N Gatti, S Formentin
    CDC 2025
  3. Online Markov Decision Processes Configuration with Continuous Decision Space
    D Maran*, P Olivieri*, FE Stradi*, G Urso, N Gatti, M Restelli
    AAAI 2024
  4. Subgame Solving in Adversarial Team Games
    B Zhang, L Carminati, F Cacciamani, G Farina, P Olivieri, N Gatti, T Sandholm
    NeurIPS 2022

Workshops

  1. Online Configuration in Continuous Decision Space
    D Maran, P Olivieri, FE Stradi, G Urso, N Gatti, M Restelli
    EWRL 2023
  2. Do it for HER: First-order Logic Reward Specification in Reinforcement Learning
    P Olivieri, F Lasca, A Gianola, M Papini
    RLC 2025 Workshop on Programmatic Reinforcement Learning
  3. Delayed Adversarial Attacks on Stochastic Multi-Armed Bandits
    P Olivieri, M Castiglioni, N Gatti
    ICML 2024 Workshop Aligning Reinforcement Learning Experimentalists and Theorists
  4. Experimental implementation of discrete time quantum walk with the ibm qiskit library
    P Olivieri, M Askarpour, E Di Nitto
    IEEE/ACM QSE Workshop 2021

Full list on Google Scholar.