Antonio Guillen-Perez

Antonio Guillen-Perez

AI Research Scientist at Hewlett-Packard Enterprise (HPE)

Email: antonio_algaida [at] hotmail [dot] com

About Me

I am an AI Research Scientist at Hewlett-Packard Enterprise (HPE), specializing in the applied development and deployment of Multi-Agent Deep Reinforcement Learning (MADRL) systems to solve real-world challenges in both Data Center Sustainability and Autonomous Vehicle (AV) optimization. My work has integrated advanced AI techniques to optimize energy efficiency and reduce carbon footprint in large-scale data centers, as well as enhance mobility and safety in urban traffic scenarios for connected autonomous vehicles.

With a Ph.D. in Computer Science, I have translated cutting-edge AI research into impactful applications, including trajectory planning for autonomous driving using imitation learning and reinforcement learning. My research has led to significant reductions in traffic delays and improved coordination at busy intersections.

Autonomous Systems Research

My doctoral work focused on using deep reinforcement learning and imitation learning techniques to generate optimal driving trajectories in multi-agent traffic environments. Key contributions include:

  • Intersection Management: Developed algorithms that reduced traffic delays by up to 50% and emissions by 45% in complex urban intersections.
  • Learning from Demonstrations: Improved AV behavior by combining expert expertise with multi-agent DRL for safer, more efficient driving strategies.
  • Integration with 5G/6G: Leveraged next-generation communication to enable faster model inference and robust vehicle-to-infrastructure connections.

Recent News

  • 2024/11: Paper Hierarchical Multi-Agent Framework for Carbon-Efficient Liquid-Cooled Data Center Cluster accepted for the Demonstration Track at AAAI 2025. The paper introduces a hierarchical RL framework for improving the energy efficiency of liquid-cooled data center clusters.
  • 2024/10: Paper SustainDC: Benchmarking for Sustainable Data Center Control accepted for presentation at the NeurIPS 2024 Datasets and Benchmarks Track. The paper introduces a novel multi-agent reinforcement learning benchmark for sustainable data center control. Link.
  • 2023/12: Received the Best Paper Award at the NeurIPS 2023 Climate Change AI Workshop for work on real-time carbon footprint minimization in data centers using deep reinforcement learning.
  • 2022/09: Joined HPE AI Labs as an AI Research Scientist. Initiated and led a pioneering project on Multi-Agent Deep Reinforcement Learning (MADRL) focused on data center sustainability, achieving significant improvements in energy efficiency and carbon footprint reduction.
  • 2022/06: Successfully defended my Ph.D. thesis on enhancing the cognitive capability of Intelligent Transportation Systems (ITS) using Artificial Intelligence, awarded Cum Laude. Link.
  • 2022/03: Demonstrated a multi-agent AV intersection management system that reduced wait times by 50% in simulation at an international ITS conference.
  • 2022/01: Completed my stay at the University of California, Davis as Researcher Visitor. Thanks Prof. J. Sebastian Gomez-Diaz. Focused on AI in healthcare, specifically early detection of throat cancer using a multimodal approach, achieving over 90% accuracy. Link.

Selected Publications
Full publication list at Google Scholar

* denotes equal contribution.
Journal Articles (Autonomous Vehicles)
Multi-Agent Deep Reinforcement Learning to Manage Connected Autonomous Vehicles at Tomorrow's Intersections
Antonio Guillen-Perez Maria-Dolores Cano
IEEE Transactions on Vehicular Technology, 2022, Vol. 71, No. 7, pp. 7033-7043
Learning From Oracle Demonstrations—A New Approach to Develop Autonomous Intersection Management Control Algorithms Based on Multiagent Deep Reinforcement Learning
Antonio Guillen-Perez Maria-Dolores Cano
IEEE Access, 2022, Vol. 10, pp. 53601-53613
Journal Articles (Other)
Flying Ad Hoc Networks: A New Domain for UAV Network Communications
Antonio Guillen-Perez Maria-Dolores Cano
Sensors, 2018, Vol. 18, No. 10, Article 3571
Conferences
SustainDC: Benchmarking for Sustainable Data Center Control
Avisek Naug* Antonio Guillen-Perez* Ricardo Luna Gutierrez* Vineet Gundecha* Sahand Ghorbanpour Sajad Mousavi Dejan Markovikj Ashwin Ramesh Babu Soumyendu Sarkar
NeurIPS 2024
Carbon Footprint Reduction for Sustainable Data Centers in Real-Time
Soumyendu Sarkar* Avisek Naug* Ricardo Luna* Antonio Guillen-Perez* Vineet Gundecha Sahand Ghorbanpour Sajad Mousavi Dejan Markovikj Ashwin Ramesh Babu
Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38(20), pp. 22322-22330
Concurrent Carbon Footprint Reduction (C2FR) Reinforcement Learning Approach for Sustainable Data Center Digital Twin
Avisek Naug* Antonio Guillen-Perez* Ricardo Luna-Gutierrez* Sahand Ghorbanpour Sajad Mousavi Ashwin Ramesh Babu Vineet Gundecha Soumyendu Sarkar
2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), 2023, pp. 1-8
WiFi Networks on Drones
Antonio Guillen-Perez Ramon Sanchez-Iborra Maria-Dolores Cano Juan Carlos Sanchez-Aarnoutse Joan Garcia-Haro
2016 ITU Kaleidoscope: ICTs for a Sustainable World (ITU WT), 2016, pp. 1-8
Workshops
Real-time Carbon Footprint Minimization in Sustainable Data Centers with Reinforcement Learning
Avisek Naug* Antonio Guillen-Perez* Ricardo Luna-Gutierrez* Vineet Gundecha Ashwin Ramesh Babu Soumyendu Sarkar Cullen Bash
NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning, 2023
Sustainable Data Center Modeling: A Multi-Agent Reinforcement Learning Benchmark
Avisek Naug* Antonio Guillen-Perez* Ricardo Luna-Gutierrez* Vineet Gundecha Sahand Ghorbanpour Sajad Mousavi Ashwin Ramesh Babu Soumyendu Sarkar
NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning, 2023

Experience

Hewlett-Packard Enterprise (HPE) - AI Labs

Sep 2022 - Present

Research Scientist, Milpitas-San Jose, CA

  • Led multi-agent DRL projects to optimize data center sustainability, achieving >20% reduction in energy consumption.
  • Investigated large-scale ML pipelines, including fine-tuning transformer-based models for robustness.
  • Collaborated with cross-functional teams: simulation engineers, researchers, and software developers.

Polytechnic University of Cartagena

Sep 2018 - Jun 2022

Associate Lecturer, Cartagena, Spain

  • Developed imitation learning and DRL techniques for autonomous vehicles at urban intersections.
  • Analyzed 5G/6G communication influence on AV performance and real-time trajectory planning.
  • Mentored graduate students in AI, computer science, and robotics.

Dolphin Wave | Startup

Feb 2018 - Sep 2018

ML Engineer and Data Scientist, Murcia, Spain

  • Implemented data-driven solutions for client-facing applications, focusing on analytics and modeling.
  • Collaborated with a small, agile team to iterate quickly on new ML product features.

Polytechnic University of Cartagena

Sep 2014 - Sep 2018

Research Intern, Department of Information and Communications Technologies, Cartagena, Spain

  • Studied UAV-based networking solutions, leading to the publication on Flying Ad Hoc Networks.
  • Acquired foundational ML and research skills used later for AV research.

Future Directions

My aim is to further integrate learned trajectory planning and behavior prediction into real autonomous fleets. I am passionate about bridging theory and deployment for safer, more efficient urban mobility. Ultimately, I strive to make robust reinforcement learning solutions a practical reality in next-generation transportation systems.