2024
Accelerating Machine Learning Inference on GPUs with SYCL
Proceedings of the 12th International Workshop on OpenCL and SYCL. April 2024.
2022
FPGA Roofline modeling and its Application to Visual SLAM
International Conference on Field Programmable Logic and Applications (FPL). August 2022. Belfast, United Kingdom .
2020
Dynamic Undervolting to Improve Energy Efficiency on Multicore X86 CPUs
IEEE Transactions on Parallel and Distributed Systems. December 2020.
2019
Exploiting CPU Voltage Margins to Increase the Profit of Cloud Infrastructure Providers
2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID). May 2019. Larnaca, Cyprus.
2018
A Framework for Evaluating Software on Reduced Margins Hardware
48th International Conference on Dependable Systems and Networks (DSN). June 2018. Luxemburg.
AcHEe: Evaluating approximate computing and heterogeneity for energy efficiency
Parallel Computing. April 2018.
2017
Edge and Cloud Provider Cost Minimization by Exploiting Extended Voltage and Frequency Margins
Parallel Computing (PARCO). September 2017. Bologna, Italy.
2016
A Software Stack for Operation Beyond Conservative Scaling Boundaries
. .
2015
Energy Minimization on Heterogeneous Systems through Approximate Computing
International Conference on Parallel Computing (PARCO). September 2015. Edinburgh, UK.
An Energy-Efficient and Error-Resilient Server Ecosystem Exceeding Conservative Scaling Limits
Design, Automation & Test in Europe Conference & Exhibition, (DATE). March 2018. Dresden, Germany.
2011
Massively parallel programming models used as hardware description languages: The OpenCL case
International Conference on Computer-Aided Design (ICCAD). November 2011. San Jose, CA.
Synthesis of Platform Architectures from OpenCL Programs
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). May 2011. Salt Lake City, UT.
Implementation and Performance Comparison of the Motion Compensation Kernel of the AVS Video Decoder on FPGA, GPU and Multicore Processors
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). May 2011. Salt Lake City, UT.
GLOpenCL: OpenCL support on hardware- and software-managed cache multicores
6th International Conference on High Performance Embedded Architectures & Compilers (HiPEAC). January 2011. Heraklion, Greece.
2010
Fisheye lens distortion correction on multicore and hardware accelerator platforms
24th International Parallel and Distributed Processing Symposium (IPDPS). April 2010. Atlanta, GA.
2009
Implementation of a wide-angle lens distortion correction algorithm on the cell broadband engine
23rd International Conference on Supercomputing (ICS). June 2009. New York Metro Area, NY.