2024
Accelerating Machine Learning Inference on GPUs with SYCL
Proceedings of the 12th International Workshop on OpenCL and SYCL. April 2024.
2023
Reconfigurable System-on-Chip Architectures for Robust Visual SLAM on Humanoid Robots
ACM Transactions on Embedded Computing Systems. March-April/2023.
2022
FPGA Roofline modeling and its Application to Visual SLAM
International Conference on Field Programmable Logic and Applications (FPL). August 2022. Belfast, United Kingdom .
FPGA Accelerators for Robust Visual SLAM on Humanoid Robots
International Symposium on Field-Programmable Gate Arrays (FPGA). February 2022. Virtual Conference.
2021
Architectures for SLAM and Augmented Reality Computing
International Conference on Field-Programmable Logic and Applications (FPL). September 2021. Virtual Conference.
FPGA Architectures for Approximate Dense SLAM Computing
Proceedings of the 24th Conference on Design, Automation and Test in Europe. February 2021. Virtual Conference.
2019
Comparative Performance Analysis of Vulkan Implementations of Computational Applications
Proceedings of the International Workshop on OpenCL. May 2019. Boston, MA.
2015
Exploring Automatically Generated Platforms in High Performance FPGAs
Parallel Computing (PARCO). September 2015. Edinburgh, UK.
Enhancing Design Space Exploration by Extending CPU/GPU Specifications onto FPGAs
ACM Transactions on Embedded Computing Systems (TECS). March 2015.
2014
On the characterization of OpenCL dwarfs on fixed and reconfigurable platforms
25th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP). June 2014. Zurich, Switzerland.
A Grammar Induction Method for Clustering of Operations in Complex FPGA Designs
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). May 2014. Boston, MA.
2013
On the Portability of the OpenCL Dwarfs on Fixed and Reconfigurable Parallel Platforms
19th IEEE International Conference on Parallel and Distributed Systems (ICPADS). December 2013. Seoul, Korea (South).
2012
Shortening Design Time through Multiplatform Simulations with a Portable OpenCL Golden-model: The LDPC Decoder Case
ium on Field-Programmable Custom Computing Machines (FCCM). April 2012. Toronto, ON.
2011
Massively parallel programming models used as hardware description languages: The OpenCL case
International Conference on Computer-Aided Design (ICCAD). November 2011. San Jose, CA.
Implementation and Performance Comparison of the Motion Compensation Kernel of the AVS Video Decoder on FPGA, GPU and Multicore Processors
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). May 2011. Salt Lake City, UT.
Synthesis of Platform Architectures from OpenCL Programs
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). May 2011. Salt Lake City, UT.