2011
Implementation and Performance Analysis of SEAL Encryption on FPGA, GPU and Multi-core Processors
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). May 2011. Salt Lake City, UT.
Implementation and Performance Comparison of the Motion Compensation Kernel of the AVS Video Decoder on FPGA, GPU and Multicore Processors
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). May 2011. Salt Lake City, UT.
GLOpenCL: OpenCL support on hardware- and software-managed cache multicores
6th International Conference on High Performance Embedded Architectures & Compilers (HiPEAC). January 2011. Heraklion, Greece.
2010
Mapping and optimization of the AVS video decoder on a high performance chip multiprocessor
. July 2010. Singapore.
Fisheye lens distortion correction on multicore and hardware accelerator platforms
24th International Parallel and Distributed Processing Symposium (IPDPS). April 2010. Atlanta, GA.
2009
Algorithm, software, and hardware optimizations for Delaunay mesh generation on simultaneous multithreaded architectures
Journal of Parallel and Distributed Computing. July 2009.
A multigrain Delaunay mesh generation method for multicore SMT-based architectures
Journal of Parallel and Distributed Computing. July 2009.
Implementation of a wide-angle lens distortion correction algorithm on the cell broadband engine
23rd International Conference on Supercomputing (ICS). June 2009. New York Metro Area, NY.
2008
On SDMA Transmission Considering an Efficient IP Based IEEE 802.16 Relay Station
IEEE International Conference on Wireless and Mobile Computing (WiMob). October 2008. Avignon, France.
Prediction-Based Power-Performance Adaptation of Multithreaded Scientific Codes
IEEE Transactions on Parallel and Distributed Systems (TPDS). October 2008.
2007
Runtime scheduling of dynamic parallelism on accelerator-based multi-core systems
Parallel Computing. November 2007.
Exploring New Search Algorithms and Hardware for Phylogenetics: RAxML Meets the IBM Cell
The Journal of VLSI Signal Processing. September 2007.
Dynamic multigrain parallelization on the cell broadband engine
2th ACM Symposium on Principles and Practice of Parallel Programming (PPoPP). March 2007. San Jose, CA.
RAxML-Cell: Parallel Phylogenetic Tree Inference on the Cell Broadband Engine
21th International Parallel and Distributed Processing Symposium (IPDPS). March 2007. Long Beach, CA.
A comparison of online and offline strategies for program adaptation
45th Annual Southeast Regional Conference. March 2007. Winston-Salem, NC.
2006
PACMAN: A PerformAnce Counters MANager for Intel Hyperthreaded Processors
Third International Conference on the Quantitative Evaluation of Systems (QEST). September 2006. Riverside, CA.
Online power-performance adaptation of multithreaded programs using hardware event-based prediction
Proceedings of the 20th Annual International Conference on Supercomputing (ICS). June 2006. Cairns, Queensland, Australia.
Scalable locality-conscious multithreaded memory allocation
Proceedings of the 5th International Symposium on Memory Management( ISMM). June 2006. Ottawa, Canada.
Online strategies for high-performance power-aware thread execution on emerging multiprocessors
20th International Parallel and Distributed Processing Symposium (IPDPS). April 2006. Rhodes Island, Greece.
2005
A Case for Dynamic Page Migration in Multiple-Writer Software DSM Systems
IEEE International Conference on Cluster Computing (CLUSTER). September 2005. Boston, MA.