Exploring a Multithreaded Methodology to Implement a Network Communication Protocol on the Cyclops-64 Multithreaded Architecture
Workshop on Multi-Threaded Architectures and Applications (MTAAP), held in conjunction with the 21st International Parallel and Distributed Processing System (IPDPS'07), March 30, 2007
Ge Gan, Zinag Hu, Juan del Cuvillo, Guang R. Gao
The complete paper is available in PDF format.
Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences
The 11th International Euro-Par Conference, August 29 - September 1, 2006
Ziang Hu, Juan del Cuvillo, Weirong Zhu, and Guang R. Gao
The complete paper is available in PDF format.
User-Friendly Methodology for Automatic Exploration of Compiler Options: A Case Study on the Intel XScale Microarchitecture
The 2006 International Conference on Programming Languages and Compilers (PLC'06), June 26 - 29, 2006
Haiping Wu, Eunjung Park, Long Chen, Juan del Cuvillo, and Guang R. Gao
The complete paper is available in PDF format.
Performance Characteristics of OpenMP Language Constructs on a Many-core-on-a-chip Architecture
The 2nd International Workshop on OpenMP (IWOMP'06), June 12 - 15, 2006
Weirong Zhu, Juan del Cuvillo, and Guang R. Gao
The complete paper is available in PDF format.
Towards a Software Infrastructure for the Cyclops-64 Cellular Architecture
The 20th International Symposium on High Performance Computing Systems and Applications (HPCS'06), May 14 - 17, 2006
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
The complete paper is available in PDF format.
Landing OpenMP on Cyclops-64: An Efficient Mapping of OpenMP to a Many-Core System-on-a-Chip
The 3rd ACM International Conference on Computing Frontiers (CF'06), May 2 - 5, 2006
Juan del Cuvillo, Weirong Zhu, and Guang R. Gao
The complete paper is available in PDF format.
FAST: A Functionally Accurate Simulation Toolset for the Cyclops-64 Cellular Architecture
Workshop on Modeling, Benchmarking and Simulation (MoBS), held in conjunction with the 32nd Annual International Symposium on Computer Architecture (ISCA'05), June 4, 2005
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
The complete paper is available in PDF format.
TiNy Threads: A Thread Virtual Machine for the Cyclops64 Cellular Architecture
Fifth Workshop on Massively Parallel Processing (WMPP), held in conjunction with the 19th International Parallel and Distributed Processing System (IPDPS'05), April 3 - 8, 2005
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
The complete paper is available in PDF format.
Physical Experimentation with Prefetching Helper Threads on Intel's Hyper-Threaded Processors
The 2004 International Symposium on Code Generation and Optimization with Special Emphasis on Feedback-Directed and Runtime Optimization (CGO2004), March 20 - 24, 2004
Dongkeun Kim, Steve Shih-wei Liao, Perry Wang, Juan del Cuvillo, Xinmin Tian, Xiang Zou, Hong Wang, Donald Yeung, Milind Girkar, and John Shen
The complete paper is available in PDF format.
EmonLite: User-Level Library Routines for Dynamic Performance Monitoring with Low Profiling Overhead
The First Intel Programming Technology Conference(IPTC'03) on Dynamic Compilation and Profiling Guided Optimizations, November, 2003
Dongkeun Kim, Juan del Cuvillo, Steve Shih-wei Liao, Perry Wang, Xinmin Tian, Hong Wang, and John Shen
AutoHelper: Profile-Guided Generation of Helper Threads
The First Intel Programming Technology Conference(IPTC'03) on Dynamic Compilation and Profiling Guided Optimizations, November, 2003
Steve Shih-wei Liao, Xinmin Tian, Perry Wang, Dongkeun Kim, Juan del Cuvillo, Hong Wang, and John Shen
Performance Study of a Whole Genome Comparison Tool on a Hyper-Threading Multiprocessor
The 5th International Symposium on High Performance Computing, October 20 - 22, 2003
Juan del Cuvillo, Xinmin Tian, Guang Gao and Milind Girkar
The complete paper is available in PDF format.
Programming Models and System Software for Future High-End Computing Systems: Work-in-Progress
The 17th International Parallel and Distributed Processing Symposium (IPDPS'03), April 22 - 26, 2003
Guang R. Gao, Kevin B. Theobald, R. Govindarajan, Clement Leung, Ziang Hu, Haiping Wu, Jizhu Lu, Juan del Cuvillo, Adeline Jacquet, Vincent Janot and Thomas L. Sterling
The abstract is available in HTML format.
Whole Genome Alignment using a Multithreaded Parallel Implementation
The 13th Symposium on Computer Architecture and High Performance Computing, September 10 - 12, 2001
Wellington S. Martins, Juan B. del Cuvillo, Wenwu Cui and Guang R. Gao
The complete paper is available in PS format.
ATGC: Another Tool for Genome Comparison
Currents in Computational Molecular Biology 2001, April 22 - 25, 2001
Juan B. del Cuvillo, Wellington S. Martins, Guang R. Gao, Wenwu Cui and Sun Kim
The poster abstract is available in PDF format.
A Multithreaded Parallel Implementation of a Dynamic Programming Algorithm for Sequence Comparison
Pacific Symposium on Biocomputing 2001, January 3 - 7, 2001
W.S. Martins, J.B. del Cuvillo, F.J. Useche, K.B. Theobald and G.R. Gao
The complete paper is available in PS format.