Wu-chun Feng

Wu-chun Feng (a.k.a. "Wu")

Director
SyNeRGy Laboratory

Professor
Dept. of CS
Dept. of ECE
Health Sciences

Other Affiliations
CHREC | VBI |
Wireless @ VT

Profiles
Google Scholar

Contact

Office:
Torgersen Hall 2050
620 Drillfield Drive (Alumni Mall)
Blacksburg, VA.
24061. [map]
Phone:
(540) 231-1192
Fax:
(540) 231-9218
Email:
feng [at] cs.vt.edu

Virginia Tech

Publications

For electronic version of publications please visit the Synergy Lab. web site.

  • AutoMatch: An Automated Framework for Relative Performance Estimation and Workload Distribution on Heterogeneous HPC Systems.
    Ahmed E. Helal, Wu-chun Feng, Changhee Jung, Yasser Y. Hanafy.
    In Proceedings of the IEEE International Symposium on Workload Characterization (IISWC), Seattle, WA, October 2017.
       
  • A Framework for Fast and Fair Evaluation of Automata Processing Hardware.
    Xiaodong Yu, Kaixi Hou, Hao Wang, Wu-chun Feng.
    In IEEE International Symposium on Workload Characterization (IISWC), Seattle, WA, October 2017.
       
  • A Runtime Estimation Framework for ALICE.
    Sarunya Pumma, Wu-chun Feng, Phond Phunchongharn, Sylvain Chapeland, Tiranee Achalakul.
    In Future Generation Computer Systems, 72 65-77, July 2017.
       
  • Fast Segmented Sort on GPUs.
    Kaixi Hou, Weifeng Liu, Hao Wang, Wu-chun Feng.
    In Proceedings of the International Conference on Supercomputing, Chicago, IL, June 2017.
       
  • A Non-Invasive Approach for Realizing Resilience in MPI.
    Umar Kalim, Mark K. Gardner, Wu-chun Feng.
    In Proceedings of the 7th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS), in conjunction with the 26th ACM Symposium on High-Performance Parallel and Distributed Computing, Washington DC, June 2017.
       
  • Demystifying Automata Processing: GPUs, FPGAs or Micron’s AP?.
    Marziyeh Nourian, Xiang Wang, Xiaodong Yu, Wu-chun Feng, Michela Becchi.
    In Proceedings of the International Conference on Supercomputing, Chicago, IL, June 2017.
       
  • Directive-Based Partitioning and Pipelining for Graphics Processing Units.
    Xuewen Cui, Thomas R. W. Scogland, Bronis R. de Supinski, Wu-chun Feng.
    In Proceedings of the IEEE International Parallel and Distributed Processing Symposium, Orlando, Florida, May 2017.
       
  • Characterizing and Modeling Power and Energy for Extreme-Scale In-situ Visualization.
    Vignesh Adhinarayanan, Wu-chun Feng, David Rogers, James Ahrens, Scott Pakin.
    In Proceedings of the IEEE International Parallel and Distributed Processing Symposium, Orlando, Florida, May 2017.
       
  • Auto-Tuning Strategies for Parallelizing Sparse Matrix-Vector (SpMV) Multiplication on Multi- and Many-Core Processors.
    Kaixi Hou, Wu-chun Feng, Shuai Che.
    In Proceedings of the 7th International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), Orlando, Florida, May 2017.
       
  • GPU-UniCache: Automatic Code Generation of Spatial Blocking for Stencils on GPUs.
    Kaixi Hou, Hao Wang, Wu-chun Feng.
    In Proceedings of the Proceedings of the ACM Computing Frontiers, Siena, Italy, May 2017.
       
  • PaPar: A Parallel Data Partitioning Framework for Big Data Applications.
    Hao Wang, Jing Zhang, Da Zhang, Sarunya Pumma, Wu-chun Feng.
    In Proceedings of the IEEE International Parallel and Distributed Processing Symposium, Orlando, Florida, May 2017.
       
  • Eliminating Irregularities of Protein Sequence Search on Multicore Architectures.
    Jing Zhang, Sanchit Misra, Hao Wang, Wu-chun Feng.
    In Proceedings of the IEEE International Parallel and Distributed Processing Symposium, Orlando, Florida, May 2017.
       
  • An Enhanced Image Reconstruction Tool for Computed Tomography on GPUs.
    Xiaodong Yu, Hao Wang, Wu-chun Feng, Hao Gong, Guohua Cao.
    In Proceedings of the ACM Computing Frontiers, Siena, Italy, May 2017.
       
  • ChIP-GMM: A Gaussian Mixture Model for Inferring Binding Regions in ChIP-seq Pro.
    Sharmi Banerjee, Xi Chen, Xiaowei Wu, Hehuang Xie, Jianhua Xuan, Wu-chun Feng.
    In Proceedings of the 9th International Conference on Bioinformatics and Computational Biology (BiCoB), Honolulu, HI, March 2017.
       
  • Accelerating 3D-Structured Grid on FPGA via OpenCL : A Case Study with OpenDwarfs.
    Anshuman Verma, Wu-chun Feng.
    In International Symposium on Code Generation and Optimization (CGO), Austin, TX, February 2017.
       
  • Parallel Programming with Pictures is a Snap!.
    Annette Feng, Mark Gardner, Wu-chun Feng.
    In Journal of Parallel and Distributed Computing, 105 150-162, January 2017.
       
  • Telescoping Architectures: Evaluating Next-Generation Heterogeneous Computing.
    Konstantinos Krommydas, Wu-chun Feng.
    In Proceedings of the 23rd IEEE International Conference on High Performance Computing, Hyderabad, India, December 2016.
       
  • MetaMorph: A Library Framework for Interoperable Kernels on Multi- and Many-core Clusters.
    Ahmed E. Helal, Paul Sathre, Wu-chun Feng.
    In Proceedings of the IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (Supercomputing), Salt Lake City, Utah, USA, November 2016.
       
  • Characterizing Performance and Power Towards Efficient Synchronization of GPU Kernels.
    Islam Harb, Wu-chun Feng.
    In Proceedings of the IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), London, England, September 2016.
       
  • Measuring and Modeling On-chip Interconnect Power on Real Hardware.
    Vignesh Adhinarayanan, Indrani Paul, Joseph Greathouse, Wei Huang, Ashutosh Pattnaik, Wu-chun Feng.
    In Proceedings of the IEEE International Symposium on Workload Characterization (IISWC), Providence, Rhode Island, USA, September 2016.
    Best Paper Award
       
  • Parallel Transposition of Sparse Data Structures.
    Hao Wang, Weifeng Liu, Kaixi Hou, Wu-chun Feng.
    In Proceedings of the 30th International Conference on Supercomputing (ICS), Istanbul, Turkey, June 2016.
       
  • Online Power Estimation of Graphics Processing Units.
    Vignesh Adhinarayanan, Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Cartagena, Colombia, May 2016.
       
  • AAlign: A SIMD Framework for Pairwise Sequence Alignment on x86-based Multi- and Many-core Processors.
    Kaixi Hou, Hao Wang, Wu-chun Feng.
    In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, IL, USA, May 2016.
       
  • cuART: Fine-Grained Algebraic Reconstruction Technique for Computed Tomography Images on GPUs.
    Xiaodong Yu, Hao Wang, Wu-chun Feng, Hao Gong, Guohua Cao.
    In Proceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Cartagena, Colombia, May 2016.
       
  • Parallel Programming with Pictures in a Snap!.
    Annette Feng, Wu-chun Feng.
    In Proceedings of the 6th NSF/TCPP Workshop on Parallel and Distributed Computing Education (EduPar), Chicago, Illinois, USA, May 2016.
       
  • An Automated Framework for Characterizing and Subsetting GPGPU Workloads.
    Vignesh Adhinarayanan, Wu-chun Feng.
    In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Uppsala, Sweden, April 2016.
       
  • O3FA: A Scalable Finite Automata-based Pattern-Matching Engine for Out-of-Order Deep Packet Inspection.
    Xiaodong Yu, Wu-chun Feng, Danfeng (Daphne) Yao, Michela Becchi.
    In Proceedings of the ACM/IEEE Symposium on Architectures for Networking and Communications Systems, Santa Clara, CA, March 2016.
       
  • Fast Detection of Transformed Data Leaks.
    Xiaokui Shu, Jing Zhang, Danfeng (Daphne) Yao, Wu-chun Feng.
    In IEEE Transactions on Information Forensics and Security (TIFS), PP (99): November 2015.
       
  • Performance, Power, and Energy of In-Situ and Post-Processing Visualization: A Case Study in Climate Simulation.
    Vignesh Adhinarayanan, Scott Pakin, David Rogers, Wu-chun Feng, James Ahrens.
    In ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Austin, TX, USA, November 2015.
    Best Poster Finalist
       
  • CoreTSAR: Core Task-Size Adapting Runtime.
    Thomas R. W. Scogland, Wu-chun Feng, Barry Rountree, Bronis R. de Supinski.
    In IEEE Transactions on Parallel and Distributed Systems, 26 (11): 2970-2983, November 2015.
       
  • OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures.
    Konstantinos Krommydas, Wu-chun Feng, Christos D. Antonopoulos, Nikolaos Bellas.
    In Journal of Signal Processing Systems, 1--20, October 2015.
       
  • cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on CPU+GPU.
    Jing Zhang, Hao Wang, Wu-chun Feng.
    In IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), PP (99): October 2015.
       
  • pDindel: Accelerating InDel Detection on a Multicore CPU Architecture with SIMD.
    Da Zhang, Hao Wang, Kaixi Hou, Jing Zhang, Wu-chun Feng.
    In Proceedings of the 5th IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), Miami, FL, USA, October 2015.
       
  • Block-Based Programming Abstractions for Explicit Parallel Computing.
    Annette Feng, Eli Tilevich, Wu-chun Feng.
    In Proceedings of the Blocks and Beyond: Lessons and Directions for First Programming Environments, Atlanta, GA, USA, October 2015. A VL/HCC 2015 Workshop.
       
  • GLAF: A Visual Programming and Auto-Tuning Framework for Parallel Computing.
    Konstantinos Krommydas, Ruchira Sasanka, Wu-chun Feng.
    In Proceedings of the International Conference on Parallel Processing, Beijing, China, September 2015.
       
  • Automatic Command Queue Scheduling for Task-Parallel Workloads in OpenCL.
    Ashwin M. Aji, Antonio J. Pena, Pavan Balaji, Wu-chun Feng.
    In Proceedings of the IEEE Cluster, Chicago, Illinois, September 2015.
       
  • ASPaS: A Framework for Automatic SIMDization of Parallel Sorting on x86-based Many-core Processors.
    Kaixi Hou, Hao Wang, Wu-chun Feng.
    In Proceedings of the 29th ACM International Conference on Supercomputing, Newport Beach, California, USA, June 2015.
       
  • On the Performance, Energy, and Power of Data-Access Methods in Heterogeneous Computing Systems.
    Rubasri Kalidas, Mayank Daga, Konstantinos Krommydas, Wu-chun Feng.
    In Proceedings of the 11th Workshop on High-Performance, Power-Aware Computing (HPPAC), Hyderabad, India, May 2015.
       
  • On the Greenness of In-Situ and Post-Processing Visualization Pipelines.
    Vignesh Adhinarayanan, Wu-chun Feng, Jonathan Woodring, David Rogers, James Ahrens.
    In Proceedings of the 11th Workshop on High-Performance, Power-Aware Computing (HPPAC), Hyderabad, India, May 2015.
       
  • Rapid Screening of Transformed Data Leaks with Efficient Algorithms and Parallel Computing.
    Xiaokui Shu, Jing Zhang, Danfeng (Daphne) Yao, Wu-chun Feng.
    In ACM Conference on Data and Application Security and Privacy (CODASPY), San Antonio, TX, USA, March 2015.
    Best Poster Award
       
  • Design and Evaluation of Scalable Concurrent Queues for Many-Core Architectures.
    Thomas R. W. Scogland, Wu-chun Feng.
    In Proceedings of the International Conference on Performance Engineering (ICPE), Austin, TX, USA, January 2015.
       
  • On the Performance and Energy Efficiency of FPGAs and GPUs for Polyphase Channelization.
    Vignesh Adhinarayanan, Thaddeus Koehn, Krzysztof Kepa, Wu-chun Feng, Peter Athanas.
    In Proceedings of the International Conference on ReConFigurable Computing and FPGAs, Cancun, Mexico, December 2014.
       
  • On the Energy Proportionality of Distributed NoSQL Data Stores.
    Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS14), New Orleans, LA, USA, November 2014. Held as part of SC14.
       
  • MetaMorph: A Modular Library for Democratizing the Acceleration of Parallel Computing across Heterogeneous Devices.
    Paul Sathre, Wu-chun Feng.
    In ACM/IEEE International Conference on High-Performance Computing, Networking, Storage, and Analysis (SC|14), New Orleans, LA, November 2014.
       
  • Delivering Parallel Programmability to the Masses via the Intel MIC Ecosystem: A Case Study.
    Kaixi Hou, Hao Wang, Wu-chun Feng.
    In Proceedings of the 7th International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), Minneapolis, Minnesota, September 2014.
       
  • Aeromancer: A Workflow Manager for Large-Scale MapReduce-Based Scientific Workflows.
    Nabeel Mohamed, Nabanita Maji, Jing Zhang, Nataliya Timoshevskaya, Wu-chun Feng.
    In Proceedings of the 3rd IEEE International Conference on Big Data Science and Engineering (BDSE), Beijing, China, September 2014.
       
  • SDAFT: A Novel Scalable Data Access Framework for Parallel BLAST.
    Jiangling Yin, Junyao Zhang, Jun Wang, Wu-chun Feng.
    In Parallel Computing, 40 (10): 697–709, August 2014.
       
  • Locality-Aware Memory Association for Multi-Target Worksharing in OpenMP.
    Thomas R. W. Scogland, Wu-chun Feng.
    In 23rd International Conference on Parallel Architectures and Compilation Techniques (PACT), Alberta, Canada, August 2014.
       
  • CoreTSAR: Adaptive Worksharing for Heterogeneous Systems.
    Thomas R. W. Scogland, Wu-chun Feng, Barry Rountree, Bronis R. de Supinski.
    In Proceedings of the International Supercomputing Conference, Leipzig, Germany, June 2014.
       
  • On the Characterization of OpenCL Dwarfs on Fixed and Reconfigurable Platforms.
    Konstantinos Krommydas, Wu-chun Feng, Muhsen Owaida, Christos D. Antonopoulos, Nikolaos Bellas.
    In Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), Zurich, Switzerland, June 2014.
    Best Paper Finalist
       
  • SAIS-OPT: On the Characterization and Optimization of the SA-IS Algorithm for Suffix Array Construction.
    Nataliya Timoshevskaya, Wu-chun Feng.
    In Proceedings of the 4th IEEE International Conference on Computational Advanced in Bio and Medical Sciences, Miami Beach, Florida, June 2014.
       
  • Towards a Performance-Portable FFT Library for Heterogeneous Computing.
    Carlo del Mundo, Wu-chun Feng.
    In Proceedings of the ACM International Conference on Computing Frontiers (CF), Cagliari, Italy, May 2014.
       
  • Petascale Application of a Coupled CPU-GPU Algorithm for Simulation and Analysis of Multiphase Flow Solutions in Porous Medium Systems.
    James E. McClure, Hao Wang, Jan F. Prins, Cass T. Miller, Wu-chun Feng.
    In Proceedings of the IEEE International Parallel and Distributed Processing Symposium, Phoenix, Arizona, May 2014.
       
  • Enabling Efficient Power Provisioning for Enterprise Applications.
    Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Chicago, IL, May 2014.
       
  • cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on a GPU.
    Jing Zhang, Hao Wang, Heshan Lin, Wu-chun Feng.
    In GPU Technology Conference (GTC), San Jose, CA, USA, March 2014.
       
  • A Power-Measurement Methodology for Large-Scale, High-Performance Computing.
    Thomas Scogland, Craig Steffen, Torsten Wilde, Florent Parent, Susan Coghlan, Natalie Bates, Wu-chun Feng, Erich Strohmaier.
    In Proceedings of the 5th ACM/SPEC International Conference on Performance Engineering (ICPE), Dublin, Ireland, March 2014.
    Nominated for Best Industrial Paper Award
       
  • Wideband Channelization for Software-Defined Radio via Mobile Graphics Processors.
    Vignesh Adhinarayanan, Wu-chun Feng.
    In Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2013), Seoul, Korea, December 2013.
       
  • Characterizing the Challenges and Evaluating the Efficacy of a CUDA-to-OpenCL Translator.
    Mark Gardner, Paul Sathre, Wu-chun Feng, Gabriel Martinez.
    In Parallel Computing, 39 (12): 769-786, December 2013.
       
  • Cascaded TCP: Applying Pipelining to TCP for Efficient Communication over Wide-Area Networks.
    Umar Kalim, Mark Gardner, Eric Brown, Wu-chun Feng.
    In Proceedings of the IEEE Global Communications Conference (GLOBECOM), Atlanta, GA, USA, December 2013.
       
  • On the Programmability and Performance of Heterogeneous Platforms.
    Konstantinos Krommydas, Thomas R.W. Scogland, Wu-chun Feng.
    In Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2013), Seoul, Korea, December 2013.
       
  • On the Portability of the OpenCL Dwarfs on Fixed and Reconfigurable Parallel Platforms.
    Konstantinos Krommydas, Muhsen Owaida, Christos D. Antonopoulos, Nikolaos Bellas, Wu-chun Feng.
    In 19th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2013), Seoul, Korea, December 2013.
       
  • Online Performance Projection for Clusters with Heterogeneous GPUs.
    Lokendra S. Panwar, Ashwin M. Aji, Jiayuan Meng, Pavan Balaji, Wu-chun Feng.
    In Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2013), Seoul, Korea, December 2013.
       
  • Enabling Efficient Intra-Warp Communication for Fourier Transforms in a Many-Core Architecture.
    Carlo del Mundo, Wu-chun Feng.
    In ACM/IEEE International Conference on High-Performance Computing, Networking, Storage, and Analysis (SC|13), Denver, CO, November 2013.
       
  • Consolidating Applications for Energy Efficiency in Heterogeneous Computing Systems.
    Jing Zhang, Hao Wang, Heshan Lin, Wu-chun Feng.
    In Proceedings of the 15th IEEE International Conference on High Performance Computing and Communications (HPCC 2013), Zhangjiajie, China, November 2013.
       
  • pVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments.
    Palden Lama, Yan Li, Ashwin M. Aji, Pavan Balaji, James Dinan, Shucai Xiao, Yunquan Zhang, Wu-chun Feng, Rajeev Thakur, Xiaobo Zhou.
    In Proceedings of the 33rd International Conference on Distributed Computing Systems, Philadelphia, USA, July 2013.
       
  • Seamless Migration of Virtual Machines Across Networks.
    Umar Kalim, Mark K. Gardner, Eric Brown, Wu-chun Feng.
    In Proceedings of the International Conference on Computer Communications and Networks (ICCCN), Nassau, Bahamas, July 2013.
       
  • On the Efficacy of GPU-Integrated MPI for Scientific Applications.
    Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Milind Chabbi, Karthik Murthy, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John Mellor-Crummy, Xiaosong Ma, Rajeev Thakur.
    In Proceedings of the ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), New York, USA, June 2013.
       
  • Trends in Energy-Efficient Computing: A Perspective from the Green500.
    Balaji Subramaniam, Winston Saunders, Tom Scogland, Wu-chun Feng.
    In Proceedings of the 4th International Green Computing Conference, Arlington, VA, June 2013.
       
  • Accelerating Fast Fourier Transform for Wideband Channelization.
    Carlo del Mundo, Vignesh Adhinarayanan, Wu-chun Feng.
    In Proceedings of the International Conference on Communications (ICC), Budapest, Hungary, June 2013.
       
  • Optimizing Burrows-Wheeler Transform-Based Sequence Alignment on Multicore Architectures.
    Jing Zhang, Heshan Lin, Pavan Balaji, Wu-chun Feng.
    In Proceedings of the 13th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Delft, Netherlands, May 2013.
       
  • Towards Energy-Proportional Computing for Enterprise-Class Server Workloads.
    Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the 3rd ACM/SPEC International Conference on Performance Engineering (ICPE), Prague, Czech Republic, April 2013.
    Best Paper Award
       
  • Accelerating Data-Intensive Genome Analysis in the Cloud.
    Nabeel M. Mohamed, Heshan Lin, Wu-chun Feng.
    In Proceedings of the 5th International Conference on Bioinformatics and Computational Biology (BICoB), Honolulu, Hawaii, USA, March 2013.
       
  • Cascaded TCP: Big Throughput for Big Data Applications in Distributed HPC.
    Umar Kalim, Mark Gardner, Eric Brown, Wu-chun Feng.
    In ACM/IEEE International Conference for High-Performance Computing, Networking, Storage and Analysis (SC), Salt Lake City, Utah, November 2012.
       
  • Lost in Translation: Challenges in Automating CUDA-to-OpenCL Translation.
    Paul Sathre, Mark Gardner, Wu-chun Feng.
    In Proceedings of the 5th International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), Pittsburgh, PA, September 2012.
       
  • Performance Characterization of Data-Intensive Kernels on AMD Fusion Architectures.
    Kenneth Lee, Heshan Lin, Wu-chun Feng.
    In Proceedings of the International Supercomputing Conference (ISC), Hamburg, Germany, June 2012.
       
  • GBench: Benchmarking Methodology for Evaluating the Energy Efficiency of Supercomputers.
    Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the International Supercomputing Conference (ISC), Hamburg, Germany, June 2012.
       
  • The Green500 List: Escapades to Exascale.
    Tom Scogland, Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the International Supercomputing Conference (ISC), Hamburg, Germany, June 2012.
       
  • DMA-Assisted, Intranode Communication in GPU Accelerated Systems.
    Feng Ji, Ashwin M. Aji, James Dinan, Darius Buntinas, Pavan Balaji, Rajeev Thakur, Wu-chun Feng, Xiaosong Ma.
    In Proceedings of the 14th IEEE International Conference on High Performance Computing and Communications, Liverpool, UK, June 2012.
       
  • MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-Based Systems.
    Ashwin M. Aji, James Dinan, Darius Buntinas, Pavan Balaji, Wu-chun Feng, Keith R. Bisset, Rajeev Thakur.
    In Proceedings of the 14th IEEE International Conference on High Performance Computing and Communications, Liverpool, UK, June 2012.
       
  • The Green Index: A Metric for Evaluating System-Wide Energy Efficiency in HPC Systems.
    Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the 8th IEEE Workshop on High-Performance, Power-Aware Computing (HPPAC), Shanghai, China, May 2012.
       
  • VOCL: An Optimized Environment for Transparent Virtualization of Graphics Processing Units.
    Shucai Xiao, Pavan Balaji, Qian Zhu, Rajeev Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong, Wu-chun Feng.
    In Proceedings of the IEEE Innovative Parallel Computing (InPar2012), San Jose, CA, May 2012.
       
  • Generalizing the Utility of GPUs in Large-Scale Heterogeneous Computing Systems.
    Shucai Xiao, Wu-chun Feng.
    In Proceedings of the PhD Forum at the 26th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Shanghai, China, May 2012.
       
  • Heterogeneous Task Scheduling for Accelerated OpenMP.
    Thomas R. W. Scogland, Barry Rountree, Wu-chun Feng, Bronis R. de Supinski.
    In Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, Shanghai, China, May 2012.
       
  • Efficient Intranode Communication in GPU-Accelerated Systems.
    Feng Ji, Ashwin Aji, James Dinan, Darius Buntinas, Pavan Balaji, Wu-chun Feng, Xiaosong Ma.
    In Proceedings of the 2nd IEEE International Workshop on Accelerators and Hybrid Exascale Systems (in conjunction with the 26th IEEE International Parallel and Distributed Processing Symposium), Shanghai, China, May 2012.
       
  • Transparent Accelerator Migration in a Virtualized GPU Environment.
    Shucai Xiao, Pavan Balaji, James Dinan, Rajeev Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong, Wu-chun Feng.
    In Proceedings of the IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Ottawa, Canada, May 2012.
       
  • Multi-Dimensional Characterization of Electrostatic Surface Potential Computation on Graphics Processors.
    Mayank Daga, Wu-chun Feng.
    In BMC Bioinformatics, 13 (S5): S4, April 2012.
       
  • Automatic NUMA Characterization using Cbench.
    Ryan Braithwaite, Wu-chun Feng, Patrick McCormick.
    In Proceedings of the 3rd ACM/SPEC International Conference on Performance Engineering (ICPE), Boston, MA, April 2012.
       
  • High-Performance Biocomputing for Simulating the Spread of Contagion over Large Contact Networks.
    Keith R. Bisset, Ashwin M. Aji, Madhav V. Marathe, Wu-chun Feng.
    In BMC Genomics, 13 (S2): S3, April 2012.
       
  • StreamMR: An Optimized MapReduce Framework for AMD GPUs.
    Marwa Elteir, Heshan Lin, Wu-chun Feng, Tom Scogland.
    In Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, Tainan, Taiwan, December 2011.
       
  • Architecture-Aware Mapping and Optimization on a 1600-Core GPU.
    Mayank Daga, Tom Scogland, Wu-chun Feng.
    In Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, Tainan, Taiwan, December 2011.
       
  • CU2CL: A CUDA-to-OpenCL Translator for Multi- and Many-Core Architectures.
    Gabriel Martinez, Mark Gardner, Wu-chun Feng.
    In Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, Tainan, Taiwan, December 2011.
       
  • Characterizing the Impact of Memory-Access Techniques on AMD Fusion.
    Kenneth Lee, Heshan Lin, Wu-chun Feng.
    In ACM/IEEE SC|11: The International Conference on High-Performance Computing, Networking, Storage, and Analysis, Seattle, Washington, USA, November 2011.
       
  • Spectral Method Characterization on FPGA and GPU Accelerators.
    Karl Pereira, Peter Athanas, Heshan Lin, Wu-chun Feng.
    In Proceedings of the International Conference on Reconfigurable Computing and FPGAs (ReConFig), Cancun, Mexico, November 2011.
       
  • Performance Characterization and Optimization of Atomic Operations on AMD GPUs.
    Marwa Elteir, Heshan Lin, Wu-chun Feng.
    In Proceedings of the IEEE Cluster 2011, Austin, TX, USA, September 2011.
       
  • Restoring End-to-End Resilience in the Presence of Middleboxes.
    Eric Brown, Mark Gardner, Umar Kalim, Wu-chun Feng.
    In Proceedings of the 20th IEEE International Conference on Computer Communications and Networking (ICCCN), Maui, Hawaii, August 2011.
       
  • AVS Video Decoder on Multicore Systems: Optimizations and Tradeoffs.
    Konstantinos Krommydas, Christos D. Antonopoulos, Nikolaos Bellas, Wu-chun Feng.
    In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), Barcelona, Spain, July 2011.
       
  • On the Efficacy of a Fused CPU+GPU Processor for Parallel Computing.
    Mayank Daga, Ashwin Aji, Wu-chun Feng.
    In Proceedings of the Symposium on Application Accelerators in High-Performance Computing, Knoxville, Tennessee, USA, July 2011.
       
  • Bounding the Effect of Partition Camping in GPU Kernels.
    Ashwin M. Aji, Mayank Daga, Wu-chun Feng.
    In Proceedings of the ACM International Conference on Computing Frontiers, Ischia, Italy, May 2011.
       
  • Accelerating Protein Sequence Search in a Heterogeneous Computing System.
    Shucai Xiao, Heshan Lin, Wu-chun Feng.
    In Proceedings of the 25th International Parallel and Distributed Processing Symposium, Anchorage, Alaska, USA, May 2011.
       
  • Emerging Trends on the Evolving Green500: Year Three.
    Tom Scogland, Balaji Subramaniam, Wu-chun Feng.
    In Proceedings of the 7th Workshop on High-Performance, Power-Aware Computing, Anchorage, Alaska, USA, May 2011.
       
  • Communication and Data-Intensive Science in the Beginning of the 21st Century.
    Jack Faris, Evelyne Kolker, Alex Szalay, Leon Bradlow, Ewa Deelman, Wu-chun Feng, Judy Qiu, Donna Russell, Elizabeth Stewart, Eugene Kolker.
    In OMICS A Journal of Integrative Biology, 15 (4): 213--215, April 2011.
       
  • Reliable MapReduce Computing on Opportunistic Resources.
    Heshan Lin, Xiaosong Ma, Wu-chun Feng.
    In Cluster Computing, 1-17, February 2011.
       
  • Towards Accelerating Molecular Modeling via Multi-Scale Approximation on a GPU.
    Mayank Daga, Wu-chun Feng, Thomas Scogland.
    In Proceedings of the 1st IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), Orlando, Florida, USA, February 2011.
       
  • High-Performance Biocomputing for Simulating the Spread of Contagion over Large Contact Networks.
    Keith Bisset, Ashwin M. Aji, Madhav Marathe, Wu-chun Feng.
    In Proceedings of the IEEE International Conference on Computational Advances in Bio and medical Sciences (ICCABS), Orlando, Florida, USA, February 2011.
       
  • GPU-RMAP: Accelerating Short-Read Mapping on Graphics Processors.
    Ashwin M. Aji, Liqing Zhang, Wu-chun Feng.
    In Proceedings of the 13th IEEE International Conference on Computational Science and Engineering, Hong Kong, China, December 2010.
       
  • Enhancing MapReduce via Asynchronous Data Processing.
    Marwa Elteir, Heshan Lin, Wu-chun Feng.
    In Proceedings of the 16th International Conference on Parallel and Distributed Systems (ICPADS), Shanghai, China, December 2010.
       
  • The Terabase Metagenomics Workshop and the Vision of an Earth Microbiome Project.
    Jack Gilbert, Folker Meyer, Dion Antonopoulos, Pavan Balaji, C. Titus Brown, Christopher T. Brown, Narayan Desai, Jonathan A. Eisen, Dirk Evers, Dawn Field, Wu-chun Feng, Daniel Huson, Janet Jansson, Rob Knight, James Knight, Eugene Kolker, Kostas Konstantindis, Joel Kostka, Nikos Kyrpides, Rachel Mackelprang, Alice McHardy, Christopher Quince, Jeroen Raes, Alexander Sczyrba, Ashley Shade, Rick Stevens.
    In Standards in Genomic Sciences, 3 (3): 243-248, December 2010.
       
  • Global-Scale Distributed I/O with ParaMEDIC.
    Pavan Balaji, Wu-chun Feng, Heshan Lin, Jeremy Archuleta, S. Matsuoka, A. Warren, J. Setubal, E. Lusk, R. Thakur, Ian Foster, D. Katz, S. Jha, K. Shinpaugh, S. Coghlan, D. Reed.
    In Concurrency and Computation: Practice and Experience, 22 (16): 2266-2281, November 2010.
       
  • Enabling Renewed Innovation in TCP by Establishing an Isolation Boundary.
    Umar Kalim, Eric Brown, Mark Gardner, Wu-chun Feng.
    In Proceedings of the 8th International Workshop on Protocols for Future, Large-Scale and Diverse Network Transports (PFLDNeT), Lancaster, Pennsylvania, November 2010.
       
  • Enhancing Early Childhood Education with Computer Science Curriculum.
    Michelle Datoc, Wu-chun Feng.
    In Grace Hopper Conference, Atlanta, GA, September 2010.
       
  • A First Look at Integrated GPUs for Green High-Performance Computing.
    Tom Scogland, Heshan Lin, Wu-chun Feng.
    In Proceedings of the International Conference on Energy-Aware High Performance Computing, Hamburg, Germany, September 2010.
       
  • Accelerating Molecular Modeling using GPUs.
    Mayank Daga, Wu-chun Feng.
    In GPU Technology Conference, San Jose, California, September 2010.
       
  • On the Goodput of TCP NewReno in Mobile Networks.
    Sushant Sharma, Donald Gillies, Wu-chun Feng.
    In Proceedings of the 19th International Conference on Computer Communications and Networks (ICCCN), Zurich, Switzerland, pp. 1-8, August 2010.
       
  • MOON: MapReduce On Opportunistic eNvironments.
    Heshan Lin, Xiaosong Ma, Jeremy Archuleta, Wu-chun Feng, Mark Gardner, Zhe Zhang.
    In Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, Chicago, Illinois, pp. 95--106, June 2010.
    Selected as one of the 20 best papers from HPDC 1992--2012.
       
  • Broadening Accessibility to Computer Science for K-12 Education.
    Mark Gardner, Wu-chun Feng.
    In Proceedings of the 15th Annual Conference on Innovation and Technology in Computer Science Education, Ankara, Turkey, 2010.
       
  • Towards Chip-on-Chip Neuroscience: Fast Mining of Neuronal Spike Streams Using Graphics Hardware.
    Yong Cao, Debprakash Patnaik, Sean Ponce, Jeremy Archuleta, Patrick Butler, Wu-chun Feng, Naren Ramakrishnan.
    In Proceedings of the 7th ACM International Conference on Computing Frontiers, Bertinoro, Italy, pp. 1--10, May 2010.
       
  • Coordinating Computation and I/O in Massively Parallel Sequence Search.
    Heshan Lin, Xiaosong Ma, Wu-chun Feng, Nagiza Samatova.
    In IEEE Transactions on Parallel and Distributed Systems, PP (99): 1-14, May 2010.
       
  • Towards Chip-on-Chip Neuroscience: Fast Mining of Neuronal Spike Streams Using Graphics Hardware.
    Yong Cao, D Patnaik, Sean Ponce, Wu-chun Feng, Naren Ramakrishnan.
    In 7th ACM International Conference on Computing Frontiers, ACM, New York, NY, USA, May 2010.
       
  • Accelerating Electrostatic Surface Potential Calculation with Multi-Scale Approximation on Graphics Processing Units.
    Ramu Anandakrishnan, Tom R.W. Scogland, Andrew T. Fenley, John C. Gordon, Wu-chun Feng, Alexey V. Onufriev.
    In Journal of Molecular Graphics and Modelling, 28 904-910, April 2010.
       
  • Inter-Block GPU Communication via Fast Barrier Synchronization.
    Shucai Xiao, Wu-chun Feng.
    In Proceedings of the 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Atlanta, Georgia, USA, April 2010.
       
  • Missing Genes in the Annotation of Prokaryotic Genomes.
    Andrew S. Warren, Jeremy Archuleta, Wu-chun Feng, Joao Carlos Setubal.
    In BMC Bioinformatics 2010, 11 (131): 1-12, March 2010.
       
  • On the Robust Mapping of Dynamic Programming onto a Graphics Processing Unit.
    Shucai Xiao, Ashwin M. Aji, Wu-chun Feng.
    In Proceedings of the 15th International Conference on Parallel and Distributed Systems (ICPADS), Shenzhen, China, December 2009.
       
  • A Flexible and Extensible Framework for Delivering Designer Images for K-12 Pedagogy.
    Mark K. Gardner, Adam Herr, David Mazary, Heshan Lin, Thomas Scogland, Wu-chun Feng.
    In Proceedings of the 3rd International Conference on the Virtual Computing Initiative (ICVCI), Research Triangle Park, NC, October 2009.
       
  • GePSeA: A General-Purpose Software Acceleration Framework for Lightweight Task Offloading.
    Ajeet Singh, Pavan Balaji, Wu-chun Feng.
    In Proceedings of the 38th International Conference on Parallel Processing (ICPP), Vienna, Austria, September 2009.
       
  • On the Energy Efficiency of Graphics Processing Units for Scientific Computing.
    Song Huang, Shucai Xiao, Wu-chun Feng.
    In Proceedings of the 5th IEEE Workshop on High-Performance, Power-Aware Computing (in conjunction with the 23rd International Parallel and Distributed Processing Symposium (IPDPS)), Rome, Italy, 3801-3804, June 2009.
       
  • Multi-Dimensional Characterization of Temporal Data Mining on Graphics Processors.
    Jeremy Archuleta, Yong Cao, Tom Scogland, Wu-chun Feng.
    In Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), Rome, Italy, May 2009.
       
  • Energy-Efficient Cluster Computing via Accurate Workload Characterization.
    Song Huang, Wu-chun Feng.
    In Proceedings of the 9th IEEE International Symposium on Cluster Computing and the Grid (CCGrid), Shanghai, China, May 2009.
       
  • Making a Case for Proactive Flow Control in Optical Circuit-Switched Networks.
    Mithilesh Kumar, Vineeta Chaube, Pavan Balaji, Wu-chun Feng, Hyun-Wook Jin.
    In Proceedings of the 15th International Conference on High-Performance Computing (HiPC), Bangalore, India, December 2008.
       
  • Asymmetric Interactions in Symmetric Multi-core Systems: Analysis, Enhancements and Evaluation.
    Thomas Scogland, Pavan Balaji, Wu-chun Feng, Ganesh Narayanaswamy.
    In Proceedings of the ACM/IEEE SC|08: The International Conference on High-Performance Computing, Networking, Storage, and Analysis, Austin, Texas, USA, November 2008.
       
  • Massively Parallel Genomic Sequence Search on the Blue Gene/P Architecture.
    Heshan Lin, Pavan Balaji, Ruth Poole, Carlos Sosa, Xiaosong Ma, Wu-chun Feng.
    In Proceedings of the ACM/IEEE SC|08: The International Conference on High-Performance Computing, Networking, Storage, and Analysis, Austin, Texas, November 2008.
       
  • Optimizing Performance, Cost, and Sensitivity in Pairwise Sequence Search on a Cluster of PlayStations.
    Ashwin M. Aji, Wu-chun Feng.
    In Proceedings of the IEEE International Conference on BioInformatics and BioEngineering, Athens, Greece, October 2008.
       
  • Impact of Network Sharing in Multi-core Architectures.
    Ganesh, Narayanaswamy, Pavan Balaji, Wu-chun Feng.
    In Proceedings of the 17th International Conference on Computer Communications and Networks (ICCCN), St. Thomas, U.S. Virgin Islands, August 2008.
       
  • Semantic-based Distributed I/O with the ParaMEDIC Framework.
    Pavan Balaji, Wu-chun Feng, Heshan Lin.
    In Proceedings of the 17th ACM/IEEE International Symposium on High-Performance Distributed Computing, Boston, Massachusetts, USA, June 2008.
       
  • Cell-SWat: Modeling and Scheduling Wavefront Computations on the Cell Broadband Engine.
    Ashwin M. Aji, Wu-chun Feng, Filip Blagojevic, Dimitrios S. Nikolopoulos.
    In Proceedings of the 5th ACM International Conference on Computing Frontiers, Ischia, Italy, May 2008.
       
  • Towards a Virtual Ecosystem for K-8 Education.
    Mark Gardner, Wu-chun Feng.
    In Proceedings of the International Conference on the Virtual Computing Initiative (ICVCI), Research Triangle Park, NC, May 2008.
       
  • Modeling and Analysis of Power in Multicore Network Processors.
    Song Huang, Yan Luo, Wu-chun Feng.
    In Proceedings of the 4th IEEE Workshop on High-Performance, Power-Aware Computing (in conjunction with the 22nd International Parallel and Distributed Processing Symposium (IPDPS)), Miami, Florida, USA, April 2008.
       
  • Semantics-based Distributed I/O for mpiBLAST (Short Paper).
    Pavan Balaji, Wu-chun Feng, Jeremy Archuleta, Heshan Lin, Rajkumar Kettimuthu, Rajeev Thakur, Xiaosong Ma.
    In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Salt Lake City, Utah, USA, February 2008.
       
  • Achieving Edge-Based Fairness in a Multi-Hop Environment.
    Mustafa Arisoylu, Wu-chun Feng.
    In Proceedings of the 5th IEEE Consumer Communications and Networking Conference, Las Vegas, Nevada, USA, January 2008.
       
  • Algorithms for Integrated Routing and Scheduling for Aggregating Data from Distributed Resources on a Lambda Grid.
    Amitabha Banerjee, Wu-chun Feng, Dipak Ghosal, Biswanath Mukherjee.
    In IEEE Transactions on Parallel and Distributed Systems, 19 (1): January 2008.
       
  • Analyzing the Impact of Supporting Out-of-Order Communication on In- Order Performance with iWARP.
    Pavan Balaji, Wu-chun Feng, Sitha Bhagvat, Dhabaleswar Panda, Rajeev Thakur, William Gropp.
    In Proceedings of the ACM/IEEE SC|07: The International Conference on High-Performance Computing, Networking, Storage, and Analysis, Reno, Nevada, USA, November 2007.
       
  • A Maintainable Software Architecture for Fast and Modular Bioinformatics Sequence Search.
    Jeremy S. Archuleta, Eli Tilevich, Wu-chun Feng.
    In Proceedings of the 23rd IEEE International Conference on Software Maintenance, Paris, France, October 2007.
       
  • An Analysis of 10-Gigabit Ethernet Protocol Stacks in Multicore Environments.
    Ganesh Narayanaswamy, Pavan Balaji, Wu-chun Feng.
    In Proceedings of the 15th International Symposium on High-Performance Interconnects (HotI 2007), Palo Alto, California, August 2007.
       
  • A Pluggable Framework for Parallel Pairwise Sequence Search.
    Jeremy S. Archuleta, Wu-chun Feng, Eli Tilevich.
    In Proceedings of the International Conference of the IEEE Engineering in Medicine and Biology Society, Lyon, France, August 2007.
       
  • Parallel Genomic Sequence-Search on a Massively Parallel System.
    Oystein Thorsen, Brian Smith, Carlos P. Sosa, Karl Jiang, Heshan Lin, Amanda Peters, Wu-chun Feng.
    In Proceedings of the ACM International Conference on Computing Frontiers, Ischia, Italy, May 2007.
       
  • Parallel Genomic Sequence-Searching on an Ad-Hoc Grid: Experiences, Lessons Learned, and Implications.
    Mark K. Gardner, Wu-chun Feng, Jeremy S. Archuleta, Heshan Lin, Xiaosong Ma.
    In Proceedings of the ACM/IEEE SC|06: The International Conference on High-Performance Computing, Networking, Storage, and Analysis, Tampa, FL, November 2006.
    Best Paper Nominee
       
  • End-System Aware, Rate-Adaptive Protocol for Network Transport in LambdaGrid Environments.
    Pallab Datta, Wu-chun Feng, Sushant Sharma.
    In Proceedings of the ACM/IEEE SC|06: The International Conference on High-Performance Computing, Networking, Storage, and Analysis, Tampa, FL, November 2006.
       
  • Exploring I/O Strategies for Parallel Sequence Database Search Tools with S3aSim.
    Avery Ching, Wu-chun Feng, Heshan Lin, Xiaosong Ma, Alok Choudhary.
    In Proceedings of the 15th IEEE International Symposium on High-Performance Distributed Computing (HPDC’06), Paris, France, June 2006.
       
  • Bridging the Ethernet-Ethernot Performance Gap.
    Pavan Balaji, Wu-chun Feng, Dhabaleswar K. Panda.
    In IEEE Micro (Special Issue: High-Performance Interconnects), 26 (3): 24-40, May 2006.
       
  • A Feedback Mechanism for Network Scheduling in LambdaGrids.
    Pallab Datta, Sushant Sharma, Wu-chun Feng.
    In Proceedings of the 6th IEEE/ACM Symposium on Cluster Computing and the Grid (CCGrid’06), Singapore, May 2006.
       
  • RAPID: An End-System Aware Protocol for Intelligent Data-Transfer over LambdaGrids.
    Amitabha Banerjee, Wu-chun Feng, Biswanath Mukherjee, Dipak Ghosal.
    In Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS), Rhodes, Greece, April 2006.
       
  • End-system Performance Aware Transport over Optical Circuit-Switched Connections.
    Amitabha Banerjee, Wu-chun Feng, Biswanath Mukherjee, Dipak Ghosal.
    In Proceedings of the IEEE INFOCOM High-Speed Networking Workshop: The Terabits Challenge (in conjunction with the 25th IEEE INFOCOM), Barcelona, Spain, April 2006.
       
  • Making a Case for a Green500 List.
    Sushant Sharma, Chung-hsing Hsu, Wu-chun Feng.
    In Proceedings of the 2nd IEEE Workshop on High-Performance, Power-Aware Computing (in conjunction with the 20th International Parallel and Distributed Processing Symposium (IPDPS)), Rhodes, Greece, April 2006.
       
  • Efficient Data Handling in Comparative Genome Analysis Applications.
    Heshan Lin, Xiaosong Ma, Wu-chun Feng, Al Geist, Nagiza Samatova.
    In Proceedings of the 12th SIAM Conference on Parallel Processing for Scientific Computing, San Francisco, California, February 2006. (electronic version unavailable)
       
  • Whole Genome, Physics-Based Sequence Alignment for Pathogen Signature Design.
    Jason D. Gans, Wu-chun Feng, Murray Wolinsky.
    In Proceedings of the 12th SIAM Conference on Parallel Processing for Scientific Computing, San Francisco, California, February 2006. (electronic version unavailable)
       
  • A Case for UDP Offload Engines in LambdaGrids.
    Venkatram Vishwanath, Pavan Balaji, Wu-chun Feng, Jason Leigh, Dhabaleswar K. Panda.
    In Proceedings of the 4th International Workshop on Protocols for Fast Long-Distance Networks (PFLDnet’06), Nara, Japan, February 2006.
       
  • Efficient Data Handling in Comparative Genome Analysis Applications.
    Heshan Lin, Xiaosong Ma, Wu-chun Feng, A. Geist, Nagiza Samatova.
    In 12th SIAM Conference on Parallel Processing for Scientific Computing, February 2006.
       
  • Whole Genome, Physics-Based Sequence Alignment for Pathogen Signature Design.
    J. Gans, Wu-chun Feng, M. Wolinsky.
    In 12th SIAM Conference on Parallel Processing for Scientific Computing, February 2006.
       
  • A Power-Aware Run-Time System for High-Performance Computing.
    Chung-hsing Hsu, Wu-chun Feng.
    In Proceedings of the ACM/IEEE SC2005: The International Conference on High-Performance Computing, Networking, and Storage, Seattle, Washington, November 2005.
       
  • Analyzing MPI Performance over 10-Gigabit Ethernet.
    Justin (Gus) Hurwitz, Wu-chun Feng.
    In Journal of Parallel and Distributed Computing, Special Issue: Design and Performance of Networks for Super-, Cluster-, and Grid-Computing, 65 (10): 1253-1260, October 2005.
    LA-UR 05-2010.  
       
  • Anatomy of UDP and M-VIA for Cluster Communication.
    Xiao Zhang, Laxmi N. Bhuyan, Wu-chun Feng.
    In Journal of Parallel and Distributed Computing, Special Issue: Design and Performance of Networks for Super-, Cluster-, and Grid-Computing, 65 (10): 1290-1298, October 2005.
    LA-UR 05-2009.  
       
  • Head-to-TOE Evaluation of High-Performance Sockets over Protocol Offload Engines.
    Pavan Balaji, Wu-chun Feng, Qi Gao, Ranjit Noronha, Weikuan Yu, Dhabaleswar K. Panda.
    In Proceedings of the 7th IEEE International Conference on Cluster Computing (CLUSTER'05), Boston, Massachusetts, September 2005.
    LA-UR 05-4148.  
       
  • A Feasibility Analysis of Power Awareness in Commodity-Based High-Performance Clusters.
    Chung-hsing Hsu, Wu-chun Feng.
    In Proceedings of the 7th IEEE International Conference on Cluster Computing (CLUSTER, Boston, Massachusetts, September 2005.
       
  • Towards Efficient Supercomputing: A Quest for the Right Metric.
    Chung-hsing Hsu, Wu-chun Feng, Jeremy S. Archuleta.
    In Proceedings of the 1st IEEE Workshop on High-Performance, Power-Aware Computing (in conjunction with the 19th International Parallel and Distributed Processing Symposium (IPDPS)), Denver, Colorado, April 2005. Presentation
       
  • Reducing Overheating-Induced Failures via Performance-Aware CPU Power Management.
    Chung-hsing Hsu, Wu-chun Feng.
    In Proceedings of the 6th International Conference on Linux Clusters: The HPC Revolution 2005, April 2005.
    LA-UR 05-1581.  
       
  • Q-Composer and CpR: A Probabilistic Synthesizer and Regulator of Traffic.
    Sami Ayyorgun, Sarut Vanichpun, Wu-chun Feng.
    In Proceedings of the 24th IEEE Conference on Computer Communications (INFOCOM'05), Miami, Florida, March 2005.
       
  • Routing and Scheduling Large File Transfers over Lambda Grids.
    Amitabha Banerjee, Wu-chun Feng, Dipak Ghosal, Biswanath Mukherjee.
    In Proceedings of the 3rd International Workshop on Protocols for Fast Long-Distance Networks (PFLDnet'05), Lyon, France, February 2005.
    LA-UR 05-7911.  
       
  • Effective Dynamic Voltage Scaling through CPU-Boundedness Detection.
    Chung-hsing Hsu, Wu-chun Feng.
    In Lecture Notes in Computer Science, February 2005.
    LA-UR 04-7195.  (This paper is an extended version of a paper that appeared in 4th IEEE/ACM Workshop on Power-Aware Computer Systems in conjunction with the 37th IEEE/ACM International Symposium on Microarchitecture, Portland, OR, December 2004.)
       
  • When Discreteness Meets Continuity: Energy-Optimal DVS Scheduling Revisited.
    Chung-hsing Hsu, Wu-chun Feng.
    Los Alamos technical report LA-UR 05-3104, February 2005.
       
  • FAST TCP: From Theory to Experiments.
    Cheng Jin, David Wei, Steven H. Low, Gary Buhrmaster, Julian Bunn, Hyojeong D. Choe, Les R. Cottrell, John C. Doyle, Wu-chun Feng, Oliver Martin, Harvey Newman, Fernando Paganini, Sylvian Ravot, Suresh Singh.
    In IEEE Network, 19 (1): 4-11, January 2005.
    LA-UR 05-4539.  
       
  • A Systematic Approach for Providing End-to-end Probabilistic QoS Guarantee.
    Sami Ayyorgun, Wu-chun Feng.
    In Proceedings of the 13th IEEE International Conference on Computer Communications and Networks (ICCCN'04), Chicago, Illinois, October 2004.
    LA-UR 03-7267.  
       
  • User-Space Auto-Tuning for TCP Flow Control in Computational Grids.
    Mark K. Gardner, Sunil Thulasidasan, Wu-chun Feng.
    In Computer Communications, Special Issue on Network Support for Grid Computing,, 27 (14): 1364-1374, September 2004.
    LA-UR 03-1807.  
       
  • Scheduling and Transport for File Transfers on High-Speed Optical Circuits.
    Malathi Veeraraghavan, Xuan Zheng, Wu-chun Feng, Hojun Lee, Edwin Chong, Hua Li.
    In Journal of Grid Computing, 1 (4): 395-405, June 2004.
    LA-UR 04-2008.  
       
  • Re-Architecting Flow-Control Adaptation for Grid Environments.
    Adam Engelhart, Mark K. Gardner, Wu-chun Feng.
    In Proceedings of the 18th IEEE International Parallel and Distributed Processing Symposium, (IPDPS'04), Santa Fe, New Mexico, April 2004.
    LA-UR 04-0494.  
       
  • A Deterministic Characterization of Network Traffic for Average Performance Guarantee.
    Sami Ayyorgun, Wu-chun Feng.
    In Proceedings of the 38th Annual Conference on Information Sciences and Systems (CISS'04), Princeton, New Jersey, March 2004.
    LA-UR 03-4477.  
       
  • A High-Fidelity Software Oscilloscope for Globus.
    Mark K. Gardner, Wei Deng, Stephen T. Markham, Celso L. Mendes, Wu-chun Feng, Daniel A. Reed.
    In Proceedings of the GlobusWorld 2004, San Francisco, California, January 2004.
    LA-UR 03-7681.  
       
  • End-to-End Performance of 10-Gigabit Ethernet on Commodity Systems.
    Justin (Gus) Hurwitz, Wu-chun Feng.
    In IEEE Micro, 24 (1): 10-22, January 2004.
    LA-UR 04-0743.  
       
  • GREEN: A Practical Solution for Ensuring Fairness in a Best-Effort Network.
    Apu C. Kapadia, Wu-chun Feng, R. Campbell.
    Los Alamos technical report LA-UR 03-2372, 2003.
       
  • CHEETAH: Circuit-Switched High-Speed End-to-End Transport ArcHitecture.
    Malathi Veeraraghavan, Xuan Zheng, Hojun Lee, Mark K. Gardner, Wu-chun Feng.
    In Proceedings of the SPIE/IEEE Optical Networking and Computer Communications Conference (OptiComm), Dallas TX, October 2003.
    Best Paper Award
       
  • A Restricted Matching Problem on Bipartite Graphs.
    Sami Ayyorgun, Wu-chun Feng.
    Los Alamos technical report LA-UR 03-7269, September 2003.
       
  • A Systematic Approach to Probablistic Quality of Service Guarantees in Communication Networks.
    Sami Ayyorgun, Wu-chun Feng.
    Los Alamos technical report LA-UR 03-4477, September 2003.
       
  • A New Traffic Model and Its Implications over Multiplexers and Switches.
    Sami Ayyorgun, Wu-chun Feng.
    Los Alamos technical report LA-UR 03-7268, September 2003.
       
  • Initial End-to-End Performance Evaluation of 10-Gigabit Ethernet.
    Justin (Gus) Hurwitz, Wu-chun Feng.
    In Proceedings of the IEEE Hot Interconnects: A Symposium on High-Performance Interconnects, Palo Alto, California, August 2003.
    LA-UR 03-2713.  
       
  • Optimizing GridFTP Through Dynamic Right-Sizing.
    Sunil Thulasidasan, Wu-chun Feng, Mark K. Gardner.
    In Proceedings of the IEEE Symposium on High-Performance Distributed Computing. (HPDC-2003), Seattle, Washington, June 2003.
    LA-UR 03-2486.  
       
  • A Deterministic Definition of Burstiness for Network Traffic Characterization.
    Sami Ayyorgun, Wu-chun Feng.
    Los Alamos technical report LA-UR 03-4477, June 2003.
       
  • The Design, Implementation, and Evaluation of mpiBLAST.
    Aaron E. Darling, Lucas Carey, Wu-chun Feng.
    In Proceedings of the ClusterWorld Conference, San Jose, California, June 2003.
    LA-UR 03-2862.  
    Best Paper: Applications Track
       
  • MAGNET: A Tool for Debugging, Analysis and Adaptation in Computing Systems.
    Mark K. Gardner, Wu-chun Feng, Michael Broxton, Adam Engelhart, Justin (Gus) Hurwitz.
    In Proceedings of the 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid'2003), Tokyo, Japan, May 2003.
    LA-UR 02-7170.  
       
  • A Probabilistic Definition of Burstiness Characterization.
    Sami Ayyorgun, Wu-chun Feng.
    Los Alamos technical report LA-UR 03-3668, May 2003.
       
  • MUSE: A Software Oscilloscope for Clusters and Grids.
    Mark K. Gardner, Michael Broxton, Adam Engelhart, Wu-chun Feng.
    In Proceedings of the 17th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2003), Nice, France, April 2003.
    LA-UR 02-7169.  
       
  • GREEN+IDMAPS: A Practical Solution for Ensuring Fairness in a Biased Internet.
    Apu C. Kapadia, Sunil Thulasidasan, Wu-chun Feng.
    Los Alamos technical report LA-UR 02-6126, 2002. (electronic version unavailable)
       
  • High-Density Computing: A 240-Node Beowulf in One Cubic Meter.
    Michael Warren, Eric Weigle, Wu-chun Feng.
    In Proceedings of the SC 2002: High-Performance Networking and Computing Conference (SC2002), Baltimore, Maryland, November 2002.
    LA-UR 02-5194.  
       
  • mpiBLAST: Parallelization of BLAST for Computational Clusters.
    Aaron E. Darling, Wu-chun Feng.
    In SC 2002: High-Performance Networking and Computing Conference (SC2002), Baltimore, Maryland, November 2002.
    LA-UR 02-7222.  
       
  • On the Transient Behavior of TCP Vegas.
    Sarut Vanichpun, Wu-chun Feng.
    In Proceedings of the 11th IEEE International Conference on Computer Communications and Networks (IC3N'02), Miami, Florida, October 2002.
    LA-UR 02-5522.  
       
  • BLASTing Off with Green Destiny.
    Aaron E. Darling, Wu-chun Feng.
    In IEEE Computer Society Bioinformatics Conference, (CSB, Palo Alto, California, August 2002.
    LA-UR 02-5598.  
       
  • Performance Tradeoffs of TCP Adaptation Methods.
    Nageswara S. V. Rao, Wu-chun Feng.
    In Proceedings of the IEEE International Conference on Networks (ICN'02), Atlanta, Georgia, August 2002.
    LA-UR 02-3589.  
       
  • Packet Spacing: An Enabling Mechanism for the Delivery of Multimedia Content in Computational Grids.
    Annette C. Feng, Apu C. Kapadia, Wu-chun Feng, Geneva G. Belford.
    In Journal of Supercomputing, 23 (1): August 2002.
    LA-UR 01-4904.  (extended version)
       
  • Dynamic Right-Sizing in FTP (drsFTP): An Automatic Technique for Enhancing Grid Performance.
    Mark K. Gardner, Wu-chun Feng, Michael E. Fisk.
    In Proceedings of the IEEE Symposium on High-Performance Distributed Computing. (HPDC-2002), Edinburgh, Scotland, July 2002.
    LA-UR 02-2799.  
       
  • A Comparison of TCP Automatic-Tuning Techniques for Distributed Computing.
    Eric Weigle, Wu-chun Feng.
    In Proceedings of the IEEE Symposium on High-Performance Distributed Computing (HPDC'02), Edinburgh, Scotland, July 2002.
    LA-UR 02-2492.  
       
  • TICKETing High-Speed Traffic with Commodity Hardware and Software.
    Eric Weigle, Wu-chun Feng.
    In Proceedings of the Passive & Active Measurement Workshop (PAM2002), Fort Collins, Colorado, March 2002.
    LA-UR 02-0809.  
       
  • Monitoring Protocol Traffic with a MAGNeT.
    Mark K. Gardner, Wu-chun Feng, Jeffrey R. Hay.
    In Proceedings of the Passive & Active Measurement Workshop (PAM2002), Fort Collins, Colorado, March 2002.
    LA-UR 02-0808.  
       
  • The Quadrics Network : High-Performance Clustering Technology.
    Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg.
    In IEEE Micro, 22 (1): 46-57, January 2002.
    LA-UR 01-4100.  (extended version)
       
  • Performance Evaluation of the Quadrics Interconnection Network.
    Fabrizio Petrini, Adolfy Hoisie, Wu-chun Feng, Richard Graham.
    Los Alamos technical report LA-UR 00-5205, 2001.
       
  • Capturing Network Traffic with a MAGNeT.
    Jeffrey R. Hay, Wu-chun Feng, Mark K. Gardner.
    In Proceedings of the 5th Annual Linux Showcase and Conference (ALS'01), Oakland, California, November 2001.
    LA-UR 01-5065.  SC2001 Pamphlet LA-UR 01-5656: PDF
       
  • Dynamic Right-Sizing: TCP Flow-Control Adaptation.
    Michael E. Fisk, Wu-chun Feng.
    In SC 2001: High-Performance Networking and Computing Conference (SC 2001), Denver, Colorado, November 2001.
    LA-UR 01-2823.  
       
  • Packet Spacing: An Enabling Mechanism for the Delivery of Multimedia Content in Computational Grids.
    Annette C. Feng, Wu-chun Feng, Geneva G. Belford.
    In Proceedings of the 2nd Los Alamos Computer Science Institute Symposium (LACSI 2001), Santa Fe, New Mexico, October 2001.
    LA-UR 01-4859.  (Invited for publication in Journal of Supercomputing)
       
  • An Inter-Realm, Cyber-Security Infrastructure for Virtual Supercomputing.
    Jalal Al-Muhtadi, Wu-chun Feng, Michael E. Fisk.
    In Proceedings of the 2nd Annual Los Alamos Computer Science Institute Symposium. (LACSI 2001), Santa Fe, New Mexico, October 2001.
    LA-UR 01-4860, LA-UR 01-5757, LA-UR 02-3313.  
       
  • Dynamic Right-Sizing in TCP.
    Michael E. Fisk, Wu-chun Feng.
    In Proceedings of the 2nd Annual Los Alamos Computer Science Institute Symposium (LACSI 2001), Santa Fe, New Mexico, October 2001.
    LA-UR 01-5460.  
       
  • Dynamic Right-Sizing: A Simulation Study.
    Eric Weigle, Wu-chun Feng.
    In Proceedings of the 10th International Conference on Computer Communication and Networking (ICCCN 2001), Scottsdale, Arizona, October 2001.
    LA-UR 01-5062.  
       
  • Gang Scheduling with Lightweight User-Level Communication.
    Eitan Frachtenberg, Fabrizio Petrini, Salvador Coll, Wu-chun Feng.
    In Proceedings of the Workshop on Scheduling and Resource Management for Cluster Computing (in conjunction with the ICPP01) (SRMCC 2001), Valencia, Spain, September 2001.
    LA-UR 00-0894.  
       
  • The Quadrics Network (QsNet): High-Performance Clustering Technology.
    Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg.
    In Proceedings of the 9th IEEE Hot Interconnects. (HotI'01), Palo Alto, California, August 2001.
    LA-UR 01-4100.  (Selected for publication in IEEE Micro)
       
  • A Case for TCP Vegas in High-Performance Computational Grids.
    Eric Weigle, Wu-chun Feng.
    In Proceedings of the 9th IEEE International Symposium on High-Performance Distributed Computing (HPDC'01), San Francisco, California, August 2001.
    LA-UR 01-3420.  
       
  • The Effects of Inter-Packet Spacing on the Delivery of Multimedia Content.
    Apu C. Kapadia, Annette C. Feng, Wu-chun Feng.
    In Proceedings of the 21st International Conference on Distributed Computing Systems (ICDCS'01), Scottsdale, Arizona, April 2001.
    LA-UR 01-0904.  
       
  • Performance Evaluation of the Quadrics Interconnection Network.
    Fabrizio Petrini, Adolfy Hoisie, Wu-chun Feng, Richard Graham.
    In Proceedings of the IEEE Workshop on Communication Architectures for Clusters (CAC'01), held in conjunction with IPDPS 2001, San Francisco, California, April 2001.
    LA-UR 00-5205.  
       
  • Why TCP Will Not Scale for the Next-Generation Internet.
    Eric Weigle, Wu-chun Feng, Mark K. Gardner.
    In Proceedings of the 11th IEEE Workshop on Local and Metropolitan Area Networks. (LANMAN 2001), Boulder, Colorado, March 2001.
    LA-UR 01-1039.  
       
  • Rate-Adjustment Algorithm for Aggregate TCP Congestion Control.
    Peerapol Tinnakornsrisuphap, R. Agrawal, Wu-chun Feng.
    Los Alamos technical report LA-UR 00-4220, 2000.
       
  • Improved Resource Utilization with Buffered Coscheduling.
    Fabrizio Petrini, Wu-chun Feng.
    In Journal of Parallel Algorithms and Applications (Special Issue), 16 (2-3): 123-144, 2000.
    LA-UR 00-3476.  
       
  • Dynamic Adjustment of TCP Window Sizes.
    Michael E. Fisk, Wu-chun Feng.
    Los Alamos technical report LA-UR 00-3321, 2000.
       
  • Buffered Coscheduling: A New Methodology for Multitasking Parallel Jobs on Distributed Systems.
    Fabrizio Petrini, Wu-chun Feng.
    In Proceedings of the 14th International Parallel and Distributed Processing Symposium (IPDPS), Cancun, Mexico, May 2000.
    LA-UR 00-0892.  
       
  • Time-Sharing Parallel Jobs in the Presence of Multiple Resource Requirements.
    Fabrizio Petrini, Wu-chun Feng.
    In Proceedings of the 6th International Workshop on Job Scheduling Strategies for Parallel Processing, May 2000.
    LA-UR 00-6023.  Also in Lecture Notes in Computer Science Vol. 1911
       
  • Scheduling with Global Information in Distributed Systems.
    Fabrizio Petrini, Wu-chun Feng.
    In Proceedings of the 20th International Conference on Distributed Computing Systems (ICDCS'00), Taipei, Taiwan, April 2000.
    LA-UR 00-0894.  
       
  • On the Burstiness of the TCP Congestion-Control Mechanism in a Distributed Computing System.
    Peerapol Tinnakornsrisuphap, Wu-chun Feng, Ian R. Philp.
    In Proceedings of the 20th International Conference on Distributed Computing Systems (ICDCS'00), Taipei, Taiwan, April 2000.
    LA-UR 00-0481.  
       
  • Efficient Resource Utilization on a Massively Parallel Systems.
    Fabrizio Petrini, Wu-chun Feng.
    In Proceedings of the 7th International Conference on Advanced Computing & Communications . (ADCOM'99), Roorkee, India, December 1999.
    LA-UR 99-4930.  
       
  • From HiPPI-800 to HiPPI-6400: A Changing of the Guard and Gateway to the Future.
    Don E. Tolmie, Mike T. Boorman, Andy DuBois, Dave DuBois, Wu-chun Feng, Ian R. Philp.
    In Proceedings of the 6th International Conference on Parallel Interconnects (PI'99), Anchorage, Alaska, October 1999.
    LA-UR 99-4450.