Publications

2014
 
"Adaptive Cache Management for Energy-efficient GPU Computing", Chen, Xuhao; Chang, Li-Wen; Rodrigues, Christopher I.; Lv, Jie; Wang, Zhiying; Hwu, Wen-mei, Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, December 2014 . [more...]
 
"Supporting High-Level, High-Performance Parallel Programming with Library-Driven Optimization", Rodrigues, Christopher I., University of Illinois Doctoral Disertation, May 2014. [more...]
 
"Automatic execution of single-GPU computations across multiple GPUs", Cabezas, Javier; Vilanova, Lluis; Gelado, Isaac; Jablin, Tom; Navarro, Nacho; Hwu, Wen-mei, Proceedings of the 23rd international conference on Parallel architectures and compilation (PACT '14). [more...]
 
"High Performance Histogramming on Massively Parallel Processors", Ross, Greg, University of Illinois Masters Disertation, August 2014. [more...]
 
"Scalable Parallel Tridiagonal Algorithms with Diagonal Pivoting and Their Optimization for Many-core Architectures", Chang, Li-Wen, University of Illinois Master Thesis, July 2014? . [more...]
 
"Adaptive Cache Bypass and Insertion for Many-core Accelerators", Chen, Xuhao; Wu, Shengzhao; Chang, Li-Wen; Huang, Wei-Sheng; Pearson, Carl; Hwu, Wen-mei, Proceedings of International Workshop on Manycore Embedded Systems, 2014. [more...]
 
"Dynamic Loop Vectorization for Executing OpenCL Kernels on CPUs", El Hajj, Izzat, University of Illinois Masters Disertation, May 2014. [more...]
 
"Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications", Cabezas, Javier; Gelado, Isaac; Stone, John E.; Navarro, Nacho; Kirk, David; Hwu, Wen-mei, IEEE Transactions on Parallel and Distributed Systems, Issue:99. [more...]
 
"Triolet: A Programming System that Unifies Algorithmic Skeleton Interfaces for High-Performance Cluster Computing", Rodrigues, Christopher I.; Dakkak, Abdul; Jablin, Tom; Hwu, Wen-mei, Proceedings of the 2014 ACM SIGPLAN Conference on Principles and Practice of Parallel Programing, February 2014. [more...]
 
"Multi-tier Dynamic Vectorization for Translating GPU Optimizations into CPU Performance", Kim, Hee-Seok; El Hajj, Izzat; Stratton, John A.; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-14-01, University of Illinois at Urbana-Champaign, Center for Reliable and High-Performance Computing, February 5, 2014. [more...]
 
"In-place transposition of rectangular matrices on accelerators", Sung, Ray; Gómez-Luna, Juan; González-Linares, José María; Guil, Nicolás; Hwu, Wen-mei, PPoPP '14 Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming. [more...]
 
"BLESS: Bloom filter-based error correction solution for high-throughput sequencing reads", Heo, Yun; Wu, Xiao-Long; Chen, Deming; Ma, Jian; Hwu, Wen-mei; Aarts, Baastian, Advance Access Publication. [more...]
 
2013
 
"Mapping Tridiagonal Solvers to Linear Recurrences", Chang, Li-Wen; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-13-01, University of Illinois at Urbana-Champaign, Center for Reliable and High-Performance Computing, Sept. 8, 2013 . [more...]
 
"Throughput-Oriented Kernel Porting onto FPGAs", Papakonstantinou, Alexandros; Chen, Deming; Hwu, Wen-mei; Cong, Jason; Liang, Yun, Proceedings of the 50th Annual Design Automation Conference, May 2013 . [more...]
 
"Performance Portability in Accelerated Parallel Kernels", Stratton, John A.; Kim, Hee-Seok; Jablin, Tom; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-13-01, University of Illinois at Urbana-Champaign, Center for Reliable and High-Performance Computing, May 18, 2012. [more...]
 
"Real-time in vivo computed optical interferometric tomography", Ahmad, Adeel; Shemonski, Nathan; Adie, Steven; Kim, Hee-Seok; Hwu, Wen-mei; Carney, Scott; Boppart, Stephen, Nature Photonics. [more...]
 
"Comparison Based Sorting for Systems with Multiple GPUs", Tanasic, Ivan; Vilanova, Lluis; Jordà, Marc; Cabezas, Javier; Gelado, Isaac; Navarro, Nacho; Hwu, Wen-mei, In GPGPU-6 , Six Workshop on General Purpose Processing Using GPUs, Mar 2013, ISBN: 978-1-4503-2017-7. [more...]
 
"Rapid Computation of Sodium Bioscales Using GPU-Accelerated Image Reconstruction", Atkinson, Ian C. ; Liu, Geng; Obeid, Nady; Thulborn, Keith R.; Hwu, Wen-mei, International Journal of Imaging Systems and Technology, Volume 23, Issue 1, pages 29-35, March 2013. [more...]
 
"More IMPATIENT: A gridding-accelerated Toeplitz-based strategy for non-Cartesian high-resolution 3D MRI on GPUs", Gai, Jiading; Obeid, Nady; Holtrop, Joseph L. ; Wu, Xiao-Long; Lam, Fan; Fu, Maojing; Haldar, Justin P.; Hwu, Wen-mei; Liang, zhi-Pei; Sutton, Bradley P., Journal of Parallel and Distributed Computing, 16 January 2013. [more...]
 
2012
 
"TIGER: Tiled iterative genome assembler", Wu, Xiao-Long; Heo, Yun; El Hajj, Izzat; Hwu, Wen-mei; Chen, Deming; Ma, Jian, Journal of BMC Bioinformatics, 2012 . [more...]
 
"Design evaluation of OpenCL compiler framework for Coarse-Grained Reconfigurable Arrays", Kim, Hee-Seok; Ahn, Minwook; Stratton, John A.; Hwu, Wen-mei, Proceedings of the Field-Programmable Technology (FPT) International Conference, Dec 2012 . [more...]
 
"A Scalable, Numerically Stable, High-performance Tridiagonal Solver using GPUs", Chang, Li-Wen; Stratton, John A.; Kim, Hee-Seok; Hwu, Wen-mei, Proceedings of the International Conference for High Performance Computing, Networking Storage and Analysis, 2012. [more...]
 
"Performance Analysis andTuning for General Purpose Graphics Processing Units (GPGPU)", Kim, Hyesoon; Vudue, Richard; Baghsorkhi, Sara Sadeghi; Choi, Jee; Hwu, Wen-mei, Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers 2012 . [more...]
 
"Unboxed Polymorphic Objects for Functional Numerical Programming", Rodrigues, Christopher I.; Hwu, Wen-mei, IMPACT Technical Report IMPACT-12-02, University of Illinois at Urbana-Champaign, 2012. [more...]
 
"Algorithm and Data Optimization Techniques for Scaling to Massively Threaded Systems", Stratton, John A.; Rodrigues, Christopher I.; Sung, Ray; Chang, Li-Wen; Anssari, Nasser; Liu, Daniel; Hwu, Wen-mei; Obeid, Nady, IEEE Computer, vol. 45, no. 8, pp. 26-32, Aug. 2012 . [more...]
 
"Optimization and Architecure Effects on GPU Computing Workload Performance", Stratton, John A.; Anssari, Nasser; Rodrigues, Christopher I.; Sung, Ray; Obeid, Nady; Chang, Li-Wen; Liu, Daniel; Hwu, Wen-mei, Proceedings of the IEEE Conference on Innovative Parallel Computing, May 2012. [more...]
 
"DL: Data Layout Transformation System for Heterogeneous Computing", Sung, Ray; Liu, Daniel; Hwu, Wen-mei, IEEE Innovative Parallel Computing (InPar 2012), San Jose, CA, May 13--14, 2012. [more...]
 
"More IMPATIENT : A Gridding - Accelerated Toeplitz - based S trategy for Non - Cartesian High - Resolution 3D MRI on GPU", Gai, Jiading; Holtrop, Joseph L. ; Wu, Xiao-Long; Lam, Fan; Fu, Maojing; Haldar, Justin P.; Hwu, Wen-mei; Liang, zhi-Pei; Sutton, Bradley P., Proceedings of the International Society for Magnetic Resonance in Medicine (ISMRM), May 2012 . [more...]
 
"Parboil: A Revised Benchmark Suite for Scientific and Commercial Throughput Computing", Stratton, John A.; Rodrigues, Christopher I.; Sung, Ray; Obeid, Nady; Chang, Li-Wen; Anssari, Nasser; Liu, Daniel; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-12-01, University of Illinois at Urbana-Champaign, Center for Reliable and High-Performance Computing, March 2, 2012. [more...]
 
"Efficient Performance Evaluation of Memory Hierarchy for Highly Multithreaded Graphics Processors", Baghsorkhi, Sara Sadeghi; Gelado, Isaac; Delahaye, Matthieu; Hwu, Wen-mei, Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, February, 2012. [more...]
 
2011
 
"A Tiling-Scheme Viterbi Decoder in Software-Defined Radio for GPUs", Lin, Chih-Sheng; Liu, Wei-Lun; Yeh, Wei-Ting; Chang, Li-Wen; Hwu, Wen-mei; Chen, Sao-Jie; Hsiung, Pao-Ann, Proceedings of the 7th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM), 2011 . [more...]
 
"Scalable SIMD-parallel memory allocation for many-core machines", Huang, Victor; Rodrigues, Christopher I.; Jones, Stephen; Buck, Ian; Hwu, Wen-mei, The Journal of Supercomputing, 9 Sep 2011. [more...]
 
"Scalable Tridiagonal Solver for GPUs", Kim, Hee-Seok; Chang, Li-Wen; Hwu, Wen-mei, Proceedings of the International Conference on Parallel Processing, September 2011. [more...]
 
"IMPATIENT MRI: Illinois Massively Parallel Acceleration Toolkit for Image Reconstruction with ENhanced Throughput in MRI", Wu, Xiao-Long; Gai, Jiading; Lam, Fan; Fu, Maojing; Haldar, Justin P.; Zhuo, Yue; Liang, zhi-Pei; Hwu, Wen-mei; Sutton, Bradley P., Proceedings of the International Society for Magnetic Resonance in Medicine (ISMRM), May 2011. [more...]
 
"Parallel Implementation of Multi-Dimensional Ensemble Empirical Mode Decomposition", Chang, Li-Wen; Lo, Men-Tzung; Anssari, Nasser; Hsu, Liang-Chuan; Huang, Norden E.; Hwu, Wen-mei, Proceedings of the IEEE Conference on Acoustics, Speech and Signal Processing, May 2011. [more...]
 
"Multilevel Granularity Parallelism Synthesis on FPGAs", Papakonstantinou, Alexandros; Liang, Yun; Stratton, John A.; Gururaj, Karthik; Chen, Deming; Hwu, Wen-mei; Cong, Jason, Proceedings of the 2011 International Symposium on Field-Programmable Custom Computing Machines (FCCM), May 2011. (Best Paper Award from FCCM 2011) [more...]
 
"Auto-tuning of Fast Fourier Transform on Graphics Processors", Dotsenko, Yuri; Baghsorkhi, Sara Sadeghi; Lloyd, Brandon; Govindaraju, Naga, Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Paral lel Programming (PPoPP), Feb. 2011. [more...]
 
"Advanced MRI Reconstruction Toolbox with Accelerating on GPUs", Wu, Xiao-Long; Zhuo, Yue; Gai, Jiading; Lam, Fan; Fu, Maojing; Haldar, Justin P.; Hwu, Wen-mei; Liang, zhi-Pei; Sutton, Bradley P., Proceedings of the IS&T/SPIE Electronic Imaging 2011 Conference on "Parallel Processing for Imaging Applications", January 2011. [more...]
 
2010
 
"Sparse Regularization in MRI Iterative Reconstruction using GPUs", Zhuo, Yue; Sutton, Bradley P. ; Wu, Xiao-Long; Haldar, Justin P.; Hwu, Wen-mei; Liang, zhi-Pei, Proceedings of the 3rd International Conference on BioMedical Engineering and Informatics (BMEI'10), October 2010. [more...]
 
"Exploiting More Parallelism from Applications Having Generalized Reductions on GPU Architectures", Wu, Xiao-Long; Obeid, Nady; Hwu, Wen-mei, Proceedings of the 10th IEEE International Conference on Computer and Information Technology (CIT 2010), pp.1175-1180, June 2010. [more...]
 
"Data Layout Transformation Exploiting Memory-Level Parallelism in Structured Grid Many-Core Applications", Sung, Ray; Stratton, John A.; Hwu, Wen-mei, Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques (PACT), Vienna, Austria, September 11-15, 2010. [more...]
 
"An Effective GPU Implementation of Breadth-First Search", Luo, Lijuan ; Wong, Martin ; Hwu, Wen-mei, Proceedings of the 47th Design Automation Conference, 2010. [more...]
 
"Implementing a GPU Programming Model on a non-GPU Accelerator Architecture", Kofsky, Stephen M.; Johnson, Daniel R.; Stratton, John A.; Hwu, Wen-mei; Patel, sanjay J.; Lumetta, Steve S, Proceedings of the Workshop on Applications for Multi- and Many-cores, June 2010. [more...]
 
"XMalloc: A Scalable Lock-free Dynamic Memory Allocator for Many-core Machines", Huang, Victor; Rodrigues, Christopher I.; Jones, Stephen; Buck, Ian; Hwu, Wen-mei, Proceedings of the 10th IEEE International Conference on Computer and Information Technology (CIT 2010), pp.1134-1139, June 2010. (Best Paper Award from CIT 2010) [more...]
 
"Multi-GPU Implementation for Iterative MR Image Reconstruction with Field Correction", Zhuo, Yue; Wu, Xiao-Long; Haldar, Justin P.; Hwu, Wen-mei; Liang, zhi-Pei; Sutton, Bradley P., Proceedings of International Society for Magnetic Resonance in Medicine (ISMRM) 2010. [more...]
 
"Accelerating Iterative Field-Compensated MR Image Reconstruction on GPUs", Zhuo, Yue; Wu, Xiao-Long; Haldar, Justin P.; Hwu, Wen-mei; Liang, zhi-Pei; Sutton, Bradley P., Proceedings of the IEEE International Symposium on Biomedical Imaging(ISBI), April, 2010. [more...]
 
"Efficient Compilation of Fine-grained SPMD-threaded Programs for Multicore CPUs", Stratton, John A.; Grover, Vinod; Marathe, Jaydeep; Aarts, Baastian; Murphy, Mike; Hu, Ziang; Hwu, Wen-mei, Proceedings of the International Symposium on Code Generation and Optimization, April 2010 . [more...]
 
"An Asymmetric Distributed Shared Memory Model for Heterogeneous Parallel Systems", Gelado, Isaac; Stone, John E.; Cabezas, Javier; Patel, sanjay J.; Navarro, Nacho; Hwu, Wen-mei, The ACM/IEEE 15th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'10), Pittsburgh, PA., March 13 - 17, 2010. [more...]
 
"Data Layout Transformation for Structured-Grid Codes on GPU", Sung, Ray; Hwu, Wen-mei, Workshop on Language, Compiler, and Architecture Support for GPGPU, in conjunction with PPoPP 2010. [more...]
 
"An Adaptive Performance Modeling Tool for GPU Architectures", Baghsorkhi, Sara Sadeghi; Delahaye, Matthieu; Patel, sanjay J.; Gropp, William D.; Hwu, Wen-mei, Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), Jan. 2010. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
2009
 
"The parallelization of video processing", Lin, Dennis; Huang, Victor; Nguyen, Quang; Blackburn, Joshua; Rodrigues, Christopher I.; Huang, Thomas; Do, Minh N.; Patel, sanjay J.; Hwu, Wen-mei, IEEE Signal Processing Magazine 26(6), 103--112, 2009. [more...]
 
"GPU clusters for high-performance computing", Kindratenko, Volodymyr V.; Enos, Jeremy J. ; Shi, Guochun; Showerman, Michael T. ; Arnold, Galen W. ; Stone, John E.; Phillips, James C. ; Hwu, Wen-mei, Cluster Computing and Workshops, 2009. CLUSTER'09. IEEE International Conference on, 2009. [more...]
 
"FCUDA: Enabling Efficient Compilation of CUDA Kernels onto FPGAs", Pattabiraman, Karthik; Gururaj, Karthik; Stratton, John A.; Chen, Deming; Cong, Jason; Hwu, Wen-mei, (Best Paper Award) Alexander Papakonstantinou, Karthik Gururaj, John A. Stratton, Deming Chen, Jason Cong and Wen-mei W. Hwu Symposium on Application Specific Processors, July 2009. (Best Paper Award) [more...]
 
"Compute Unified Device Architecture Application Suitability", Hwu, Wen-mei; Rodrigues, Christopher I.; Ryoo, Shane; Stratton, John A., Computing in Science and Engineering Vol. 11 No. 3, May 2009. [more...]
 
"Analytical Performance Prediction for Evaluation and Tuning of GPGPU Applications.", Baghsorkhi, Sara Sadeghi; Delahaye, Matthieu; Gropp, William D.; Hwu, Wen-mei, In the Workshop on Exploiting Parallelism Using GPUs and Other Hardware-Assisted Methods (associated with CGO '09), March 2009. [more...]
 
"QP: a heterogeneous multi-accelerator cluster", Showerman, Michael T. ; Enos, Jeremy J. ; Pant, Avneesh; Kindratenko, Volodymyr V.; Steffen, Craig ; Pennington, Robert ; Hwu, Wen-mei, Proc. 10th LCI International Conference on High-Performance Clustered Computing, 2009. [more...]
 
"High performance computation and interactive display of molecular orbitals on GPUs and multi-core CPUs", Stone, John E.; Saam, Jan ; Hardy, David J; Hwu, Wen-mei; Schulten, Klaus, Proceeding GPGPU-2 Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, 2009. [more...]
 
2008
 
"Program Optimization Strategies for Data-Parallel Many-Core Processors.", Ryoo, Shane, PhD Dissertation, Department of Electrical and Computer Engineering, University of Illinois, Urbana, IL, 2008. [more...]
 
"The Concurrency Challenge", Hwu, Wen-mei; Keutzer, Kurt ; Mattson, Timothy G., Design & Test of Computers, IEEE, 2008 . [more...]
 
"MCUDA: An Efficient Implementation of CUDA Kernels for Multi-Core CPUs", Stratton, John A.; Stone, Sam S.; Hwu, Wen-mei, 21st International Workshop on Languages and Compilers for Parallel Computing, LNCS 5335, pp. 16-30, 2008. (Paper of IMPACT - Cited Greater Than 150 Times) [more...]
 
"CUDA-lite: Reducing GPU Programming Complexity.", Ueng, Sain-Zee; Lathara, Melvin; Baghsorkhi, Sara Sadeghi; Hwu, Wen-mei, The 21st International Workshop on Languages and Compilers for Parallel Computing, LNCS 5335, pp. 1-15, 2008. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"Analyses for Extensive Parallelization of Video Applications in C.", Ryoo, Shane; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-08-02, University of Illinois at Urbana-Champaign, Urbana, IL, June 2008. [more...]
 
"CUBA: An Architecture for Efficient CPU/Co-processor Data Communication.", Gelado, Isaac; Kelm, John H.; Ryoo, Shane; Lumetta, Steve S; Navarro, Nacho; Hwu, Wen-mei, Proceedings of the 22nd ACM International Conference on Supercomputing, June 2008. [more...]
 
"Accelerating Advanced MRI Reconstructions on GPUs.", Stone, Sam S.; Haldar, Justin P.; Tsao, Stephanie; Hwu, Wen-mei; Liang, zhi-Pei; Sutton, Bradley P., Proceedings of the 2008 International Conference on Computing Frontiers, May 2008. (Paper of IMPACT - Cited Greater Than 150 Times) [more...]
 
"GPU Acceleration of Cutoff Pair Potential for Molecular Modeling Applications", Rodrigues, Christopher I.; Hardy, David J; Stone, John E.; Schulten, Klaus; Hwu, Wen-mei, Proceedings of the 2008 International Conference on Computing Frontiers, May 2008. [more...]
 
"Program Optimization Space Pruning for a Multithreaded GPU", Ryoo, Shane; Rodrigues, Christopher I.; Stone, Sam S.; Baghsorkhi, Sara Sadeghi; Ueng, Sain-Zee; Stratton, John A.; Hwu, Wen-mei, Proceedings of the 2008 International Symposium on Code Generation and Optimization, April 2008 . (Paper of IMPACT - Cited Greater Than 150 Times) [more...]
 
"MCUDA: An Efficient Implementation of CUDA Kernels on Multi-cores.", Stratton, John A.; Stone, Sam S.; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-08-01, University of Illinois, Urbana, IL, 2008. [more...]
 
"Program Optimization Carving for GPU Computing.", Ryoo, Shane; Rodrigues, Christopher I.; Stratton, John A.; Stone, Sam S.; Ueng, Sain-Zee; Baghsorkhi, Sara Sadeghi; Hwu, Wen-mei, The Special Issue of the Journal of Parallel and Distributed Computing on General Purpose Parallel Processing Using GPUs, 2008. [more...]
 
"Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA", Ryoo, Shane; Rodrigues, Christopher I.; Baghsorkhi, Sara Sadeghi; Stone, Sam S.; Kirk, David; Hwu, Wen-mei, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, February 2008. (Paper of IMPACT - Cited Greater Than 600 Times) [more...]
 
2007
 
"Iteration Disambiguation for Parallelism Identification in Time-Sliced Applications", Ryoo, Shane; Rodrigues, Christopher I.; Hwu, Wen-mei, The 20th International Workshop on Languages and Compilers for Parallel Computing, LNCS 5234, October 2007. [more...]
 
"How GPUs Can Improve the Quality of Magnetic Resonance Imaging.", Stone, Sam S.; Yi, Haoran; Haldar, Justin P.; Hwu, Wen-mei; Sutton, Bradley P. ; Liang, zhi-Pei, The First Workshop on General Purpose Processing on Graphics Processing Units, October 2007. [more...]
 
"Program Optimization Study on a 128-Core GPU.", Ryoo, Shane; Rodrigues, Christopher I.; Stone, Sam S.; Baghsorkhi, Sara Sadeghi; Ueng, Sain-Zee; Hwu, Wen-mei; Yu, Xuan, The First Workshop on General Purpose Processing on Graphics Processing Units, October 2007. [more...]
 
"CIGAR: Application Partitioning for a CPU/Coprocessor Architecture", Kelm, John H.; Gelado, Isaac; Murphy, Mark J.; Navarro, Nacho; Lumetta, Steve S; Hwu, Wen-mei, In The 2007 International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), pp. 317-326, Sep 2007, ISBN 978-0-7695-2944-8. [more...]
 
"Performance Insights on Executing Non-Graphics Applications on CUDA on the NVIDIA GeForce 8800 GTX.", Hwu, Wen-mei; Kirk, David; Ryoo, Shane; Rodrigues, Christopher I.; Stratton, John A.; Huang, KuangWei, Presentation at Hot Chips 19, August 2007. [more...]
 
"Automatic Discovery of Coarse-Grained Parallelism in Media Applications", Ryoo, Shane; Ueng, Sain-Zee; Rodrigues, Christopher I.; Kidd, Robert E.; Frank, Matthew I; Hwu, Wen-mei, Transactions on HiPEAC I, LNCS 4050, pp. 194-213, 2007. [more...]
 
"Implicit Parallel Programming Models for Thousand-Core Microprocessors.", Hwu, Wen-mei; Ryoo, Shane; Ueng, Sain-Zee; Kelm, John H.; Gelado, Isaac; Stone, Sam S.; Kidd, Robert E.; Baghsorkhi, Sara Sadeghi; Mahesri, Aqeel A.; Tsao, Stephanie; Navarro, Nacho; Lumetta, Steve S; Frank, Matthew I; Patel, sanjay J., Proceedings of the 44th Annual Design Automation Conference, June 2007. [more...]
 
"Dynamic Tracking of Information-Flow Signatures for Security Checking", Healey, William; Pattabiraman, Karthik; Ryoo, Shane; Iyer, Ravishanker; Hwu, Wen-mei, Technical Report UILU-ENG-02-2002, University of Illinois at Urbana-Champaign, January 2007. [more...]
 
2006
 
"Improved Superblock Optimization in GCC.", Kidd, Robert E.; Hwu, Wen-mei, Proceedings of the GCC Developer's Summit, pp. 85-96, June 2006. [more...]
 
"P3DE: Profile-Directed Predicated Partial Dead Code Elimination.", Ryoo, Shane; Ueng, Sain-Zee; Hwu, Wen-mei, The 5th Workshop on EPIC Architectures and Compiler Technology, March 2006. [more...]
 
"Tolerating Cache-Miss Latency With Multipass Pipelines.", Barnes, Ronald D.; Ryoo, Shane; Hwu, Wen-mei, IEEE Micro, Vol. 26, No. 1, January-February 2006. [more...]
 
2005
 
"Fulcra Pointer Analysis Framework.", Nystrom, Erik M., PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2005. [more...]
 
"A Systematic Approach to Delivering Instruction-Level Parallelism in EPIC Systems.", Sias, John W., PhD. Dissertation, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2005. [more...]
 
"Multiple-Pass Pipelining: Enhancing in-order Microarchitectures to Out-Of-Order Performance.", Barnes, Ronald D., PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2005. [more...]
 
""Flea-flicker" Multipass Pipelining: An Alternative to the High-Power Out-of-Order Offense.", Barnes, Ronald D.; Ryoo, Shane; Hwu, Wen-mei, Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture, November 2005. [more...]
 
"Trimaran: An infrastructure for research in instruction-level parallelism", Chakrapani, Lakshmi N. ; Gyllenhaal, John C.; Hwu, Wen-mei; Palem, Krishna V. ; Rabbah, Rodric M., .. [more...]
 
2004
 
"Matching On-Chip Data Storage To Telecommunication And Media Application Properties.", Hunter, Hillery C., PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2004. [more...]
 
"Partial Code Elimination in the IMPACT Compiler Framework.", Ryoo, Shane, MS Thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana, IL, 2004. [more...]
 
"Template Bundling for EPIC Architectures.", Ueng, Sain-Zee, MS thesis, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 2004. [more...]
 
"Applying Scalable Interprocedural Pointer Analysis to Embedded Applications.", Hunter, Hillery C.; Nystrom, Erik M.; Hwu, Wen-mei, Workshop on Compilers and Tools for Constrained Embedded Systems, September 2004. [more...]
 
"Bottom-up and Top-down Context-Sensitive Summary-based Pointer Analysis.", Nystrom, Erik M.; Kim, Hong-Seok; Hwu, Wen-mei, Proceedings of the 11th Static Analysis Symposium, August 2004. [more...]
 
"Exploiting Load Flexibility for Embedded Power Savings.", Hunter, Hillery C.; Ryoo, Shane; Player, James; Connors, Daniel A.; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-04-01, University of Illinois, at Urbana-Champaign, June 2004. [more...]
 
"Field-testing IMPACT EPIC Research Results in Itanium 2.", Sias, John W.; Ueng, Sain-Zee; Kent, Geoffrey A.; Steiner, Ian; Nystrom, Erik M.; Hwu, Wen-mei, Proceedings of the 31st Annual International Symposium on Computer Architecture, pp. 26-37, July 2004. [more...]
 
"Importance of Heap Specialization in Pointer Analysis.", Nystrom, Erik M.; Kim, Hong-Seok; Hwu, Wen-mei, Proceedings of Program Analysis for Software Tools and Engineering, June 2004. [more...]
 
"Extracting Data Flow Model from von Neumann Program for Synthesis", Li, Chien-wei; Kim, Hong-Seok; Hwu, Wen-mei, Proceedings of the 13th International Workshop on Logic and Synthesis, June 2004. [more...]
 
2003
 
"A Dynamic Application Analysis Framework.", Conte, Marie T., PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, at Urbana-Champaign, 2003. [more...]
 
"Beating in-order stalls with "flea-flicker" two-pass pipelining", Barnes, Ronald D.; Nystrom, Erik M.; Sias, John W.; Patel, sanjay J.; Navarro, Nacho; Hwu, Wen-mei, Proceeding MICRO 36 Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture, 2003. [more...]
 
"A New Look at Exploiting Data Parallelism in Embedded Systems.", Hunter, Hillery C.; Moreno, Jaime H., Proceedings of the International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, October 2003. [more...]
 
"Memory Profiling: Expanding the 3G Developer's Bag of Tricks.", Hunter, Hillery C.; Hwu, Wen-mei, Workshop on Compilers and Tools for Constrained Embedded Systems, October 2003. [more...]
 
"Motivating use of Memory Profiling in the 3G Domain.", Hunter, Hillery C.; Li, Chien-wei; Hwu, Wen-mei, Proceedings of the SRC TECHCON 2003, August 2003. [more...]
 
"An Innovative Low-Power High-Performance Programmable Signal Processor for Digital Communications.", Moreno, Jaime H.; Zyuban, Victor V.; Shvadron, Uzi; Neeser, Fredy D.; Derby, Jeffrey H; Ware, Malcolm S.; Kailas, Krishnan; Zaks, Ayal; Geva, A.; Ben-David, Shay; Asaad, Sameh W.; Fox, Thomas W.; Littrell, D.; Biberstein, Marina; Naishlos, Dorit; Hunter, Hillery C., IBM Journal of Research and Development, vol. 47, no 2/3, March/May 2003. [more...]
 
"Compaction algorithm for precise modular context-sensitive pointer analysis.", Kim, Hong-Seok; Nystrom, Erik M.; Barnes, Ronald D.; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-03-03, University of Illinois, Urbana, IL, 2003. [more...]
 
"Scalable, precise context-sensitive top-down process for modular points-to analysis.", Nystrom, Erik M.; Kim, Hong-Seok; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-03-03, University of Illinois, Urbana, IL, 2003. [more...]
 
2002
 
"The IMPACT SC140 Code Generator.", Shannon, Christopher J., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, April 2002. [more...]
 
"Vacuum Packing: Extracting Hardware-Detected Program Phases for Post-link Optimization.", Barnes, Ronald D.; Nystrom, Erik M.; Merten, Matthew C.; Hwu, Wen-mei, Proceedings of the 35th International Symposium on Microarchitecture, November 2002. [more...]
 
"Code Coverage and Input Variability: Effects on Architecture and Compiler Research.", Hunter, Hillery C.; Hwu, Wen-mei, Proceedings of the International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, October, 2002. [more...]
 
2001
 
"Enhancing Loop Buffering of Media and Telecommunications Applications Using Low-overhead Predication.", Sias, John W.; Hunter, Hillery C.; Hwu, Wen-mei, Proceedings of the 34th International Symposium on Microarchitecture, December, 2001. [more...]
 
"Program Decision Logic Optimization Using Predication and Control Speculation", Hwu, Wen-mei; August, David I.; Sias, John W., Proceedings of the IEEE, November, 2001. [more...]
 
"A study of the energy saving and capacity improvement potential of power control in multi-hop wireless networks", Monks, Jeffrey P.; Ebert, J.-P.; Wolisz, Adam ; Hwu, Wen-mei, Proceedings. LCN 2001. 26th Annual IEEE Conference on Local Computer Networks, 2001. [more...]
 
"Itanium Performance Insights.", Hwu, Wen-mei; Sias, John W.; Merten, Matthew C.; Nystrom, Erik M.; Barnes, Ronald D.; Shannon, Christopher J.; Ryoo, Shane; Olivier, Jeff V., Presentation at Microprocessor Forum, October 2001. [more...]
 
"Modulo Schedule Buffers.", Merten, Matthew C.; Hwu, Wen-mei, Proceedings of the 34th International Symposium on Microarchitecture, December, 2001. [more...]
 
"Code Reordering and Speculation Support for Dynamic Optimization Systems.", Nystrom, Erik M.; Barnes, Ronald D.; Merten, Matthew C.; Hwu, Wen-mei, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, September 8-12, 2001. [more...]
 
"Itanium Performance Insights from the IMPACT Compiler.", Sias, John W.; Merten, Matthew C.; Nystrom, Erik M.; Barnes, Ronald D.; Shannon, Christopher J.; Matarazzo, Joseph D.; Ryoo, Shane; Olivier, Jeff V.; Hwu, Wen-mei, Presentation at Hot Chips 13, August 2001. [more...]
 
"Characterization of Repeating Data Access Patterns in Integer Benchmarks.", Nystrom, Erik M.; Hwu, Wen-mei, Memory Performance Issues Workshop at the 28th International Symposium on Computer Architecture, July 2001. [more...]
 
"An Architectural Framework for Run-Time Optimization.", Merten, Matthew C.; Trick, Andrew R.; Barnes, Ronald D.; Nystrom, Erik M.; George, Christopher N.; Gyllenhaal, John C.; Hwu, Wen-mei, IEEE Transactions on Computers, Vol. 50, No. 6, pp. 567-589, June 2001. [more...]
 
2000
 
"Accurate and Efficient Predicate Analysis with Binary Decision Diagrams.", Sias, John W.; August, David I.; Hwu, Wen-mei, Proceedings of the 33rd International Symposium on Microarchitecture, December, 2000. [more...]
 
"Hardware Support for Dynamic Activation of Compiler-Directed Computation Reuse.", Connors, Daniel A.; Hunter, Hillery C.; Cheng, Ben-Chung; Hwu, Wen-mei, Proceedings of the 9th International Conference on Architecture Support for Programming Languages and Operating Systems, November 2000. [more...]
 
"Modular Interprocedural Pointer Analysis Using Access Paths: Design, Implementation, and Evaluation.", Cheng, Ben-Chung; Hwu, Wen-mei, Proceedings of the 2000 ACM SIGPLAN Conference on Programming Language Design and Implementation, Vancouver, British Columbia, Canada, June, 2000. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"A Hardware Mechanism for Dynamic Extraction and Relayout of Program Hot Spots.", Merten, Matthew C.; Trick, Andrew R.; Nystrom, Erik M.; Barnes, Ronald D.; Hwu, Wen-mei, Proceedings of the 27th International Symposium on Computer Architecture, pp. 59-70, June 2000. [more...]
 
"Compile-Time Memory Disambiguation for C Programs.", Cheng, Ben-Chung, PhD thesis, Department of Computer Science, University of Illinois, Urbana, IL, May 2000. [more...]
 
"Eliminating Dynamic Computation Redundancy", Connors, Daniel A., Ph.D. dissertation, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 2000. [more...]
 
"Systematic Compilation for Predicated Execution.", August, David I., Ph.D. dissertation, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, Feb. 2000. [more...]
 
1999
 
"Condition Awareness Support for Predicate Analysis and Optimization.", Sias, John W., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 1999. [more...]
 
"Run-Time Cache Bypassing.", Johnson, Teresa L.; Barnes, Ronald D.; Merten, Matthew C.; Hwu, Wen-mei, IEEE Transactions on Computers, Vol. 48, No. 12, pp. 1338-1354, December 1999. [more...]
 
"Compiler-Directed Dynamic Computation Reuse: Rationale and Initial Results.", Connors, Daniel A.; Hwu, Wen-mei, Proceedings of the 32nd International Symposium on Microarchitecture, November, 1999. [more...]
 
"Feedback-Directed Data Cache Optimizations for the x86.", Barnes, Ronald D., Proceedings of the 2nd ACM Workshop on Feedback-Directed Optimization, November 1999. [more...]
 
"A Framework for Profile-Driven Optimization in the IMPACT Binary Reoptimization System.", Merten, Matthew C., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, June 1999. [more...]
 
"A Framework for Install-Time Optimization of Binary Dynamic-Link Libraries.", George, Christopher N., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 1999. [more...]
 
"An Empirical Study of Function Pointers Using Spec Benchmarks.", Cheng, Ben-Chung; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-99-02, University of Illinois, Urbana, IL 1999. [more...]
 
"A Hardware-Driven Profiling Scheme for Identifying Program Hot Spots to Support Runtime Optimization.", Merten, Matthew C.; Trick, Andrew R.; George, Christopher N.; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of the 26th International Symposium on Computer Architecture, pp. 136-147, May, 1999 . (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"The Program Decision Logic Approach to Predicated Execution.", August, David I.; Sias, John W.; Puiatti, Jean-Michel; Mahlke, Scott A.; Connors, Daniel A.; Crozier, Kevin M.; Hwu, Wen-mei, Proceedings of the 26th International Symposium on Computer Architecture, May, 1999. [more...]
 
"A New Framework for Debugging Globally Optimized code.", Wu, Le-Chun; Miran, Rajiv; Patil, Harish; Olsen, Brnce; Hwu, Wen-mei, Proceedings of the 1999 ACM SIGPLAN Conference on Programming Language Design and Implementation, Atlanta, Georgia, May, 1999. [more...]
 
"A Practical Interprocedural Pointer Analysis Framework.", Cheng, Ben-Chung; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-99-01, University of Illinois, Urbana, IL 1999. [more...]
 
"Optimizing Memory Accesses Using Advanced Compile-Time Memory Disambiguation Techniques.", Cheng, Ben-Chung; Connors, Daniel A.; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-99-03, University of Illinois, Urbana, IL 1999. [more...]
 
1998
 
"A Software-Oriented Floating-Point Format for Enhancing Automotive Control Systems.", Connors, Daniel A.; Yamada, Yoji; Hwu, Wen-mei, Workshop on Compiler and Architecture Support for Embedded Computing Systems (CASES98), December, 1998. [more...]
 
"Compiler-Directed Early Load-Address Generation.", Cheng, Ben-Chung; Connors, Daniel A.; Hwu, Wen-mei, Proceedings of the 31st International Symposium on Microarchitecture, December, 1998. [more...]
 
"Effective Modulo Scheduling for Clustered Architectures.", Nystrom, Erik M.; Eichenberger, Alexandre E., Proceedings of the 31th International Symposium on Microarchitecture, Dec, 1998. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"Improving Static Branch Prediction in a Compiler", Beitrich, Brian L.; Cheng, Ben-Chung; Hwu, Wen-mei, Proceedings of International Parallel Architecture and Compilation Techniques, October 12-18, 1998. [more...]
 
"New Data-Location Tracking Scheme for the Recovery of Expected Variable Values", Wu, Le-Chun; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-98-07, University of Illinois, Urbana, IL 1998. [more...]
 
"Optimization and Executable Regeneration in the IMPACT Binary Reoptimization Framework.", Thiems, Michael S., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, August 1998. [more...]
 
"Optimization of Machine Descriptions for Efficient Use.", Gyllenhaal, John C.; Hwu, Wen-mei; Rau, Ramakrishna.B, International Journal of Parallel Programming, vol. 26, No. 4, pp. 417-447, August 1998. [more...]
 
"Integrated Predicated and Speculative Execution in the IMPACT EPIC Architecture.", August, David I.; Connors, Daniel A.; Mahlke, Scott A.; Sias, John W.; Crozier, Kevin M.; Cheng, Ben-Chung; Eaton, Patrick R.; Olaniran, Qudus B.; Hwu, Wen-mei, Proceedings of the 25th International Symposium on Computer Architecture, July, 1998. (Paper of IMPACT - Cited Greater Than 150 Times) [more...]
 
"An Overview of the IMPACT X86 Binary Reoptimization Framework", Merten, Matthew C.; Thiems, Michael S., IMPACT Technical Report, IMPACT-98-05, University of Illinois, Urbana, IL 1998. [more...]
 
"A New Breakpoint Implementation Scheme for Debugging Globally Optimized Code", Wu, Le-Chun; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-98-06, University of Illinois, Urbana, IL 1998. [more...]
 
"A Novel Breakpoint Implementation Scheme for Debugging Optimized Code", Wu, Le-Chun; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-98-01, University of Illinois, Urbana, IL 1998. [more...]
 
1997
 
"A Study of the Cache and Branch Performance Issues with Running Java on Current Hardware Platforms.", Hsieh, Cheng-Hsueh Andrew; Conte, Marie T.; Johnson, Teresa L.; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of COMPCON, pp. 211-216, February 1997. [more...]
 
"Run-time Spatial Locality Detection and Optimization.", Johnson, Teresa L.; Merten, Matthew C.; Hwu, Wen-mei, Proceedings of the 30th International Symposium on Microarchitecture, December 1-3, 1997. [more...]
 
"An Efficient Framework For Performing Execution-Constraint-Sensitive Transformations That Increase Instruction-Level Parallelism.", Gyllenhaal, John C., PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, September 1997. [more...]
 
" Using NET to Capture Performance in Java-Based Software.", Hsieh, Cheng-Hsueh Andrew; Conte, Marie T.; Johnson, Teresa L.; Gyllenhaal, John C.; Hwu, Wen-mei, IEEE Computer, pp. 67-75, June 1997. [more...]
 
"Run-time adaptive cache hierarchy management via reference analysis", Johnson, Teresa L.; Hwu, Wen-mei, Proceedings of the 24th annual international symposium on Computer architecture, ISCA 1997. (Paper of IMPACT - Cited Greater than 150 Times) [more...]
 
"Region-Based Compilation: An Introduction and Motivation.", Hank, Richard E.; Hwu, Wen-mei; Rau, Ramakrishna.B, International Journal of Parallel Programming, vol. 25, no. 2, pp. 113-146, April 1997. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"A Framework for Balancing Control Flow and Predication.", August, David I.; Hwu, Wen-mei; Mahlke, Scott A., Proceedings of the 30th International Symposium on Microarchitecture, December 1997. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"A Study of the Cache and Branch Performance Issues with Running Java on Current Hardware Platforms.", Hsieh, Cheng-Hsueh Andrew; Conte, Marie T.; Johnson, Teresa L.; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of COMPCON, pp. 211-216, February 1997. [more...]
 
"Architectural Support Compiler-Synthesized Dynamic Branch Prediction Strategies: Rationale and Initial Results.", August, David I.; Connors, Daniel A.; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of the 3rd International Symposium on High-Performance Computer Architecture, Feb. 1997. [more...]
 
1996
 
"Java Bytecode to Native Code Translation: The Caffeine Prototype and Preliminary Results.", Hsieh, Cheng-Hsueh Andrew; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of the 29th International Symposium on Microarchitecture, pp. 90-99, December 1996. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"Modulo Scheduling of Loops in Control-Intensive Non-Numeric Programs.", Lavery, Daniel M.; Hwu, Wen-mei, Proceedings of the 29th Annual International Symposium on Microarchitecture, pp. 126-141, Dec. 1996. [more...]
 
"Optimization of Machine Descriptions for Efficient Use.", Gyllenhaal, John C.; Hwu, Wen-mei; Rau, Ramakrishna.B, Proceedings of the 29th International Symposium on Microarchitecture, pp. 349-358, December 1996. [more...]
 
"Speculative Hedge: Regulating Compile-Time Speculation Against Profile Variations.", Deitrich, Brian L.; Hwu, Wen-mei, Proceedings of the 29th International Symposium on Microarchitecture, pp.70-79, December 2-4, 1996. [more...]
 
"Region-Based Compilation.", Hank, Richard E., PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 1996. [more...]
 
"Supporting Predicated Execution: Techniques and Tradeoffs.", McCormick, Jim E., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 1996. [more...]
 
"HMDES Version 2.0 Specification.", Gyllenhaal, John C.; Hwu, Wen-mei, IMPACT Technical Report, IMPACT-96-03, University of Illinois, Urbana, IL, 1996. [more...]
 
1995
 
"Compiler Technology for Future Microprocessors.", Hwu, Wen-mei; Hank, Richard E.; Lavery, Daniel M.; Haab, Grant E.; Gyllenhaal, John C.; August, David I., Proceedings of the IEEE, Vol. 83, No. 12, pp. 1625-1640, December 1995. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"Region-Based Compilation: An Introduction and Motivation.", Hank, Richard E.; Hwu, Wen-mei; Rau, Ramakrishna.B, Proceedings of the 28th Annual International Symposium on Microarchitecture, pp. 158-168, Dec. 1995. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"Unrolling-Based Optimizations for Modulo Scheduling.", Lavery, Daniel M.; Hwu, Wen-mei, Proceedings of the 28th Annual International Symposium on Microarchitecture, pp. 327-337, Dec. 1995. [more...]
 
"A Comparison of Full and Partial Predicated Execution Support for ILP Processors.", Mahlke, Scott A.; Hank, Richard E.; McCormick, Jim E.; August, David I.; Hwu, Wen-mei, Proceedings of the 22nd International Symposium on Computer Architecture, pp. 138-150, June 19955 . (Paper of IMPACT - Cited Greater Than 150 Times) [more...]
 
"Code Scheduling and Optimization for a Superscalar X86 Microprocessor.", Dugal, Wayne F., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, May, 1995. [more...]
 
"Enhancing Instruction Level Parallelism Through Complier-Controlled Speculation.", Bringmann, Roger A., PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, May 1995. [more...]
 
"Performance and Cost Analysis of the Execution Stage of Superscalar Microprocessors.", Argyres, Dimitri C., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, May 1995. [more...]
 
"Three Architectural Models for Compiler-Controlled Speculative Execution.", Chang, Pohua P.; Warter, Nancy J.; Mahlke, Scott A.; Chen, William Y.; Hwu, Wen-mei, IEEE Transactions on Computers, Vol. 44, No. 4, pp. 481-494, April 1995. [more...]
 
"The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors.", Chang, Pohua P.; Lavery, Daniel M.; Mahlke, Scott A.; Chen, William Y.; Hwu, Wen-mei, IEEE Transactions on Computers, Vol. 44, No. 3, pp. 353-370, March 1995. [more...]
 
"Sentinel Scheduling with Recovery Blocks.", August, David I.; Deitrich, Brian L.; Mahlke, Scott A., Technical Report CRHC-95-05, 1995, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL, Feb, 1995. [more...]
 
"Data Relocation and Prefetching for Programs with Large Data Sets.", Yamada, Yoji, PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, 1995. [more...]
 
"Compiler-Assisted Multiple Instruction Retry.", Li, Chung-Chi Jim; Chen, Shyh-Kwei; Fuchs, W. Kent; Hwu, Wen-mei, IEEE Transactions on Computers, Vol.44, No.1, Jan. 1995. [more...]
 
1994
 
"Characterizing the Impact of Predicated Execution on Branch Prediction.", Mahlke, Scott A.; Hank, Richard E.; Bringmann, Roger A.; Gyllenhaal, John C.; Gallagher, David M.; Hwu, Wen-mei, Proceedings of the 27th International Symposium on Microarchitecture, pp. 217-227, December 1994. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"Data Relocation and Prefetching for Large Data Sets.", Yamada, Yoji; Gyllenhaal, John C.; Haab, Grant E.; Hwu, Wen-mei, Proceedings of the 27th Annual ACM/IEEE International Symposium on Microarchitecture, pp. 118-127, December, 1994. [more...]
 
"Dynamic Memory Disambiguation Using the Memory Conflict Buffer.", Gallagher, David M.; Chen, William Y.; Mahlke, Scott A.; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of the 6th International Conference on Architecture Support for Programming Languages and Operating Systems, San Jose, California, pp.183-195, October, 1994 . (Paper of IMPACT - Cited Greater Than 150 Times) [more...]
 
"A Machine Description Language for Compilation.", Gyllenhaal, John C., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, Sept. 1994. [more...]
 
"Compiler Support for SPARC Architecture Processors.", Ouellette, Roland G., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1994. [more...]
 
"Modulo Scheduling with Isomorphic Control Transformations.", Warter, Nancy J., PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1994. [more...]
 
"The Susceptibility of Programs to Context Switching Effects.", Hwu, Wen-mei; Conte, Marie T., IEEE Transactions on Computers, Vol. 43, No. 9, Sept. 1994. [more...]
 
"Compiler-Assisted Multiple Instruction Rollback Recovery Using A Read Buffer.", Alewine, Neal J.; Chen, Shyh-Kwei; Fuchs, W. Kent; Hwu, Wen-mei, IEEE Transactions on Computers, 1994. [more...]
 
"Incremental Compiler Transformations for Multiple Instruction Retry.", Chen, Shyh-Kwei; Alewine, Neal J.; Fuchs, W. Kent; Hwu, Wen-mei, Software, Practice & Experience, John Wiley & Sons Ltd., Vol. 24(9), pp. 1-20, Sept. 1994. [more...]
 
"Performance Implications of Synchronization Support for Parallel FORTRAN Programs.", Anik, Sadun; Hwu, Wen-mei, Journal of Parallel and Distributed Computing, Vol. 22, pp. 202-215, 1994. [more...]
 
"Profile-Assisted Instruction Scheduling.", Chen, William Y.; Mahlke, Scott A.; Warter, Nancy J.; Anik, Sadun; Hwu, Wen-mei, International Journal for parallel Programming, Vol. 22, No. 2, pp. 151-181, April 1994. [more...]
 
1993
 
"Machine Independent Register Allocation for the IMPACT-I C Compiler.", Hank, Richard E., MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 1993. [more...]
 
"Speculative Execution Exception Recovery using Write-back Suppression.", Bringmann, Roger A.; Mahlke, Scott A.; Hank, Richard E.; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of the 26th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Austin, Texas, pp. 214-223, Dec. 1993. [more...]
 
"Superblock Formation Using Static Program Analysis.", Hank, Richard E.; Mahlke, Scott A.; Bringmann, Roger A.; Gyllenhaal, John C.; Hwu, Wen-mei, Proceedings of the 26th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Austin, Texas, pp. 247-256, Dec. 1993. [more...]
 
"Architectural and Software Support for Executing Numerical Applications on High Performance Computers.", Anik, Sadun, PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, CRHC-93-19, Sept. 1993. [more...]
 
"Data Preload for Superscalar and VLIW Processors.", Chen, William Y., PhD thesis, Department of Computer Science, University of Illinois, Urbana, IL, Sept. 1993. [more...]
 
"Sentinel Scheduling: A Model for Compiler-Controlled Speculative Execution.", Mahlke, Scott A.; Chen, William Y.; Bringmann, Roger A.; Hank, Richard E.; Hwu, Wen-mei; Rau, Ramakrishna.B ; Schlansker, Michael S., ACM Transactions on Computer Systems, Vol. 11, No. 4, Nov. 1993. [more...]
 
"Reverse If-Conversion", Warter, Nancy J.; Mahlke, Scott A.; Hwu, Wen-mei; Rau, Ramakrishna.B, Proceeding PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation, 1993. (Paper of IMPACT - Cited Greater than 100 Times) [more...]
 
"Register Connection: A New Approach to Adding Registers into Instruction Set Architectures.", Tokuzo, Kiyohara; Mahlke, Scott A.; Chen, William Y.; Bringmann, Roger A.; Hank, Richard E.; Anik, Sadun; Hwu, Wen-mei, Proceedings of the 20th Annual International Symposium on Computer Architecture, pp. 247-256, San Diego, CA, May 17-19, 1993. [more...]
 
"XPROF: An Execution Profiler for Window-oriented Applications.", Gupta, Aloke; Hwu, Wen-mei, Software, Practice & Experience, John Wiley & Sons Ltd., Vol. 23 (5), pp. 487-510, May 1993. [more...]
 
"The Superblock: An Effective Technique for VLIW and Superscalar Compilation", Hwu, Wen-mei; Mahlke, Scott A.; Chen, William Y.; Chang, Pohua P.; Warter, Nancy J.; Bringmann, Roger A.; Ouellette, Roland G.; Hank, Richard E.; Tokuzo, Kiyohara; Haab, Grant E.; Holm, John C.; Lavery, Daniel M., Journal of Supercomputing, 1993. (Paper of IMPACT - Cited Greater Than 650 Times) [more...]
 
"Performance Aspects of Computers with Graphical User Interfaces.", Gupta, Aloke, PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, CRHC-93-09, April 1993. [more...]
 
"The Benefit of Predicated Execution for Software Pipelining.", Warter, Nancy J.; Lavery, Daniel M.; Hwu, Wen-mei, Proceedings of the 26th Annual Hawaii Int'l Conference on system Sciences, Wailea, pp 497-506, Hawaii, Jan. 5-8, 1993. [more...]
 
1992
 
"Efficient Instruction Sequencing with Inline Target Insertion.", Hwu, Wen-mei; Chang, Pohua P., IEEE Transactions on Computers, Vol. 41, No.12, pp. 1537-1551, Dec. 1992. [more...]
 
"Code Scheduling for VLIW/Superscalar Processors with Limited Register Files.", Tokuzo, Kiyohara; Gyllenhaal, John C., Proceedings of the 25th International Symposium on Microarchitecture, pp. 197-201, Dec. 1992. [more...]
 
"Effective Compiler Support for Predicated Execution Using the Hyperblock.", Mahlke, Scott A.; Lin, David C.; Chen, William Y.; Hank, Richard E.; Bringmann, Roger A., Proceedings of the 25th International Symposium on Microarchitecture, pp. 45-54, Dec. 1992. (Paper of IMPACT - Cited Greater Than 600 Times) [more...]
 
"Enhanced Modulo Scheduling for Loops with Conditional Branches.", Warter, Nancy J.; Haab, Grant E.; Subramanian, Krishna; Bockhaus, John W., Proceedings of 25th Annual ACM/IEEE Int'l Symposium on Microarchitecture, pp. 170-179, Dec. 1992. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"Compiler Code Transformations for Superscalar-Based High-Performance Systems.", Mahlke, Scott A.; Chen, William Y.; Gyllenhaal, John C.; Hwu, Wen-mei; Chang, Pohua P.; Tokuzo, Kiyohara, Proceedings of Supercomputing 1992, Minneapolis, Minnesota, pp. 808-817, Nov. 16-20, 1992. [more...]
 
"Sentinel Scheduling for VLIW and Superscalar Processors.", Mahlke, Scott A.; Chen, William Y.; Hwu, Wen-mei; Rau, Ramakrishna.B ; Schlansker, Michael S., Proceedings of the Fifth Int'l Conference on Architecture Support for Programming Languages and Operating Systems (ASPLOS-V), Boston, MA, pp.238-247, Oct. 12-15, 1992. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"A Template for Code Generator Development Using the IMPACT-I C Compile.", Bringmann, Roger A., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992. [more...]
 
"Systematic Computer Architecture Prototyping.", Conte, Thomas M., PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992. [more...]
 
"Design and Implementation of a Portable Global Code Optimizer.", Mahlke, Scott A., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992. [more...]
 
"Evaluation of Some Superscalar and VLIW Processor Designs.", Holm, John C., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992. [more...]
 
"Executing Nested Parallel Loops On Shared-Memory Multiprocessors.", Anik, Sadun; Hwu, Wen-mei, Proceedings of the 21st Annual Int'l Conference on Parallel Processing, pp.(III) 241-244, St. Charles, IL, Aug. 1992. [more...]
 
"Tolerating First Level Memory Access Latency In High-Performance Systems.", Chen, William Y.; Mahlke, Scott A.; Hwu, Wen-mei, Proceedings of the 21st Annual Int'l Conference on Parallel Processing, pp.(I) 36-43, St Charles, IL, Aug. 1992. [more...]
 
"Tolerating Data Access Latency with Register Preloading.", Chen, William Y.; Mahlke, Scott A.; Hwu, Wen-mei; Tokuzo, Kiyohara; Chang, Pohua P., Proceedings of the 1992 Int'l Conf. on Supercomputing, pp. 104-113, Washington D.C., July, 1992. [more...]
 
"Branch Recovery with Compiler-Assisted Multiple Instruction Retry.", Alewine, Neal J.; Chen, Shyh-Kwei; Li, Chung-Chi Jim; Fuchs, W. Kent; Hwu, Wen-mei, Proceedings of the 22nd Annual International Symposium on Fault-Tolerant Computing, pp. 66-73, Boston, MA, July 8-10, 1992. [more...]
 
"Profile-Guided Automatic Inline Expansion for C Programs.", Chang, Pohua P.; Mahlke, Scott A.; Chen, William Y.; Hwu, Wen-mei, Software Practice and Experience, May 1992, Vol. 22, No. 5, pp. 349-369. (Paper of IMPACT - Cited Greater Than 150 Times) [more...]
 
"An Execution Profiler for Window-Oriented Applications", Gupta, Aloke; Hwu, Wen-mei, Coordinated Science Lab, University of Illinois, Urbana, IL, Technical Report CRHC-92-02, 1992. [more...]
 
"Scalar Program Performance on Multiple-Instruction-Issue Processors with a Limited Number of Registers.", Mahlke, Scott A.; Chen, William Y.; Chang, Pohua P.; Hwu, Wen-mei, Proceedings of the 25th Annual Hawaii Int'l Conference on System Sciences, pp. 34-44, Jan. 6-9, 1992. [more...]
 
1991
 
"Three Superblock Scheduling Models for Superscalar and Superpipelined Processors.", Chang, Pohua P.; Warter, Nancy J.; Mahlke, Scott A.; Chen, William Y.; Hwu, Wen-mei, Technical Report CRHC-91-29, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL, Dec. 1991. [more...]
 
"Using Profile Information to Assist Classic Compiler Code Optimizations.", Chang, Pohua P.; Mahlke, Scott A.; Hwu, Wen-mei, Software Practice and Experience, Vol. 21, No. 12, pp. 1301-1321, Dec. 1991. (Paper of IMPACT - Cited Greater Than 250 Times) [more...]
 
"Comparing Static And Dynamic Code Scheduling for Multiple-Instruction-Issue Processors.", Chang, Pohua P.; Chen, William Y.; Mahlke, Scott A.; Hwu, Wen-mei, Proceedings of the 24th Annual ACM/IEEE Int'l Symposium on Microarchitecture, pp. 69-73, Albuquerque, New Mexico, Nov. 18-20,1991. [more...]
 
"Data Access Microarchitectures for Superscalar Processor with Compiler-Assisted Data Prefetching.", Chen, William Y.; Mahlke, Scott A.; Chang, Pohua P.; Hwu, Wen-mei, Proceedings of the 24th Annual ACM/IEEE Int'l Symposium on Microarchitecture, pp. 69-73, Albuquerque, New Mexico, Nov. 1991. (Paper of IMPACT - Cited Greater Than 100 Times) [more...]
 
"An Optimizing Compiler Code Generator: A platform for RISC Performance Analysis.", Chen, William Y., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1991. [more...]
 
"The Effect of Compiler Optimizations On Available Parallelism In Scalar Programs.", Mahlke, Scott A.; Warter, Nancy J.; Chen, William Y.; Chang, Pohua P.; Hwu, Wen-mei, Proceedings of the 20th Annual Int'l Conference on Parallel Processing, St. pp. 142-145, Charles, IL, Aug. 12-16, 1991. [more...]
 
"Performance Implications of Synchronization Support for Parallel Fortran Programs.", Anik, Sadun; Hwu, Wen-mei, Technical Report CRHC-91-21, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL, Jun. 1991. [more...]
 
"IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors.", Chang, Pohua P.; Mahlke, Scott A.; Chen, William Y.; Warter, Nancy J.; Hwu, Wen-mei, Proceedings of the 18th Annual Int'l Symposium on Computer Architecture, Toronto, Canada, pp. 266-275, May 28, 1991. (Paper of IMPACT - Cited Greater Than 400 Times) [more...]
 
"The Effect of Code Expanding Optimizations of Instruction Cache Design.", Chen, William Y.; Chang, Pohua P.; Conte, Thomas M.; Hwu, Wen-mei, Technical Report CRHC-91-17, Center for Reliable and High-Performance, university of Illinois, Urbana, IL, May 1991. [more...]
 
"Benchmark Characterization.", Conte, Thomas M.; Hwu, Wen-mei, Proceedings of the 24th Annual Hawaii International Conference on System Sciences, pp. 364-372, Jan. 8-11, 1991 . [more...]
 
1990
 
"A Multiported Nonblocking Cache for a Superscalar Uniprocessor.", Sicolo, Jim E., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1990. [more...]
 
"Compiler Support for Predicated Execution in Superscalar Processors.", Lin, David C., MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1990. [more...]
 
"Benchmark characterization for experimental system evaluation", Conte, Thomas M.; Hwu, Wen-mei, Proceedings of the Twenty-Third Annual Hawaii International Conference on System Sciences , 1990. [more...]
 
1989
 
"Inline function expansion for compiling C programs", Chang, Pohua P.; Hwu, Wen-mei, Proceeding PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation, 1989. (Paper of IMPACT - Cited Greater than 100 Times) [more...]
 
"Control Flow Optimization for Supercomputer Scalar Processing.", Chang, Pohua P.; Hwu, Wen-mei, Proceedings of the 1989 Int'l Conf. on Supercomputing, Crete, Greece, Jun. 5-9, 1989. [more...]
 
"Comparing Software and Hardware Schemes For Reducing the Cost of Branches.", Hwu, Wen-mei; Conte, Thomas M.; Chang, Pohua P., Proceedings of the 16th Annual International Symposium on Computer Architecture, Jerusalem, Israel, pp. 224-233, May 28- June 1, 1989 . [more...]
 
1988
 
"Exploiting parallel microprocessor microarchitectures with a compiler code generator", Hwu, Wen-mei; Chang, Pohua P., Proceeding ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture, 1988. [more...]
 
"Trace selection for compiling large C application programs to microcode", Chang, Pohua P.; Hwu, Wen-mei, Proceeding MICRO 21 Proceedings of the 21st annual workshop on Microprogramming and microarchitecture, 1988. [more...]
 
1987
 
"Checkpoint Repair for High-Performance Out-of-Order Execution Machines", Hwu, Wen-mei; Patt, Yale N., Computers, IEEE Transactions on, 1987. (Paper of IMPACT - Cited Greater than 100 Times) [more...]
 
"Checkpoint repair for out-of-order execution machines", Hwu, Wen-mei; Patt, Yale N., Proceeding ISCA '87 Proceedings of the 14th annual international symposium on Computer architecture, 1987. (Paper of IMPACT - Cited Greater than 100 Times) [more...]
 
1986
 
"HPSm, a high performance restricted data flow architecture having minimal functionality", Hwu, Wen-mei; Patt, Yale N., Proceeding ISCA '86 Proceedings of the 13th annual international symposium on Computer architecture, June, 1986. (Paper of IMPACT - Cited Greater than 100 Times) [more...]
 
1985
 
"Critical issues regarding HPS, a high performance microarchitecture", Patt, Yale N.; Melvin, Stephen W.; Hwu, Wen-mei; Shebanow, Michael, Proceeding MICRO 18 Proceedings of the 18th annual workshop on Microprogramming, 1985. [more...]
 
"HPS, a new microarchitecture: rationale and introduction", Patt, Yale N.; Hwu, Wen-mei; Shebanow, Michael, Proceedings of the 18th annual workshop on Microprogramming, MICRO 18, 1985. (Paper of IMPACT - Cited Greater than 150 Times) [more...]