Recent & Highlighted Items

Columbia University - Invited Talk: Moving Towards Exascale with Lessons Learned from GPU Computing (October 14, 2014)

The rise of GPU computing has significantly boosted the pace of progress in numeric methods, algorithm design, and programming techniques for developing scalable applications. Much has been learned about of algorithms, languages, compilers and hardware architecture in this movement. I will discuss some insights gained and a vision for moving applications into exascale computing.

University of British Columbia: Invited Distinguish Lecture (October 6, 2014)

PCI Distinguished Lecture: Runtime Aware Architectures (September 22, 2014)

You are invited to attend the next talk in the PCI Distinguished Lectures. On Monday, September 22nd  Mateo Valero of the Barcelona Supercomputing Center will be speaking in CSL B02 at 10am on Runtime Aware Architectures.

The traditional ways of increasing hardware performance predicted by Moore’s Law
have vanished. When uni-cores were the norm, hardware design was decoupled
from the software stack, thanks to a well-defined Instruction Set Architecture.
This simple interface allowed developers to design applications without much
concern for the hardware, while hardware designers were able to exploit parallelism
in superscalar processors. With the irruption of multi-cores and parallel
applications, this approach no longer worked. As a result, the role of decoupling
applications from the hardware was moved to the runtime system. Efficiently using
the underlying hardware from this runtime without exposing its complexities
to the application has been the target of research in the last years.

It is our position that the runtime has to drive the design of future multi-cores
to overcome the restrictions in terms of power, memory, programmability and
resilience that multi-cores have. In this talk, we introduce an approach towards a
Runtime-Aware Architecture, a massively parallel architecture designed from the
runtime’s perspective.

Mateo Valero is full professor at the Computer Architecture Department at UPC, in Barcelona. His research interests focuses on high performance architectures. He has published approximately 600 papers, has served in the organization of more than 300 International Conferences and he has given more than 400 invited talks. He is the director of the Barcelona Supercomputing Center, the National Center of Supercomputing in Spain.

Attachment #1: Flyer (PDF)

Christopher Rodrigues Doctoral Dissertation: Supporting High-Level, High-Performance Parallel Programming with Library-Driven Optimization

This dissertation presents Triolet, a programming language and compiler for high-level programming of parallel loops for high-performance execution on clusters of multicore computers. Triolet’s design demonstrates that it is possible to decouple the design of a compiler from the implementation of parallelism without sacrificing performance or ease of use. This programming approach opens the potential for future research into parallel programming frameworks.

DoD ACS Productivity Workshop (July 16, 2014)

Wen-mei Hwu gave a presentation at the DoD ACS Productivity workshop. He first discussed the current status of performance portable programming using OpenCL and MxPA. He then gave a vision for drastic improvement of productivity in performance portable applications using new research breakthroughs from UIUC: Triolet and Tangram.

Attachment #1: Triolet C++/Python – Productive Programming in Heterogeneous Parallel Systems (PDF)

Wen-Mei Hwu Awarded the Collins Award for Innovative Teaching

Hwu earned the Collins Award for Innovative Teaching, an award that recognizes outstanding development or use of new and innovative teaching methods. Hwu’s major avenue of innovative teaching is his development of a parallel programming course, both in the ECE ILLINOIS classroom and across the world online. His massive open online course on parallel programming was first offered through Coursera in 2012, and a revised version of the class was offered earlier this year.

The course contents have been so popular that it has also been translated into Chinese, Japanese, Russian, and other languages.” Considering his pioneering efforts in teaching an increasingly relevant topic, and his commitment to using online platforms to reach a wide pool of students, it is no wonder that he was awarded the Collins Award for Innovative Teaching.

Cornell Departmental Lecture - Scalability, Portability, and Productivity in GPU Computing (April 7, 2014)

The IMPACT group at the University of Illinois has been working on the co-design of scalable algorithms and programming tools for massively threaded heterogeneous computing. A major challenge that we are addressing is to simultaneously achieve scalability, performance, numerical stability, portability, and productivity in GPU computing. In this talk, I will give a brief overview of the NSF Blue Waters petascale heterogeneous parallel computing system at the University of Illinois. I will show experimental results of our achievements to date in applications, libraries, and MxPA. I will then discuss our current work on Tangram and Triolet projects that are aimed to drastically reduce the development and maintenance cost of heterogeneous parallel computing applications.

Attachment #1: Cornell Departmental Lecture Slides (PDF)
(View Archive of Highlighted Items)

Upcoming Items

Recent & Highlighted Papers


"Supporting High-Level, High-Performance Parallel Programming with Library-Driven Optimization", Rodrigues, Christopher I., University of Illinois Doctoral Disertation, May 2014. [more...]
"Scalable Parallel Tridiagonal Algorithms with Diagonal Pivoting and Their Optimization for Many-core Architectures", Chang, Li-Wen, University of Illinois Master Thesis, July 2014?. [more...]
"Adaptive Cache Bypass and Insertion for Many-core Accelerators", Chen, Xuhao; Wu, Shengzhao; Chang, Li-Wen; Huang, Wei-Sheng; Pearson, Carl; Hwu, Wen-mei, Proceedings of International Workshop on Manycore Embedded Systems, 2014. [more...]