2013

Topic 4: High-Performance Architectures and Compilers - (Introduction).

[DOI]

Denis Barthou

Wolfgang Karl

Ramon Doallo

Evelyn Duesterwald

Sami Yehia

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

On the convergence of mainstream and mission-critical markets.

[DOI]

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

2012

On the evaluation of the impact of shared resources in multithreaded COTS processors in time-critical environments.

[DOI]

ACM Trans. Archit. Code Optim., 2012

2011

The MORPHEUS Heterogeneous Dynamically Reconfigurable Platform.

[DOI]

Wolfram Putzke-Röming

Int. J. Parallel Program., 2011

Towards improved survivability in safety-critical systems.

[DOI]

Proceedings of the 17th IEEE International On-Line Testing Symposium (IOLTS 2011), 2011

Exploiting structural redundancy of SIMD accelerators for their built-in self-testing/diagnosis and reconfiguration.

[DOI]

Proceedings of the 22nd IEEE International Conference on Application-specific Systems, 2011

2010

CMA: Chip multi-accelerator.

[DOI]

Proceedings of the IEEE 8th Symposium on Application Specific Processors, 2010

A hypercube-based NoC routing algorithm for efficient all-to-all communications in embedded image and signal processing applications.

[DOI]

Proceedings of the 2010 International Conference on High Performance Computing & Simulation, 2010

A memory interface for multi-purpose multi-stream accelerators.

[DOI]

Proceedings of the 2010 International Conference on Compilers, 2010

2009

Reconciling specialization and flexibility through compound circuits.

[DOI]

Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

2007

Liquid SIMD: Abstracting SIMD Hardware using Lightweight Dynamic Mapping.

[DOI]

Proceedings of the 13st International Conference on High-Performance Computer Architecture (HPCA-13 2007), 2007

2006

Load squared: Adding logic close to memory to reduce the latency of indirect loads in embedded and general systems.

[DOI]

Sami Yehia

Jean-Francois Collard

Olivier Temam

J. Embed. Comput., 2006

Scalable subgraph mapping for acyclic computation accelerators.

[DOI]

Proceedings of the 2006 International Conference on Compilers, 2006

2005

System-wide performance monitors and their application to the optimization of coherent memory accesses.

[DOI]

Jean-Francois Collard

Norman P. Jouppi

Sami Yehia

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2005

Exploring the design space of LUT-based transparent accelerators.

[DOI]

Proceedings of the 2005 International Conference on Compilers, 2005

2004

Load squared: adding logic close to memory to reduce the latency of indirect loads with high miss ratios.

[DOI]

Sami Yehia

Jean-Francois Collard

Olivier Temam

Proceedings of the 2004 workshop on MEmory performance, 2004

From Sequences of Dependent Instructions to Functions: An Approach for Improving Performance without ILP or Speculation.

[DOI]

Sami Yehia

Olivier Temam

Proceedings of the 31st International Symposium on Computer Architecture (ISCA 2004), 2004