Nicolas Vasilache

Oleksandr Zinenko

Theodoros Theodoridis

ACM Trans. Archit. Code Optim., 2020

MLIR: A Compiler Infrastructure for the End of Moore's Law.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Openly revisiting derivative-free optimization.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference Companion, 2019

2018

Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions.

[BibT_eX]

[DOI]

Oleksandr Zinenko

Theodoros Theodoridis

CoRR, 2018

2017

Training Language Models Using Target-Propagation.

[BibT_eX]

[DOI]

CoRR, 2017

Diagonal Rescaling For Neural Networks.

[BibT_eX]

[DOI]

Jean Lafond

Léon Bottou

CoRR, 2017

2016

Learning Visual Features from Large Weakly Supervised Data.

[BibT_eX]

[DOI]

Armand Joulin

Laurens van der Maaten

Allan Jabri

Proceedings of the Computer Vision - ECCV 2016, 2016

2015

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Learning Representations, 2015

2014

A Tale of Three Runtimes.

[BibT_eX]

[DOI]

CoRR, 2014

Tiling and optimizing time-iterated computations on periodic domains.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration.

[BibT_eX]

[DOI]

Harper Langston

Proceedings of the IEEE High Performance Extreme Computing Conference, 2013

Runnemede: An architecture for Ubiquitous High-Performance Computing.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

Memory reuse optimizations in the R-Stream compiler.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units, 2013

2012

Automatic communication optimizations through memory reuse strategies.

[BibT_eX]

[DOI]

Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

Efficient and scalable computations with sparse tensors.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on High Performance Extreme Computing, 2012

2011

R-Stream Compiler.

[BibT_eX]

[DOI]

David Wohlford

Allen Leung

Proceedings of the Encyclopedia of Parallel Computing, 2011

Loop transformations: convexity, pruning and optimization.

[BibT_eX]

[DOI]

Proceedings of the 38th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 2011

2010

A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction.

[BibT_eX]

[DOI]

Allen Leung

David Wohlford

Cédric Bastoul

Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010

2007

Iterative Optimization in the Polyhedral Model: Part I, One-Dimensional Time.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Symposium on Code Generation and Optimization (CGO 2007), 2007

Automatic Correction of Loop Transformations.

[BibT_eX]

[DOI]

Albert Cohen

Louis-Noël Pouchet

Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), 2007

2006

Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2006

Violated dependence analysis.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual International Conference on Supercomputing, 2006

Polyhedral Code Generation in the Real World.

[BibT_eX]

[DOI]