Steven Derrien

Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction, 2024

2023

Increasing FPGA Accelerators Memory Bandwidth With a Burst-Friendly Memory Layout.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., May, 2023

An Irredundant Decomposition of Data Flow with Affine Dependences.

[BibT_eX]

[DOI]

Corentin Ferry

CoRR, 2023

Rapid Prototyping of Complex Micro-architectures Through High-Level Synthesis.

[BibT_eX]

[DOI]

Sara Sadat Hoseininasab

Caroline Collange

Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2023

Automatic Algorithm-Based Fault Tolerance (AABFT) of Stencil Computations.

[BibT_eX]

[DOI]

Louis Narmour

Proceedings of the 32nd International Conference on Parallel Architectures and Compilation Techniques, 2023

2022

Special Issue on Applied Reconfigurable Computing.

[BibT_eX]

[DOI]

Frank Hannig

J. Signal Process. Syst., 2022

SpecHLS: Speculative Accelerator Design Using High-Level Synthesis.

[BibT_eX]

[DOI]

Jean-Michel Gorius

IEEE Micro, 2022

Maximal Atomic irRedundant Sets: a Usage-based Dataflow Partitioning Algorithm.

[BibT_eX]

[DOI]

Corentin Ferry

CoRR, 2022

Design Exploration of RISC-V Soft-Cores through Speculative High-Level Synthesis.

[BibT_eX]

[DOI]

Jean-Michel Gorius

Proceedings of the International Conference on Field-Programmable Technology, 2022

2020

Safe Overclocking for CNN Accelerators Through Algorithm-Level Error Detection.

[BibT_eX]

[DOI]

Thibaut Marty

Tomofumi Yuki

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Toward Speculative Loop Pipelining for High-Level Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Application-Specific Arithmetic in High-Level Synthesis Tools.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2020

2019

Worst-Case Execution-Time-Aware Parallelization of Model-Based Avionics Applications.

[BibT_eX]

[DOI]

J. Aerosp. Inf. Syst., November, 2019

Hybrid-DBT: Hardware/Software Dynamic Binary Translation Targeting VLIW.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019

Reconciling Compiler Optimizations and WCET Estimation Using Iterative Compilation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Real-Time Systems Symposium, 2019

Hiding Communication Delays in Contention-Free Execution for SPM-Based Multi-Core Architectures.

[BibT_eX]

[DOI]

Proceedings of the 31st Euromicro Conference on Real-Time Systems, 2019

Aggressive Memory Speculation in HW/SW Co-Designed Machines.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

2018

Fine-Grain Iterative Compilation for WCET Estimation.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Worst-Case Execution Time Analysis, 2018

Enabling Overclocking Through Algorithm-Level Error Detection.

[BibT_eX]

[DOI]

Thibaut Marty

Tomofumi Yuki

Proceedings of the International Conference on Field-Programmable Technology, 2018

Supporting runtime reconfigurable VLIWs cores through dynamic binary translation.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Using polyhedral techniques to tighten WCET estimates of optimized code: A case study with array contraction.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

2017

Foreword to the Special Section on Reconfigurable Computing.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2017

Tightening Contention Delays While Scheduling Parallel Applications on Multi-core Architectures.

[BibT_eX]

[DOI]

Benjamin Rouxel

Isabelle Puaut

ACM Trans. Embed. Comput. Syst., 2017

Bridging high-level synthesis and application-specific arithmetic: The case study of floating-point summations.

[BibT_eX]

[DOI]

Yohann Uguen

Florent de Dinechin

Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

One size does not fit all: Implementation trade-offs for iterative stencil computations on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

A High-Level Synthesis Approach Optimizing Accumulations in Floating-Point Programs Using Custom Formats and Operators.

[BibT_eX]

[DOI]

Yohann Uguen

Florent de Dinechin

Proceedings of the 25th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2017

Hardware-accelerated dynamic binary translation.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

Superword level parallelism aware word length optimization.

[BibT_eX]

[DOI]

Ali Hassan El Moussawi

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

WCET-aware parallelization of model-based applications for multi-cores: The ARGO approach.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

2016

Communication-Based Power Modelling for Heterogeneous Multiprocessor Architectures.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2016

System level synthesis for virtual memory enabled hardware threads.

[BibT_eX]

[DOI]

Nicolas Estibals

Gaël Deest

Ali Hassan El Moussawi

Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

Demo: SLP-aware word length optimization.

[BibT_eX]

[DOI]

Ali Hassan El Moussawi

Proceedings of the 2016 Conference on Design and Architectures for Signal and Image Processing (DASIP), 2016

2015

Combining execution pipelines to improve parallel implementation of HMMER on FPGA.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2015

2014

Component reuse methodology for multi-clock Data-Flow parallel embedded Systems.

[BibT_eX]

[DOI]

Anne-Marie Chana

ARIMA J., 2014

Toward scalable source level accuracy analysis for floating-point to fixed-point conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2014

Low Power Reconfigurable Controllers for Wireless Sensor Network Nodes.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2014

2013

Polyhedral Bubble Insertion: A Method to Improve Nested Loop Pipelining for High-Level Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2013

Compiling Scilab to high performance embedded multicore systems.

[BibT_eX]

[DOI]

Grigoris Dimitroulakos

Kostas Masselos

Dimitrios Kritharidis

Nikolaos Mitas

Thomas Perschke

Microprocess. Microsystems, 2013

GeCoS: A framework for prototyping custom hardware design flows.

[BibT_eX]

[DOI]

Proceedings of the 13th IEEE International Working Conference on Source Code Analysis and Manipulation, 2013

Derivation of efficient FSM from loop nests.

[BibT_eX]

[DOI]

Tomofumi Yuki

Proceedings of the 2013 International Conference on Field-Programmable Technology, 2013

Using Model Types to Support Contract-Aware Model Substitutability.

[BibT_eX]

[DOI]

Proceedings of the Modelling Foundations and Applications - 9th European Conference, 2013

Component-Level Datapath Merging in System-Level Design of Wireless Sensor Node Controllers for FPGA-Based Implementations.

[BibT_eX]

[DOI]

Proceedings of the 2013 Euromicro Conference on Digital System Design, 2013

Runtime dependency analysis for loop pipelining in high-level synthesis.

[BibT_eX]

[DOI]

Mythri Alle

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

2012

System-Level Synthesis for Wireless Sensor Node Controllers: A Complete Design Flow.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2012

Bridging the chasm between MDE and the world of compilation.

[BibT_eX]

[DOI]

Softw. Syst. Model., 2012

Efficient hardware implementation of data-flow parallel embedded systems.

[BibT_eX]

[DOI]

Anne-Marie Chana

Proceedings of the 2012 International Conference on Embedded Computer Systems: Architectures, 2012

From Scilab to multicore embedded systems: Algorithms and methodologies.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Conference on Embedded Computer Systems: Architectures, 2012

A flexible approach for compiling scilab to reconfigurable multi-core embedded systems.

[BibT_eX]

[DOI]

Proceedings of the 7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC), 2012

On Model Subtyping.

[BibT_eX]

[DOI]

Proceedings of the Modelling Foundations and Applications - 8th European Conference, 2012

From Scilab to High Performance Embedded Multicore Systems: The ALMA Approach.

[BibT_eX]

[DOI]

Dimitrios Kritharidis

Nikolaos Mitas

Diana Göhringer

Proceedings of the 15th Euromicro Conference on Digital System Design, 2012

A semiempirical model for wakeup time estimation in power-gated logic clusters.

[BibT_eX]

[DOI]

Vivek D. Tovinakere

Proceedings of the 49th Annual Design Automation Conference 2012, 2012

A Compilation- and Simulation-Oriented Architecture Description Language for Multicore Systems.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE International Conference on Computational Science and Engineering, 2012

2011

A Polynomial Based Approach to Wakeup Time and Energy Estimation in Power-Gated Logic Clusters.

[BibT_eX]

[DOI]

Vivek D. Tovinakere

J. Low Power Electron., 2011

Wakeup Time and Wakeup Energy Estimation in Power-Gated Logic Clusters.

[BibT_eX]

[DOI]

Vivek D. Tovinakere

Proceedings of the VLSI Design 2011: 24th International Conference on VLSI Design, 2011

Model-Driven Engineering and Optimizing Compilers: A Bridge Too Far?

[BibT_eX]

[DOI]

Proceedings of the Model Driven Engineering Languages and Systems, 2011

ompVerify: Polyhedral Analysis for the OpenMP Programmer.

[BibT_eX]

[DOI]

Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

Efficient nested loop pipelining in high level synthesis using polyhedral bubble insertion.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Field-Programmable Technology, 2011

HLS Tools for FPGA: Faster Development with Better Performance.

[BibT_eX]

[DOI]

Alexandre Cornu

Dominique Lavenier

Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2011

Contributions à la conception d'architectures matérielles dédiées.

[BibT_eX]

[DOI]

, 2011

2010

Hardware Acceleration of HMMER on FPGAs.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2010

Accelerating HMMER on FPGA using parallel prefixes and reductions.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Field-Programmable Technology, 2010

System Level Synthesis for Ultra Low-Power Wireless Sensor Nodes.

[BibT_eX]

[DOI]

Proceedings of the 13th Euromicro Conference on Digital System Design, 2010

A complete design-flow for the generation of ultra low-power WSN node architectures based on micro-tasking.

[BibT_eX]

[DOI]

Proceedings of the 47th Design Automation Conference, 2010

2009

Ultra Low-power FSM for Control Oriented Applications.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

2008

Deriving efficient control in Process Networks with Compaan/Laura.

[BibT_eX]

[DOI]

Int. J. Embed. Syst., 2008

2007

Parallelizing HMMER for Hardware Acceleration on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Application-Specific Systems, 2007

Combining Flash Memory and FPGAs to Efficiently Implement a Massively Parallel Algorithm for Content-Based Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Reconfigurable Computing: Architectures, 2007

2006

Acceleration of a content-based image-retrieval application on the RDISK cluster.

[BibT_eX]

[DOI]

Auguste Noumsi

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005

Cluster of re-configurable nodes for scanning large genomic banks.

[BibT_eX]

[DOI]

Parallel Comput., 2005

Hardware/Software Interface for Multi-Dimensional Processor Arrays.

[BibT_eX]

[DOI]

Alain Darte

Tanguy Risset

Proceedings of the 16th IEEE International Conference on Application-Specific Systems, 2005

2003

A Reconfigurable Parallel Disk System for Filtering Genomic Banks.

[BibT_eX]

Proceedings of the International Conference on Engineering of Reconfigurable Systems and Algorithms, June 23, 2003

2002

Energy/Power Estimation of Regular Processor Arrays.

[BibT_eX]

[DOI]

Proceedings of the 15th International Symposium on System Synthesis (ISSS 2002), 2002

2001

Combined instruction and loop parallelism in array synthesis for FPGAs.

[BibT_eX]

[DOI]

Susmita Sur-Kolay

Proceedings of the 14th International Symposium on Systems Synthesis, 2001

Loop Tiling for Reconfigurable Accelerators.

[BibT_eX]

[DOI]

Proceedings of the Field-Programmable Logic and Applications, 2001

Combining Instruction and Loop Level Parallelism for FPGAs.

[BibT_eX]

[DOI]

Susmita Sur-Kolay

Proceedings of the 9th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, 2001

2000

Interfacing compiled FPGA programs: the MMAlpha approach.

[BibT_eX]

Tanguy Risset

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

Optimal Partitioning for FPGA Based Regular Array Implementations.

[BibT_eX]

[DOI]

Susmita Sur-Kolay

Proceedings of the 2000 International Conference on Parallel Computing in Electrical Engineering (PARELEC 2000), 2000

Approximating a Single Viewpoint in Panoramic Imaging Devices.

[BibT_eX]

[DOI]

Kurt Konolige

Proceedings of the 2000 IEEE International Conference on Robotics and Automation, 2000

FCCMS and the Memory Wall.

[BibT_eX]

[DOI]