Arindam Mallik

Francky Catthoor

CoRR, 2024

Accelerating Large Language Model Training with In-Package Optical Links for Scale-Out Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2024

Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2024

2023

DIANA: An End-to-End Hybrid DIgital and ANAlog Neural Network SoC for the Edge.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2023

AIMC Modeling and Parameter Tuning for Layer-Wise Optimal Operating Point in DNN Inference.

[BibT_eX]

[DOI]

Iman Dadras

Giuseppe Maria Sarda

Nathan Laubeuf

IEEE Access, 2023

Evaluating the Effects of FeFET Device Variability on Charge Sharing Based AiMC Accelerator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

2022

Dynamic Quantization Range Control for Analog-in-Memory Neural Networks Acceleration.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2022

AERO: Design Space Exploration Framework for Resource-Constrained CNN Mapping on Tile-Based Accelerators.

[BibT_eX]

[DOI]

Simei Yang

Juan Sebastian Piedrahita Giraldo

IEEE J. Emerg. Sel. Topics Circuits Syst., 2022

DIANA: An End-to-End Energy-Efficient Digital and ANAlog Hybrid Neural Network SoC.

[BibT_eX]

[DOI]

Peter Vrancx

Jonas Doevenspeck

Proceedings of the IEEE International Solid-State Circuits Conference, 2022

Write-Verify Scheme for IGZO DRAM in Analog in-Memory Computing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Tiny ci-SAR A/D Converter for Deep Neural Networks in Analog in-Memory Computation.

[BibT_eX]

[DOI]

Michele Caselli

Peter Debacker

Diederik Verkest

Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

2021

Analog In-memory Computing in FeFET-based 1T1R Array for Edge AI Applications.

[BibT_eX]

[DOI]

Proceedings of the 2021 Symposium on VLSI Circuits, Kyoto, Japan, June 13-19, 2021, 2021

Charge Sharing and Charge Injection A/D Converters for Analog In-Memory Computing.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International New Circuits and Systems Conference, 2021

Design-Technology Space Exploration for Energy Efficient AiMC-Based Inference Acceleration.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Noise tolerant ternary weight deep neural networks for analog in-memory inference.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

A 22 nm, 1540 TOP/s/W, 12.1 TOP/s/mm<sup>2</sup> in-Memory Analog Matrix-Vector-Multiplier for DNN Acceleration.

[BibT_eX]

[DOI]

Proceedings of the IEEE Custom Integrated Circuits Conference, 2021

2019

FQ-Conv: Fully Quantized Convolution for Efficient and Accurate Inference.

[BibT_eX]

[DOI]

CoRR, 2019

2018

Sequential 3D: Key integration challenges and opportunities for advanced semiconductor scaling.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on IC Design & Technology, 2018

2015

Lateral NWFET optimization for beyond 7nm nodes.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on IC Design & Technology, 2015

2014

Design Technology co-optimization for N10.

[BibT_eX]

[DOI]

Proceedings of the IEEE 2014 Custom Integrated Circuits Conference, 2014

2013

TEASE: a systematic analysis framework for early evaluation of FinFET-based advanced technology nodes.

[BibT_eX]

[DOI]

Pablo Royer Del Bario

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

2011

Automatic Extraction of Pipeline Parallelism for Embedded Software Using Linear Programming.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

2010

Mapping Embedded Applications on MPSoCs: The MNEMEE Approach.

[BibT_eX]

[DOI]

Dimitrios Kritharidis

Proceedings of the VLSI 2010 Annual Symposium - Selected papers, 2010

Mapping Embedded Applications on MPSoCs: The MNEMEE Approach.

[BibT_eX]

[DOI]

Dimitrios Kritharidis

Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2010

A framework for automatic parallelization, static and dynamic memory optimization in MPSoC platforms.

[BibT_eX]

[DOI]

Proceedings of the 47th Design Automation Conference, 2010

Automatic parallelization of embedded software using hierarchical task graphs and integer linear programming.

[BibT_eX]

[DOI]

Daniel Cordes

Peter Marwedel

Proceedings of the 8th International Conference on Hardware/Software Codesign and System Synthesis, 2010

MNEMEE: a framework for memory management and optimization of static and dynamic data in MPSoCs.

[BibT_eX]

[DOI]

Proceedings of the 2010 International Conference on Compilers, 2010

2009

User- and process-driven dynamic voltage and frequency scaling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2009

2008

Learning and Leveraging the Relationship between Architecture-Level Measurements and Individual User Satisfaction.

[BibT_eX]

[DOI]

Proceedings of the 35th International Symposium on Computer Architecture (ISCA 2008), 2008

PICSEL: measuring user-perceived performance to control dynamic frequency scaling.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems, 2008

2007

Low-Power Optimization by Smart Bit-Width Allocation in a SystemC-Based ASIC Design Environment.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2007

Power reduction through measurement and modeling of users and CPUs: summary.

[BibT_eX]

[DOI]

Proceedings of the 2007 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2007

Variable latency caches for nanoscale processor.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

The user in experimental computer systems research.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Experimental Computer Science, 2007

Automated task distribution in multicore network processors using statistical analysis.

[BibT_eX]

[DOI]

Yu Zhang

Proceedings of the 2007 ACM/IEEE Symposium on Architecture for Networking and Communications Systems, 2007

2006

User-Driven Frequency Scaling.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2006

Smart bit-width allocation for low power optimization in a systemc based ASIC design environment.

[BibT_eX]

[DOI]

Proceedings of the Conference on Design, Automation and Test in Europe, 2006

2005

Low Power Correlating Caches for Network Processors.

[BibT_eX]

[DOI]

J. Low Power Electron., 2005

Application-Level Error Measurements for Network Processors.

[BibT_eX]

[DOI]

Matthew C. Wildrick

IEICE Trans. Inf. Syst., 2005

Load elimination for low-power embedded processors.

[BibT_eX]

[DOI]

Mahmut T. Kandemir

Proceedings of the 15th ACM Great Lakes Symposium on VLSI 2005, 2005

Engineering Over-Clocking: Reliability-Performance Trade-Offs for High-Performance Register Files.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Conference on Dependable Systems and Networks (DSN 2005), 28 June, 2005

2004

A Case for Clumsy Packet Processors.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual International Symposium on Microarchitecture (MICRO-37 2004), 2004

Design and implementation of correlating caches.

[BibT_eX]

[DOI]

Matthew C. Wildrick