Juan L. Aragón

Juan Luis Aragón

CoRR, 2024

LIBRA: Memory Bandwidth- and Locality-Aware Parallel Tile Rendering.

[BibT_eX]

[DOI]

Aurora Tomás

Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

2023

Graphfire: Synergizing Fetch, Insertion, and Replacement Policies for Graph Analytics.

[BibT_eX]

[DOI]

Aninda Manocha

IEEE Trans. Computers, 2023

Architectural Support for Optimizing Huge Page Selection Within the OS.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUs.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Parallel Architectures and Compilation Techniques, 2023

2022

The Implications of Page Size Management on Graph Analytics.

[BibT_eX]

[DOI]

Dataset, September, 2022

Omega-Test: A Predictive Early-Z Culling to Improve the Graphics Pipeline Energy-Efficiency.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2022

Energy-efficient design of a presbyopia correction wearable powered by mobile GPUs and FPGAs.

[BibT_eX]

[DOI]

Juan Mompean

Pablo Artal

J. Supercomput., 2022

Dynamic sampling rate: harnessing frame coherence in graphics applications for energy-efficient GPUs.

[BibT_eX]

[DOI]

J. Supercomput., 2022

Triangle Dropping: An Occluded-geometry Predictor for Energy-efficient Mobile GPUs.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2022

DTM-NUCA: Dynamic Texture Mapping-NUCA for Energy-Efficient Graphics Rendering.

[BibT_eX]

[DOI]

Proceedings of the 30th Euromicro International Conference on Parallel, 2022

DTexL: Decoupled Raster Pipeline for Texture Locality.

[BibT_eX]

[DOI]

Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

MEGsim: A Novel Methodology for Efficient Simulation of Graphics Workloads in GPUs.

[BibT_eX]

[DOI]

Jorge Ortiz

Proceedings of the International IEEE Symposium on Performance Analysis of Systems and Software, 2022

Tiny but mighty: designing and realizing scalable latency tolerance for manycore SoCs.

[BibT_eX]

[DOI]

Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

The Implications of Page Size Management on Graph Analytics.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2022

TCOR: A Tile Cache with Optimal Replacement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021

Fast and Accurate SER Estimation for Large Combinational Blocks in Early Stages of the Design.

[BibT_eX]

[DOI]

IEEE Trans. Sustain. Comput., 2021

GraphAttack: Optimizing Data Supply for Graph Applications on In-Order Multicore Architectures.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2021

2020

The MosaicSim Simulator (Full Technical Report).

[BibT_eX]

[DOI]

CoRR, 2020

MosaicSim: A Lightweight, Modular Simulator for Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2020

A Simulator and Compiler Framework for Agile Hardware-Software Co-design Evaluation and Exploration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

2019

Efficient Data Supply for Parallel Heterogeneous Architectures.

[BibT_eX]

[DOI]

Tae Jun Ham

ACM Trans. Archit. Code Optim., 2019

GPU-based processing of Hartmann-Shack images for accurate and high-speed ocular wavefront sensing.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2019

Rendering Elimination: Early Discard of Redundant Tiles in the Graphics Pipeline.

[BibT_eX]

[DOI]

Pedro Marcuello

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

Early Visibility Resolution for Removing Ineffectual Computations in the Graphics Pipeline.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

2018

Design of an accurate and high-speed binocular pupil tracking system based on GPGPUs.

[BibT_eX]

[DOI]

J. Supercomput., 2018

2017

Decoupling Data Supply from Computation for Latency-Tolerant Communication in Heterogeneous Architectures.

[BibT_eX]

[DOI]

Tae Jun Ham

ACM Trans. Archit. Code Optim., 2017

2016

MASkIt: Soft error rate estimation for combinational circuits.

[BibT_eX]

[DOI]

Proceedings of the 34th IEEE International Conference on Computer Design, 2016

2015

Soft-error mitigation by means of decoupled transactional memory threads.

[BibT_eX]

[DOI]

Distributed Comput., 2015

GPU-Accelerated High-Speed Eye Pupil Tracking System.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015

DeSC: decoupled supply-compute communication management for heterogeneous architectures.

[BibT_eX]

[DOI]

Tae Jun Ham

Proceedings of the 48th International Symposium on Microarchitecture, 2015

2014

Managing power constraints in a single-core scenario through power tokens.

[BibT_eX]

[DOI]

J. Supercomput., 2014

2013

Modeling the impact of permanent faults in caches.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

Efficient inter-core power and thermal balancing for multicore processors.

[BibT_eX]

[DOI]

Computing, 2013

2012

A fault-tolerant architecture for parallel applications in tiled-CMPs.

[BibT_eX]

[DOI]

J. Supercomput., 2012

2011

Leakage-efficient design of value predictors through state and non-state preserving techniques.

[BibT_eX]

[DOI]

J. Supercomput., 2011

Power Token Balancing: Adapting CMPs to Power Constraints for Parallel Multithreaded Workloads.

[BibT_eX]

[DOI]

Juan M. Cebrian

Stefanos Kaxiras

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

An analytical model for the calculation of the Expected Miss Ratio in faulty caches.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International On-Line Testing Symposium (IOLTS 2011), 2011

Token3D: Reducing Temperature in 3D Die-Stacked CMPs through Cycle-Level Power Control Mechanisms.

[BibT_eX]

[DOI]

Juan M. Cebrian

Stefanos Kaxiras

Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010

Heterogeneous Interconnects for Energy-Efficient Message Management in CMPs.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2010

Exploiting address compression and heterogeneous interconnects for efficient message management in tiled CMPs.

[BibT_eX]

[DOI]

J. Syst. Archit., 2010

Energy-Efficient Hardware Prefetching for CMPs Using Heterogeneous Interconnects.

[BibT_eX]

[DOI]

Proceedings of the 18th Euromicro Conference on Parallel, 2010

A log-based redundant architecture for reliable parallel computation.

[BibT_eX]

[DOI]

Juan Manuel Cebrian Gonzalez

Proceedings of the 2010 International Conference on High Performance Computing, 2010

MLP-Aware Instruction Queue Resizing: The Key to Power-Efficient Performance.

[BibT_eX]

[DOI]

Pavlos Petoumenos

Georgia Psychou

Stefanos Kaxiras

Proceedings of the Architecture of Computing Systems, 2010

2009

Extending SRT for parallel applications in tiled-CMP architectures.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Efficient microarchitecture policies for accurately adapting to power constraints.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

REPAS: Reliable Execution for Parallel ApplicationS in Tiled-CMPs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008

An energy consumption characterization of on-chip interconnection networks for tiled CMP architectures.

[BibT_eX]

[DOI]

J. Supercomput., 2008

Optimizing CAM-based instruction cache designs for low-power embedded systems.

[BibT_eX]

[DOI]

Alexander V. Veidenbaum

J. Syst. Archit., 2008

Address Compression and Heterogeneous Interconnects for Energy-Efficient High-Performance in Tiled CMPs.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

2007

Leakage Energy Reduction in Value Predictors through Static Decay.

[BibT_eX]

[DOI]

Juan M. Cebrian

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Efficient Message Management in Tiled CMP Architectures Using a Heterogeneous Interconnection Network.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2007

Adaptive VP decay: making value predictors leakage-efficient designs for high performance processors.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Computing Frontiers, 2007

Sim-PowerCMP: A Detailed Simulator for Energy Consumption Analysis in Future Embedded CMP Architectures.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007

2006

Control Speculation for Energy-Efficient Next-Generation Superscalar Processors.

[BibT_eX]

[DOI]

José M. González

IEEE Trans. Computers, 2006

2005

Energy-Effective Instruction Fetch Unit for Wide Issue Processors.

[BibT_eX]

[DOI]

Alexander V. Veidenbaum

Proceedings of the Advances in Computer Systems Architecture, 10th Asia-Pacific Conference, 2005

2004

Energy-Efficient Design for Highly Associative Instruction Caches in Next-Generation Embedded Processors.

[BibT_eX]

[DOI]

Dan Nicolaescu

Alexander V. Veidenbaum

Ana-Maria Badulescu

Proceedings of the 2004 Design, 2004

2003

Power-Aware Control Speculation through Selective Throttling.

[BibT_eX]

[DOI]

José González