Juan L. Aragón

Orcid: 0000-0002-4955-7235

According to our database1, Juan L. Aragón authored at least 61 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
WaSP: Warp Scheduling to Mimic Prefetching in Graphics Workloads.
CoRR, 2024

LIBRA: Memory Bandwidth- and Locality-Aware Parallel Tile Rendering.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

2023
Graphfire: Synergizing Fetch, Insertion, and Replacement Policies for Graph Analytics.
IEEE Trans. Computers, 2023

Architectural Support for Optimizing Huge Page Selection Within the OS.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUs.
Proceedings of the 32nd International Conference on Parallel Architectures and Compilation Techniques, 2023

2022
The Implications of Page Size Management on Graph Analytics.
Dataset, September, 2022

Omega-Test: A Predictive Early-Z Culling to Improve the Graphics Pipeline Energy-Efficiency.
IEEE Trans. Vis. Comput. Graph., 2022

Energy-efficient design of a presbyopia correction wearable powered by mobile GPUs and FPGAs.
J. Supercomput., 2022

Dynamic sampling rate: harnessing frame coherence in graphics applications for energy-efficient GPUs.
J. Supercomput., 2022

Triangle Dropping: An Occluded-geometry Predictor for Energy-efficient Mobile GPUs.
ACM Trans. Archit. Code Optim., 2022

DTM-NUCA: Dynamic Texture Mapping-NUCA for Energy-Efficient Graphics Rendering.
Proceedings of the 30th Euromicro International Conference on Parallel, 2022

DTexL: Decoupled Raster Pipeline for Texture Locality.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

MEGsim: A Novel Methodology for Efficient Simulation of Graphics Workloads in GPUs.
Proceedings of the International IEEE Symposium on Performance Analysis of Systems and Software, 2022

Tiny but mighty: designing and realizing scalable latency tolerance for manycore SoCs.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

The Implications of Page Size Management on Graph Analytics.
Proceedings of the IEEE International Symposium on Workload Characterization, 2022

TCOR: A Tile Cache with Optimal Replacement.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021
Fast and Accurate SER Estimation for Large Combinational Blocks in Early Stages of the Design.
IEEE Trans. Sustain. Comput., 2021

GraphAttack: Optimizing Data Supply for Graph Applications on In-Order Multicore Architectures.
ACM Trans. Archit. Code Optim., 2021

2020
The MosaicSim Simulator (Full Technical Report).
CoRR, 2020

MosaicSim: A Lightweight, Modular Simulator for Heterogeneous Systems.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2020

A Simulator and Compiler Framework for Agile Hardware-Software Co-design Evaluation and Exploration.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

2019
Efficient Data Supply for Parallel Heterogeneous Architectures.
ACM Trans. Archit. Code Optim., 2019

GPU-based processing of Hartmann-Shack images for accurate and high-speed ocular wavefront sensing.
Future Gener. Comput. Syst., 2019

Rendering Elimination: Early Discard of Redundant Tiles in the Graphics Pipeline.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

Early Visibility Resolution for Removing Ineffectual Computations in the Graphics Pipeline.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

2018
Design of an accurate and high-speed binocular pupil tracking system based on GPGPUs.
J. Supercomput., 2018

2017
Decoupling Data Supply from Computation for Latency-Tolerant Communication in Heterogeneous Architectures.
ACM Trans. Archit. Code Optim., 2017

2016
MASkIt: Soft error rate estimation for combinational circuits.
Proceedings of the 34th IEEE International Conference on Computer Design, 2016

2015
Soft-error mitigation by means of decoupled transactional memory threads.
Distributed Comput., 2015

GPU-Accelerated High-Speed Eye Pupil Tracking System.
Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015

DeSC: decoupled supply-compute communication management for heterogeneous architectures.
Proceedings of the 48th International Symposium on Microarchitecture, 2015

2014
Managing power constraints in a single-core scenario through power tokens.
J. Supercomput., 2014

2013
Modeling the impact of permanent faults in caches.
ACM Trans. Archit. Code Optim., 2013

Efficient inter-core power and thermal balancing for multicore processors.
Computing, 2013

2012
A fault-tolerant architecture for parallel applications in tiled-CMPs.
J. Supercomput., 2012

2011
Leakage-efficient design of value predictors through state and non-state preserving techniques.
J. Supercomput., 2011

Power Token Balancing: Adapting CMPs to Power Constraints for Parallel Multithreaded Workloads.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

An analytical model for the calculation of the Expected Miss Ratio in faulty caches.
Proceedings of the 17th IEEE International On-Line Testing Symposium (IOLTS 2011), 2011

Token3D: Reducing Temperature in 3D Die-Stacked CMPs through Cycle-Level Power Control Mechanisms.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010
Heterogeneous Interconnects for Energy-Efficient Message Management in CMPs.
IEEE Trans. Computers, 2010

Exploiting address compression and heterogeneous interconnects for efficient message management in tiled CMPs.
J. Syst. Archit., 2010

Energy-Efficient Hardware Prefetching for CMPs Using Heterogeneous Interconnects.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

A log-based redundant architecture for reliable parallel computation.
Proceedings of the 2010 International Conference on High Performance Computing, 2010

MLP-Aware Instruction Queue Resizing: The Key to Power-Efficient Performance.
Proceedings of the Architecture of Computing Systems, 2010

2009
Extending SRT for parallel applications in tiled-CMP architectures.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Efficient microarchitecture policies for accurately adapting to power constraints.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

REPAS: Reliable Execution for Parallel ApplicationS in Tiled-CMPs.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
An energy consumption characterization of on-chip interconnection networks for tiled CMP architectures.
J. Supercomput., 2008

Optimizing CAM-based instruction cache designs for low-power embedded systems.
J. Syst. Archit., 2008

Address Compression and Heterogeneous Interconnects for Energy-Efficient High-Performance in Tiled CMPs.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

2007
Leakage Energy Reduction in Value Predictors through Static Decay.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Efficient Message Management in Tiled CMP Architectures Using a Heterogeneous Interconnection Network.
Proceedings of the High Performance Computing, 2007

Adaptive VP decay: making value predictors leakage-efficient designs for high performance processors.
Proceedings of the 4th Conference on Computing Frontiers, 2007

Sim-PowerCMP: A Detailed Simulator for Energy Consumption Analysis in Future Embedded CMP Architectures.
Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007

2006
Control Speculation for Energy-Efficient Next-Generation Superscalar Processors.
IEEE Trans. Computers, 2006

2005
Energy-Effective Instruction Fetch Unit for Wide Issue Processors.
Proceedings of the Advances in Computer Systems Architecture, 10th Asia-Pacific Conference, 2005

2004
Energy-Efficient Design for Highly Associative Instruction Caches in Next-Generation Embedded Processors.
Proceedings of the 2004 Design, 2004

2003
Power-Aware Control Speculation through Selective Throttling.
Proceedings of the Ninth International Symposium on High-Performance Computer Architecture (HPCA'03), 2003

2002
Dual path instruction processing.
Proceedings of the 16th international conference on Supercomputing, 2002

2001
Selective Branch Prediction Reversal By Correlating with Data Values and Control Flow.
Proceedings of the 19th International Conference on Computer Design (ICCD 2001), 2001

Confidence Estimation for Branch Prediction Reversal.
Proceedings of the High Performance Computing - HiPC 2001, 8th International Conference, 2001


  Loading...