Eduard Ayguadé
Orcid: 0000-0002-5146-103XAffiliations:
- Polytechnic University of Catalonia, Barcelona, Spain
According to our database1,
Eduard Ayguadé
authored at least 410 papers
between 1989 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE Trans. Computers, May, 2024
CoRR, 2024
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
Reinforcement Learning-based Adaptive Mitigation of Uncorrected DRAM Errors in the Field.
Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing, 2024
2023
J. Supercomput., September, 2023
Future Gener. Comput. Syst., 2023
Proceedings of the 33rd International Conference on Field-Programmable Logic and Applications, 2023
Proceedings of the 31st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2023
2022
J. Comput. Sci., 2022
The MAMe dataset: on the relevance of high resolution and variable shape image properties.
Appl. Intell., 2022
Proceedings of the 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2022
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the 51st International Conference on Parallel Processing, 2022
Proceedings of the Euro-Par 2022: Parallel Processing, 2022
Proceedings of the IEEE/ACM International Workshop on Education for High Performance Computing, 2022
Proceedings of the IEEE International Conference on Cluster Computing, 2022
2021
Implementation of a high-accuracy phase unwrapping algorithm using parallel-hybrid programming approach for displacement sensing using self-mixing interferometry.
J. Supercomput., 2021
IEEE Trans. Computers, 2021
Size & Shape Matters: The Need of HPC Benchmarks of High Resolution Image Training for Deep Learning.
Supercomput. Front. Innov., 2021
Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2021
2020
Parallel Comput., 2020
Future Gener. Comput. Syst., 2020
Proceedings of the High Performance Computing, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020
Proceedings of the Euro-Par 2020: Parallel Processing, 2020
Proceedings of the Euro-Par 2020: Parallel Processing, 2020
Proceedings of the IEEE International Conference on Cluster Computing, 2020
2019
IEEE Trans. Parallel Distributed Syst., 2019
The Abstract Streaming Machine: Compile-Time Performance Modelling of Stream Programs on Heterogeneous Multiprocessors.
Trans. High Perform. Embed. Archit. Compil., 2019
Semantic Web, 2019
Proc. ACM Meas. Anal. Comput. Syst., 2019
J. Parallel Distributed Comput., 2019
J. Parallel Distributed Comput., 2019
J. Parallel Distributed Comput., 2019
Proceedings of the Platform for Advanced Scientific Computing Conference, 2019
Proceedings of the International Symposium on Memory Systems, 2019
Worksharing Tasks: An Efficient Way to Exploit Irregular and Fine-Grained Loop Parallelism.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019
Proceedings of the Artificial Intelligence Research and Development, 2019
Proceedings of the Artificial Intelligence Research and Development, 2019
2018
IEEE Trans. Parallel Distributed Syst., 2018
Multim. Tools Appl., 2018
J. Parallel Distributed Comput., 2018
J. Artif. Intell. Res., 2018
Formalization of Block Pruning: Reducing the Number of Cells Computed in Exact Biological Sequence Comparison Algorithms.
Comput. J., 2018
Proceedings of the 2018 IEEE/ACM Workshop on Education for High-Performance Computing, 2018
Proceedings of the 2018 IEEE/ACM Workshop on Education for High-Performance Computing, 2018
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018
Proceedings of the International Symposium on Memory Systems, 2018
Reducing Data Movement on Large Shared Memory Systems by Exploiting Computation Dependencies.
Proceedings of the 32nd International Conference on Supercomputing, 2018
Proceedings of the 32nd International Conference on Supercomputing, 2018
Proceedings of the 2018 IEEE International Conference on Big Knowledge, 2018
Proceedings of the International Conference on Field-Programmable Technology, 2018
Proceedings of the Euro-Par 2018: Parallel Processing, 2018
Proceedings of the Real World Domain Specific Languages Workshop, 2018
Proceedings of the Artificial Intelligence Research and Development, 2018
2017
IEEE Trans. Parallel Distributed Syst., 2017
ACM Trans. Archit. Code Optim., 2017
Supercomput. Front. Innov., 2017
Microprocess. Microsystems, 2017
CoRR, 2017
Cogn. Syst. Res., 2017
Proceedings of the 2nd Workshop on Semantic Deep Learning, 2017
Proceedings of the 2nd Workshop on Semantic Deep Learning, 2017
Proceedings of the 29th International Symposium on Computer Architecture and High Performance Computing, 2017
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017
Proceedings of the International Symposium on Memory Systems, 2017
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017
General Purpose Task-Dependence Management Hardware for Task-Based Dataflow Programming Models.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Characterizing and Improving the Performance of Many-Core Task-Based Parallel Programming Runtimes.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017
Picos, A Hardware Task-Dependence Manager for Task-Based Dataflow Programming Models.
Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017
Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017
Proceedings of the 46th International Conference on Parallel Processing, 2017
ParaView + Alya + D8tree: Integrating High Performance Computing and High Performance Data Analytics.
Proceedings of the International Conference on Computational Science, 2017
Fluid Communities: A Competitive, Scalable and Diverse Community Detection Algorithm.
Proceedings of the Complex Networks & Their Applications VI, 2017
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017
2016
CUDAlign 4.0: Incremental Speculative Traceback for Exact Chromosome-Wide Alignment in GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2016
ACM Trans. Parallel Comput., 2016
ACM Trans. Archit. Code Optim., 2016
CoRR, 2016
Architectural Impact on Performance of In-memory Data Analytics: Apache Spark Case Study.
CoRR, 2016
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the Second International Symposium on Memory Systems, 2016
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016
The Secrets of the Accelerators Unveiled: Tracing Heterogeneous Executions Through OMPT.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016
Supporting Adaptive Privatization Techniques for Irregular Array Reductions in Task-Parallel Programming Models.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016
Performance analysis of a hardware accelerator of dependence management for task-based dataflow programming models.
Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, 2016
Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications.
Proceedings of the 2016 IEEE International Symposium on Workload Characterization, 2016
Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes.
Proceedings of the 2016 International Conference on Supercomputing, 2016
D8-tree: a de-normalized approach for multidimensional data analysis on key-value databases.
Proceedings of the 17th International Conference on Distributed Computing and Networking, 2016
Proceedings of the 2016 Euromicro Conference on Digital System Design, 2016
Proceedings of the Artificial Intelligence Research and Development, 2016
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016
Micro-Architectural Characterization of Apache Spark on Batch and Stream Processing Workloads.
Proceedings of the 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), 2016
Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, 2016
Proceedings of The 8th Asian Conference on Machine Learning, 2016
POSTER: Collective Dynamic Parallelism for Directive Based GPU Programming Languages and Compilers.
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016
Reducing Cache Coherence Traffic with Hierarchical Directory Cache and NUMA-Aware Runtime Scheduling.
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016
2015
Hardware-Software Coherence Protocol for the Coexistence of Caches and Local Memories.
IEEE Trans. Computers, 2015
Parallel Comput., 2015
Proceedings of the Workshop on Computer Architecture Education, 2015
Proceedings of the Second Workshop on Accelerator Programming using Directives, 2015
Proceedings of the Second Workshop on Accelerator Programming using Directives, 2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the 2015 International Conference on Embedded Computer Systems: Architectures, 2015
Proceedings of the 23rd Euromicro International Conference on Parallel, 2015
Proceedings of the 2015 International Symposium on Memory Systems, 2015
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015
Coherence protocol for transparent management of scratchpad memories in shared memory manycore architectures.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015
Proceedings of the 21st IEEE International Conference on Parallel and Distributed Systems, 2015
Proceedings of the International Conference on Computational Science, 2015
Proceedings of the International Conference on Computational Science, 2015
Proceedings of the 6th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and the 4th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms, 2015
Proceedings of the Euro-Par 2015: Parallel Processing, 2015
Proceedings of the 2015 Euromicro Conference on Digital System Design, 2015
Proceedings of the Artificial Intelligence Research and Development, 2015
Proceedings of the Big Data Benchmarks, Performance Optimization, and Emerging Hardware, 2015
Proceedings of the 8th International Conference on Biomedical Engineering and Informatics, 2015
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015
Proceedings of the Fifth IEEE International Conference on Big Data and Cloud Computing, 2015
Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015
2014
J. Parallel Distributed Comput., 2014
A methodology for the evaluation of high response time on E-commerce users and sales.
Inf. Syst. Frontiers, 2014
Proceedings of the Supercomputing - 29th International Conference, 2014
Scalability and Parallel Execution of OmpSs-OpenCL Tasks on Heterogeneous CPU-GPU Environment.
Proceedings of the Supercomputing - 29th International Conference, 2014
Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2014
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014
A Case Study of Hybrid Dataflow and Shared-Memory Programming Models: Dependency-Based Parallel Game Engine.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014
Analyzing Performance Improvements and Energy Savings in Infiniband Architecture using Network Compression.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014
Proceedings of the 2014 International Conference on ReConFigurable Computing and FPGAs, 2014
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014
Proceedings of the Modeling Decisions for Artificial Intelligence, 2014
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014
On the Roles of the Programmer, the Compiler and the Runtime System When Programming Accelerators in OpenMP.
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014
Proceedings of the 43rd International Conference on Parallel Processing, 2014
Proceedings of the International Conference on High Performance Computing & Simulation, 2014
Proceedings of the 2014 International Conference on Field-Programmable Technology, 2014
Proceedings of the 24th International Conference on Field Programmable Logic and Applications, 2014
Proceedings of the 2014 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2014
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014
DaSH: a benchmark suite for hybrid dataflow and shared memory programming models: with comparative evaluation of three hybrid dataflow models.
Proceedings of the Computing Frontiers Conference, CF'14, 2014
Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014
Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014
ALOJA: A systematic study of Hadoop deployment variables to enable automated characterization of cost-effectiveness.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014
Proceedings of the IEEE 25th International Conference on Application-Specific Systems, 2014
Proceedings of the Reconfigurable Computing: Architectures, Tools, and Applications, 2014
2013
A Systematic Methodology to Generate Decomposable and Responsive Power Models for CMPs.
IEEE Trans. Computers, 2013
A template system for the efficient compilation of domain abstractions onto reconfigurable computers.
J. Syst. Archit., 2013
Programmability and portability for exascale: Top down programming methodology and tools with StarSs.
J. Comput. Sci., 2013
Proceedings of the 2013 IEEE 12th International Symposium on Network Computing and Applications, 2013
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013
Implementing OmpSs support for regions of data in architectures with multiple address spaces.
Proceedings of the International Conference on Supercomputing, 2013
Proceedings of the International Conference on Computational Science, 2013
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013
2012
IEEE Trans. Parallel Distributed Syst., 2012
Energy accounting for shared virtualized environments under DVFS using PMC-based power models.
Future Gener. Comput. Syst., 2012
POTRA: a framework for building power models for next generation multicore architectures.
Proceedings of the ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, 2012
Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012
Proceedings of the Languages and Compilers for Parallel Computing, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
Assessing the Impact of Network Compression on Molecular Dynamics and Finite Element Methods.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012
Proceedings of the 19th International Conference on High Performance Computing, 2012
Proceedings of the 19th International Conference on High Performance Computing, 2012
Proceedings of the 22nd International Conference on Field Programmable Logic and Applications (FPL), 2012
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012
DMA-circular: an enhanced high level programmable DMA controller for optimized management of on-chip local memories.
Proceedings of the Computing Frontiers Conference, CF'12, 2012
Proceedings of the Computing Frontiers Conference, CF'12, 2012
Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2012
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
2011
IEEE Trans. Parallel Distributed Syst., 2011
Parallel Process. Lett., 2011
Int. J. Parallel Program., 2011
Comput. J., 2011
Proceedings of the IEEE 9th Symposium on Application Specific Processors, 2011
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011
Proceedings of The Tenth IEEE International Symposium on Networking Computing and Applications, 2011
Proceedings of the Middleware 2011, 2011
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011
Proceedings of the 20th ACM International Symposium on High Performance Distributed Computing, 2011
Implementation of a Reverse Time Migration kernel using the HCE High Level Synthesis tool.
Proceedings of the 2011 International Conference on Field-Programmable Technology, 2011
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011
2010
Automatic Prefetch and Modulo Scheduling Transformations for the Cell BE Architecture.
IEEE Trans. Parallel Distributed Syst., 2010
Int. J. Parallel Program., 2010
Concurr. Comput. Pract. Exp., 2010
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010
Proceedings of the 2010 International Conference on Parallel and Distributed Computing, 2010
Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, 2010
Proceedings of the Languages and Compilers for Parallel Computing, 2010
Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010
Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010
Characterization of workload and resource consumption for an online travel and booking site.
Proceedings of the 2010 IEEE International Symposium on Workload Characterization, 2010
Proceedings of the 24th International Conference on Supercomputing, 2010
Decomposable and responsive power models for multicore processors using performance counters.
Proceedings of the 24th International Conference on Supercomputing, 2010
Proceedings of the 39th International Conference on Parallel Processing, 2010
A CellBE-based HPC Application for the Analysis of Vulnerabilities in Cryptographic Hash Functions.
Proceedings of the 12th IEEE International Conference on High Performance Computing and Communications, 2010
Proceedings of the High Performance Embedded Architectures and Compilers, 2010
Buffer Sizing for Self-timed Stream Programs on Heterogeneous Distributed Memory Multiprocessors.
Proceedings of the High Performance Embedded Architectures and Compilers, 2010
Accurate energy accounting for shared virtualized environments using PMC-based power modeling techniques.
Proceedings of the 2010 11th IEEE/ACM International Conference on Grid Computing, 2010
Proceedings of the International Conference on Field Programmable Logic and Applications, 2010
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010
Proceedings of the 2010 conference of the Centre for Advanced Studies on Collaborative Research, 2010
2009
Int. J. Parallel Program., 2009
Int. J. High Perform. Comput. Appl., 2009
Proceedings of the 2009 International Conference on Embedded Computer Systems: Architectures, 2009
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009
Turbocharging boosted transactions or: how i learnt to stop worrying and love longer transactions.
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009
Proceedings of the 17th Euromicro International Conference on Parallel, 2009
Impact of the Memory Hierarchy on Shared Memory Architectures in Multicore Programming Models.
Proceedings of the 17th Euromicro International Conference on Parallel, 2009
Achieving high memory performance from heterogeneous architectures with the SARC programming model.
Proceedings of the 10th workshop on MEmory performance, 2009
Adaptive and Speculative Memory Consistency Support for Multi-core Architectures with On-Chip Local Memories.
Proceedings of the Languages and Compilers for Parallel Computing, 2009
Proceedings of the Languages and Compilers for Parallel Computing, 2009
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009
Proceedings of the 23rd international conference on Supercomputing, 2009
Barcelona OpenMP Tasks Suite: A Set of Benchmarks Targeting the Exploitation of Task Parallelism in OpenMP.
Proceedings of the ICPP 2009, 2009
Proceedings of the ICPP 2009, 2009
Proceedings of the 16th International Conference on High Performance Computing, 2009
Proceedings of the 2009 International Conference on Field-Programmable Technology, 2009
Proceedings of the Euro-Par 2009 Parallel Processing, 2009
Proceedings of the 2009 International Conference on Compilers, 2009
Proceedings of the 2009 conference of the Centre for Advanced Studies on Collaborative Research, 2009
2008
Int. J. Parallel Program., 2008
Int. J. High Perform. Comput. Netw., 2008
Int. J. Embed. Syst., 2008
Dynamic CPU provisioning for self-managed secure web applications in SMP hosting platforms.
Comput. Networks, 2008
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008
Proceedings of the IEEE/IFIP Network Operations and Management Symposium: Pervasive Management for Ubioquitous Networks and Services, 2008
Enabling Resource Sharing between Transactional and Batch Workloads Using Dynamic Application Placement.
Proceedings of the Middleware 2008, 2008
Proceedings of the 9th workshop on MEmory performance, 2008
Proceedings of the 9th workshop on MEmory performance, 2008
Automatic Pre-Fetch and Modulo Scheduling Transformations for the Cell BE Architecture.
Proceedings of the Languages and Compilers for Parallel Computing, 2008
Proceedings of the OpenMP in a New Era of Parallelism, 4th International Workshop, 2008
Proceedings of the OpenMP in a New Era of Parallelism, 4th International Workshop, 2008
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
Proceedings of the 14th International Conference on Parallel and Distributed Systems, 2008
Tailoring Resources: The Energy Efficient Consolidation Strategy Goes Beyond Virtualization.
Proceedings of the 2008 International Conference on Autonomic Computing, 2008
Proceedings of the 17th International Symposium on High-Performance Distributed Computing (HPDC-17 2008), 2008
Proceedings of the 2008 conference of the Centre for Advanced Studies on Collaborative Research, 2008
Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008
2007
Int. J. Parallel Program., 2007
Comput. Networks, 2007
Proceedings of the Embedded Computer Systems: Architectures, 2007
Proceedings of the 2007 workshop on MEmory performance, 2007
Proceedings of the 2007 workshop on MEmory performance, 2007
Proceedings of the Languages and Compilers for Parallel Computing, 2007
Proceedings of the Languages and Compilers for Parallel Computing, 2007
Proceedings of the A Practical Programming Model for the Multi-Core Era, 2007
Proceedings of the A Practical Programming Model for the Multi-Core Era, 2007
Proceedings of the 2007 conference of the Centre for Advanced Studies on Collaborative Research, 2007
2006
J. Parallel Distributed Comput., 2006
Employing nested OpenMP for the parallelization of multi-zone computational fluid dynamics applications.
J. Parallel Distributed Comput., 2006
Exploiting multilevel parallelism using OpenMP on a massive multithreaded architecture.
J. Embed. Comput., 2006
Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors.
IEEE Comput. Archit. Lett., 2006
Proceedings of the Languages and Compilers for Parallel Computing, 2006
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006
2005
Proceedings of the 13th Euromicro Workshop on Parallel, 2005
WAS Control Center: An Autonomic Performance-Triggered Tracing Environment for WebSphere.
Proceedings of the 13th Euromicro Workshop on Parallel, 2005
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005
Proceedings of the 11th International Conference on Parallel and Distributed Systems, 2005
Proceedings of the High Performance Computing and Communications, 2005
2004
Software and Hardware Techniques to Optimize Register File Utilization in VLIW Architectures.
Int. J. Parallel Program., 2004
Int. J. High Perform. Comput. Netw., 2004
Performance and Power Evaluation of Clustered VLIW Processors with Wide Functional Units.
Proceedings of the Computer Systems: Architectures, 2004
Proceedings of the 33rd International Conference on Parallel Processing (ICPP 2004), 2004
2003
Sci. Program., 2003
Proceedings of the OpenMP Shared Memory Parallel Programming, 2003
Proceedings of the OpenMP Shared Memory Parallel Programming, 2003
Complete instrumentation requirements for performance analysis of Web based technologies.
Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software, 2003
Proceedings of the High Performance Computing, 5th International Symposium, 2003
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
Application/Kernel Cooperation Towards the Efficient Execution of Shared-Memory Parallel Java Codes.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
2002
J. Parallel Distributed Comput., 2002
Runtime vs. Manual Data Distribution for Architecture-Agnostic Shared-Memory Programming Models.
Int. J. Parallel Program., 2002
Dual-Level Parallelism Exploitation with OpenMP in Coastal Ocean Circulation Modeling.
Proceedings of the High Performance Computing, 4th International Symposium, 2002
Proceedings of the 2002 International Conference on Parallel Architectures and Compilation Techniques (PACT 2002), 2002
2001
IEEE Trans. Parallel Distributed Syst., 2001
A Framework for Integrating Data Alignment, Distribution, and Redistribution in Distributed Memory Multiprocessors.
IEEE Trans. Parallel Distributed Syst., 2001
Cost-Conscious Strategies to Increase Performance of Numerical Programs on Aggressive VLIW Architectures.
IEEE Trans. Computers, 2001
IEEE Trans. Computers, 2001
SIGARCH Comput. Archit. News, 2001
Concurr. Comput. Pract. Exp., 2001
Proceedings of the OpenMP Shared Memory Parallel Programming, 2001
Proceedings of the OpenMP Shared Memory Parallel Programming, 2001
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001
Modulo scheduling with integrated register spilling for clustered VLIW architectures.
Proceedings of the 34th Annual International Symposium on Microarchitecture, 2001
Proceedings of the Languages and Compilers for Parallel Computing, 2001
Proceedings of the 15th international conference on Supercomputing, 2001
The trade-off between implicit and explicit data distribution in shared-memory programming paradigms.
Proceedings of the 15th international conference on Supercomputing, 2001
Proceedings of the 2001 International Conference on Parallel Processing, 2001
Proceedings of the 2001 International Conference on Parallel Processing, 2001
Proceedings of the Euro-Par 2001: Parallel Processing, 2001
2000
Concurr. Pract. Exp., 2000
Proceedings of the Proceedings Supercomputing 2000, 2000
Proceedings of the 2000 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2000
Proceedings of the 33rd Annual IEEE/ACM International Symposium on Microarchitecture, 2000
UPMLIB: A Runtime System for Tuning the Memory Performance of OpenMP Programs on Scalable Shared-Memory Multiprocessors.
Proceedings of the Languages, 2000
Proceedings of the Languages and Compilers for Parallel Computing, 2000
Proceedings of the ACM 2000 Java Grande Conference, San Francisco, CA, USA, 2000
Leveraging Transparent Data Distribution in OpenMP via User-Level Dynamic Page Migration.
Proceedings of the High Performance Computing, Third International Symposium, 2000
Applying Interposition Techniques for Performance Analysis of OpenMP Parallel Applications.
Proceedings of the 14th International Parallel & Distributed Processing Symposium (IPDPS'00), 2000
Proceedings of the 14th international conference on Supercomputing, 2000
Proceedings of the 2000 International Conference on Parallel Processing, 2000
1999
Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors.
Proceedings of the 13th international conference on Supercomputing, 1999
Proceedings of the 13th international conference on Supercomputing, 1999
Proceedings of the 13th international conference on Supercomputing, 1999
Proceedings of the International Conference on Parallel Processing 1999, 1999
Proceedings of the International Conference on Parallel Processing 1999, 1999
Quantifying the Benefits of SPECint Distant Parallelism in Simultaneous Multi-Threading Architectures.
Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999
1998
Int. J. Parallel Program., 1998
Proceedings of the 31st Annual IEEE/ACM International Symposium on Microarchitecture, 1998
Proceedings of the 12th international conference on Supercomputing, 1998
1997
Sci. Program., 1997
Proceedings of the Languages and Compilers for Parallel Computing, 1997
Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997
Increasing Memory Bandwidth with Wide Buses: Compiler, Hardware and Performance Trade-Offs.
Proceedings of the 11th international conference on Supercomputing, 1997
1996
Parallel Process. Lett., 1996
Proceedings of the Eighth IEEE Symposium on Parallel and Distributed Processing, 1996
Proceedings of the 4th Euromicro Workshop on Parallel and Distributed Processing (PDP '96), 1996
Proceedings of the 29th Annual IEEE/ACM International Symposium on Microarchitecture, 1996
Proceedings of the Languages and Compilers for Parallel Computing, 1996
Proceedings of the Euro-Par '96 Parallel Processing, 1996
Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, 1996
1995
IEEE Trans. Computers, 1995
Int. J. Parallel Program., 1995
Proceedings of the Proceedings Supercomputing '95, San Diego, CA, USA, December 4-8, 1995, 1995
Proceedings of the 3rd Euromicro Workshop on Parallel and Distributed Processing (PDP '95), 1995
Proceedings of the 28th Annual International Symposium on Microarchitecture, Ann Arbor, Michigan, USA, November 29, 1995
Proceedings of the Languages and Compilers for Parallel Computing, 1995
Proceedings of the 22nd Annual International Symposium on Computer Architecture, 1995
Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture (HPCA 1995), 1995
Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques, 1995
1994
Parallel Process. Lett., 1994
Proceedings of the Second Euromicro Workshop on Parallel and Distributed Processing, 1994
Proceedings of the Languages and Compilers for Parallel Computing, 1994
Proceedings of the 8th international conference on Supercomputing, 1994
Proceedings of the Parallel Processing: CONPAR 94, 1994
Proceedings of the Parallel Processing: CONPAR 94, 1994
1993
Microprocess. Microprogramming, 1993
Proceedings of the 1993 Euromicro Workshop on Parallel and Distributed Processing, 1993
Proceedings of the Languages and Compilers for Parallel Computing, 1993
Proceedings of the 7th international conference on Supercomputing, 1993
1992
Proceedings of the 19th Annual International Symposium on Computer Architecture. Gold Coast, 1992
Proceedings of the 6th international conference on Supercomputing, 1992
1991
Microprocessing and Microprogramming, 1991
Proceedings of the Languages and Compilers for Parallel Computing, 1991
Proceedings of the Distributed Memory Computing, 2nd European Conference, 1991
1989
PhD thesis, 1989
Proceedings of the Proceedings Supercomputing '89, Reno, NV, USA, November 12-17, 1989, 1989
Proceedings of the PARLE '89: Parallel Architectures and Languages Europe, 1989