2018
The Mensa Correctional Institute.
Am. Math. Mon., 2018
2016
An approach for code generation in the Sparse Polyhedral Framework.
Parallel Comput., 2016
2008
Centralized versus Distributed Schedulers for Bag-of-Tasks Applications.
IEEE Trans. Parallel Distributed Syst., 2008
2006
Interference-Aware Scheduling.
Int. J. High Perform. Comput. Appl., 2006
Int. J. High Perform. Comput. Appl., 2006
Centralized versus distributed schedulers for multiple bag-of-task applications.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
Path Grammar Guided Trace Compression and Trace Approximation.
Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing, 2006
2005
GUARD: Gossip Used for Autonomous Resource Detection.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005
2004
Scheduling Strategies for Master-Slave Tasking on Heterogeneous Processor Platforms.
IEEE Trans. Parallel Distributed Syst., 2004
Sparse Tiling for Stationary Iterative Methods.
Int. J. High Perform. Comput. Appl., 2004
On the Interference of Communication on Computation in Java.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004
A-FAST: Autonomous Flow Approach to Scheduling Tasks.
Proceedings of the High Performance Computing, 2004
2003
On the Parallel Execution Time of Tiled Loops.
IEEE Trans. Parallel Distributed Syst., 2003
Folklore confirmed: reducible flow graphs are exponentially larger.
Proceedings of the Conference Record of POPL 2003: The 30th SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 2003
Compile-time composition of run-time data and iteration reorderings.
Proceedings of the ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation 2003, 2003
Autonomous Protocols for Bandwidth-Centric Scheduling of Independent-Task Applications.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
2002
Combining Performance Aspects of Irregular Gauss-Seidel Via Sparse Tiling.
Proceedings of the Languages and Compilers for Parallel Computing, 15th Workshop, 2002
Bandwidth-Centric Allocation of Independent Tasks on Heterogeneous Platforms.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002
2001
Rescheduling for Locality in Sparse Matrix Computations.
Proceedings of the Computational Science - ICCS 2001, 2001
Proceedings of the Computational Science - ICCS 2001, 2001
2000
CROPS: coordinated restructuring of programs and storage.
ACM SIGSOFT Softw. Eng. Notes, 2000
Path Analysis and Renaming for Predicated Instruction Scheduling.
Int. J. Parallel Program., 2000
Faster FFTs via Architecture-Cognizance.
Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques (PACT'00), 2000
1999
Selecting Tile Shape for Minimal Execution Time.
Proceedings of the Eleventh Annual ACM Symposium on Parallel Algorithms and Architectures, 1999
Proceedings of the ACM/IEEE Conference on Supercomputing, 1999
Architecture-Cognizant Divide and Conquer Algorithms.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1999
Asynchronous Dynamic Load Balancing of Tiles.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999
Performance and Programming Experience on the Tera MTA.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999
Performance optimisations of the NPB FT kernel by special-purpose unroller.
Proceedings of the Seventh Euromicro Workshop on Parallel and Distributed Processing. PDP'99, 1999
Memory Hierarchy Considerations for Fast Transpose and Bit-Reversals.
Proceedings of the Fifth International Symposium on High-Performance Computer Architecture, 1999
Localizing Non-Affine Array References.
Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999
Predicated Static Single Assignment.
Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999
1998
Quantifying the Multi-Level Nature of Tiling Interactions.
Int. J. Parallel Program., 1998
Multi-processor Performance on the Tera MTA.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1998
Towards an Optimal Bit-Reversal Permutation Program.
Proceedings of the 39th Annual Symposium on Foundations of Computer Science, 1998
Schedule-Independent Storage Mapping for Loops.
Proceedings of the ASPLOS-VIII Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems, 1998
1997
Determining the Idle Time of a Tiling.
Proceedings of the Conference Record of POPL'97: The 24th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 1997
1995
High-Performance Parallel Implementations of the NAS Kernel Benchmarks on the IBM SP2.
IBM Syst. J., 1995
Microparallelism and High-Performance Protein Matching.
Proceedings of the Proceedings Supercomputing '95, San Diego, CA, USA, December 4-8, 1995, 1995
Efficient Parallelism via Hierarchical Tiling.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995
The Myth of Scalable High Performance.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995
Is Scalability Relevant? A Look at Sparse Matrix-Vector Product.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995
Message Compression for High Performance.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995
Hierarchical tiling for improved superscalar performance.
Proceedings of IPPS '95, 1995
1994
XDP: A compiler intermediate language extension for the representation and optimization of data movement.
Int. J. Parallel Program., 1994
The Uniform Memory Hierarchy Model of Computation.
Algorithmica, 1994
1993
Orientation Maps: Techniques for Visualizing Rotations.
Proceedings of the 4th IEEE Visualization Conference, 1993
Explicit Data Placement (XDP): A Methodology for Explicit Compile-Time Representation and Optimization.
Proceedings of the Fourth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1993
1992
Rectilinear Steiner Tree Minimization on a Workstation.
Proceedings of the Computational Support for Discrete Mathematics, 1992
1991
Proceedings of the 2nd IEEE Visualization Conference, 1991
1990
Visualizing Computer Memory Architectures.
Proceedings of the 1st IEEE Visualization Conference, 1990
Uniform Memory Hierarchies
Proceedings of the 31st Annual Symposium on Foundations of Computer Science, 1990
1988
TRIM: testability range by ignoring the memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1988
1987
Distribution and Abstract Types in Emerald.
IEEE Trans. Software Eng., 1987
1985
The Complexity of Backtrack Searches (Preliminary Version)
Proceedings of the 17th Annual ACM Symposium on Theory of Computing, 1985
1981
New Hash Functions and Their Use in Authentication and Set Equality.
J. Comput. Syst. Sci., 1981
1979
Universal Classes of Hash Functions.
J. Comput. Syst. Sci., 1979
New Classes and Applications of Hash Functions
Proceedings of the 20th Annual Symposium on Foundations of Computer Science, 1979
1978
Exact and Approximate Membership Testers
Proceedings of the 10th Annual ACM Symposium on Theory of Computing, 1978
Analysis of a Universal Class of Hash Functions.
Proceedings of the Mathematical Foundations of Computer Science 1978, 1978
1977
A Case Study of a New Code Generation Technique for Compilers.
Commun. ACM, 1977
Universal Classes of Hash Functions (Extended Abstract)
Proceedings of the 9th Annual ACM Symposium on Theory of Computing, 1977
1974
Conjectures on uniquely decipherable codes (Corresp.).
IEEE Trans. Inf. Theory, 1974