Olli Saarikivi

Orcid: 0000-0001-7596-4734

According to our database1, Olli Saarikivi authored at least 40 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Efficient Schedule Construction for Distributed Execution of Large DNN Models.
IEEE Trans. Parallel Distributed Syst., December, 2024

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone.
CoRR, 2024

Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024

A Framework for Fine-Grained Synchronization of Dependent GPU Kernels.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2024

2023
Derivative Based Nonbacktracking Real-World Regex Matching with Backtracking Semantics.
Proc. ACM Program. Lang., 2023

Symbolic Automata: ω-Regularity Modulo Theories.
CoRR, 2023

Textbooks Are All You Need.
CoRR, 2023

TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

MSCCLang: Microsoft Collective Communication Language.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
MSCCL: Microsoft Collective Communication Library.
CoRR, 2022

Breaking the computation and communication abstraction barrier in distributed machine learning workloads.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021
EVA Improved: Compiler and Extension Library for CKKS.
IACR Cryptol. ePrint Arch., 2021

Synthesizing Collective Communication Algorithms for Heterogeneous Networks with TACCL.
CoRR, 2021

CoCoNet: Co-Optimizing Computation and Communication for Distributed Machine Learning.
CoRR, 2021

Synthesizing optimal collective algorithms.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Scaling Distributed Training with Adaptive Summation.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

Distributed Training of Embeddings using Graph Analytics.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

2020
Artifact for the OOPSLA'20 paper "Regex Matching with Counting-Set Automata".
Dataset, August, 2020

Dataset for the OOPSLA'20 paper "Regex Matching with Counting-Set Automata".
Dataset, August, 2020

Dataset for the OOPSLA'20 paper "Regex Matching with Counting-Set Automata".
Dataset, August, 2020

Regex matching with counting-set automata.
Proc. ACM Program. Lang., 2020

EVA: an encrypted vector arithmetic language and compiler for efficient homomorphic computation.
Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020

2019
Succinct Determinisation of Counting Automata via Sphere Construction (Technical Report).
CoRR, 2019

Distributed Word2Vec using Graph Analytics Frameworks.
CoRR, 2019

Symbolic Regex Matcher.
Proceedings of the Tools and Algorithms for the Construction and Analysis of Systems, 2019

CHET: an optimizing compiler for fully-homomorphic neural-network inferencing.
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019

Succinct Determinisation of Counting Automata via Sphere Construction.
Proceedings of the Programming Languages and Systems - 17th Asian Symposium, 2019

2018
CHET: Compiler and Runtime for Homomorphic Evaluation of Tensor Programs.
CoRR, 2018

2017
Symbolic Methods for Transducers and Testing (Symboliset metodit muuntimille ja testaukseen).
PhD thesis, 2017

Minimizing Test Suites with Unfoldings of Multithreaded Programs.
ACM Trans. Embed. Comput. Syst., 2017

Fusing effectful comprehensions.
Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2017

Translating C# to Branching Symbolic Transducers.
Proceedings of the IWIL@LPAR 2017 Workshop and LPAR-21 Short Presentations, 2017

Minimization of Symbolic Transducers.
Proceedings of the Computer Aided Verification - 29th International Conference, 2017

2016
LCTD: Test-guided proofs for C programs on LLVM.
J. Log. Algebraic Methods Program., 2016

LCTD: Tests-Guided Proofs for C Programs on LLVM - (Competition Contribution).
Proceedings of the Tools and Algorithms for the Construction and Analysis of Systems, 2016

2015
Unfolding based automated testing of multithreaded programs.
Autom. Softw. Eng., 2015

Reporting Races in Dynamic Partial Order Reduction.
Proceedings of the NASA Formal Methods - 7th International Symposium, 2015

Unfolding Based Minimal Test Suites for Testing Multithreaded Programs.
Proceedings of the 15th International Conference on Application of Concurrency to System Design, 2015

2012
Using unfoldings in automated testing of multithreaded programs.
Proceedings of the IEEE/ACM International Conference on Automated Software Engineering, 2012

Improving Dynamic Partial Order Reductions for Concolic Testing.
Proceedings of the 12th International Conference on Application of Concurrency to System Design, 2012


  Loading...