Daniel Lo

Orcid: 0009-0002-6504-9078

According to our database1, Daniel Lo authored at least 19 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Wavefront Threading Enables Effective High-Level Synthesis.
Proc. ACM Program. Lang., 2024

2023
PQC Cloudization: Rapid Prototyping of Scalable NTT/INTT Architecture to Accelerate Kyber.
IACR Cryptol. ePrint Arch., 2023

Dynamic Stashing Quantization for Efficient Transformer Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2020
Pushing the Limits of Narrow Precision Inferencing at Cloud Scale with Microsoft Floating Point.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Inside Project Brainwave's Cloud-Scale, Real-Time AI Processor.
IEEE Micro, 2019

2018
Serving DNNs in Real Time at Datacenter Scale with Project Brainwave.
IEEE Micro, 2018

A Configurable Cloud-Scale DNN Processor for Real-Time AI.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

2017
Configurable Clouds.
IEEE Micro, 2017

2016
A cloud-scale acceleration architecture.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Prediction-Guided Performance-Energy Trade-off with Continuous Run-Time Adaptation.
Proceedings of the 2016 International Symposium on Low Power Electronics and Design, 2016


2015
Prediction-guided performance-energy trade-off for interactive applications.
Proceedings of the 48th International Symposium on Microarchitecture, 2015

Run-time monitoring with adjustable overhead using dataflow-guided filtering.
Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Improving worst-case cache performance through selective bypassing and register-indexed cache.
Proceedings of the 52nd Annual Design Automation Conference, 2015

2014
Slack-aware opportunistic monitoring for real-time systems.
Proceedings of the 20th IEEE Real-Time and Embedded Technology and Applications Symposium, 2014

2012
Worst-case execution time analysis for parallel run-time monitoring.
Proceedings of the 49th Annual Design Automation Conference 2012, 2012

2011
FlexCache: Field Extensible Cache Controller Architecture Using On-chip Reconfigurable Fabric.
Proceedings of the International Conference on Field Programmable Logic and Applications, 2011

2010
Flexible and Efficient Instruction-Grained Run-Time Monitoring Using On-Chip Reconfigurable Fabric.
Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, 2010

Implementing dynamic information flow tracking on microprocessors with integrated FPGA fabric (abstract only).
Proceedings of the ACM/SIGDA 18th International Symposium on Field Programmable Gate Arrays, 2010


  Loading...