Hui Guan
Orcid: 0000-0001-9128-2231Affiliations:
- University of Massachusetts Amherst, USA
- North Carolina State University, Raleigh, USA (former)
According to our database1,
Hui Guan
authored at least 60 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE Trans. Neural Networks Learn. Syst., November, 2024
DiffServe: Efficiently Serving Text-to-Image Diffusion Models with Query-Aware Model Scaling.
CoRR, 2024
CoRR, 2024
Integrating Graph Neural Networks and Many-Body Expansion Theory for Potential Energy Surfaces.
CoRR, 2024
In-Situ Fine-Tuning of Wildlife Models in IoT-Enabled Camera Traps for Efficient Adaptation.
CoRR, 2024
Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch.
CoRR, 2024
CoRR, 2024
IEEE Access, 2024
CACTUS: Dynamically Switchable Context-aware micro-Classifiers for Efficient IoT Inference.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 7th IEEE International Conference on Multimedia Information Processing and Retrieval, 2024
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling.
Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing, 2024
Proceedings of the Nineteenth European Conference on Computer Systems, 2024
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
CoRR, 2023
CoRR, 2023
IEEE Access, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Re-thinking computation offload for efficient inference on IoT devices with duty-cycled radios.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023
Proceedings of the 2023 ACM SIGPLAN International Symposium on Memory Management, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the 32nd International Conference on Parallel Architectures and Compilation Techniques, 2023
2022
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Enabling Near Real-Time NLU-Driven Natural Language Programming through Dynamic Grammar Graph-Based Translation.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022
Proceedings of the International Conference on Automated Machine Learning, 2022
2021
IEEE Trans. Parallel Distributed Syst., 2021
COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression.
Proc. VLDB Endow., 2021
Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression.
CoRR, 2021
CoCoPIE: enabling real-time AI on off-the-shelf mobile devices via compression-compilation co-design.
Commun. ACM, 2021
FreeLunch: Compression-based GPU Memory Management for Convolutional Neural Networks.
Proceedings of the IEEE/ACM Workshop on Memory Centric High Performance Computing, 2021
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021
Proceedings of the IEEE International Conference on Data Mining, 2021
Proceedings of the CC '21: 30th ACM SIGPLAN International Conference on Compiler Construction, 2021
2020
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020
Proceedings of the Third Conference on Machine Learning and Systems, 2020
2019
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019
2018
Proceedings of the International Conference for High Performance Computing, 2018
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018
2017
Egeria: a framework for automatic synthesis of HPC advising tools through multi-layered natural language processing.
Proceedings of the International Conference for High Performance Computing, 2017
Generalizations of the theory and deployment of triangular inequality for compiler-based strength reduction.
Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2017
2016
Proceedings of the 17th IEEE International Workshop on Signal Processing Advances in Wireless Communications, 2016