Sai Qian Zhang

CoRR, 2024

GazeGen: Gaze-Driven User Interaction for Visual Content Generation.

[BibT_eX]

[DOI]

CoRR, 2024

PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context.

[BibT_eX]

[DOI]

CoRR, 2024

Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices.

[BibT_eX]

[DOI]

Kleber Hugo Stangherlin

CoRR, 2024

DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing.

[BibT_eX]

[DOI]

Zhenyuan Dong

CoRR, 2024

HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization.

[BibT_eX]

[DOI]

Wenxuan Liu

CoRR, 2024

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, 2024

T3M: Text Guided 3D Human Motion Synthesis from Speech.

[BibT_eX]

[DOI]

Wenshuo Peng

Kaipeng Zhang

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design.

[BibT_eX]

[DOI]

Proceedings of the 2024 ACM/IEEE International Symposium on Machine Learning for CAD, 2024

Learning Client Selection Strategy for Federated Learning across Heterogeneous Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the 25th International Symposium on Quality Electronic Design, 2024

sLLM: Accelerating LLM Inference using Semantic Load Balancing with Shared Memory Data Structures.

[BibT_eX]

[DOI]

Alberto Leon-Garcia

Proceedings of the 25th International Symposium on Quality Electronic Design, 2024

JointNF: Enhancing DNN Performance through Adaptive N: M Pruning across both Weight and Activation.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design, 2024

Hyft: A Reconfigurable Softmax Accelerator with Hybrid Numeric Format for both Training and Inference.

[BibT_eX]

[DOI]

Tianhua Xia

Proceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design, 2024

Murmuration: On-the-fly DNN Adaptation for SLO-Aware Distributed Inference in Dynamic Edge Environments.

[BibT_eX]

[DOI]

Proceedings of the 53rd International Conference on Parallel Processing, 2024

CAMEL: Co-Designing AI Models and eDRAMs for Efficient On-Device Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024

DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model.

[BibT_eX]

[DOI]

Chao Gao

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling.

[BibT_eX]

[DOI]

Yixuan Luo

Mengye Ren

CoRR, 2023

Softmax Acceleration with Adaptive Numeric Format for both Training and Inference.

[BibT_eX]

[DOI]

Tianhua Xia

CoRR, 2023

CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning.

[BibT_eX]

[DOI]

CoRR, 2023

2022

FAST: DNN Training Under Variable Precision Block Floating Point with Stochastic Rounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

SphereFed: Hyperspherical Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

A Multi-Agent Reinforcement Learning Approach for Efficient Client Selection in Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Saturation RRAM Leveraging Bit-Level Sparsity Resulting from Term Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Training for multi-resolution inference using reusable quantization terms.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

2020

Term Revealing: Furthering Quantization at Run Time on Quantized DNNs.

[BibT_eX]

[DOI]

CoRR, 2020

On the Robustness of Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE Security and Privacy Workshops, 2020

Term quantization: furthering quantization at run time.

[BibT_eX]

[DOI]

Hsiang-Tsung Kung

Proceedings of the International Conference for High Performance Computing, 2020

Succinct and Robust Multi-Agent Communication With Temporal Message Control.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

RTN: Reparameterized Ternary Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Full-stack Optimization for Accelerating CNNs with FPGA Validation.

[BibT_eX]

[DOI]

CoRR, 2019

Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Systolic Building Block for Logic-on-Logic 3D-IC Implementations of Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Full-stack optimization for accelerating CNNs using powers-of-two weights with FPGA validation.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Supercomputing, 2019

Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint Optimization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

Maestro: A Memory-on-Logic Architecture for Coordinated Parallel Use of Many Systolic Arrays.

[BibT_eX]

[DOI]

Proceedings of the 30th IEEE International Conference on Application-specific Systems, 2019

2018

InferBeam: A Fast Beam Alignment Protocol for Millimeter-wave Networking.

[BibT_eX]

[DOI]

Youngjune Gwon

CoRR, 2018

A Machine Learning Assisted Cell Selection Method for Drones in Cellular Networks.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Workshop on Signal Processing Advances in Wireless Communications, 2018

Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Workshop on Signal Processing Systems, 2018

Adaptive Tiling: Applying Fixed-size Systolic Arrays To Sparse Convolutional Neural Networks.

[BibT_eX]

[DOI]