Ruihao Gong

Orcid: 0000-0002-6024-7086

According to our database1, Ruihao Gong authored at least 50 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Rectify representation bias in vision-language models for long-tailed recognition.
Neural Networks, 2024

HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration.
CoRR, 2024

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms.
CoRR, 2024

OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation Balance.
CoRR, 2024

Temporal Feature Matters: A Framework for Diffusion Model Quantization.
CoRR, 2024

ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models.
CoRR, 2024

LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models.
CoRR, 2024

2023 Low-Power Computer Vision Challenge (LPCVC) Summary.
CoRR, 2024

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding.
CoRR, 2024

PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

PRoof: A Comprehensive Hierarchical Profiling Framework for Deep Neural Networks with Roofline Analysis.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

Compressing Large Language Models by Joint Sparsification and Quantization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Distribution-Sensitive Information Retention for Accurate Binary Neural Network.
Int. J. Comput. Vis., 2023

Exploring the Potential of Flexible 8-bit Format: Design and Algorithm.
CoRR, 2023

Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling.
CoRR, 2023

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

Lossy and Lossless (L<sup>2</sup>) Post-training Model Size Compression.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Annealing-based Label-Transfer Learning for Open World Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Contrastive Knowledge Distillation for BERT Compression.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Exploring the Relationship between Architecture and Adversarially Robust Generalization.
CoRR, 2022

Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Generating Transferable Adversarial Examples against Vision Transformers.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

NNLQP: A Multi-Platform Neural Network Latency Query and Prediction System with An Evolving Database.
Proceedings of the 51st International Conference on Parallel Processing, 2022

QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
RobustART: Benchmarking Robustness on Architecture Design and Training Techniques.
CoRR, 2021

Real World Robustness from Systematic Noise.
CoRR, 2021

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Real World Robustness from Systematic Noise.
Proceedings of the ADVM '21: Proceedings of the 1st International Workshop on Adversarial Learning for Multimedia, 2021

A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration.
Proceedings of the 38th International Conference on Machine Learning, 2021

BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction.
Proceedings of the 9th International Conference on Learning Representations, 2021

Once Quantization-Aware Training: High Performance Extremely Low-bit Architecture Search.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Diversifying Sample Generation for Accurate Data-Free Quantization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Binary neural networks: A survey.
Pattern Recognit., 2020

Learning in School: Multi-teacher Knowledge Inversion for Data-Free Quantization.
CoRR, 2020

Efficient Bitwidth Search for Practical Mixed Precision Neural Network.
CoRR, 2020

Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

Balanced Binary Neural Networks with Gated Residual.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards Unified INT8 Training for Convolutional Neural Network.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Rotation Consistent Margin Loss for Efficient Low-Bit Face Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Forward and Backward Information Retention for Accurate Binary Neural Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Balanced Binary Neural Networks with Gated Residual.
CoRR, 2019

Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...