Yuqing Yang

Orcid: 0000-0003-3518-5212

Affiliations:
  • Microsoft Research Asia, China
  • Fudan University, Shanghai, China (former)


According to our database1, Yuqing Yang authored at least 58 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Online Streaming Video Super-Resolution With Convolutional Look-Up Table.
IEEE Trans. Image Process., 2024

NERVE: Real-Time Neural Video Recovery and Enhancement on Mobile Devices.
Proc. ACM Netw., 2024

Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely.
CoRR, 2024

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval.
CoRR, 2024

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention.
CoRR, 2024

Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning.
CoRR, 2024

VisEval: A Benchmark for Data Visualization in the Era of Large Language Models.
CoRR, 2024

Mitigate Position Bias in Large Language Models via Scaling a Single Dimension.
CoRR, 2024

LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models.
CoRR, 2024

Benchmarking Data Science Agents.
CoRR, 2024

Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Parrot: Efficient Serving of LLM-based Applications with Semantic Variable.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Designing Network Algorithms via Large Language Models.
Proceedings of the 23rd ACM Workshop on Hot Topics in Networks, 2024

LoRASC: Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Position Engineering: Boosting Large Language Models through Positional Information Manipulation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Unified Medical Image Pre-training in Language-Guided Common Semantic Space.
Proceedings of the Computer Vision - ECCV 2024, 2024

MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Automated Contrastive Learning Strategy Search for Time Series.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Benchmarking Data Science Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Online Video Super-Resolution With Convolutional Kernel Bypass Grafts.
IEEE Trans. Multim., 2023

Online Video Streaming Super-Resolution with Adaptive Look-Up Table Fusion.
CoRR, 2023

SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.
CoRR, 2023

PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

SPFresh: Incremental In-Place Update for Billion-Scale Vector Search.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Optimizing Dynamic Neural Networks with Brainstorm.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

End-to-End Word-Level Pronunciation Assessment with MASK Pre-training.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Accurate and Structured Pruning for Efficient Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Attentive Mask CLIP.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Video Anomaly Detection For Stereotypical Behaviours in Autism.
Proceedings of the IEEE International Conference on Acoustics, 2023

SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Inference Efficient Deep Ensemble Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Online Video Super-Resolution with Convolutional Kernel Bypass Graft.
CoRR, 2022

Game of Privacy: Towards Better Federated Platform Collaboration under Privacy Restriction.
CoRR, 2022

PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

Reinforcement Learning with Automated Auxiliary Loss Search.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Variational oracle guiding for reinforcement learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Privacy-preserving Online AutoML for Domain-Specific Face Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision.
CoRR, 2021

AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing.
CoRR, 2021

nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices.
Proceedings of the MobiSys '21: The 19th Annual International Conference on Mobile Systems, Applications, and Services, Virtual Event, Wisconsin, USA, 24 June, 2021

Towards Generating Real-World Time Series Data.
Proceedings of the IEEE International Conference on Data Mining, 2021

2020
Fast Hardware-Aware Neural Architecture Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Hardware-aware One-Shot Neural Architecture Search in Coordinate Ascent Framework.
CoRR, 2019

2010
A low-cost UHF RFID tag baseband with an IDEA cryptography engine.
Proceedings of the 2010 Internet of Things (IOT), IoT for a green Planet, Tokyo, Japan, November 29, 2010

Synchronization with timing recovery loop in UHF RFID reader receivers.
Proceedings of the 17th IEEE International Conference on Electronics, 2010

A semi-passive UHF RFID tag with on-chip temperature sensor.
Proceedings of the IEEE Custom Integrated Circuits Conference, 2010


  Loading...