Huiqiang Jiang

Orcid: 0000-0002-1327-4882

According to our database1, Huiqiang Jiang authored at least 23 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning.
CoRR, 2024

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval.
CoRR, 2024

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention.
CoRR, 2024

Mitigate Position Bias in Large Language Models via Scaling a Single Dimension.
CoRR, 2024

Poster: Design of Elastic Deep Neural Network Candidate Spaces for Inference on Diverse Devices.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024

Position Engineering: Boosting Large Language Models through Positional Information Manipulation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hybrid SLM and LLM for Edge-Cloud Collaborative Inference.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.
CoRR, 2023

PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

End-to-End Word-Level Pronunciation Assessment with MASK Pre-training.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Accurate and Structured Pruning for Efficient Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Attentive Mask CLIP.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Decomposed Meta-Learning for Few-Shot Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
BoningKnife: Joint Entity Mention Detection and Typing for Nested NER via prior Boundary Knowledge.
CoRR, 2021

AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator for Cross-Lingual NER.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2017
Conversion Rate Estimation in Online Advertising via Exploring Potential Impact of Creative.
Proceedings of the Database and Expert Systems Applications, 2017


  Loading...