Qi Chen

Orcid: 0009-0006-7394-0185

Affiliations:
  • Microsoft Research Asia, Beijing, China
  • Peking University, Department of Computer Science, Beijing, China (former)


According to our database1, Qi Chen authored at least 36 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval.
CoRR, 2024

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance.
CoRR, 2024

OneSparse: A Unified System for Multi-index Vector Search.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024


Understanding the Weakness of Large Language Model Agents within a Complex Android Environment.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

2023
IRGen: Generative Modeling for Image Retrieval.
CoRR, 2023

LUT-NN: Towards Unified Neural Network Inference by Table Lookup.
CoRR, 2023

SPFresh: Incremental In-Place Update for Billion-Scale Vector Search.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Model-enhanced Vector Index.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

FrozenHot Cache: Rethinking Cache Management for Modern Hardware.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022
A Neural Corpus Indexer for Document Retrieval.
CoRR, 2022

Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings.
CoRR, 2022

Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

A Neural Corpus Indexer for Document Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Heat-RL: Online Model Selection for Streaming Time-Series Anomaly Detection.
Proceedings of the Conference on Lifelong Learning Agents, 2022

2021
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search.
CoRR, 2021

Match Plan Generation in Web Search with Parameterized Action Reinforcement Learning.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Results of the NeurIPS'21 Challenge on Billion-Scale Approximate Nearest Neighbor Search.
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Byzantine Ordered Consensus without Byzantine Oligarchy.
IACR Cryptol. ePrint Arch., 2020

AutoSys: The Design and Operation of Learning-Augmented Systems.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

Estimating Minimum Operation Steps via Memory-based Recurrent Calculation Network.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

2019
PISCES: Optimizing Multi-Job Application Execution in MapReduce.
IEEE Trans. Cloud Comput., 2019

We Know What You Will Ask: A Dialogue System for Multi-intent Switch and Prediction.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Collaborative learning between cloud and end devices: an empirical study on location prediction.
Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, 2019

2018
Auto-Dialabel: Labeling Dialogue Data with Unsupervised Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2016
The Freeze-Frame File System.
Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

2015
LIBRA: Lightweight Data Skew Mitigation in MapReduce.
IEEE Trans. Parallel Distributed Syst., 2015

Spartan: A Distributed Array Framework with Smart Tiling.
Proceedings of the 2015 USENIX Annual Technical Conference, 2015

2014
Automatic Scaling of Internet Applications for Cloud Computing Services.
IEEE Trans. Computers, 2014

Adaptive Resource Provisioning for the Cloud Using Online Bin Packing.
IEEE Trans. Computers, 2014

Improving MapReduce Performance Using Smart Speculative Execution Strategy.
IEEE Trans. Computers, 2014

Building Scalable Multimedia Search Engine Using Infiniband.
Proceedings of the 6th USENIX Workshop on Hot Topics in Cloud Computing, 2014

2013
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
IEEE Trans. Parallel Distributed Syst., 2013


  Loading...