Shuming Shi
Orcid: 0009-0003-1712-5619Affiliations:
- Tencent AI Lab, China
- Microsoft Research Asia (former)
- Tsinghua University, China (PhD 2004)
According to our database1,
Shuming Shi
authored at least 211 papers
between 2004 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Int. J. Mach. Learn. Cybern., September, 2024
Trans. Assoc. Comput. Linguistics, 2024
Trans. Assoc. Comput. Linguistics, 2024
CoRR, 2024
Mitigating Hallucinations of Large Language Models via Knowledge Consistent Alignment.
CoRR, 2024
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models.
CoRR, 2024
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models.
CoRR, 2024
DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
The Reasonableness Behind Unreasonable Translation Capability of Large Language Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Addressing Entity Translation Problem via Translation Difficulty and Context Diversity.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Lang. Resour. Evaluation, September, 2023
IEEE Trans. Games, June, 2023
CoRR, 2023
When Graph Data Meets Multimodal: A New Paradigm for Graph Understanding and Reasoning.
CoRR, 2023
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving.
CoRR, 2023
CoRR, 2023
TeGit: Generating High-Quality Instruction-Tuning Data with Text-Grounded Task Design.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration.
CoRR, 2023
A Simple and Plug-and-play Method for Unsupervised Sentence Representation Enhancement.
CoRR, 2023
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study.
CoRR, 2023
Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs.
Proceedings of the Eighth Conference on Machine Translation, 2023
Proceedings of the Eighth Conference on Machine Translation, 2023
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model.
Proceedings of the 8th Workshop on Representation Learning for NLP, 2023
Proceedings of the Natural Language Processing and Chinese Computing, 2023
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Skillnet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Making Better Use of Training Corpus: Retrieval-based Aspect Sentiment Triplet Extraction via Label Interpolation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Interpretable Real-Time Win Prediction for Honor of Kings - A Popular Mobile MOBA Esport.
IEEE Trans. Games, 2022
One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code.
CoRR, 2022
CoRR, 2022
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022
Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022
Proceedings of the Seventh Conference on Machine Translation, 2022
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Towards Efficient Dialogue Pre-training with Transferable and Interpretable Latent Structure.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
MCPG: A Flexible Multi-Level Controllable Framework for Unsupervised Paraphrase Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Learning from Sibling Mentions with Scalable Graph Inference in Fine-Grained Entity Typing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Redistributing Low-Frequency Words: Making the Most of Monolingual Data in Non-Autoregressive Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Attending From Foresight: A Novel Attention Mechanism for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Neurocomputing, 2021
Rethinking Negative Sampling for Unlabeled Entity Problem in Named Entity Recognition.
CoRR, 2021
REAM#: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation.
CoRR, 2021
Tencent AI Lab Machine Translation Systems for the WMT21 Biomedical Translation Task.
Proceedings of the Sixth Conference on Machine Translation, 2021
Proceedings of the Sixth Conference on Machine Translation, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021
An Empirical Study on Multiple Information Sources for Zero-Shot Fine-Grained Entity Typing.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
REAM$\sharp$: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
Neurocomputing, 2020
TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis.
CoRR, 2020
Interpretable Real-Time Win Prediction for Honor of Kings, a Popular Mobile MOBA Esport.
CoRR, 2020
CoRR, 2020
Grayscale Data Construction and Multi-Level Ranking Objective for Dialogue Response Selection.
CoRR, 2020
CoRR, 2020
Proceedings of the Fifth Conference on Machine Translation, 2020
Tencent AI Lab Machine Translation Systems for the WMT20 Biomedical Translation Task.
Proceedings of the Fifth Conference on Machine Translation, 2020
Proceedings of the Fifth Conference on Machine Translation, 2020
When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
The World is Not Binary: Learning to Rank with Grayscale Data for Dialogue Response Selection.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Balancing Quality and Human Involvement: An Effective Approach to Interactive Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered Neurons.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Retrieval-guided Dialogue Response Generation via a Matching-to-Generation Framework.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Trans. Assoc. Comput. Linguistics, 2018
CoRR, 2018
Directional Skip-Gram: Explicitly Distinguishing Left and Right Context for Word Embeddings.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Joint Learning Embeddings for Chinese Words and their Components via Ladder Structured Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Towards Less Generic Responses in Neural Conversation Models: A Statistical Re-weighting Method.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Generating Classical Chinese Poems via Conditional Variational Autoencoder and Adversarial Training.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
2016
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
2014
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013
2012
Int. J. Semantic Web Inf. Syst., 2012
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012
2011
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011
2010
Proceedings of the Third International Conference on Web Search and Web Data Mining, 2010
Proceedings of the COLING 2010, 2010
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010
2009
Proceedings of The Eighteenth Text REtrieval Conference, 2009
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009
Proceedings of the ACL 2009, 2009
2008
Proceedings of the 17th International Conference on World Wide Web, 2008
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008
2007
Proceedings of the 16th International Conference on World Wide Web, 2007
Proceedings of the Advances in Information Retrieval, 2007
Effective top-k computation in retrieving structured documents with term-proximity support.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007
2006
Proceedings of the Advances in Information Retrieval, 2006
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006
2005
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005
Title extraction from bodies of HTML documents and its application to web page retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005
2004
Proceedings of the Thirteenth Text REtrieval Conference, 2004