Chenglei Si

According to our database1, Chenglei Si authored at least 27 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers.
CoRR, 2024

Configurable Foundation Models: Building LLMs from a Modular Perspective.
CoRR, 2024

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions.
CoRR, 2024

The Prompt Report: A Systematic Survey of Prompting Techniques.
CoRR, 2024

Best Practices and Lessons Learned on Synthetic Data for Language Models.
CoRR, 2024

Design2Code: How Far Are We From Automating Front-End Engineering?
CoRR, 2024

Large Language Models Help Humans Verify Truthfulness - Except When They Are Convincingly Wrong.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
Sub-Character Tokenization for Chinese Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2023

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition.
CoRR, 2023

Mixture of Prompt Experts for Generalizable and Interpretable Question Answering.
CoRR, 2023

Prompting GPT-3 To Be Reliable.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Getting MoRE out of Mixture of Language Model Reasoning Experts.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Revisiting Calibration for Question Answering.
CoRR, 2022

Re-Examining Calibration: The Case of Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP.
CoRR, 2021

SHUOWEN-JIEZI: Linguistically Informed Tokenizers For Chinese Language Model Pretraining.
CoRR, 2021

Adversarial Training for Machine Reading Comprehension with Virtual Embeddings.
Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics, 2021

What's in a Name? Answer Equivalence For Open-Domain Question Answering.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Benchmarking Robustness of Machine Reading Comprehension Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning.
CoRR, 2020

CharBERT: Character-aware Pre-trained Language Model.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?
CoRR, 2019

Sentiment Aware Neural Machine Translation.
Proceedings of the 6th Workshop on Asian Translation, 2019


  Loading...