Shuohang Wang

According to our database1, Shuohang Wang authored at least 77 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Knowledge-augmented Methods for Natural Language Processing
Springer Briefs in Computer Science, Springer, ISBN: 978-981-97-0749-2, 2024

LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy.
CoRR, 2024

GRIN: GRadient-INformed MoE.
CoRR, 2024

Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning.
CoRR, 2024

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment.
CoRR, 2024

Multi-LoRA Composition for Image Generation.
CoRR, 2024

SciAgent: Tool-augmented Language Models for Scientific Reasoning.
CoRR, 2024

Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

SciAgent: Tool-augmented Language Models for Scientific Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Small Models are Valuable Plug-ins for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions.
CoRR, 2023

PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents.
CoRR, 2023

LMGQS: A Large-scale Dataset for Query-focused Summarization.
CoRR, 2023

G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment.
CoRR, 2023

Sparse Modular Activation for Efficient Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Prompting GPT-3 To Be Reliable.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generate rather than Retrieve: Large Language Models are Strong Context Generators.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

LMGQS: A Large-scale Dataset for Query-focused Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

G-Eval: NLG Evaluation using Gpt-4 with Better Human Alignment.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

In-Context Demonstration Selection with Cross Entropy Difference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Empowering Language Models with Knowledge Graph Reasoning for Question Answering.
CoRR, 2022

Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

ParaTag: A Dataset of Paraphrase Tagging for Fine-Grained Labels, NLG Evaluation, and Data Augmentation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Empowering Language Models with Knowledge Graph Reasoning for Open-Domain Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Retrieval Augmentation for Commonsense Reasoning: A Unified Approach.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Task Compass: Scaling Multi-task Pre-training with Task Prefix.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

AdaPrompt: Adaptive Model Training for Prompt-based NLP.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CLIP-Event: Connecting Text and Images with Event Structures.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Empirical Study of Training End-to-End Vision-and-Language Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training Data.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Leveraging Knowledge in Multilingual Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Dict-BERT: Enhancing Language Model Pre-training with Dictionary.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Playing Lottery Tickets with Vision and Language.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
MLP Architectures for Vision-and-Language Modeling: An Empirical Study.
CoRR, 2021

Playing Lottery Tickets with Vision and Language.
CoRR, 2021

Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

The Elastic Lottery Ticket Hypothesis.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective.
Proceedings of the 9th International Conference on Learning Representations, 2021

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Want To Reduce Labeling Cost? GPT-3 Can Help.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

On Orthogonality Constraints for Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Cluster-Former: Clustering-based Sparse Transformer for Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Counterfactual Variable Control for Robust and Interpretable Question Answering.
CoRR, 2020

Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding.
CoRR, 2020

Accelerating Real-Time Question Answering via Question Generation.
CoRR, 2020

T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Cross-Thought for Sentence Encoder Pre-training.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Contrastive Distillation on Intermediate Representations for Language Model Compression.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Hierarchical Graph Network for Multi-hop Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-Fact Correction in Abstractive Text Summarization.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-Level Head-Wise Match and Aggregation in Transformer for Textual Sequence Matching.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Knowledge Base Question Answering With a Matching-Aggregation Model and Question-Specific Contextual Relations.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?
CoRR, 2019

Compositional De-Attention Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unsupervised Deep Structured Semantic Models for Commonsense Reasoning.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Knowledge Base Question Answering with Topic Units.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Multi-hop Knowledge Base Question Answering with an Iterative Sequence Matching Model.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering.
Proceedings of the 6th International Conference on Learning Representations, 2018

A Co-Matching Model for Multi-choice Reading Comprehension.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

R<sup>3</sup>: Reinforced Ranker-Reader for Open-Domain Question Answering.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
R<sup>3</sup>: Reinforced Reader-Ranker for Open-Domain Question Answering.
CoRR, 2017

Machine Comprehension Using Match-LSTM and Answer Pointer.
Proceedings of the 5th International Conference on Learning Representations, 2017

A Compare-Aggregate Model for Matching Text Sequences.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Learning Natural Language Inference with LSTM.
Proceedings of the NAACL HLT 2016, 2016


  Loading...