Weizhu Chen
According to our database1,
Weizhu Chen
authored at least 158 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena.
CoRR, 2024
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling.
CoRR, 2024
Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency.
CoRR, 2023
CoRR, 2023
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation.
CoRR, 2023
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation.
Proceedings of the International Conference on Machine Learning, 2023
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation.
CoRR, 2022
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer.
CoRR, 2022
CoRR, 2022
Virtual information core optimization for collaborative filtering recommendation based on clustering and evolutionary algorithms.
Appl. Soft Comput., 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022
OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of Deep Learning Inside Out: The 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, 2022
XLM-K: Improving Cross-Lingual Language Model Pre-training with Multilingual Knowledge.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization.
CoRR, 2021
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining.
Proceedings of the 38th International Conference on Machine Learning, 2021
CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding.
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
CoRR, 2020
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation.
CoRR, 2020
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
DSCOVR: Randomized Primal-Dual Block Coordinate Algorithms for Asynchronous Distributed Optimization.
J. Mach. Learn. Res., 2019
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding.
CoRR, 2019
Learning to Attend On Essential Terms: An Enhanced Retriever-Reader Model for Open-domain Question Answering.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
CoRR, 2018
Learning to Attend On Essential Terms: An Enhanced Retriever-Reader Model for Scientific Question Answering.
CoRR, 2018
FusionNet: Fusing via Fully-aware Attention with Application to Machine Comprehension.
Proceedings of the 6th International Conference on Learning Representations, 2018
2017
Limited-memory Common-directions Method for Distributed Optimization and its Application on Empirical Risk Minimization.
Proceedings of the 2017 SIAM International Conference on Data Mining, 2017
2016
Proceedings of the Workshop on Cognitive Computation: Integrating neural and symbolic approaches 2016 co-located with the 30th Annual Conference on Neural Information Processing Systems (NIPS 2016), 2016
2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014
2012
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012
2011
Proceedings of the 20th International Conference on World Wide Web, 2011
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011
Proceedings of the IJCAI 2011, 2011
Proceedings of the 11th IEEE International Conference on Data Mining, 2011
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011
2010
Proceedings of the 19th International Conference on World Wide Web, 2010
Proceedings of the Third International Conference on Web Search and Web Data Mining, 2010
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010
2009
Proceedings of the ICDM 2009, 2009
Proceedings of the ICDM 2009, 2009
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009
2008
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008
2007
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007