Sachin Kumar
Affiliations:- Carnegie Mellon University, Pittsburg, PA, USA
According to our database1,
Sachin Kumar
authored at least 34 papers
between 2017 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
CoRR, 2024
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization.
CoRR, 2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.
CoRR, 2024
P³Sum: Preserving Author's Perspective in News Summarization with Diffusion Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization.
CoRR, 2023
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
CoRR, 2022
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
CoRR, 2021
An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Proceedings of the "I Can't Believe It's Not Better!" at NeurIPS Workshops, 2020
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020
2019
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs.
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019
2018
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018
2017
Earth Mover's Distance Pooling over Siamese LSTMs for Automatic Short Answer Grading.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017