2025

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models.

[DOI]

Chejian Xu

Wei Ping

CoRR, April, 2025

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.

[DOI]

Aaron Blakeman

Aarti Basant

Abhinav Khattar

Adithya Renduchintala

Amala Sanjay Deshmukh

Ameya Sunil Mahabaleshwarkar

Maer Rodrigues de Melo

Makesh Narsimhan Sreedhar

Marcin Chochowski

Markus Kliegl

CoRR, April, 2025

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning.

[DOI]

CoRR, March, 2025

2024

NVLM: Open Frontier-Class Multimodal LLMs.

[DOI]

CoRR, 2024

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Can Public Large Language Models Help Private Cross-device Federated Learning?

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining.

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks.

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

POSTER: Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications.

[DOI]

Proceedings of the 19th ACM Asia Conference on Computer and Communications Security, 2024

2023

Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications.

[DOI]

CoRR, 2023

Can Public Large Language Models Help Private Cross-device Federated Learning?

[DOI]

CoRR, 2023

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study.

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data.

[DOI]

CoRR, 2022

Improving Certified Robustness via Statistical Learning with Logical Reasoning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SemAttack: Natural Textual Attacks via Different Semantic Spaces.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Certifying Out-of-Domain Generalization for Blackbox Functions.

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Revisit Systematic Generalization via Meaningful Learning.

[DOI]

Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

2021

Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models.

[DOI]

Ahmed Hassan Awadallah

Bo Li

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Incorporating External POS Tagger for Punctuation Restoration.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Uncovering the Connections Between Adversarial Transferability and Knowledge Transferability.

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective.

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Counterfactual Adversarial Learning with Representation Interpolation.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

DataLens: Scalable Privacy Preserving Training via Gradient Compression and Aggregation.

[DOI]

Proceedings of the CCS '21: 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, Republic of Korea, November 15, 2021

2020

Towards Evaluating the Robustness of Chinese BERT Classifiers.

[DOI]

CoRR, 2020

End-to-end Robustness for Sensing-Reasoning Machine Learning Pipelines.

[DOI]

CoRR, 2020

Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States.

[DOI]

CoRR, 2020

T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack.

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Reinforcement-Learning Based Portfolio Management with Augmented Asset Movement Prediction States.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms.

[DOI]

Proc. VLDB Endow., 2019

AdvCodec: Towards A Unified Framework for Adversarial Text Generation.

[DOI]

CoRR, 2019

Towards Efficient Data Valuation Based on the Shapley Value.

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019