From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models.
CoRR, April, 2025
NVLM: Open Frontier-Class Multimodal LLMs.
CoRR, 2024
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Can Public Large Language Models Help Private Cross-device Federated Learning?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
POSTER: Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications.
Proceedings of the 19th ACM Asia Conference on Computer and Communications Security, 2024
Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications.
CoRR, 2023
Can Public Large Language Models Help Private Cross-device Federated Learning?
CoRR, 2023
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data.
CoRR, 2022
Improving Certified Robustness via Statistical Learning with Logical Reasoning.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
SemAttack: Natural Textual Attacks via Different Semantic Spaces.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022
Certifying Out-of-Domain Generalization for Blackbox Functions.
Proceedings of the International Conference on Machine Learning, 2022
Revisit Systematic Generalization via Meaningful Learning.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Incorporating External POS Tagger for Punctuation Restoration.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Uncovering the Connections Between Adversarial Transferability and Knowledge Transferability.
Proceedings of the 38th International Conference on Machine Learning, 2021
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective.
Proceedings of the 9th International Conference on Learning Representations, 2021
Counterfactual Adversarial Learning with Representation Interpolation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
DataLens: Scalable Privacy Preserving Training via Gradient Compression and Aggregation.
Proceedings of the CCS '21: 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, Republic of Korea, November 15, 2021
Towards Evaluating the Robustness of Chinese BERT Classifiers.
CoRR, 2020
End-to-end Robustness for Sensing-Reasoning Machine Learning Pipelines.
CoRR, 2020
Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States.
CoRR, 2020
T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Reinforcement-Learning Based Portfolio Management with Augmented Asset Movement Prediction States.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms.
Proc. VLDB Endow., 2019
AdvCodec: Towards A Unified Framework for Adversarial Text Generation.
CoRR, 2019
Towards Efficient Data Valuation Based on the Shapley Value.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019