Sequence Transferability and Task Order Selection in Continual Learning.
CoRR, February, 2025
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding & Reasoning Capabilities of CodeLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Continual Learning, Fast and Slow.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models.
CoRR, 2024
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs.
CoRR, 2024
CompeteSMoE - Effective Training of Sparse Mixture of Experts via Competition.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Class-incremental Learning for Time Series: Benchmark and Evaluation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
Who's Who: Large Language Models Meet Knowledge Conflicts in Practice.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts.
CoRR, 2023
On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Learning Fast and Slow for Online Time Series Forecasting.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Adaptive-saturated RNN: Remember more with less instability.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
TATL: Task agnostic transfer learning for skin attributes detection.
Medical Image Anal., 2022
Continual Normalization: Rethinking Batch Normalization for Online Continual Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022
DualNet: Continual Learning, Fast and Slow.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
An Efficient Transformer-Based Model for Vietnamese Punctuation Prediction.
Proceedings of the Advances and Trends in Artificial Intelligence. From Theory to Practice, 2021
Contextual Transformation Networks for Online Continual Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021
Bilevel Continual Learning.
CoRR, 2020
Extracting Entities and Topics from News and Connecting Criminal Records.
CoRR, 2020
Vietnamese Punctuation Prediction Using Deep Neural Networks.
Proceedings of the SOFSEM 2020: Theory and Practice of Computer Science, 2020
URLNet: Learning a URL Representation with Deep Learning for Malicious URL Detection.
CoRR, 2018
Online Deep Learning: Learning Deep Neural Networks on the Fly.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Vietnamese food recognition using convolutional neural networks.
Proceedings of the 9th International Conference on Knowledge and Systems Engineering, 2017
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2010