TileLang: A Composable Tiled Programming Model for AI Systems.
,
,
,
,
,
,
,
,
,
,
CoRR, April, 2025
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms.
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
Differential Transformer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Retentive Network: A Successor to Transformer for Large Language Models.
CoRR, 2023
Welder: Scheduling Deep Learning Memory Access via Tile-graph.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
A Neural Corpus Indexer for Document Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
ROLLER: Fast and Efficient Tensor Compilation for Deep Learning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022
A Neural Corpus Indexer for Document Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Spatio-temporal Model Checking for 3D Individual-Based Biofilm Simulations.
Proceedings of the From Data to Models and Back, 2021
Scalable Feedback of Spectral Projection for Subspace Learning.
IEEE Signal Process. Lett., 2019
Minimal Sample Subspace Learning: Theory and Algorithms.
J. Mach. Learn. Res., 2019
Manifold Fitting under Unbounded Noise.
CoRR, 2019
Rank-sparsity balanced representation for subspace clustering.
Mach. Vis. Appl., 2018
An Adaptive S-Method to Analyze Micro-Doppler Signals for Human Activity Classification.
Sensors, 2017