2025
Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model.
CoRR, June, 2025

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction.
CoRR, February, 2025

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model.
CoRR, February, 2025

2024
Simultaneous Identification of Sparse Structures and Communities in Heterogeneous Graphical Models.
CoRR, 2024

2020
Fine-Grained Lung Cancer Classification from PET and CT Images Based on Multidimensional Attention Mechanism.
Complex., 2020

Multicenter Computer-Aided Diagnosis for Lymph Nodes Using Unsupervised Domain-Adaptation Networks Based on Cross-Domain Confounding Representations.
Comput. Math. Methods Medicine, 2020

Multi-Type Interdependent Feature Analysis Based on Hybrid Neural Networks for Computer-Aided Diagnosis of Epidermal Growth Factor Receptor Mutations.
IEEE Access, 2020

2019
Multi-level features combined end-to-end learning for automated pathological grading of breast cancer on digital mammograms.
Comput. Medical Imaging Graph., 2019

2017
Eyes Understand the Sketch!: Gaze-Aided Stroke Grouping of Hand-Drawn Flowcharts.
Proceedings of the 22nd International Conference on Intelligent User Interfaces, 2017

2015
基于语法描述语言的在线手绘流程图识别 (On-line Handwritten Flowchart Recognition Based on Grammar Description Language).
计算机科学, 2015

2011
A resting-state fMRI study of patients with HIV infection based on regional homogeneity method.
Proceedings of the Seventh International Conference on Natural Computation, 2011