GEXMERT: Geometrically enhanced cross-modality encoder representations from transformers inspired by higher-order visual percepts.
Pattern Recognit., 2025
Dtsr: detail-enhanced transformer for image super-resolution.
Vis. Comput., November, 2024
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis.
CoRR, 2024
Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction.
CoRR, 2024
Comprehensive Survey of Model Compression and Speed up for Vision Transformers.
CoRR, 2024
A novel Transformer-based model with large kernel temporal convolution for chemical process fault detection.
Comput. Chem. Eng., 2024
DCD-FPI: A Deformable Convolution-Based Fusion Network for Unmanned Aerial Vehicle Localization.
IEEE Access, 2024
TCSR: Lightweight Transformer and CNN Interaction Network for Image Super-Resolution.
IEEE Access, 2024
TextrolSpeech: A Text Style Control Speech Corpus with Codec Language Text-to-Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
Landslide Hazard Evaluation Based on SSA-BP.
Proceedings of the 2024 4th International Conference on Artificial Intelligence, 2024
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models.
CoRR, 2023
A Dynamic Pricing Strategy for Load Balancing Across Multiple Edge Servers.
Proceedings of the IEEE International Conference on Web Services, 2023
VarietySound: Timbre-Controllable Video to Sound Generation Via Unsupervised Information Disentanglement.
Proceedings of the IEEE International Conference on Acoustics, 2023
LSTM Based Short-Term Data Center Electrical Consumption Forecasting.
Proceedings of the Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing, 2023
Predicting unseen antibodies' neutralizability via adaptive graph neural networks.
,
,
,
,
,
,
,
,
,
,
,
,
Nat. Mac. Intell., November, 2022
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation.
CoRR, 2021
DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis.
CoRR, 2021
BridgeDPI: A Novel Graph Neural Network for Predicting Drug-Protein Interactions.
CoRR, 2021
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Deep Reinforcement Learning Based Task Offloading Strategy Under Dynamic Pricing in Edge Computing.
Proceedings of the Service-Oriented Computing - 19th International Conference, 2021
A unified framework for packing deformable and non-deformable subcellular structures in crowded cryo-electron tomogram simulation.
,
,
,
,
,
,
,
,
,
,
BMC Bioinform., 2020
PUB-SalNet: A Pre-Trained Unsupervised Self-Aware Backpropagation Network for Biomedical Salient Segmentation.
Algorithms, 2020
Complementary Fusion of Multi-Features and Multi-Modalities in Sentiment Analysis.
Proceedings of the 3rd Workshop on Affective Content Analysis (AffCon 2020) co-located with Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020), 2020
Sentiment Analysis using Deep Robust Complementary Fusion of Multi-Features and Multi-Modalities.
CoRR, 2019
An efficient sorting algorithm - Ultimate Heapsort(UHS).
CoRR, 2019
The Application of Bipartite Matching in Assignment Problem.
CoRR, 2019
Audio Sentiment Analysis by Heterogeneous Signal Features Learned from Utterance-Based Parallel Neural Network.
Proceedings of the 2nd Workshop on Affective Content Analysis (AffCon 2019) co-located with Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019), 2019
Assessing four Neural Networks on Handwritten Digit Recognition Dataset (MNIST).
CoRR, 2018
Utterance-Based Audio Sentiment Analysis Learned by a Parallel Combination of CNN and LSTM.
CoRR, 2018
Automatic Identification and Presentation of Twitter Content for Planned Events.
Proceedings of the Fifth International Conference on Weblogs and Social Media, 2011