Zhenye Gan

Orcid: 0000-0002-2431-1159

According to our database¹, Zhenye Gan authored at least 35 papers between 2013 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

2014

2016

2018

2020

2022

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

SoftPatch+: Fully unsupervised anomaly classification and segmentation.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

2024

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.

[BibT_eX]

[DOI]

CoRR, 2024

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision.

[BibT_eX]

[DOI]

CoRR, 2024

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Multimodal Large Language Models: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

DMAD: Dual Memory Bank for Real-World Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Learning Hybrid Negative Probability Model for Weakly-Supervised Whole Slide Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

TransAVS: End-to-End Audio-Visual Segmentation with Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Speech Recognition of Noisy Tibetan Based on Parallel Branch Structure.

[BibT_eX]

[DOI]

Zhenye Gan

Feilong Zhai

Proceedings of the 2024 4th International Conference on Artificial Intelligence, 2024

Rethinking Reverse Distillation for Multi-Modal Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Hear to Segment: Unmixing the Audio to Guide the Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

MixTeacher: Mining Promising Labels with Mixed Scale Teacher for Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Calibrated Teacher for Sparsely Annotated Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

A tibetan-dependent speaker recognition method based on deep learning.

[BibT_eX]

[DOI]

Zhenye Gan

Yue Yu

Min Luo

Multim. Tools Appl., 2022

CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2022

Iterative Few-shot Semantic Segmentation from Image Label Text.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Learning Distinctive Margin toward Active Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-high Resolution Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2019

Study on the Tones Biases of Mandarin Speaker in Amdo Tibetan Areas Based on Statistics.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Perception and Production of Mandarin Monosyllabic Tones by Amdo Tibetan College Students.

[BibT_eX]

[DOI]

Zhenye Gan

Jiafang Han

Hongwu Yang

Proceedings of the Natural Language Processing and Chinese Computing, 2018

Mandarin-Tibetan Cross-Lingual Voice Conversion System Based on Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 2nd International Conference on Computer Science and Artificial Intelligence, 2018

A DNN-based Mandarin-Tibetan cross-lingual speech synthesis.

[BibT_eX]

[DOI]

Weitong Guo

Hongwu Yang

Zhenye Gan

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Towards Realizing Mandarin-Tibetan Bi-lingual Emotional Speech Synthesis with Mandarin Emotional Training Corpus.

[BibT_eX]

[DOI]

Peiwen Wu

Hongwu Yang

Zhenye Gan

Proceedings of the Data Science, 2017

Improved CNN-based facial landmarks tracking via ridge regression at 150 Fps on mobile devices.

[BibT_eX]

[DOI]

Proceedings of the 10th International Congress on Image and Signal Processing, 2017

2016

Towards Realizing Sign Language-to-Speech Conversion by Combining Deep Learning and Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Xiaochun An

Hongwu Yang

Zhenye Gan

Proceedings of the Social Computing, 2016

Research on text analysis for Tibetan statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Zhenye Gan

Xinjie Kong

Shuai Zhang

Proceedings of the 9th International Congress on Image and Signal Processing, 2016

2015

Using speaker adaptive training to realize Mandarin-Tibetan cross-lingual speech synthesis.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2015

2014

Realizing speech enhancement by combining EEMD and K-SVD dictionary training algorithm.

[BibT_eX]

[DOI]

Hao Chen

Zhenye Gan

Hongwu Yang

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

2013

Realizing Tibetan speech synthesis by speaker adaptive training.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Zhenye Gan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...