Long Zhou

Orcid: 0009-0008-6579-4469

According to our database1, Long Zhou authored at least 105 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Novel Tongue Coating Segmentation Method Based on Improved TransUNet.
Sensors, July, 2024

Measuring Villagers' Perceptions of Changes in the Landscape Values of Traditional Villages.
ISPRS Int. J. Geo Inf., February, 2024

VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning.
IEEE Trans. Multim., 2024

SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Identification of Typhoon-Vulnerable Areas and Countermeasures in High-Density Coastal Cities: The Case of Macau.
ISPRS Int. J. Geo Inf., 2024

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation.
CoRR, 2024

NDVQ: Robust Neural Audio Codec with Normal Distribution-Based Vector Quantization.
CoRR, 2024

Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation.
CoRR, 2024

Autoregressive Speech Synthesis without Vector Quantization.
CoRR, 2024

VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment.
CoRR, 2024

VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers.
CoRR, 2024

TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation.
CoRR, 2024

CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
CoRR, 2024

WavLLM: Towards Robust and Adaptive Speech Large Language Model.
CoRR, 2024

Boosting Large Language Model for Speech Synthesis: An Empirical Study.
CoRR, 2024

Multi-attention Fusion for Multimodal Sentiment Classification.
Proceedings of 2024 ACM ICMR Workshop on Multimodal Video Retrieval, 2024

WavLLM: Towards Robust and Adaptive Speech Large Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Zero-Shot Compound Fault Diagnosis Method Based on Semantic Learning and Discriminative Features.
IEEE Trans. Instrum. Meas., 2023

Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction.
CoRR, 2023

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation.
CoRR, 2023

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation.
CoRR, 2023

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling.
CoRR, 2023

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers.
CoRR, 2023

Vehicle Retarders: A Review.
IEEE Access, 2023

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Design and Implementation of a Game Session Plugin based on Unreal Engine.
Proceedings of the 8th International Conference on Cyber Security and Information Engineering, 2023

Robust Data2VEC: Noise-Robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Prosody-Aware Speecht5 for Expressive Neural TTS.
Proceedings of the IEEE International Conference on Acoustics, 2023

Research and analysis of big data based on decision model.
Proceedings of the 2023 7th International Conference on Electronic Information Technology and Computer Engineering, 2023

Fault diagnosis of rolling bearing based on optimized Tunable-Q Wavelet Transform.
Proceedings of the CAA Symposium on Fault Detection, 2023

On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
IEEE J. Sel. Top. Signal Process., 2022

Zero-shot learning for compound fault diagnosis of bearings.
Expert Syst. Appl., 2022

LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers.
CoRR, 2022

The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task.
CoRR, 2022

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speech Pre-training with Acoustic Piece.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Configurable Multilingual Model is All You Need to Recognize All Languages.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-View Self-Attention Based Transformer for Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
CoRR, 2021

SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing.
CoRR, 2021

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Zero-shot learning compound fault diagnosis of bearings.
Proceedings of the International Joint Conference on Neural Networks, 2021

GraphCodeBERT: Pre-training Code Representations with Data Flow.
Proceedings of the 9th International Conference on Learning Representations, 2021

Jointly Learning to Repair Code and Generate Commit Message.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Study of Microstructure Evolution of Austenitic Steel after Long-term Service.
Proceedings of the AIAM 2021: 3rd International Conference on Artificial Intelligence and Advanced Manufacture, Manchester, United Kingdom, October 23, 2021

Grammar-Based Patches Generation for Automated Program Repair.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Deep Neural Network-based Machine Translation System Combination.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Supervised learning with cyclegan for low-dose FDG PET image denoising.
Medical Image Anal., 2020

Visualizing the USA's Maritime Freight Flows Using DM, LP, and AON in GIS.
ISPRS Int. J. Geo Inf., 2020

CodeBLEU: a Method for Automatic Evaluation of Code Synthesis.
CoRR, 2020

Synchronous bidirectional inference for neural sequence generation.
Artif. Intell., 2020

Non-autoregressive Neural Machine Translation with Distortion Model.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

CASIA's System for IWSLT 2020 Open Domain Translation.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Synchronous Bidirectional Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2019

Interference Suppression of Partially Overlapped Signals Using GSVD and Orthogonal Projection.
IEICE Trans. Commun., 2019

Power Grid Enterprise Intelligent Risk Identification Model Considering Multi-Attribute and Low Correlation Data.
IEEE Access, 2019

Sequence Generation: From Both Sides to the Middle.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Synchronously Generating Two Languages with Interactive Decoding.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Compact and Language-Sensitive Multilingual Translation Method.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Language-Independent Representor for Neural Machine Translation.
CoRR, 2018

A Comparable Study on Model Averaging, Ensembling and Reranking in NMT.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

2017
The analysis on college students' physical fitness testing data - two cases study.
Proceedings of the International Conference on Security, Pattern Analysis, and Cybernetics, 2017

Augmenting Neural Sentence Summarization Through Extractive Summarization.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Look-Ahead Attention for Generation in Neural Machine Translation.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Fault analysis of power transmission line in a generalized state-space model perspective.
Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017

Word, Subword or Character? An Empirical Study of Granularity in Chinese-English NMT.
Proceedings of the Machine Translation - 13th China Workshop, 2017

Neural System Combination for Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Stochastic Petri Net-based performance evaluation of hybrid traffic for social networks system.
Neurocomputing, 2016

Improved classification and regression tree based omni-directional wheelchair control with eye movement.
Proceedings of the IEEE International Conference on Information and Automation, 2016

Design of an eye movement-controlled wheelchair using Kalman filter algorithm.
Proceedings of the IEEE International Conference on Information and Automation, 2016

An End-to-End Chinese Discourse Parser with Adaptation to Explicit and Non-explicit Relation Recognition.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning: Shared Task, 2016

The Detecting Method of Building Deformation Based on Terrestrial Laser Point Cloud.
Proceedings of the 12th International Conference on Computational Intelligence and Security, 2016

2015
Monitoring the Deformation of the Facade of a Building Based on Terrestrial Laser Point-Cloud.
Proceedings of the 11th International Conference on Computational Intelligence and Security, 2015

2014
Multi-scale sparse denoising model based on non-separable wavelet.
Proceedings of the Proceedings IEEE International Conference on Security, 2014

A new GPR image de-nosing method based on BEMD.
Proceedings of the Proceedings IEEE International Conference on Security, 2014

Approaches to grey state modeling and modal control of complicated systems.
Proceedings of the Proceedings IEEE International Conference on Security, 2014

Traffic congestion judgment based on linear spatial pyramid matching using sparse coding.
Proceedings of the 10th International Conference on Natural Computation, 2014

Rapid vehicle edge detection based on cellular neural network.
Proceedings of the 10th International Conference on Natural Computation, 2014

2013
Wavelet de-noising techniques with power spectral density to vibration signal.
Kybernetes, 2013

Chaos Multiscale-Synchronization between Two Different fractional-Order hyperchaotic Systems Based on Feedback Control.
Int. J. Bifurc. Chaos, 2013

2012
Research on Smart Grid analysis based on informatization.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

The instantaneous frequency extraction of GPR B-scan data based on HHT method.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

2011
Clustering Based Image Denoising Using SURE-LET.
Proceedings of the Seventh International Conference on Computational Intelligence and Security, 2011

Cell Nuclei Detection in Histopathological Images by Using Multi-curvature Edge Cue.
Proceedings of the Seventh International Conference on Computational Intelligence and Security, 2011

2010
Writer identification using fractal dimension of wavelet subbands in gabor domain.
Integr. Comput. Aided Eng., 2010

Retinal Blood Vessels Segmentation Using the Radial Projection and Supervised Classification.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Nonseparable wavelet domain BPCA for face recognition.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Impact of smart metering on energy efficiency.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Artificial neural network for load forecasting in smart grid.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

A RBF network for short - Term Load forecast on microgrid.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Long digital straight segments for fingerprint matching.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Extracting corner-cue feature to improve minutiae-matching accuracy.
Proceedings of the International Conference on Image Processing, 2010

Image Denoising Using Nonseparable Wavelet and SURE-LET.
Proceedings of the 2010 International Conference on Computational Intelligence and Security, 2010

Fingerprint enhancement based on non-separable wavelet.
Proceedings of the 9th IEEE International Conference on Cognitive Informatics, 2010

2009
Application of Simulated Annealing Algorithm in Pest Image Segmentation.
Proceedings of the 2009 Second International Symposium on Computational Intelligence and Design, 2009


  Loading...