Pengcheng Guo

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia.

[BibT_eX]

[DOI]

CoRR, January, 2025

DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification.

[BibT_eX]

[DOI]

CoRR, January, 2025

CRMSP: A semi-supervised approach for key information extraction with Class-Rebalancing and Merged Semantic Pseudo-Labeling.

[BibT_eX]

[DOI]

Neurocomputing, 2025

Intelligent Recognition for Operation States of Hydroelectric Generating Units Based on Data Fusion and Visualization Analysis.

[BibT_eX]

[DOI]

Int. J. Intell. Syst., 2025

2024

PLBR: A Semi-Supervised Document Key Information Extraction via Pseudo-Labeling Bias Rectification.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., December, 2024

IoT-ReliableComm: A Self-Supervised Approach to Signal Transmission Reliability in Interconnected Consumer Electronics.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., November, 2024

Coarse-to-Fine Structure and Semantic Learning for Single-Sample SAR Image Generation.

[BibT_eX]

[DOI]

Remote. Sens., September, 2024

Single-Channel Blind Source Separation in Wireless Communications: A Complex-Domain Deep Learning Approach.

[BibT_eX]

[DOI]

IEEE Wirel. Commun. Lett., June, 2024

Swing Trend Prediction of Main Guide Bearing in Hydropower Units Based on MFS-DCGNN.

[BibT_eX]

[DOI]

Xu Li

Zhuofei Xu

Pengcheng Guo

Sensors, June, 2024

Nearshore Ship Detection in PolSAR Images by Integrating Superpixel-Level GP-PNF and Refined Polarimetric Decomposition.

[BibT_eX]

[DOI]

Remote. Sens., March, 2024

DCMAI: A Dynamical Cross-Modal Alignment Interaction Framework for Document Key Information Extraction.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., January, 2024

Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR.

[BibT_eX]

[DOI]

CoRR, 2024

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper.

[BibT_eX]

[DOI]

CoRR, 2024

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023.

[BibT_eX]

[DOI]

CoRR, 2024

Optimizing Dysarthria Wake-Up Word Spotting: an End-to-End Approach For SLT 2024 LRDWWS Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Mining Relational Similarity in Social Networks for Enhanced Recommendations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2024

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

MLCA-AVSR: Multi-Layer Cross Attention Fusion Based Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Automatic Channel Selection and Spatial Feature Integration for Multi-Channel Speech Recognition Across Various Array Topologies.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

An Audio-Quality-Based Multi-Strategy Approach For Target Speaker Extraction in the Misp 2023 Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Ship Detection with Optical Image Based on Swin-YOLOv5 Network.

[BibT_eX]

[DOI]

Proceedings of the 2024 5th International Conference on Computing, 2024

Ship-YOLOv5: Ship Target Detection Based on Enhanced Feature Fusion.

[BibT_eX]

[DOI]

Proceedings of the 2024 5th International Conference on Computing, 2024

2023

Timbre-Reserved Adversarial Attack in Speaker Identification.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Jointly optimized design of distributed RS-coded spatial modulation by appropriate selection at the relay.

[BibT_eX]

[DOI]

EURASIP J. Wirel. Commun. Netw., 2023

Timbre-reserved Adversarial Attack in Speaker Identification.

[BibT_eX]

[DOI]

CoRR, 2023

TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Research of STF-CKF-SLAM algorithm using Variational Bayes.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Electronics, 2023

VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Distinguishable Speaker Anonymization Based on Formant and Fundamental Frequency Scaling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Preserving Background Sound in Noise-Robust Voice Conversion Via Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

NC-WAMKD: Neighborhood Correction Weight-Adaptive Multi-Teacher Knowledge Distillation for Graph-Based Semi-Supervised Node Classification.

[BibT_eX]

[DOI]

Jiahao Liu

Pengcheng Guo

Yonghong Song

Proceedings of the IEEE International Conference on Acoustics, 2023

The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Controllability of Windmill Networks.

[BibT_eX]

[DOI]

Proceedings of the Bio-Inspired Computing: Theories and Applications, 2023

Sa-Paraformer: Non-Autoregressive End-To-End Speaker-Attributed ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Characteristic Analysis of the Outer Sheath Circulating Current in a Single-Core AC Submarine Cable System.

[BibT_eX]

[DOI]

Peng Li

Pengcheng Guo

Symmetry, 2022

TESSP: Text-Enhanced Self-Supervised Speech Pre-training.

[BibT_eX]

[DOI]

CoRR, 2022

MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario.

[BibT_eX]

[DOI]

CoRR, 2022

NWPU-ASLP System for the VoicePrivacy 2022 Challenge.

[BibT_eX]

[DOI]

CoRR, 2022

MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism.

[BibT_eX]

[DOI]

Kun Wei

Pengcheng Guo

Ning Jiang

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Adaptive Sparse Self-attention for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2022

Improving Separation Performance of Wireless Communication Signals with Antenna Angle Adjustment.

[BibT_eX]

[DOI]

Miaomiao Gu

Pengcheng Guo

Miao Yu

Proceedings of the 22nd IEEE International Conference on Communication Technology, 2022

WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Jointly optimized design of distributed Reed-Solomon codes by proper selection in relay.

[BibT_eX]

[DOI]

Telecommun. Syst., 2021

Hybrid Time-Scale Optimal Scheduling Considering Multi-Energy Complementary Characteristic.

[BibT_eX]

[DOI]

IEEE Access, 2021

ESPnet-ST IWSLT 2021 Offline Speech Translation System.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Context-aware RNNLM Rescoring for Conversational Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Adversarial Training for Multi-domain Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

MCDALNet: Multi-scale Contextual Dual Attention Learning Network for Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

Recent Developments on Espnet Toolkit Boosted By Conformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Boundary and Context Aware Training for CIF-Based Non-Autoregressive End-to-End ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

DSmT-Based Ultrasonic Detection Model for Estimating Indoor Environment Contour.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2020

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

Wangyou Zhang

CoRR, 2020

Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition.

[BibT_eX]

[DOI]

Qing Wang

Pengcheng Guo

Lei Xie

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End ASR with Adaptive Span Self-Attention.

[BibT_eX]

[DOI]

Xuankai Chang

Aswin Shanmugam Subramanian

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Multi-Scaled Receptive Field Learning Approach for Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Adversarial Regularization for End-to-End Robust Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition.

[BibT_eX]

[DOI]

Pengcheng Guo

Sining Sun

Lei Xie

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Natural Scene Text Extraction Approach Based on Generative Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 26th International Conference, 2019

Domain Adversarial Training for Improving Keyword Spotting Performance of ESL Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Surface defects detection for mobilephone panel workpieces based on machine vision and machine learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Information and Automation, 2017

Signal extraction method in electromagnetic coupling system based on LM-ABC algorithm.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Information and Automation, 2017

Generalized Space Shift Keying Modulation Combined with Convolutional Coding.

[BibT_eX]

[DOI]

Pengcheng Guo

Bingyin Ren

Yongxin Zhang

Proceedings of the Communications, Signal Processing, and Systems, 2017

2016

The Impact of Sharing Economy on the Diversification of Tourism Products: Implications for Tourist Experience.

[BibT_eX]

[DOI]

Proceedings of the Information and Communication Technologies in Tourism 2016, 2016

Pengcheng Guo

Bibliography

Loading...