Pengcheng Guo

Orcid: 0000-0003-3237-5088

According to our database1, Pengcheng Guo authored at least 77 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Coarse-to-Fine Structure and Semantic Learning for Single-Sample SAR Image Generation.
Remote. Sens., September, 2024

Single-Channel Blind Source Separation in Wireless Communications: A Complex-Domain Deep Learning Approach.
IEEE Wirel. Commun. Lett., June, 2024

Swing Trend Prediction of Main Guide Bearing in Hydropower Units Based on MFS-DCGNN.
Sensors, June, 2024

Nearshore Ship Detection in PolSAR Images by Integrating Superpixel-Level GP-PNF and Refined Polarimetric Decomposition.
Remote. Sens., March, 2024

DCMAI: A Dynamical Cross-Modal Alignment Interaction Framework for Document Key Information Extraction.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Optimizing Dysarthria Wake-Up Word Spotting: An End-to-End Approach for SLT 2024 LRDWWS Challenge.
CoRR, 2024

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models.
CoRR, 2024

Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper.
CoRR, 2024

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models.
CoRR, 2024

CRMSP: A Semi-supervised Approach for Key Information Extraction with Class-Rebalancing and Merged Semantic Pseudo-Labeling.
CoRR, 2024

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets.
CoRR, 2024

The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023.
CoRR, 2024

Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

MLCA-AVSR: Multi-Layer Cross Attention Fusion Based Audio-Visual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024

Automatic Channel Selection and Spatial Feature Integration for Multi-Channel Speech Recognition Across Various Array Topologies.
Proceedings of the IEEE International Conference on Acoustics, 2024

An Audio-Quality-Based Multi-Strategy Approach For Target Speaker Extraction in the Misp 2023 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
Proceedings of the IEEE International Conference on Acoustics, 2024

Ship Detection with Optical Image Based on Swin-YOLOv5 Network.
Proceedings of the 2024 5th International Conference on Computing, 2024

Ship-YOLOv5: Ship Target Detection Based on Enhanced Feature Fusion.
Proceedings of the 2024 5th International Conference on Computing, 2024

2023
Timbre-Reserved Adversarial Attack in Speaker Identification.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Jointly optimized design of distributed RS-coded spatial modulation by appropriate selection at the relay.
EURASIP J. Wirel. Commun. Netw., 2023

Timbre-reserved Adversarial Attack in Speaker Identification.
CoRR, 2023

TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent Reinforcement Learning.
CoRR, 2023

TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Research of STF-CKF-SLAM algorithm using Variational Bayes.
Proceedings of the 6th International Conference on Electronics, 2023

VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

Distinguishable Speaker Anonymization Based on Formant and Fundamental Frequency Scaling.
Proceedings of the IEEE International Conference on Acoustics, 2023

Preserving Background Sound in Noise-Robust Voice Conversion Via Multi-Task Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

NC-WAMKD: Neighborhood Correction Weight-Adaptive Multi-Teacher Knowledge Distillation for Graph-Based Semi-Supervised Node Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Controllability of Windmill Networks.
Proceedings of the Bio-Inspired Computing: Theories and Applications, 2023

Sa-Paraformer: Non-Autoregressive End-To-End Speaker-Attributed ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Characteristic Analysis of the Outer Sheath Circulating Current in a Single-Core AC Submarine Cable System.
Symmetry, 2022

TESSP: Text-Enhanced Self-Supervised Speech Pre-training.
CoRR, 2022

MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario.
CoRR, 2022

NWPU-ASLP System for the VoicePrivacy 2022 Challenge.
CoRR, 2022

MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Adaptive Sparse Self-attention for Object Detection.
Proceedings of the International Joint Conference on Neural Networks, 2022

Improving Separation Performance of Wireless Communication Signals with Antenna Angle Adjustment.
Proceedings of the 22nd IEEE International Conference on Communication Technology, 2022

WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Jointly optimized design of distributed Reed-Solomon codes by proper selection in relay.
Telecommun. Syst., 2021

Hybrid Time-Scale Optimal Scheduling Considering Multi-Energy Complementary Characteristic.
IEEE Access, 2021

ESPnet-ST IWSLT 2021 Offline Speech Translation System.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Context-aware RNNLM Rescoring for Conversational Speech Recognition.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Adversarial Training for Multi-domain Speaker Recognition.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

MCDALNet: Multi-scale Contextual Dual Attention Learning Network for Medical Image Segmentation.
Proceedings of the International Joint Conference on Neural Networks, 2021

Recent Developments on Espnet Toolkit Boosted By Conformer.
Proceedings of the IEEE International Conference on Acoustics, 2021

Boundary and Context Aware Training for CIF-Based Non-Autoregressive End-to-End ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
DSmT-Based Ultrasonic Detection Model for Estimating Indoor Environment Contour.
IEEE Trans. Instrum. Meas., 2020

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR, 2020

Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End ASR with Adaptive Span Self-Attention.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Multi-Scaled Receptive Field Learning Approach for Medical Image Segmentation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Adversarial Regularization for End-to-End Robust Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Natural Scene Text Extraction Approach Based on Generative Adversarial Learning.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Domain Adversarial Training for Improving Keyword Spotting Performance of ESL Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Surface defects detection for mobilephone panel workpieces based on machine vision and machine learning.
Proceedings of the IEEE International Conference on Information and Automation, 2017

Signal extraction method in electromagnetic coupling system based on LM-ABC algorithm.
Proceedings of the IEEE International Conference on Information and Automation, 2017

Generalized Space Shift Keying Modulation Combined with Convolutional Coding.
Proceedings of the Communications, Signal Processing, and Systems, 2017

2016
The Impact of Sharing Economy on the Diversification of Tourism Products: Implications for Tourist Experience.
Proceedings of the Information and Communication Technologies in Tourism 2016, 2016


  Loading...