Shixiong Zhang

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2024
Kinematics inverse solution of assembly robot based on improved particle swarm optimization.
Robotica, March, 2024

E-detector: Asynchronous Spatio-temporal for Event-based Object Detection in Intelligent Transportation System.
ACM Trans. Multim. Comput. Commun. Appl., February, 2024

Recovery-Based Occluded Face Recognition by Identity-Guided Inpainting.
Sensors, January, 2024

MotifHub: Detection of trans-acting DNA motif group with probabilistic modeling algorithm.
Comput. Biol. Medicine, January, 2024

A Surveillance System for Urban Utility Tunnel Subject to Third-Party Threats Based on Fiber-Optic DAS and FPN-BiLSTM Network.
IEEE Trans. Instrum. Meas., 2024

SECap: Speech Emotion Captioning with Large Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Bounding convolutional network for refining object locations.
Neural Comput. Appl., September, 2023

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR.
CoRR, 2023

M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec.
CoRR, 2023

Survey on Controlable Image Synthesis with Deep Learning.
CoRR, 2023

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty.
CoRR, 2023

A multi-source signals separation algorithm for identifying the threatening signals applied in Fiber-Optic Distributed Acoustic Sensor.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2023

Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Deep Neural Mel-Subband Beamformer for in-Car Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Neuralecho: Hybrid of Full-Band and Sub-Band Recurrent Neural Network For Acoustic Echo Cancellation and Speech Enhancement.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
SwapInpaint: Identity-Specific Face Inpainting With Identity Swapping.
IEEE Trans. Circuits Syst. Video Technol., 2022

EVtracker: An Event-Driven Spatiotemporal Method for Dynamic Object Tracking.
Sensors, 2022

NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement.
CoRR, 2022

scWMC: weighted matrix completion-based imputation of scRNA-seq data via prior subspace information.
Bioinform., 2022

DeepMotifSyn: a deep learning approach to synthesize heterodimeric DNA motifs.
Briefings Bioinform., 2022

High-throughput single-cell RNA-seq data imputation and characterization with surrogate-assisted automated deep learning.
Briefings Bioinform., 2022

EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Joint Neural AEC and Beamforming with Double-Talk Detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2022

Consistent Training and Decoding for End-to-End Speech Recognition Using Lattice-Free MMI.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature.
Proceedings of the IEEE International Conference on Acoustics, 2022

Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Nature-Inspired Compressed Sensing for Transcriptomic Profiling From Random Composite Measurements.
IEEE Trans. Cybern., 2021

Multiobjective Genome-Wide RNA-Binding Event Identification From CLIP-Seq Data.
IEEE Trans. Cybern., 2021

Evolving Transcriptomic Profiles From Single-Cell RNA-Seq Data Using Nature-Inspired Multiobjective Optimization.
IEEE ACM Trans. Comput. Biol. Bioinform., 2021

Multi-Channel Multi-Frame ADL-MVDR for Target Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Audio-Visual Multi-Channel Integration and Recognition of Overlapped Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain.
IEEE Signal Process. Lett., 2021

Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer.
CoRR, 2021

Generalized RNN beamformer for target speech separation.
CoRR, 2021

Elucidating transcriptomic profiles from single-cell RNA sequencing data using nature-inspired compressed sensing.
Briefings Bioinform., 2021

Deep embedded clustering with multiple objectives on scRNA-seq data.
Briefings Bioinform., 2021

Identification of pan-cancer Ras pathway activation with deep learning.
Briefings Bioinform., 2021

WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Neural Mask based Multi-channel Convolutional Beamforming for Joint Dereverberation, Echo Cancellation and Denoising.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Feature Selection and Feature Extraction: Highlights.
Proceedings of the ISMSI 2021: 2021 5th International Conference on Intelligent Systems, 2021

MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

MIMO Self-Attentive RNN Beamformer for Multi-Speaker Speech Separation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Channel Speaker Verification for Single and Multi-Talker Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ADL-MVDR: All Deep Learning MVDR Beamformer for Target Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Recognition of Third-Party Interference in Cable Monitoring Based on 1-D CNN with Fiber Optic DAS System.
Proceedings of the 13th International Conference on Advanced Infocomm Technology, 2021

3D Spatial Features for Multi-Channel Target Speech Separation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Nature-Inspired Multiobjective Epistasis Elucidation from Genome-Wide Association Studies.
IEEE ACM Trans. Comput. Biol. Bioinform., 2020

Audio-Visual Speech Separation and Dereverberation With a Two-Stage Multimodal Network.
IEEE J. Sel. Top. Signal Process., 2020

Multi-Modal Multi-Channel Target Speech Separation.
IEEE J. Sel. Top. Signal Process., 2020

Review of Single-cell RNA-seq Data Clustering for Cell Type Identification and Characterization.
CoRR, 2020

Audio-Visual Multi-Channel Recognition of Overlapped Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Neural Spatio-Temporal Beamformer for Target Speech Separation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Self-Supervised Learning for Audio-Visual Speaker Diarization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
A novel method based on FTS with both GA-FCM and multifactor BPNN for stock forecasting.
Soft Comput., 2019

A Unified Framework for Speech Separation.
CoRR, 2019

End-to-End Multi-Channel Speech Separation.
CoRR, 2019

Synergizing CRISPR/Cas9 off-target predictions for ensemble insights and practical applications.
Bioinform., 2019

Single-cell RNA-seq interpretations using evolutionary multiobjective ensemble pruning.
Bioinform., 2019

Improved Speaker-Dependent Separation for CHiME-5 Challenge.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Encrypted Speech Recognition Using Deep Polynomial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Time Domain Audio Visual Speech Separation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Domain and Speaker Adaptation for Cortana Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Exploring Sequential Characteristics in Speaker Bottleneck Feature for Text-Dependent Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
A multi-factor and high-order stock forecast model based on Type-2 FTS using cuckoo search and self-adaptive harmony search.
Neurocomputing, 2017

A novel object tracker designed based on a complementary framework.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

A markerless visual-motor tracking system for behavior monitoring in DCD assessment.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
End-to-End attention based text-dependent speaker verification.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Recurrent support vector machines for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Simplifying long short-term memory acoustic models for fast training and decoding.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Deep neural support vector machines for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Infinite structured support vector machines for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Structured SVMs for Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2013

Kernelized log linear models for continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Investigation of multilingual deep neural networks for spoken term detection.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2011
Optimized Discriminative Kernel for SVM Scoring and Its Application to Speaker Verification.
IEEE Trans. Neural Networks, 2011

Structured Support Vector Machines for Noise Robust Continuous Speech Recognition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Extending noise robust structured support vector machines to larger vocabulary tasks.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Structured Log Linear Models for Noise Robust Speech Recognition.
IEEE Signal Process. Lett., 2010

Researching on Simulation of Traffic Accidents Scene by OpenGL.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

Fuzzy Expert System Based Intelligent Website Assessment System.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
A new adaptation approach to high-level speaker-model creation in speaker verification.
Speech Commun., 2009

Optimization of discriminative kernels in SVM speaker verification.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
High-level speaker verification via articulatory-feature based sequence kernels and SVM.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Speaker Verification via High-Level Feature Based Phonetic-Class Pronunciation Modeling.
IEEE Trans. Computers, 2007

A New Adaptation Method for Speaker-Model Creation in High-Level Speaker Verification.
Proceedings of the Advances in Multimedia Information Processing, 2007

High-level feature-based speaker verification via articulatory phonetic-class pronunciation modeling.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007


  Loading...