Jing Xiao

Orcid: 0009-0003-3776-5269

Affiliations:
  • PingAn Technology, Shenzhen, China
  • Epson Research and Development, San Jose, CA, USA (former)
  • Carnegie Mellon University, Robotics Institute, Pittsburgh, PA, USA (PhD 2005)


According to our database1, Jing Xiao authored at least 410 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MITER: Medical Image-TExt joint adaptive pretRaining with multi-level contrastive learning.
Expert Syst. Appl., March, 2024

Learn Stable MRI Under-Sampling Pattern With Decoupled Sampling Preference.
IEEE Trans. Computational Imaging, 2024

EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

An interpretable two-branch bi-coordinate network based on multi-grained domain knowledge for classification of thyroid nodules in ultrasound images.
Medical Image Anal., 2024

Reverse double auction mechanism: An efficient algorithm for E-commerce platform operations.
Electron. Commer. Res. Appl., 2024

Planning with Large Language Models for Conversational Agents.
CoRR, 2024

Deep Learning Segmentation of Ascites on Abdominal CT Scans for Automatic Volume Quantification.
CoRR, 2024

PFID: Privacy First Inference Delegation Framework for LLMs.
CoRR, 2024

A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed.
CoRR, 2024

Harnessing the Power of Prompt Experts: Efficient Knowledge Distillation for Enhanced Language Understanding.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track and Demo Track, 2024

Prior Bilinear-Based Models for Knowledge Graph Completion.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Multimodal Prototype-Enhanced Network for Few-Shot Action Recognition.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Retrieval-Augmented Audio Deepfake Detection.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Gecko: Resource-Efficient and Accurate Queries in Real-Time Video Streams at the Edge.
Proceedings of the IEEE INFOCOM 2024, 2024

EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization.
Proceedings of the International Joint Conference on Neural Networks, 2024

ConTuner: Singing Voice Beautifying with Pitch and Expressiveness Condition.
Proceedings of the International Joint Conference on Neural Networks, 2024

Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training.
Proceedings of the International Joint Conference on Neural Networks, 2024

QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering.
Proceedings of the International Joint Conference on Neural Networks, 2024

EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning.
Proceedings of the International Joint Conference on Neural Networks, 2024

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion.
Proceedings of the International Joint Conference on Neural Networks, 2024

Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning.
Proceedings of the International Joint Conference on Neural Networks, 2024

Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation.
Proceedings of the International Joint Conference on Neural Networks, 2024

DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RREH: Reconstruction Relations Embedded Hashing for Semi-paired Cross-Modal Retrieval.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Enhancing Emotion Recognition in Conversation Through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Improving Attention-Based End-to-End Speech Recognition by Monotonic Alignment Attention Matrix Reconstruction.
Proceedings of the IEEE International Conference on Acoustics, 2024

Leveraging Biases in Large Language Models: "bias-kNN" for Effective Few-Shot Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

ESVC: Combining Adaptive Style Fusion and Multi-Level Feature Disentanglement for Expressive Singing Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2024

P2DT: Mitigating Forgetting in Task-Incremental Learning with Progressive Prompt Decision Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

INCPrompt: Task-Aware Incremental Prompting for Rehearsal-Free Class-Incremental Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

ED-TTS: Multi-Scale Emotion Modeling Using Cross-Domain Emotion Diarization for Emotional Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2024

IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

Bidirectional Autoregressive Diffusion Model for Dance Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Medical Speech Symptoms Classification via Disentangled Representation.
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

RSET: Remapping-Based Sorting Method for Emotion Transfer Speech Synthesis.
Proceedings of the Web and Big Data - 8th International Joint Conference, 2024

2023
What is the limitation of multimodal LLMs? A deeper look into multimodal LLMs through prompt probing.
Inf. Process. Manag., November, 2023

Lumbar Bone Mineral Density Estimation From Chest X-Ray Images: Anatomy-Aware Attentive Multi-ROI Modeling.
IEEE Trans. Medical Imaging, 2023

Kdb-D2CFR: Solving Multiplayer imperfect-information games with knowledge distillation-based DeepCFR.
Knowl. Based Syst., 2023

Active perception network for non-myopic online exploration and visual surface coverage.
CoRR, 2023

DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks.
CoRR, 2023

Machine Unlearning Methodology base on Stochastic Teacher Network.
CoRR, 2023

Symbolic & Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music.
CoRR, 2023

SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning.
CoRR, 2023

From Rolling Over to Walking: Enabling Humanoid Robots to Develop Complex Motor Skills.
CoRR, 2023

HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose Estimation.
CoRR, 2023

A deep local attention network for pre-operative lymph node metastasis prediction in pancreatic cancer via multiphase CT imaging.
CoRR, 2023

Open Domain Response Generation Guided by Retrieved Conversations.
IEEE Access, 2023

Reinforced Vision-and-Language Navigation Based on Historical BERT.
Proceedings of the Advances in Swarm Intelligence - 14th International Conference, 2023

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Relative Boundary Modeling: A High-Resolution Cricket Bowl Release Detection Framework with I3D Features.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring Loss Function and Rank Fusion for Enhanced Person Re-identification.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Image- and Instance-Level Data Augmentation for Occluded Instance Segmentation.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Improving Automatic Segmentation of lymphoma with Additional Medical Knowledge Priors.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Prompt Guided Copy Mechanism for Conversational Question Answering.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

P-vectors: A Parallel-coupled TDNN/Transformer Network for Speaker Verification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improving End-to-End Modeling For Mandarin-English Code-Switching Using Lightweight Switch-Routing Mixture-of-Experts.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SVVAD: Personal Voice Activity Detection for Speaker Verification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Exploring multi-task learning and data augmentation in dementia detection with self-supervised pretrained models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model.
Proceedings of the International Joint Conference on Neural Networks, 2023

Personalized Federated Learning via Gradient Modulation for Heterogeneous Text Summarization.
Proceedings of the International Joint Conference on Neural Networks, 2023

FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

AOSR-Net: All-in-One Sandstorm Removal Network.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

EdgeMA: Model Adaptation System for Real-Time Video Analytics on Edge Devices.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

Improving EEG-based Emotion Recognition by Fusing Time-Frequency and Spatial Representations.
Proceedings of the IEEE International Conference on Acoustics, 2023

Dynamic Alignment Mask CTC: Improved Mask CTC With Aligned Cross Entropy.
Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Uncertainty Estimation with Gaussian Process for Reliable Dialog Response Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

VQ-CL: Learning Disentangled Speech Representations with Contrastive Learning and Vector Quantization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning Speech Representations with Flexible Hidden Feature Dimensions.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Music Genre Classification from multi-modal Properties of Music and Genre Correlations Perspective.
Proceedings of the IEEE International Conference on Acoustics, 2023

Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Detecting Out-of-Distribution Examples Via Class-Conditional Impressions Reappearing.
Proceedings of the IEEE International Conference on Acoustics, 2023

Systematic Literature Review on the User Evaluation of Teleoperation Interfaces for Professional Service Robots.
Proceedings of the HCI in Business, Government and Organizations, 2023

PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Shoggoth: Towards Efficient Edge-Cloud Collaborative Real-Time Video Inference via Adaptive Online Learning.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023

CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023

CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023

Voiceextender: Short-Utterance Text-Independent Speaker Verification With Guided Diffusion Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Extended Review on the User Evaluation of Teleoperation Interfaces for Professional Service Robots.
Proceedings of the 29th Americas Conference on Information Systems, 2023

Symbolic and Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music.
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023

Voice Conversion with Denoising Diffusion Probabilistic GAN Models.
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023

Machine Unlearning Methodology Based on Stochastic Teacher Network.
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023

On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
SAM: Self-Supervised Learning of Pixel-Wise Anatomical Embeddings in Radiological Images.
IEEE Trans. Medical Imaging, 2022

GBRM: a graph embedding and blockchain-based resource management framework for 5G MEC.
J. Supercomput., 2022

Parallel learner: A practical deep reinforcement learning framework for multi-scenario games.
Knowl. Based Syst., 2022

Boosting Star-GANs for Voice Conversion with Contrastive Discriminator.
CoRR, 2022

Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning.
CoRR, 2022

Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition.
CoRR, 2022

A Study of Different Ways to Use The Conformer Model For Spoken Language Understanding.
CoRR, 2022

Debias the Black-Box: A Fair Ranking Framework via Knowledge Distillation.
Proceedings of the Web Information Systems Engineering - WISE 2022, 2022

Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

PINGAN Omini-Sinitic at SemEval-2022 Task 4: Multi-prompt Training for Patronizing and Condescending Language Detection.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

ElasticMVS: Learning elastic part representation for self-supervised multi-view stereopsis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

Improving Imbalanced Text Classification with Dynamic Curriculum Learning.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

Semi-Supervised Learning Based on Reference Model for Low-resource TTS.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

MetaSpeech: Speech Effects Switch Along with Environment for Metaverse.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

ParseMVS: Learning Primitive-aware Surface Representations for Sparse Multi-view Stereopsis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Accurate and Robust Lesion RECIST Diameter Prediction and Segmentation with Transformers.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Improved Multi-modal Patch Based Lymphoma Segmentation with Negative Sample Augmentation and Label Guidance on PET/CT Scans.
Proceedings of the Multiscale Multimodal Medical Imaging - Third International Workshop, 2022

Asymmetry and Architectural Distortion Detection with Limited Mammography Data.
Proceedings of the Medical Image Learning with Limited and Noisy Data, 2022

A Privacy-Preserving Subgraph-Level Federated Graph Neural Network via Differential Privacy.
Proceedings of the Knowledge Science, Engineering and Management, 2022

Uncertainty Calibration for Deep Audio Classifiers.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Adversarial Knowledge Distillation For Robust Spoken Language Understanding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

FFM: A Frame Filtering Mechanism To Accelerate Inference Speed For Conformer In Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A compact transformer-based GAN vocoder.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Cali3F: Calibrated Fast Fair Federated Recommendation System.
Proceedings of the International Joint Conference on Neural Networks, 2022

Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection.
Proceedings of the International Joint Conference on Neural Networks, 2022

DT-SV: A Transformer-based Time-domain Approach for Speaker Verification.
Proceedings of the International Joint Conference on Neural Networks, 2022

MetaSID: Singer Identification with Domain Adaptation for Metaverse.
Proceedings of the International Joint Conference on Neural Networks, 2022

Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features.
Proceedings of the International Joint Conference on Neural Networks, 2022

TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS.
Proceedings of the International Joint Conference on Neural Networks, 2022

MDCNN-SID: Multi-scale Dilated Convolution Network for Singer Identification.
Proceedings of the International Joint Conference on Neural Networks, 2022

SUSing: SU-net for Singing Voice Synthesis.
Proceedings of the International Joint Conference on Neural Networks, 2022

ICAF: Iterative Contrastive Alignment Framework for Multimodal Abstractive Summarization.
Proceedings of the International Joint Conference on Neural Networks, 2022

Improving Human Image Synthesis with Residual Fast Fourier Transformation and Wasserstein Distance.
Proceedings of the International Joint Conference on Neural Networks, 2022

Augmentation-induced Consistency Regularization for Classification.
Proceedings of the International Joint Conference on Neural Networks, 2022

Leveraging Causal Inference for Explainable Automatic Program Repair.
Proceedings of the International Joint Conference on Neural Networks, 2022

A Fair Federated Learning Framework With Reinforcement Learning.
Proceedings of the International Joint Conference on Neural Networks, 2022

Micro-Expression Recognition Based on Attribute Information Embedding and Cross-modal Contrastive Learning.
Proceedings of the International Joint Conference on Neural Networks, 2022

Federated Non-negative Matrix Factorization for Short Texts Topic Modeling with Mutual Information.
Proceedings of the International Joint Conference on Neural Networks, 2022

Adaptive Activation Network for Low Resource Multilingual Speech Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2022

Speech Augmentation Based Unsupervised Learning for Keyword Spotting.
Proceedings of the International Joint Conference on Neural Networks, 2022

Federated Split BERT for Heterogeneous Text Classification.
Proceedings of the International Joint Conference on Neural Networks, 2022

QSpeech: Low-Qubit Quantum Speech Application Toolkit.
Proceedings of the International Joint Conference on Neural Networks, 2022

A Nearest Neighbor Under-sampling Strategy for Vertical Federated Learning in Financial Domain.
Proceedings of the IH&MMSec '22: ACM Workshop on Information Hiding and Multimedia Security, Santa Barbara, CA, USA, June 27, 2022

Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

Blur the Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English via Neural Machine Translation.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

Boosting StarGANs for Voice Conversion with Contrastive Discriminator.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Dual Feature Fusion Trade Execution Framework with DDQN.
Proceedings of the 4th International Conference on Data Intelligence and Security, 2022

Information Entropy of Uncertainty Control: An Uncertainty Management Method in Imperfect Information Games.
Proceedings of the 4th International Conference on Data Intelligence and Security, 2022

Efficient Private Set Intersection Based on Functional Encryption.
Proceedings of the 4th International Conference on Data Intelligence and Security, 2022

EFDO: Solving Extensive-Form Games Based On Double Oracle.
Proceedings of the 4th International Conference on Data Intelligence and Security, 2022

NFSP-PER: An efficient sampling NFSP-based method with prioritized experience replay.
Proceedings of the 4th International Conference on Data Intelligence and Security, 2022

nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled Noise Introducing and Contextual Information Incorporation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Self-Attention for Incomplete Utterance Rewriting.
Proceedings of the IEEE International Conference on Acoustics, 2022

VU-BERT: A Unified Framework for Visual Dialog.
Proceedings of the IEEE International Conference on Acoustics, 2022

DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Avqvc: One-Shot Voice Conversion By Vector Quantization With Applying Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Speaker Age Estimation With Label Distribution Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Supervised Contrastive Meta-learning for Few-Shot Classification.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

zkMLaaS: a Verifiable Scheme for Machine Learning as a Service.
Proceedings of the IEEE Global Communications Conference, 2022

Self-supervised Cross-modal Pretraining for Speech Emotion Recognition and Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition.
Proceedings of the 9th IEEE International Conference on Data Science and Advanced Analytics, 2022

RL-MD: A Novel Reinforcement Learning Approach for DNA Motif Discovery.
Proceedings of the 9th IEEE International Conference on Data Science and Advanced Analytics, 2022

Machine Unlearning Method Based On Projection Residual.
Proceedings of the 9th IEEE International Conference on Data Science and Advanced Analytics, 2022

Localized Adversarial Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Shallow Diffusion Motion Model for Talking Face Generation from Speech.
Proceedings of the Web and Big Data - 6th International Joint Conference, 2022

Pose Guided Human Image Synthesis with Partially Decoupled GAN.
Proceedings of the Asian Conference on Machine Learning, 2022

2021
Learning From Multiple Datasets With Heterogeneous and Partial Labels for Universal Lesion Detection in CT.
IEEE Trans. Medical Imaging, 2021

Contour Transformer Network for One-Shot Segmentation of Anatomical Structures.
IEEE Trans. Medical Imaging, 2021

Lesion-Harvester: Iteratively Mining Unlabeled Lesions and Hard-Negative Examples at Scale.
IEEE Trans. Medical Imaging, 2021

Using BI-RADS Stratifications as Auxiliary Information for Breast Masses Classification in Ultrasound Images.
IEEE J. Biomed. Health Informatics, 2021

Domain Fingerprints for No-Reference Image Quality Assessment.
IEEE Trans. Circuits Syst. Video Technol., 2021

DeepPrognosis: Preoperative prediction of pancreatic cancer survival and surgical margin via comprehensive understanding of dynamic contrast-enhanced CT imaging and tumor-vascular contact parsing.
Medical Image Anal., 2021

MommiNet-v2: Mammographic multi-view mass identification networks.
Medical Image Anal., 2021

A disentangled generative model for disease decomposition in chest X-rays via normal image synthesis.
Medical Image Anal., 2021

DeepTarget: Gross tumor and clinical target volume segmentation in esophageal cancer radiotherapy.
Medical Image Anal., 2021

Coherence Learning using Keypoint-based Pooling Network for Accurately Assessing Radiographic Knee Osteoarthritis.
CoRR, 2021

Comprehensive and Clinically Accurate Head and Neck Organs at Risk Delineation via Stratified Deep Learning: A Large-scale Multi-Institutional Study.
CoRR, 2021

A deep learning pipeline for localization, differentiation, and uncertainty estimation of liver lesions using multi-phasic and multi-sequence MRI.
CoRR, 2021

Accurate and Generalizable Quantitative Scoring of Liver Steatosis from Ultrasound Images via Scalable Deep Learning.
CoRR, 2021

Multi-institutional Validation of Two-Streamed Deep Learning Method for Automated Delineation of Esophageal Gross Tumor Volume using planning-CT and FDG-PETCT.
CoRR, 2021

DeepStationing: Thoracic Lymph Node Station Parsing in CT Scans using Anatomical Context Encoding and Key Organ Auto-Search.
CoRR, 2021

AdaK-NER: An Adaptive Top-K Approach for Named Entity Recognition with Incomplete Annotations.
CoRR, 2021

A Flexible Three-Dimensional Hetero-phase Computed Tomography Hepatocellular Carcinoma (HCC) Detection Algorithm for Generalizable and Practical HCC Screening.
CoRR, 2021

Learning from Subjective Ratings Using Auto-Decoded Deep Latent Embeddings.
CoRR, 2021

Deep Implicit Statistical Shape Models for 3D Medical Image Delineation.
CoRR, 2021

NVAE-GAN Based Approach for Unsupervised Time Series Anomaly Detection.
CoRR, 2021

Case Study of Few-Shot Learning in Text Recognition Models.
Proceedings of the Web Information Systems Engineering - WISE 2021, 2021

Modeling Without Sharing Privacy: Federated Neural Machine Translation.
Proceedings of the Web Information Systems Engineering - WISE 2021, 2021

MelGlow: Efficient Waveform Generative Network Based On Location-Variable Convolution.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

End-To-End Silent Speech Recognition with Acoustic Sensing.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Multi-Quartznet: Multi-Resolution Convolution for Speech Recognition with Multi-Layer Feature Fusion.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

LS-DST: Long and Sparse Dialogue State Tracking with Smart History Collector in Insurance Marketing.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

PINGAN Omini-Sinitic at SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

A Competition of Shape and Texture Bias by Multi-view Image Representation.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Semantic Embedding Graph Convolutional Networks for Multi-label Video Segment Classification.
Proceedings of the 12th International Symposium on Parallel Architectures, 2021

Multi-Grained Knowledge Distillation for Named Entity Recognition.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores.
Proceedings of the MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, 2021

Scalable Semi-supervised Landmark Localization for X-ray Images Using Few-Shot Deep Adaptive Graph.
Proceedings of the Deep Generative Models, and Data Augmentation, Labelling, and Imperfections, 2021

Semi-supervised Learning for Bone Mineral Density Estimation in Hip X-Ray Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

BI-RADS Classification of Calcification on Mammograms.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Effective Pancreatic Cancer Screening on Non-contrast CT Scans via Anatomy-Aware Transformers.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Opportunistic Screening of Osteoporosis Using Plain Film Chest X-Ray.
Proceedings of the Predictive Intelligence in Medicine - 4th International Workshop, 2021

Lesion Segmentation and RECIST Diameter Prediction via Click-Driven Attention and Dual-Path Connection.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Weakly-Supervised Universal Lesion Segmentation with Regional Level Set Loss.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

SAME: Deformable Image Registration Based on Self-supervised Anatomical Embeddings.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Learning from Subjective Ratings Using Auto-Decoded Deep Latent Embeddings.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Hetero-Modal Learning and Expansive Consistency Constraints for Semi-supervised Detection from Multi-sequence Data.
Proceedings of the Machine Learning in Medical Imaging - 12th International Workshop, 2021

Liver Tumor Localization and Characterization from Multi-phase MR Volumes Using Key-Slice Prediction: A Physician-Inspired Approach.
Proceedings of the Predictive Intelligence in Medicine - 4th International Workshop, 2021

DeepStationing: Thoracic Lymph Node Station Parsing in CT Scans Using Anatomical Context Encoding and Key Organ Auto-Search.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Sequential Learning on Liver Tumor Boundary Semantics and Prognostic Biomarker Mining.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Supervised Contrastive Pre-training forMammographic Triage Screening Models.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Knowledge Distillation with Adaptive Asymmetric Label Sharpening for Semi-supervised Fracture Detection in Chest X-Rays.
Proceedings of the Information Processing in Medical Imaging, 2021

Variational Information Bottleneck for Effective Low-Resource Audio Classification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech2Video: Cross-Modal Distillation for Speech to Video Generation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Effective Phase Encoding for End-To-End Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Federated Learning with Dynamic Transformer for Text to Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Improved Single Step Non-Autoregressive Transformer for Automatic Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Extending Pronunciation Dictionary with Automatically Detected Word Mispronunciations to Improve PAII's System for Interspeech 2021 Non-Native Child English Close Track ASR Challenge.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

CACnet: Cube Attentional CNN for Automatic Speech Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2021

Semantic Extraction for Sentence Representation via Reinforcement Learning.
Proceedings of the International Joint Conference on Neural Networks, 2021

ADKGN: An Attentive Dynamic Knowledge Graph Network for Sequential Recommendation.
Proceedings of the International Joint Conference on Neural Networks, 2021

Diversified Point Cloud Classification Using Personalized Federated Learning.
Proceedings of the International Joint Conference on Neural Networks, 2021

Contrastive Learning for improving End-to-end Speaker Verification.
Proceedings of the International Joint Conference on Neural Networks, 2021

Enhancing Neural Architecture Search by Upgrading Weak Components.
Proceedings of the International Joint Conference on Neural Networks, 2021

Loss Prediction: End-to-End Active Learning Approach For Speech Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2021

Automatic Joint Optimization of Algorithm-Level Compression and Compiler-Based Acceleration with Reinforcement Learning for DNN in Edge Devices.
Proceedings of the International Joint Conference on Neural Networks, 2021

Anomaly Removal for Vehicle Energy Consumption in Federated Learning.
Proceedings of the International Joint Conference on Neural Networks, 2021

Communication-Memory-Efficient Decentralized Learning For Audio Representation.
Proceedings of the International Joint Conference on Neural Networks, 2021

When Hearing the Voice, Who Will Come to Your Mind.
Proceedings of the International Joint Conference on Neural Networks, 2021

Quantum Convolutional Neural Network on Protein Distance Prediction.
Proceedings of the International Joint Conference on Neural Networks, 2021

EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture.
Proceedings of the 38th International Conference on Machine Learning, 2021

Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Towards a Translation-Based Method for Dynamic Heterogeneous Network Embedding.
Proceedings of the ICC 2021, 2021

Efficient Client Contribution Evaluation for Horizontal Federated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Network Pruning Using Linear Dependency Analysis on Feature Maps.
Proceedings of the IEEE International Conference on Acoustics, 2021

Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Quantitative Metric for Privacy Leakage in Federated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

Joint Intent Detection and Slot Filling Based on Continual Learning Model.
Proceedings of the IEEE International Conference on Acoustics, 2021

CASS-NAT: CTC Alignment-Based Single Step Non-Autoregressive Transformer for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

SEQ-CPC : Sequential Contrastive Predictive Coding for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Alignment-Agnostic Model for Chinese Text Error Correction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Automatic Vertebra Localization and Identification in CT by Spine Rectification and Anatomically-Constrained Optimization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Neural Architecture Search as Self-assessor in Semi-supervised Learning.
Proceedings of the Big Data - 9th CCF Conference, 2021

Cyclegean: Cycle Generative Enhanced Adversarial Network for Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Reconstructing Dual Learning for Neural Voice Conversion Using Relatively Few Samples.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Self-supervised Learning for Semantic Sentence Matching with Dense Transformer Inference Network.
Proceedings of the Web and Big Data - 5th International Joint Conference, 2021

A Novel Capsule Aggregation Framework for Natural Language Inference.
Proceedings of the Web and Big Data - 5th International Joint Conference, 2021

Understanding Gradient Clipping In Incremental Gradient Methods.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Window Loss for Bone Fracture Detection and Localization in X-ray Images with Point-based Annotation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Automated abnormality classification of chest radiographs using deep convolutional neural networks.
npj Digit. Medicine, 2020

IDRiD: Diabetic Retinopathy - Segmentation and Grading Challenge.
Medical Image Anal., 2020

Knowledge Distillation with Adaptive Asymmetric Label Sharpening for Semi-supervised Fracture Detection in Chest X-rays.
CoRR, 2020

Fully-Automated Liver Tumor Localization and Characterization from Multi-Phase MR Volumes Using Key-Slice ROI Parsing: A Physician-Inspired Approach.
CoRR, 2020

A New Window Loss Function for Bone Fracture Detection and Localization in X-ray Images with Point-based Annotation.
CoRR, 2020

Self-supervised Learning of Pixel-wise Anatomical Embeddings in Radiological Images.
CoRR, 2020

Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores.
CoRR, 2020

Melody Classification based on Performance Event Vector and BRNN.
CoRR, 2020

Learning from Multiple Datasets with Heterogeneous and Partial Labels for Universal Lesion Detection in CT.
CoRR, 2020

DREAM: A Dynamic Relational-Aware Model for Social Recommendation.
CoRR, 2020

UBER-GNN: A User-Based Embeddings Recommendation based on Graph Neural Networks.
CoRR, 2020

Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-ray Images.
CoRR, 2020

Harvesting, Detecting, and Characterizing Liver Lesions from Large-scale Multi-phase CT Data via Deep Dynamic Texture Learning.
CoRR, 2020

Universal Lesion Detection by Learning from Multiple Heterogeneously Labeled Datasets.
CoRR, 2020

Detecting Scatteredly-Distributed, Small, andCritically Important Objects in 3D OncologyImaging via Decision Stratification.
CoRR, 2020

Organ at Risk Segmentation for Head and Neck Cancer using Stratified Learning and Neural Architecture Search.
CoRR, 2020

BS-NAS: Broadening-and-Shrinking One-Shot NAS with Searchable Numbers of Channels.
CoRR, 2020

A Heterogeneous Information Network based Cross Domain Insurance Recommendation System for Cold Start Users.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

DCDIR: A Deep Cross-Domain Recommendation System for Cold Start Users in Insurance Domain.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

GAN-based AI Drawing Board for Image Generation and Colorization.
Proceedings of the SIGGRAPH 2020: Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2020

Contextualized Emotion Recognition in Conversation as Sequence Tagging.
Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2020

3D Point Cloud Segmentation for Complex Structure Based on PointSIFT.
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Lyrics2Song: An Automatic Song Generator for Lyrics Input.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Lymph Node Gross Tumor Volume Detection and Segmentation via Distance-Based Gating Using 3D CT/PET Imaging in Radiotherapy.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Robust Pancreatic Ductal Adenocarcinoma Segmentation with Multi-institutional Multi-phase Partially-Annotated CT Scans.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

DeepPrognosis: Preoperative Prediction of Pancreatic Cancer Survival and Surgical Margin via Contrast-Enhanced CT Imaging.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

MommiNet: Mammographic Multi-view Mass Identification Networks.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

One Click Lesion RECIST Measurement and Segmentation on CT Scans.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

E<sup>2</sup>Net: An Edge Enhanced Network for Accurate Liver and Tumor Segmentation on CT Scans.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

User-Guided Domain Adaptation for Rapid Annotation from User Interactions: A Study on Pathological Liver Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Learning to Segment Anatomical Structures Accurately from One Exemplar.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Reliable Liver Fibrosis Assessment from Ultrasound Using Global Hetero-Image Fusion and View-Specific Parameterization.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Lymph Node Gross Tumor Volume Detection in Oncology Imaging via Relationship Learning Using Graph Neural Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Deep Volumetric Universal Lesion Detection Using Light-Weight Pseudo 3D Convolution and Surface Point Regression.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Bone suppression on chest radiographs with adversarial learning.
Proceedings of the Medical Imaging 2020: Computer-Aided Diagnosis, 2020

Weakly-supervised lesion segmentation on CT scans using co-segmentation.
Proceedings of the Medical Imaging 2020: Computer-Aided Diagnosis, 2020

Weakly Supervised Lesion Co-Segmentation on Ct Scans.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Nonparallel Emotional Speech Conversion Using VAE-GAN.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Generating Reasonable Legal Text through the Combination of Language Modeling and Question Answering.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Dual Encoder Fusion U-Net (DEFU-Net) for Cross-manufacturer Chest X-ray Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Network Coding for Federated Learning Systems.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

An Approach for Neural Machine Translation with Graph Attention Network.
Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

Aligntts: Efficient Feed-Forward Text-to-Speech System Without Explicit Alignment.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

GraphTTS: Graph-to-Sequence Modelling in Neural Text-to-Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Robust Speaker Clustering Method Based on Discrete Tied Variational Autoencoder.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multi-objective Cuckoo Algorithm for Mobile Devices Network Architecture Search.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2020, 2020

Quantization and Knowledge Distillation for Efficient Federated Learning on Edge Devices.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

ParallelNAS: A Parallel and Distributed System for Neural Architecture Search.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

MABEL: An AI-Powered Mammographic Breast Lesion Diagnostic System.
Proceedings of the 22nd IEEE International Conference on E-health Networking, 2020

Empirical Studies of Institutional Federated Learning For Natural Language Processing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Co-heterogeneous and Adaptive Segmentation from Multi-source and Multi-phase CT Imaging Data: A Study on Pathological Liver and Lesion Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-modal Image Alignment of Large-Scale Pathological CT Scans.
Proceedings of the Computer Vision - ECCV 2020, 2020

Structured Landmark Detection via Topology-Adapting Deep Graph Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-Ray Images.
Proceedings of the Computer Vision - ECCV 2020, 2020

Organ at Risk Segmentation for Head and Neck Cancer Using Stratified Learning and Neural Architecture Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

DREAM: A Dynamic Relation-Aware Model for Social Recommendation.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Chinese Punctuation Prediction with Adaptive Attention and Dependency Tree.
Proceedings of the Knowledge Graph and Semantic Computing: Knowledge Graph and Cognitive Intelligence, 2020

Image Compressed Sensing Using Neural Architecture Search.
Proceedings of the Big Data - 8th CCF Conference, 2020

Epidemic Guard: A COVID-19 Detection System for Elderly People.
Proceedings of the Web and Big Data - 4th International Joint Conference, 2020

D-GHNAS for Joint Intent Classification and Slot Filling.
Proceedings of the Web and Big Data - 4th International Joint Conference, 2020

FedSmart: An Auto Updating Federated Learning Optimization Mechanism.
Proceedings of the Web and Big Data - 4th International Joint Conference, 2020

An Iterative Polishing Framework Based on Quality Aware Masked Language Model for Chinese Poetry Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Who Should Be Invited to My Party: A Size-Constrained k-Core Problem in Social Networks.
J. Comput. Sci. Technol., 2019

Domain-Aware No-Reference Image Quality Assessment.
CoRR, 2019

ULDor: A Universal Lesion Detector for CT Scans with Pseudo Masks and Hard Negative Example Mining.
CoRR, 2019

A Syllable-Structured, Contextually-Based Conditionally Generation of Chinese Lyrics.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

A Hierarchical Attention Based Seq2Seq Model for Chinese Lyrics Generation.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Automatic Acrostic Couplet Generation with Three-Stage Neural Network Pipelines.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Audio-Based Music Classification with DenseNet and Data Augmentation.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Dynamic Student Classiffication on Memory Networks for Knowledge Tracing.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2019

Composer4Everyone: Automatic Music Generation with Audio Motif.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

XLSor: A Robust and Accurate Lung Segmentor on Chest X-Rays Using Criss-Cross Attention and Customized Radiorealistic Abnormalities Generation.
Proceedings of the International Conference on Medical Imaging with Deep Learning, 2019

CT Data Curation for Liver Patients: Phase Recognition in Dynamic Contrast-Enhanced CT.
Proceedings of the Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data, 2019

Weakly Supervised Universal Fracture Detection in Pelvic X-Rays.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

TUNA-Net: Task-Oriented UNsupervised Adversarial Network for Disease Recognition in Cross-domain Chest X-rays.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Deep Esophageal Clinical Target Volume Delineation Using Encoded 3D Spatial Context of Tumors, Lymph Nodes, and Organs At Risk.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Accurate Esophageal Gross Tumor Volume Segmentation in PET/CT Using Two-Stream Chained 3D Deep Network Fusion.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Deep adversarial one-class learning for normal and abnormal chest radiograph classification.
Proceedings of the Medical Imaging 2019: Computer-Aided Diagnosis, San Diego, 2019

CT-realistic data augmentation using generative adversarial network for robust lymph node segmentation.
Proceedings of the Medical Imaging 2019: Computer-Aided Diagnosis, San Diego, 2019

Abnormal Chest X-Ray Identification With Generative Adversarial One-Class Classifier.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Federated Learning of Unsegmented Chinese Text Recognition Model.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

On Probability Calibration of Recurrent Text Recognition Network.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Performance of Training Sparse Deep Neural Networks on GPUs.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

A Heterogeneous Conversational Recommender System for Financial Products.
Proceedings of the Second Workshop on Knowledge-aware and Conversational Recommender Systems, 2019

Adversarial Discrete Sequence Generation without Explicit NeuralNetworks as Discriminators.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Social Network Monitoring for Bursty Cascade Detection.
ACM Trans. Knowl. Discov. Data, 2018

Accurate Weakly Supervised Deep Lesion Segmentation on CT Scans: Self-Paced 3D Mask Generation from RECIST.
CoRR, 2018

Fintech: AI powers financial services to improve people's lives.
Commun. ACM, 2018

Attention-Guided Curriculum Learning for Weakly Supervised Classification and Localization of Thoracic Diseases on Chest Radiographs.
Proceedings of the Machine Learning in Medical Imaging - 9th International Workshop, 2018

Semi-automatic RECIST Labeling on CT Scans with Cascaded Convolutional Neural Networks.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

CT Image Enhancement Using Stacked Generative Adversarial Networks and Transfer Learning for Lesion Segmentation Improvement.
Proceedings of the Machine Learning in Medical Imaging - 9th International Workshop, 2018

Automated Full Quantification of Left Ventricle with Deep Neural Networks.
Proceedings of the Statistical Atlases and Computational Models of the Heart. Atrial Segmentation and LV Quantification Challenges, 2018

Accurate Weakly-Supervised Deep Lesion Segmentation Using Large-Scale Clinical Annotations: Slice-Propagated 3D Mask Generation from 2D RECIST.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

A Noise-Robust Self-Adaptive Multitarget Speaker Detection System.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Video-Based Pig Recognition with Feature-Integrated Transfer Learning.
Proceedings of the Biometric Recognition - 13th Chinese Conference, 2018

City-Wide Influenza Forecasting based on Multi-Source Data.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Assistance of Speech Recognition in Noisy Environment with Sentence Level Lip-Reading.
Proceedings of the Biometric Recognition - 12th Chinese Conference, 2017

Prioritized Grid Highway Long Short-Term Memory-Based Universal Background Model for Speaker Verification.
Proceedings of the Biometric Recognition - 12th Chinese Conference, 2017

2016
User Identity Linkage by Latent User Space Modelling.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

2015
Discriminative and generative vocabulary tree: With application to vein image authentication and recognition.
Image Vis. Comput., 2015

2013
Human behavior segmentation and recognition using Continuous Linear Dynamic System.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Detection Evolution with Multi-order Contextual Co-occurrence.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Discriminative and generative vocabulary tree for vein image recognition.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Substructure and boundary modeling for continuous action recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Contextual boost for pedestrian detection.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Non-negative matrix factorization as a feature selection tool for maximum margin classifiers.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

A theory of multi-perspective defocusing.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Importance filtering for image retargeting.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
L1 Projections with Box Constraints
CoRR, 2010

Bi-affinity Filter: A Bilateral Type Filter for Color Images.
Proceedings of the Trends and Topics in Computer Vision, 2010

2009
Catadioptric projectors.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Multi-View AAM Fitting and Construction.
Int. J. Comput. Vis., 2008

2007
2D vs. 3D Deformable Face Models: Representational Power, Construction, and Real-Time Fitting.
Int. J. Comput. Vis., 2007

2006
Meticulously Detailed Eye Region Model and Its Application to Analysis of Facial Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

A Closed-Form Solution to Non-Rigid Shape and Motion Recovery.
Int. J. Comput. Vis., 2006

Simultaneous Registration and Modeling of Deformable Shapes.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Robust AAM Fitting by Fusion of Images and Disparity Data.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Uncalibrated Perspective Reconstruction of Deformable Structures.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Multi-View AAM Fitting and Camera Calibration.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

2004
Meticulously detailed eye model and its application to analysis of facial image.
Proceedings of the IEEE International Conference on Systems, 2004

Automatic analysis and recognition of brow actions and head motion in spontaneous facial behavior.
Proceedings of the IEEE International Conference on Systems, 2004

Multimodal Coordination of Facial Action, Head Rotation, and Eye Motion during Spontaneous Smiles.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

Non-Rigid Shape and Motion Recovery: Degenerate Deformations.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Real-Time Combined 2D+3D Active Appearance Models.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Fitting a Single Active Appearance Model Simultaneously to Multiple Images.
Proceedings of the British Machine Vision Conference, 2004

2003
Robust full-motion recovery of head by dynamic templates and re-registration techniques.
Int. J. Imaging Syst. Technol., 2003

Vision-based control of 3D facial animation.
Proceedings of the 2003 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 2003

2002
Automatic Recognition of Eye Blinking in Spontaneously Occurring Behavior.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Robust Full-Motion Recovery of Head by Dynamic Templates and Re-Registration Techniques.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

2000
Automatic Selection of Visemes for Image-Based Visual Speech Synthesis.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

1997
PASS: a program for automatic structure search.
Proceedings of International Conference on Neural Networks (ICNN'97), 1997

1996
Structure study of feedforward neural networks for approximation of highly nonlinear real-valued functions.
Proceedings of International Conference on Neural Networks (ICNN'96), 1996


  Loading...