Ye Jia

Orcid: 0000-0002-0457-8083

According to our database¹, Ye Jia authored at least 37 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Traceable teleportation: Improving spatial learning in virtual locomotion.

[BibT_eX]

[DOI]

Int. J. Hum. Comput. Stud., 2025

2024

Knowledge-Graph-Driven Mind Mapping for Immersive Collaborative Learning: A Pilot Study in Edu-Metaverse.

[BibT_eX]

[DOI]

IEEE Trans. Learn. Technol., 2024

SimulTron: On-Device Simultaneous Speech to Speech Translation.

[BibT_eX]

[DOI]

Michelle Tadmor Ramanovich

CoRR, 2024

illumotion: An Optical-illusion-based VR Locomotion Technique for Long-Distance 3D Movement.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2024

NivTA: Towards a Naturally Interactable Edu-Metaverse Teaching Assistant for CAVE.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Metaverse Computing, 2024

2023

Toward an Edu-Metaverse of Knowledge: Immersive Exploration of University Courses.

[BibT_eX]

[DOI]

IEEE Trans. Learn. Technol., December, 2023

Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey.

[BibT_eX]

[DOI]

Haotian Zhang

Stuart Dereck Semujju

CoRR, 2023

Speech Aware Dialog System Technology Challenge (DSTC11).

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

From Classroom to Metaverse: A Study on Gamified Constructivist Teaching in Higher Education.

[BibT_eX]

[DOI]

Proceedings of the Advances in Web-Based Learning - ICWL 2023, 2023

Textless Direct Speech-to-Speech Translation with Discrete Speech Representation.

[BibT_eX]

[DOI]

Xinjian Li

Ye Jia

Chung-Cheng Chiu

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

mSLAM: Massively multilingual joint pre-training for speech and text.

[BibT_eX]

[DOI]

CoRR, 2022

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation.

[BibT_eX]

[DOI]

Ye Jia

Michelle Tadmor Ramanovich

Quan Wang

Heiga Zen

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

XTREME-S: Evaluating Cross-lingual Speech Representations.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Translatotron 2: High-quality direct speech-to-speech translation with voice preservation.

[BibT_eX]

[DOI]

Ye Jia

Michelle Tadmor Ramanovich

Tal Remez

Roi Pomerantz

Proceedings of the International Conference on Machine Learning, 2022

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech.

[BibT_eX]

[DOI]

Michael Hassid

Michelle Tadmor Ramanovich

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training.

[BibT_eX]

[DOI]

CoRR, 2021

Translatotron 2: Robust direct speech-to-speech translation.

[BibT_eX]

[DOI]

Ye Jia

Michelle Tadmor Ramanovich

Tal Remez

Roi Pomerantz

CoRR, 2021

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Parallel Tacotron: Non-Autoregressive and Controllable TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Textual Echo Cancellation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling.

[BibT_eX]

[DOI]

CoRR, 2020

Improved Noisy Student Training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

The ASVspoof 2019 database.

[BibT_eX]

[DOI]

CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

CoRR, 2019

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Hierarchical Generative Modeling for Controllable Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Recognition with Augmented Synthesized Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Ye Jia

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...