Hongyu Gong

According to our database1, Hongyu Gong authored at least 57 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation.
CoRR, 2024

SeamlessExpressiveLM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought.
CoRR, 2024

MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation.
CoRR, 2024

An Empirical Study of Speech Language Models for Prompt-Conditioned Speech Synthesis.
CoRR, 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

Multilingual Speech-to-Speech Translation into Multiple Target Languages.
CoRR, 2023

Exploration on HuBERT with Multiple Resolutions.
CoRR, 2023

Exploration on HuBERT with Multiple Resolution.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Pre-training for Speech Translation: CTC Meets Optimal Transport.
Proceedings of the International Conference on Machine Learning, 2023

Improving Speech-to-Speech Translation Through Unlabeled Text.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Named Entity Detection and Injection for Direct Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Non-compositional Expression Generation Based on Curriculum Learning and Continual Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-to-Speech Translation for a Real-world Unwritten Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Efficient Language Modeling with Sparse all-MLP.
CoRR, 2022

Textless Speech-to-Speech Translation on Real Data.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022


From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Unified Speech-Text Pre-training for Speech Translation and Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Idiomatic Expression Paraphrasing without Strong Supervision.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Textless Speech-to-Speech Translation on Real Data.
CoRR, 2021

Direct simultaneous speech to speech translation.
CoRR, 2021

Incremental Speech Synthesis For Speech-To-Speech Translation.
CoRR, 2021

LAWDR: Language-Agnostic Weighted Document Representations from Pre-trained Models.
CoRR, 2021

Demystify Optimization Challenges in Multilingual Transformers.
CoRR, 2021

Adaptive Sparse Transformer for Multilingual Translation.
CoRR, 2021

From Solving a Problem Boldly to Cutting the Gordian Knot: Idiomatic Text Generation.
CoRR, 2021

Self-Supervised Euphemism Detection and Identification for Content Moderation.
Proceedings of the 42nd IEEE Symposium on Security and Privacy, 2021

Robust Optimization for Multilingual Translation with Imbalanced Data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multimodal and Multilingual Embeddings for Large-Scale Speech Mining.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PIE: A Parallel Idiomatic Expression Corpus for Idiomatic Sentence Generation and Paraphrasing.
Proceedings of the 17th Workshop on Multiword Expressions, 2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Abusive Language Detection in Heterogeneous Contexts: Dataset Collection and the Role of Supervised Attention.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Representation learning of natural language and its application to language understanding and generation
PhD thesis, 2020

FUSE: Multi-faceted Set Expansion by Coherent Clustering of Skip-Grams.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

Rich Syntactic and Semantic Information Helps Unsupervised Text Style Transfer.
Proceedings of the 13th International Conference on Natural Language Generation, 2020

Enriching Word Embeddings with Temporal and Spatial Information.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

IlliniMet: Illinois System for Metaphor Detection with Contextual and Linguistic Information.
Proceedings of the Second Workshop on Figurative Language Processing, 2020

2019
Context-Sensitive Malicious Spelling Error Correction.
Proceedings of the World Wide Web Conference, 2019

Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

PaRe: A Paper-Reviewer Matching Approach Using a Common Topic Space.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Equipping Educational Applications with Domain Knowledge.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

2018
Embedding Syntax and Semantics of Prepositions via Tensor Decomposition.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Preposition Sense Disambiguation and Representation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Document Similarity for Texts of Varying Lengths via Hidden Topics.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Distributed Multicast Tree Construction in Wireless Sensor Networks.
IEEE Trans. Inf. Theory, 2017

Prepositions in Context.
CoRR, 2017

Geometry of Compositionality.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2015
A Distributed Algorithm to Construct Multicast Trees in WSNs: An Approximate Steiner Tree Approach.
Proceedings of the 16th ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2015

A distributed algorithm to construct multicast trees in wireless multi-hop networks.
Proceedings of the 2015 IEEE International Conference on Communications, 2015


  Loading...