Chenhui Chu

Orcid: 0000-0001-9848-6384

According to our database1, Chenhui Chu authored at least 128 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Voices of the Himalayas: Benchmarking Speech Recognition Systems for the Tibetan Language.
Int. J. Asian Lang. Process., March, 2024

EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Bilingual Corpus Mining and Multistage Fine-tuning for Improving Machine Translation of Lecture Transcripts.
J. Inf. Process., 2024

Beyond English-Centric LLMs: What Language Do Multilingual Language Models Think in?
CoRR, 2024

Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents.
CoRR, 2024

StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement.
Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Two-Stage Finetuning for Speech Emotion Recognition Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2024

SubMerge: Merging Equivalent Subword Tokenizations for Subword Regularized Models in Neural Machine Translation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Abstractive Multi-Video Captioning: Benchmark Dataset Construction and Extensive Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Identifying Source Language Expressions for Pre-editing in Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MM-LLMs: Recent Advances in MultiModal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MELD-ST: An Emotion-aware Speech Translation Dataset.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Finetuning Pretrained Model with Embedding of Domain and Language Information for ASR of Very Low-Resource Settings.
Int. J. Asian Lang. Process., December, 2023

Dual variational generative model and auxiliary retrieval for empathetic response generation by conversational robot.
Adv. Robotics, November, 2023

SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain.
Lang. Resour. Evaluation, September, 2023

SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., August, 2023

Spatial Hierarchical Attention Network Based Video-guided Machine Translation.
J. Inf. Process., 2023

Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation.
CoRR, 2023

Reasoning before Responding: Integrating Commonsense-based Causality Explanation for Empathetic Response Generation.
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

KyotoMOS: An Automatic MOS Scoring System for Speech Synthesis.
Proceedings of the ACM Multimedia Asia Workshops, 2023

The Kyoto Speech-to-Speech Translation System for IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Two-stage Finetuning of Wav2vec 2.0 for Speech Emotion Recognition with ASR and Gender Pretraining.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Hierarchical Softmax for End-To-End Low-Resource Multilingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-Based Speech Recognition of Low-Resource Language.
Proceedings of the IEEE International Conference on Acoustics, 2023

DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Video-Helpful Multimodal Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ARKitSceneRefer: Text-based Localization of Small Objects in Diverse Real-World 3D Indoor Scenes.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Investigating the Gap Between Single-Hop and Multi-Hop Questions in Closed-Book Question Answering via Question Decomposition.
Proceedings of the Distributed Computing and Artificial Intelligence, 2023

FedCPC: An Effective Federated Contrastive Learning Method for Privacy Preserving Early-Stage Alzheimers Speech Detection.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Towards Speech Dialogue Translation Mediating Speakers of Different Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Word-Region Alignment-Guided Multimodal Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

Information Extraction from Public Meeting Articles.
SN Comput. Sci., 2022

Corpus Construction for Historical Newspapers: A Case Study on Public Meeting Corpus Construction Using OCR Error Correction.
SN Comput. Sci., 2022

Region-attentive multimodal neural machine translation.
Neurocomputing, 2022

The semantic typology of visually grounded paraphrases.
Comput. Vis. Image Underst., 2022

Learning More May Not Be Better: Knowledge Transferability in Vision and Language Tasks.
CoRR, 2022

EMS: Efficient and Effective Massively Multilingual Sentence Representation Learning.
CoRR, 2022

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

VISA: An Ambiguous Subtitles Dataset for Visual Scene-aware Machine Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Fusion of Self-supervised Learned Models for MOS Prediction.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

BERTSeg: BERT Based Unsupervised Subword Segmentation for Neural Machine Translation.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Overview of the 9th Workshop on Asian Translation.
Proceedings of the 9th Workshop on Asian Translation, 2022

Flexible Visual Grounding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

2021
Preordering Encoding on Transformer for Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Flexibly Focusing on Supporting Facts, Using Bridge Links, and Jointly Training Specialized Modules for Multi-Hop Question Answering.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies.
Int. J. Asian Lang. Process., 2021

A comparative study of language transformers for video question answering.
Neurocomputing, 2021

A Survey of Multilingual Neural Machine Translation.
ACM Comput. Surv., 2021

A Picture May Be Worth a Hundred Words for Visual Question Answering.
CoRR, 2021

Understanding the Role of Scene Graphs in Visual Question Answering.
CoRR, 2021

Cross-Lingual Visual Grounding.
IEEE Access, 2021

The Laughing Machine: Predicting Humor in Video.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Khmer Speech Translation Corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC).
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Visual Question Answering with Textual Representations for Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Transferring Domain-Agnostic Knowledge in Video Question Answering.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

TMEKU System for the WAT2021 Multimodal Translation Task.
Proceedings of the 8th Workshop on Asian Translation, 2021


Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

Lightweight Cross-Lingual Sentence Representation Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Video-guided Machine Translation with Spatial Hierarchical Attention Network.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

2020
A Survey of Domain Adaptation for Machine Translation.
J. Inf. Process., 2020

Visually grounded paraphrase identification via gating and phrase localization.
Neurocomputing, 2020

A Corpus for English-Japanese Multimodal Neural Machine Translation with Comparable Sentences.
CoRR, 2020

Lexically Cohesive Neural Machine Translation with Copy Mechanism.
CoRR, 2020

Constructing a Visual Relationship Authenticity Dataset.
CoRR, 2020

Knowledge-Based Visual Question Answering in Videos.
CoRR, 2020

A Comprehensive Survey of Multilingual Neural Machine Translation.
CoRR, 2020

Improving topic modeling through homophily for legal documents.
Appl. Netw. Sci., 2020

BERT Representations for Video Question Answering.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Constructing a Public Meeting Corpus.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Annotation of Adverse Drug Reactions in Patients' Weblogs.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Dataset and Baselines for Visual Question Answering on Art.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Double Attention-based Multimodal Neural Machine Translation with Semantic Image Regions.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

Multilingual Neural Machine Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Meta Ensemble for Japanese-Chinese Neural Machine Translation: Kyoto-U+ECNU Participation to WAT 2020.
Proceedings of the 7th Workshop on Asian Translation, 2020

IDSOU at WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

Text Classification with Negative Supervision.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

KnowIT VQA: Answering Knowledge-Based Questions about Videos.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
New Areas of Application of Comparable Corpora.
Proceedings of the Using Comparable Corpora for Under-Resourced Areas of Machine Translation, 2019

Multilingual Multi-Domain Adaptation Approaches for Neural Machine Translation.
CoRR, 2019

Using Natural Language Processing to Develop an Automated Orthodontic Diagnostic System.
CoRR, 2019

Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Legal Information as a Complex Network: Improving Topic Modeling Through Homophily.
Proceedings of the Complex Networks and Their Applications VIII, 2019

2018
A Comprehensive Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation.
J. Inf. Process., 2018

Word Rewarding for Adequate Neural Machine Translation.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

A Survey of Domain Adaptation for Neural Machine Translation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

iParaphrasing: Extracting Visually Grounded Paraphrases via an Image.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Osaka University MT Systems for WAT 2018: Rewarding, Preordering, and Domain Adaptation.
Proceedings of the 32nd Pacific Asia Conference on Language, 2018

Recursive Neural Network Based Preordering for English-to-Japanese Machine Translation.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018

2017
Constrained Partial Parsing Based Dependency Tree Projection for Tree-to-Tree Machine Translation.
Inf. Media Technol., 2017

An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation.
CoRR, 2017

An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Integrated Parallel Sentence and Fragment Extraction from Comparable Corpora: A Case Study on Chinese-Japanese Wikipedia.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016

Parallel Sentence Extraction Based on Unsupervised Bilingual Lexicon Extraction from Comparable Corpora.
Inf. Media Technol., 2016

Supervised Syntax-based Alignment between English Sentences and Abstract Meaning Representation Graphs.
CoRR, 2016

Cross-language Projection of Dependency Trees with Constrained Partial Parsing for Tree-to-Tree Machine Translation.
Proceedings of the First Conference on Machine Translation, 2016

Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Parallel Sentence Extraction from Comparable Corpora with Neural Network Features.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Consistent Word Segmentation, Part-of-Speech Tagging and Dependency Labelling Annotation for Chinese Language.
Proceedings of the COLING 2016, 2016

Kyoto University Participation to WAT 2016.
Proceedings of the 3rd Workshop on Asian Translation, 2016

Dependency Forest based Word Alignment.
Proceedings of the ACL 2016 Student Research Workshop, Berlin, Germany, August 7-12, 2016, 2016

SCTB: A Chinese Treebank in Scientific Domain.
Proceedings of the 12th Workshop on Asian Language Resources, 2016

2015
Integrated Parallel Data Extraction from Comparable Corpora for Statistical Machine Translation.
PhD thesis, 2015

Cross-language Projection of Dependency Trees for Tree-to-tree Machine Translation.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

Large-scale Dictionary Construction via Pivot-based Statistical Machine Translation with Significance Pruning and Neural Network Features.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

KyotoEBMT System Description for the 2nd Workshop on Asian Translation.
Proceedings of the 2nd Workshop on Asian Translation, 2015

2014
Improving Statistical Machine Translation Accuracy Using Bilingual Lexicon Extractionwith Paraphrases.
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014

Constructing a Chinese―Japanese Parallel Corpus from Wikipedia.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Iterative Bilingual Lexicon Extraction from Comparable Corpora with Topical and Contextual Knowledge.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

2013
Chinese-Japanese Machine Translation Exploiting Chinese Characters.
ACM Trans. Asian Lang. Inf. Process., 2013

Accurate Parallel Fragment Extraction from Quasi-Comparable Corpora using Alignment Model and Translation Lexicon.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Chinese-Japanese Parallel Sentence Extraction from Quasi-Comparable Corpora.
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, 2013

2012
Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

EBMT system of kyoto university in OLYMPICS task at IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Exploiting Shared Chinese Characters in Chinese Word Segmentation Optimization for Chinese-Japanese Machine Translation.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012

2011
Japanese-Chinese Phrase Alignment Using Common Chinese Characters Information.
Proceedings of Machine Translation Summit XIII: Papers, 2011


  Loading...