2025
Bridging Speech Emotion Recognition and Personality: Dataset and Temporal Interaction Condition Network.
CoRR, May, 2025
When Large Language Models Meet Speech: A Survey on Integration Approaches.
CoRR, February, 2025
Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition.
CoRR, January, 2025
Extending Whisper for Emotion Prediction Using Word-level Pseudo Labels.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
Voices of the Himalayas: Benchmarking Speech Recognition Systems for the Tibetan Language.
Int. J. Asian Lang. Process., March, 2024
EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Bilingual Corpus Mining and Multistage Fine-tuning for Improving Machine Translation of Lecture Transcripts.
J. Inf. Process., 2024
Beyond English-Centric LLMs: What Language Do Multilingual Language Models Think in?
CoRR, 2024
Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents.
CoRR, 2024
StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement.
Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024
Investigating ASR Error Correction with Large Language Model and Multilingual 1-best Hypotheses.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2024
Enhancing Two-Stage Finetuning for Speech Emotion Recognition Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2024
SubMerge: Merging Equivalent Subword Tokenizations for Subword Regularized Models in Neural Machine Translation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024
Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Abstractive Multi-Video Captioning: Benchmark Dataset Construction and Extensive Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Identifying Source Language Expressions for Pre-editing in Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Data Selection using Spoken Language Identification for Low-Resource and Zero-Resource Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
MELD-ST: An Emotion-aware Speech Translation Dataset.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Finetuning Pretrained Model with Embedding of Domain and Language Information for ASR of Very Low-Resource Settings.
Int. J. Asian Lang. Process., December, 2023
Dual variational generative model and auxiliary retrieval for empathetic response generation by conversational robot.
Adv. Robotics, November, 2023
SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain.
Lang. Resour. Evaluation, September, 2023
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., August, 2023
Spatial Hierarchical Attention Network Based Video-guided Machine Translation.
J. Inf. Process., 2023
Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation.
CoRR, 2023
Reasoning before Responding: Integrating Commonsense-based Causality Explanation for Empathetic Response Generation.
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023
KyotoMOS: An Automatic MOS Scoring System for Speech Synthesis.
Proceedings of the ACM Multimedia Asia Workshops, 2023
The Kyoto Speech-to-Speech Translation System for IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
Two-stage Finetuning of Wav2vec 2.0 for Speech Emotion Recognition with ASR and Gender Pretraining.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Hierarchical Softmax for End-To-End Low-Resource Multilingual Speech Recognition.
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Conference on Acoustics, 2023
Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-Based Speech Recognition of Low-Resource Language.
Proceedings of the IEEE International Conference on Acoustics, 2023
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Video-Helpful Multimodal Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
ARKitSceneRefer: Text-based Localization of Small Objects in Diverse Real-World 3D Indoor Scenes.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Investigating the Gap Between Single-Hop and Multi-Hop Questions in Closed-Book Question Answering via Question Decomposition.
Proceedings of the Distributed Computing and Artificial Intelligence, 2023
FedCPC: An Effective Federated Contrastive Learning Method for Privacy Preserving Early-Stage Alzheimers Speech Detection.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Towards Speech Dialogue Translation Mediating Speakers of Different Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
2022
Word-Region Alignment-Guided Multimodal Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022
Information Extraction from Public Meeting Articles.
SN Comput. Sci., 2022
Corpus Construction for Historical Newspapers: A Case Study on Public Meeting Corpus Construction Using OCR Error Correction.
SN Comput. Sci., 2022
Region-attentive multimodal neural machine translation.
Neurocomputing, 2022
The semantic typology of visually grounded paraphrases.
Comput. Vis. Image Underst., 2022
Learning More May Not Be Better: Knowledge Transferability in Vision and Language Tasks.
CoRR, 2022
EMS: Efficient and Effective Massively Multilingual Sentence Representation Learning.
CoRR, 2022
When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022
VISA: An Ambiguous Subtitles Dataset for Visual Scene-aware Machine Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Fusion of Self-supervised Learned Models for MOS Prediction.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
BERTSeg: BERT Based Unsupervised Subword Segmentation for Neural Machine Translation.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022
Overview of the 9th Workshop on Asian Translation.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 9th Workshop on Asian Translation, 2022
Flexible Visual Grounding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022
2021
Preordering Encoding on Transformer for Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Flexibly Focusing on Supporting Facts, Using Bridge Links, and Jointly Training Specialized Modules for Multi-Hop Question Answering.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies.
Int. J. Asian Lang. Process., 2021
A comparative study of language transformers for video question answering.
Neurocomputing, 2021
A Survey of Multilingual Neural Machine Translation.
ACM Comput. Surv., 2021
A Picture May Be Worth a Hundred Words for Visual Question Answering.
CoRR, 2021
Understanding the Role of Scene Graphs in Visual Question Answering.
CoRR, 2021
Cross-Lingual Visual Grounding.
IEEE Access, 2021
The Laughing Machine: Predicting Humor in Video.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Khmer Speech Translation Corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC).
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021
WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Visual Question Answering with Textual Representations for Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021
Transferring Domain-Agnostic Knowledge in Video Question Answering.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
TMEKU System for the WAT2021 Multimodal Translation Task.
Proceedings of the 8th Workshop on Asian Translation, 2021
Overview of the 8th Workshop on Asian Translation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 8th Workshop on Asian Translation, 2021
Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021
Lightweight Cross-Lingual Sentence Representation Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Video-guided Machine Translation with Spatial Hierarchical Attention Network.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021
2020
A Survey of Domain Adaptation for Machine Translation.
J. Inf. Process., 2020
Visually grounded paraphrase identification via gating and phrase localization.
Neurocomputing, 2020
A Corpus for English-Japanese Multimodal Neural Machine Translation with Comparable Sentences.
CoRR, 2020
Lexically Cohesive Neural Machine Translation with Copy Mechanism.
CoRR, 2020
Constructing a Visual Relationship Authenticity Dataset.
CoRR, 2020
Knowledge-Based Visual Question Answering in Videos.
CoRR, 2020
A Comprehensive Survey of Multilingual Neural Machine Translation.
CoRR, 2020
Improving topic modeling through homophily for legal documents.
Appl. Netw. Sci., 2020
BERT Representations for Video Question Answering.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020
Constructing a Public Meeting Corpus.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Annotation of Adverse Drug Reactions in Patients' Weblogs.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
A Dataset and Baselines for Visual Question Answering on Art.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
Double Attention-based Multimodal Neural Machine Translation with Semantic Image Regions.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020
Multilingual Neural Machine Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Meta Ensemble for Japanese-Chinese Neural Machine Translation: Kyoto-U+ECNU Participation to WAT 2020.
Proceedings of the 7th Workshop on Asian Translation, 2020
IDSOU at WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020
Text Classification with Negative Supervision.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
KnowIT VQA: Answering Knowledge-Based Questions about Videos.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
New Areas of Application of Comparable Corpora.
Proceedings of the Using Comparable Corpora for Under-Resourced Areas of Machine Translation, 2019
Multilingual Multi-Domain Adaptation Approaches for Neural Machine Translation.
CoRR, 2019
Using Natural Language Processing to Develop an Automated Orthodontic Diagnostic System.
CoRR, 2019
Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Legal Information as a Complex Network: Improving Topic Modeling Through Homophily.
Proceedings of the Complex Networks and Their Applications VIII, 2019
2018
A Comprehensive Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation.
J. Inf. Process., 2018
Word Rewarding for Adequate Neural Machine Translation.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018
A Survey of Domain Adaptation for Neural Machine Translation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image.
Proceedings of the 27th International Conference on Computational Linguistics, 2018
Osaka University MT Systems for WAT 2018: Rewarding, Preordering, and Domain Adaptation.
Proceedings of the 32nd Pacific Asia Conference on Language, 2018
Recursive Neural Network Based Preordering for English-to-Japanese Machine Translation.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018
2017
Constrained Partial Parsing Based Dependency Tree Projection for Tree-to-Tree Machine Translation.
Inf. Media Technol., 2017
An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation.
CoRR, 2017
An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Integrated Parallel Sentence and Fragment Extraction from Comparable Corpora: A Case Study on Chinese-Japanese Wikipedia.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016
Parallel Sentence Extraction Based on Unsupervised Bilingual Lexicon Extraction from Comparable Corpora.
Inf. Media Technol., 2016
Supervised Syntax-based Alignment between English Sentences and Abstract Meaning Representation Graphs.
CoRR, 2016
Cross-language Projection of Dependency Trees with Constrained Partial Parsing for Tree-to-Tree Machine Translation.
Proceedings of the First Conference on Machine Translation, 2016
Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Consistent Word Segmentation, Part-of-Speech Tagging and Dependency Labelling Annotation for Chinese Language.
Proceedings of the COLING 2016, 2016
Kyoto University Participation to WAT 2016.
Proceedings of the 3rd Workshop on Asian Translation, 2016
Dependency Forest based Word Alignment.
Proceedings of the ACL 2016 Student Research Workshop, Berlin, Germany, August 7-12, 2016, 2016
SCTB: A Chinese Treebank in Scientific Domain.
Proceedings of the 12th Workshop on Asian Language Resources, 2016
2015
Integrated Parallel Data Extraction from Comparable Corpora for Statistical Machine Translation.
PhD thesis, 2015
Cross-language Projection of Dependency Trees for Tree-to-tree Machine Translation.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015
Large-scale Dictionary Construction via Pivot-based Statistical Machine Translation with Significance Pruning and Neural Network Features.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015
KyotoEBMT System Description for the 2nd Workshop on Asian Translation.
Proceedings of the 2nd Workshop on Asian Translation, 2015
2014
Improving Statistical Machine Translation Accuracy Using Bilingual Lexicon Extractionwith Paraphrases.
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014
Constructing a Chinese―Japanese Parallel Corpus from Wikipedia.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Iterative Bilingual Lexicon Extraction from Comparable Corpora with Topical and Contextual Knowledge.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014
2013
Chinese-Japanese Machine Translation Exploiting Chinese Characters.
ACM Trans. Asian Lang. Inf. Process., 2013
Accurate Parallel Fragment Extraction from Quasi-Comparable Corpora using Alignment Model and Translation Lexicon.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013
Chinese-Japanese Parallel Sentence Extraction from Quasi-Comparable Corpora.
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, 2013
2012
Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
EBMT system of kyoto university in OLYMPICS task at IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012
Exploiting Shared Chinese Characters in Chinese Word Segmentation Optimization for Chinese-Japanese Machine Translation.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012
2011
Japanese-Chinese Phrase Alignment Using Common Chinese Characters Information.
Proceedings of Machine Translation Summit XIII: Papers, 2011