Jan Niehues

Orcid: 0000-0002-4231-6543

According to our database1, Jan Niehues authored at least 163 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Augmenting Automatic Speech Recognition Models with Disfluency Detection.
CoRR, 2024

Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024.
CoRR, 2024

SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading.
CoRR, 2024

Quality Estimation with <i>k</i>-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation.
CoRR, 2024

Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking across Diverse Vocabularies.
Proceedings of the Ninth Conference on Machine Translation, 2024

Identifying the Information Gap for Visually Impaired Students during Lecture Talks.
Proceedings of the 2024 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC), 2024

Language-Independent Representations Improve Zero-Shot Summarization.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Quality Estimation with k-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

How Transferable are Attribute Controllers on Pretrained Multilingual Translation Models?
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Benchmarking Diffusion Models for Machine Translation.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Speech Recognition Corpus of the Khinalug Language for Documenting Endangered Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Joint modelling of audio-visual cues using attention mechanisms for emotion recognition.
Multim. Tools Appl., March, 2023

Audience-specific Explanations for Machine Translation.
CoRR, 2023

Gender Lost In Translation: How Bridging The Gap Between Languages Affects Gender Bias in Zero-Shot Multilingual Translation.
CoRR, 2023

Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation.
CoRR, 2023

Train Global, Tailor Local: Minimalist Multilingual Translation into Endangered Languages.
CoRR, 2023

Evaluating Multilingual Speech Translation under Realistic Conditions with Resegmentation and Terminology.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Towards Efficient Simultaneous Speech Translation: CUNI-KIT System for Simultaneous Track at IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

KIT's Multilingual Speech Translation System for IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023


Towards continually learning new languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

End-to-End Evaluation for Low-Latency Simultaneous Speech Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Analyzing Challenges in Neural Machine Translation for Software Localization.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
Efficient Speech Translation with Pre-trained Models.
CoRR, 2022

CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022.
CoRR, 2022

Multilingual Simultaneous Speech Translation.
CoRR, 2022

Cost-Effective Training in Low-Resource Neural Machine Translation.
CoRR, 2022

Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation.
Proceedings of the Seventh Conference on Machine Translation, 2022

LibriS2S: A German-English Speech-to-Speech Translation Corpus.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Effective combination of pretrained models - KIT@IWSLT2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022


Adaptive multilingual speech recognition with pretrained models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

TED Talk Teaser Generation with Pre-Trained Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Tackling Data Scarcity in Speech Translation Using Zero-Shot Multilingual Machine Translation Techniques.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Unsupervised Machine Translation On Dravidian Languages.
CoRR, 2021

Maastricht University's Large-Scale Multilingual Machine Translation System for WMT 2021.
Proceedings of the Sixth Conference on Machine Translation, 2021


Maastricht University's Multilingual Speech Translation System for IWSLT 2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021


Tutorial Proposal: End-to-End Speech Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts, 2021

Continuous Learning in Neural Machine Translation using Bilingual Dictionaries.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Improving Zero-Shot Translation by Disentangling Positional Information.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Optimizing segmentation granularity for neural machine translation.
Mach. Transl., 2020

Low Latency ASR for Simultaneous Speech Translation.
CoRR, 2020

Adapting End-to-End Speech Recognition for Readable Subtitles.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020


Relative Positional Encoding for Speech Recognition and Direct Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multimodal Attention-Mechanism For Temporal Emotion Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2020

Improving Sequence-To-Sequence Speech Recognition Training with On-The-Fly Data Augmentation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Detailed Analysis of Different Strategies for Phrase Table Adaptation in SMT.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

Machine Translation with Unsupervised Length-Constraints.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

2019
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation.
Trans. Assoc. Comput. Linguistics, 2019

Improving Zero-shot Translation with Language-Independent Constraints.
Proceedings of the Fourth Conference on Machine Translation, 2019

The IWSLT 2019 KIT Speech Translation System.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

The IWSLT 2019 Evaluation Campaign.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Very Deep Self-Attention Networks for End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Survey Talk: A Survey on Speech Translation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Modeling Confidence in Sequence-to-Sequence Models.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

Incremental processing of noisy user utterances in the spoken language understanding task.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

2018
Open Source Toolkit for Speech to Text Translation.
Prague Bull. Math. Linguistics, 2018

Multi-task learning to improve natural language understanding.
CoRR, 2018

A Hierarchical Approach to Neural Context-Aware Modeling.
CoRR, 2018

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2018.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Towards Fluent Translations From Disfluent Speech.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Automated Evaluation of Out-of-Context Errors.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

KIT's IWSLT 2018 SLT Translation System.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

The IWSLT 2018 Evaluation Campaign.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

An End-to-End Goal-Oriented Dialog System with a Generative Natural Language Response Generation.
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

Self-Attentional Acoustic Models.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Low-Latency Neural Speech Translation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Term Extraction via Neural Sequence Labeling a Comparative Evaluation of Strategies Using Recurrent Neural Networks.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning.
Proceedings of the COLING 2018, 2018

Towards one-shot learning for rare-word translation with external experts.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Robust and Scalable Differentiable Neural Computer for Question Answering.
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018

2017
Transcribing against time.
Speech Commun., 2017

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2017.
Proceedings of the Second Conference on Machine Translation, 2017

The QT21 Combined Machine Translation System for English to Latvian.
Proceedings of the Second Conference on Machine Translation, 2017

Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning.
Proceedings of the Second Conference on Machine Translation, 2017

Toward Robust Neural Machine Translation for Noisy Input Sequences.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

KIT's Multilingual Neural Machine Translation systems for IWSLT 2017.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Effective Strategies in Zero-Shot Neural Machine Translation.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Domain-independent Punctuation and Segmentation Insertion.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Overview of the IWSLT 2017 Evaluation Campaign.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Comparison of Decoding Strategies for CTC Acoustic Models.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

NMT-Based Segmentation and Punctuation Insertion for Real-Time Spoken Language Translation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Neural Lattice-to-Sequence Models for Uncertain Inputs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Analyzing Neural MT Search and Model Performance.
Proceedings of the First Workshop on Neural Machine Translation, 2017

2016
Reordering space design in statistical machine translation.
Lang. Resour. Evaluation, 2016


Using Factored Word Representation in Neural Network Language Models.
Proceedings of the First Conference on Machine Translation, 2016

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016.
Proceedings of the First Conference on Machine Translation, 2016

Lecture Translator - Speech translation framework for simultaneous lecture translation.
Proceedings of the Demonstrations Session, 2016

Integrating Encyclopedic Knowledge into Neural Language Models.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Multilingual Disfluency Removal using NMT.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Adaptation and Combination of NMT Systems: The KIT Translation Systems for IWSLT 2016.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

The IWSLT 2016 Evaluation Campaign.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Towards an Open-Domain Social Dialog System.
Proceedings of the Dialogues with Social Robots, 2016

Dynamic Transcription for Low-Latency Speech Translation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Lightly Supervised Quality Estimation.
Proceedings of the COLING 2016, 2016

Pre-Translation for Neural Machine Translation.
Proceedings of the COLING 2016, 2016

Using Tweets as "Ice-Breaking" Sentences in a Social Dialog System.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
ListNet-based MT Rescoring.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

The KIT-LIMSI Translation System for WMT 2015.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

The Karlsruhe Institute of Technology Translation Systems for the WMT 2015.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Source discriminative word lexicon for translation disambiguation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

The KIT translation systems for IWSLT 2015.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Punctuation insertion for real-time spoken language translation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

The IWSLT 2015 Evaluation Campaign.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Combination of NN and CRF models for joint detection of punctuation and disfluencies.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Stripping Adjectives: Integration Techniques for Selective Stemming in SMT Systems.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

2014
Adaptation in Machine Translation.
PhD thesis, 2014

LIMSI $@$ WMT'14 Medical Translation Task.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

The Karlsruhe Institute of Technology Translation Systems for the WMT 2014.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

The KIT-LIMSI Translation System for WMT 2014.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Manual Analysis of Structurally Informed Reordering in German-English Machine Translation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

The KIT translation systems for IWSLT 2014.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Lexical translation model using a deep neural network architecture.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Combined spoken language translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Machine translation of multi-party meetings: segmentation and disfluency removal strategies.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Report on the 11th IWSLT evaluation campaign.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Tight Integration of Speech Disfluency Removal into SMT.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Combining techniques from different NN-based language models for machine translation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

2013
Joint WMT 2013 Submission of the QUAERO Project.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

An MT Error-Driven Discriminative Word Lexicon using Sentence Structure Features.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

The Karlsruhe Institute of Technology Translation Systems for the WMT 2013.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Combining Word Reordering Methods on different Linguistic Abstraction Levels for Statistical Machine Translation.
Proceedings of the Seventh Workshop on Syntax, 2013

Analyzing the potential of source sentence reordering in statistical machine translation.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

The KIT translation systems for IWSLT 2013.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

EU-BRIDGE MT: text translation of talks in the EU-BRIDGE project.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Report on the 10th IWSLT evaluation campaign.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

A real-world system for simultaneous translation of German lectures.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Letter N-Gram-based Input Encoding for Continuous Space Language Models.
Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, 2013

2012
Parallel Phrase Scoring for Extra-large Corpora.
Prague Bull. Math. Linguistics, 2012

The Karlsruhe Institute of Technology Translation Systems for the WMT 2012.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Joint WMT 2012 Submission of the QUAERO Project.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Continuous space language models using restricted Boltzmann machines.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The KIT translation systems for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Segmentation and punctuation prediction in speech language translation using a monolingual translation system.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

2011
Wider Context by Using Bilingual Language Models in Machine Translation.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Joint WMT Submission of the QUAERO Project.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Using Wikipedia to translate domain-specific terms in SMT.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

The KIT English-French translation systems for IWSLT 2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Advances on spoken language translation in the Quaero program.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

2010
The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

The KIT translation system for IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Domain Adaptation in Statistical Machine Translation using Factored Translation Models.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

Quaero Speech-to-Text and Text Translation Evaluation Systems.
Proceedings of the High Performance Computing in Science and Engineering '10, 2010

2009
A POS-Based Model for Long-Range Reorderings in SMT.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

The Universität Karlsruhe Translation System for the EACL-WMT 2009.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

2008
Discriminative Word Alignment via Alignment Matrix Modeling.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

Simultaneous machine translation of german lectures into english: Investigating research challenges for the future.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Simultaneous German-English lecture translation.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

2007
The ISL Phrase-Based MT System for the 2007 ACL Workshop on Statistical Machine Translation.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007


  Loading...