Guillaume Wisniewski

Orcid: 0000-0002-4445-080X

According to our database1, Guillaume Wisniewski authored at least 93 papers between 2005 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 




Évaluation de la Similarité Textuelle : Entre Sémantique et Surface dans les Représentations Neuronales.
Proceedings of the Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, 2024

Mesure du niveau de proximité entre enregistrements audio et évaluation indirecte du niveau d'abstraction des représentations issues d'un grand modèle de langage.
Proceedings of the Actes des 35èmes Journées d'Études sur la Parole, 2024

Interprétation des mesures de similarité entre représentations neuronales de textes.
Proceedings of the Extraction et Gestion des Connaissances, 2024

Establishing degrees of closeness between audio recordings along different dimensions using large-scale cross-lingual models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Assessing the Capacity of Transformer to Abstract Syntactic Representations: A Contrastive Analysis Based on Long-distance Agreement.
Trans. Assoc. Comput. Linguistics, 2023

From 'Snippet-lects' to Doculects and Dialects: Leveraging Neural Representations of Speech for Placing Audio Signals in a Language Landscape.
CoRR, 2023

Investigating Techniques for a Deeper Understanding of Neural Machine Translation (NMT) Systems through Data Filtering and Fine-tuning Strategies.
Proceedings of the Eighth Conference on Machine Translation, 2023

Multi-way Variational NMT for UGC: Improving Robustness in Zero-shot Scenarios via Mixture Density Networks.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

ProsAudit, a prosodic benchmark for self-supervised speech models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Using Artificial French Data to Understand the Emergence of Gender Bias in Transformer Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

The SPECTRANS System Description for the WMT22 Biomedical Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Les représentations distribuées sont-elles vraiment distribuées ? Observations sur la localisation de l'information syntaxique dans les tâches d'accord du verbe en français (How Distributed are Distributed Representations ? An Observation on the Locality of Syntactic).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

Probing phoneme, language and speaker information in unsupervised speech representations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Plugging a neural phoneme recognizer into a simple language model: a workflow for low-resource setting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Is the Language Familiarity Effect gradual ? A computational modelling approach.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Analyzing Gender Translation Errors to Identify Information Flows between the Encoder and Decoder of a NMT System.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

How Distributed are Distributed Representations? An Observation on the Locality of Syntactic Information in Verb Agreement Tasks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis.
CoRR, 2021

The SPECTRANS System Description for the WMT21 Terminology Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Biais de genre dans un système de traduction automatiqueneuronale : une étude préliminaire (Gender Bias in Neural Translation : a preliminary study ).
Proceedings of the Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2021

Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Are Neural Networks Extracting Linguistic Properties or Memorizing Training Data? An Observation with a Multilingual Probe for Predicting Tense.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Screening Gender Transfer in Neural Machine Translation.
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

Noisy UGC Translation at the Character Level: Revisiting Open-Vocabulary Capabilities and Robustness of Char-Based Models.
Proceedings of the Seventh Workshop on Noisy User-generated Text, 2021

Understanding the Impact of UGC Specificities on Translation Quality.
Proceedings of the Seventh Workshop on Noisy User-generated Text, 2021

Analyse d'erreurs de transcriptions phonémiques automatiques d'une langue « rare » : le na (mosuo) (Analyzing errors in automatic phonemic transcriptions of the Na (Mosuo) language (SinoTibetan family) Automatic phonemic transcription tools now reach high levels of accuracy on a single speaker with relatively small amounts of training data: on the order two to three hours of transcribed speech).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Phonemic Transcription of Low-Resource Languages: To What Extent can Preprocessing be Automated?
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Non-linear n-best List Reranking with Few Features.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

Combien d'exemples de tests sont-ils nécessaires à une évaluation fiable ? Quelques observations sur l'évaluation de l'analyse morphosyntaxique du français. (Some observations on the evaluation of PoS taggers).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume II : Articles courts, 2019

Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

How Bad are PoS Tagger in Cross-Corpora Settings? Evaluating Annotation Divergence in the UD Project.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Phonetic Normalization for Machine Translation of User Generated Content.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

PanParser: a Modular Implementation for Efficient Transition-Based Dependency Parsing.
Prague Bull. Math. Linguistics, 2018

Divergences entre annotations dans le projet Universal Dependencies et leur impact sur l'évaluation des performance d'étiquetage morpho-syntaxique (Evaluating Annotation Divergences in the UD Project).
Proceedings of the Actes de la Conférence TALN. CORIA-TALN-RJC 2018 - Volume 1, 2018

Analyse morpho-syntaxique en présence d'alternance codique (PoS tagging of Code Switching).
Proceedings of the Actes de la Conférence TALN. CORIA-TALN-RJC 2018 - Volume 1, 2018

Low-latency speaker spotting with online diarization and detection.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Automatically Selecting the Best Dependency Annotation Design with Dynamic Oracles.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Exploiting Dynamic Oracles to Train Projective Dependency Parsers on Non-Projective Trees.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Errator: a Tool to Help Detect Annotation Errors in the Universal Dependencies Project.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Quantifying training challenges of dependency parsers.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Reassessing the value of resources for cross-lingual transfer of POS tagging models.
Lang. Resour. Evaluation, 2017

LIMSI Submission for WMT'17 Shared Task on Bandit Learning.
Proceedings of the Second Conference on Machine Translation, 2017

Adaptation au domaine pour l'analyse morpho-syntaxique (Domain Adaptation for PoS tagging).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles. Orléans, France, June 26-30, 2017, Volume 2, 2017

Combining Speaker Turn Embedding and Incremental Structure Prediction for Low-Latency Speaker Diarization.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Don't Stop Me Now! Using Global Dynamic Oracles to Correct Training Biases of Transition-Based Dependency Parsers.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

LIMSI$@$CoNLL'17: UD Shared Task.
Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2017

A Systematic Comparison of Syntactic Representations of Dependency Parsing.
Proceedings of the NoDaLiDa Workshop on Universal Dependencies, 2017

LIMSI$@$WMT'16: Machine Translation of News.
Proceedings of the First Conference on Machine Translation, 2016

Apprentissage d'analyseur en dépendances cross-lingue par projection partielle de dépendances (Cross-lingual learning of dependency parsers from partially projected dependencies).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 2 : TALN (Articles longs), 2016

Investigating gender adaptation for speech translation.
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 2 : TALN (Posters), 2016

Ne nous arrêtons pas en si bon chemin : améliorations de l'apprentissage global d'analyseurs en dépendances par transition (Don't Stop Me Now ! Improved Update Strategies for Global Training of Transition-Based).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 2 : TALN (Articles longs), 2016

Frustratingly Easy Cross-Lingual Transfer for Transition-Based Dependency Parsing.
Proceedings of the NAACL HLT 2016, 2016

Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on Romanian.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Zero-resource Dependency Parsing: Boosting Delexicalized Cross-lingual Transfer with Linguistic Knowledge.
Proceedings of the COLING 2016, 2016

Why Predicting Post-Edition is so Hard? Failure Analysis of LIMSI Submission to the APE Shared Task.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Oublier ce qu'on sait, pour mieux apprendre ce qu'on ne sait pas : une étude sur les contraintes de type dans les modèles CRF.
Proceedings of the Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2015

Apprentissage par imitation pour l'étiquetage de séquences : vers une formalisation des méthodes d'étiquetage easy-first.
Proceedings of the Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2015

Structured prediction for speaker identification in TV series.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

LIMSI Submission for WMT'14 QE Task.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Cross-Lingual POS Tagging through Ambiguous Learning: First Experiments (Apprentissage partiellement supervisé d'un étiqueteur morpho-syntaxique par transfert cross-lingue) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

A Corpus of Machine Translation Errors Extracted from Translation Students Exercises.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Cross-Lingual Part-of-Speech Tagging through Ambiguous Learning.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Lattice BLEU oracles in machine translation.
ACM Trans. Speech Lang. Process., 2013

Oracle decoding as a new way to analyze phrase-based machine translation.
Mach. Transl., 2013

Quality estimation for machine translation: some lessons learned.
Mach. Transl., 2013

Fast Large-Margin Learning for Statistical Machine Translation.
Int. J. Comput. Linguistics Appl., 2013

LIMSI Submission for the WMT'13 Quality Estimation Task: an Experiment with N-Gram Posteriors.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

A corpus of post-edited translations (Un corpus d'erreurs de traduction) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2013

Design and Analysis of a Large Corpus of Post-Edited Translations: Quality Estimation, Failure Analysis and the Variability of Post-Edition.
Proceedings of Machine Translation Summit XIV: Papers, 2013

On the Predictability of Human Assessment: when Matrix Completion Meets NLP Evaluation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Non-Linear Models for Confidence Estimation.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

WSD for n-best reranking and local language modeling in SMT.
Proceedings of the Sixth Workshop on Syntax, 2012

Computing Lattice BLEU Oracle Scores for Machine Translation.
Proceedings of the EACL 2012, 2012


How good are your phrases? assessing phrase quality with single class classification.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Recueil et analyse d'un corpus écologique de corrections orthographiques extrait des révisions de Wikipédia.
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2010

Mining Naturally-occurring Corrections and Paraphrases from Wikipedia's Revision History.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Assessing Phrase-Based Translation Models with Oracle Decoding.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Training Continuous Space Language Models: Some Practical Issues.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Refining Word Alignment with Discriminative Training.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

Discriminant Models for Word Alignment.
Trait. Autom. des Langues, 2009

Experimental Evaluation of the Value of Structure: How to Efficiently Exploit Interdependencies in Sequence Labeling.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Modèle probabiliste pour l'extraction de structures dans les documents web.
Document Numérique, 2007

From Layout to Semantic: a Reranking Model for Mapping Web Documents to Mediated XML Representations.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 2007, 8th International Conference, Carnegie Mellon University, Pittsburgh, PA, USA, May 30, 2007

Relaxation Labeling for Selecting and Exploiting Efficiently Non-local Dependencies in Sequence Labeling.
Proceedings of the Knowledge Discovery in Databases: PKDD 2007, 2007

Probabilistic Model for Structured Document Mapping.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2007

Ré-ordonnancement pour l'apprentissage de transformations de documents HTML.
Proceedings of the Extraction et gestion des connaissances (EGC'2007), 2007

Modèle probabiliste pour l'extraction de structures dans les documents semistructurés - Application aux documents Web.
Proceedings of the COnférence en Recherche d'Infomations et Applications, 2006

Restructuration automatique de documents dans les corpus semi-structurés hétérogènes.
Proceedings of the Extraction et gestion des connaissances (EGC'2005), 2005

Classification automatique de documents structurés. Application au corpus d'arbres étiquetés de type XML.
Proceedings of the COnférence en Recherche d'Infomations et Applications, 2005
