2024
Some Tradeoffs in Continual Learning for Parliamentary Neural Machine Translation Systems.
Proceedings of the 16th Conference of the Association for Machine Translation in the Americas, 2024
2023
Dialect and Variant Identification as a Multi-Label Classification Task: A Proposal Based on Near-Duplicate Analysis.
Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023
2022
Refining an Almost Clean Translation Memory Helps Machine Translation.
Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), 2022
2021
N-gram and Neural Models for Uralic Language Identification: NRC at VarDial 2021.
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects, 2021
2020
Application of machine learning techniques to assess the trends and alignment of the funded research output.
J. Informetrics, 2020
Challenges in Neural Language Identification: NRC at VarDial 2020.
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 2020
Confident Learning Curves in Additive Factor Modeling.
Proceedings of the 13th International Conference on Educational Data Mining, 2020
Human or Neural Translation?
Proceedings of the 28th International Conference on Computational Linguistics, 2020
The Impact of Sentence Alignment Errors on Phrase-Based Machine Translation Performance.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020
2019
Event Detection using Images of Temporal Word Patterns.
Proceedings of the Third International Workshop on Recent Trends in News Information Retrieval, 2019
Identifying Misaligned Spans in Parallel Corpora Using Change Point Detection.
Proceedings of the Advances in Artificial Intelligence, 2019
2018
Accurate semantic textual similarity for cleaning noisy parallel corpora using semantic machine translation evaluation metric: The NRC supervised submissions to the Parallel Corpus Filtering task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018
Measuring sentence parallelism using Mahalanobis distances: The NRC unsupervised submissions to the WMT18 Parallel Corpus Filtering shared task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018
EuroGames16: Evaluating Change Detection in Online Conversation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
A diagnostic tool for competency-based program engineering.
Proceedings of the 8th International Conference on Learning Analytics and Knowledge, 2018
Standard error considerations on AFM parameters.
Proceedings of the 11th International Conference on Educational Data Mining, 2018
Real-time Change Point Detection using On-line Topic Models.
Proceedings of the 27th International Conference on Computational Linguistics, 2018
On the Learning Curve Attrition Bias in Additive Factor Modeling.
Proceedings of the Artificial Intelligence in Education - 19th International Conference, 2018
2017
Exploring Optimal Voting in Native Language Identification.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017
Detecting Changes in Twitter Streams using Temporal Clusters of Hashtags.
Proceedings of the Events and Stories in the News Workshop@ACL 2017, 2017
2016
Competency Based Learning in the Web of Learning Data.
Proceedings of the 25th International Conference on World Wide Web, 2016
Advances in Ngram-based Discrimination of Similar Languages.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016
CNRC at SemEval-2016 Task 1: Experiments in Crosslingual Semantic Textual Similarity.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016
Discriminating Similar Languages: Evaluations and Explorations.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Analysing and Refining Pilot Training.
Proceedings of the 9th International Conference on Educational Data Mining, 2016
Extracting Discriminative Keyphrases with Learned Semantic Hierarchies.
Proceedings of the COLING 2016, 2016
2015
A Probabilistic Model for Knowledge Component Naming.
Proceedings of the 8th International Conference on Educational Data Mining, 2015
Evaluation of Expert-Based Q-Matrices Predictive Quality in Matrix Factorization Models.
Proceedings of the Design for Teaching and Learning in a Networked World, 2015
Towards Automatic Description of Knowledge Components.
Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015
2014
Linear Mixture Models for Robust Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014
The NRC System for Discriminating Similar Languages.
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, 2014
CNRC-TMT: Second Language Writing Assistant System Description.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014
2013
Reuters RCV1 RCV2 Multilingual, Multiview Text Categorization Test collection.
Dataset, September, 2013
Feature Space Selection and Combination for Native Language Identification.
Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, 2013
2012
Learning to Translate: A Statistical and Computational Analysis.
Adv. Artif. Intell., 2012
Fast on-line learning for multilingual categorization.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012
Learning Machine Translation from In-domain and Out-of-domain Data.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012
Filtering and routing multilingual documents for translation.
Proceedings of the 2012 IEEE Symposium on Computational Intelligence for Security and Defence Applications, 2012
2011
Learning aspect models with partially labeled data.
Pattern Recognit. Lett., 2011
Multiview Semi-supervised Learning for Ranking Multilingual Documents.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011
2010
A co-classification approach to learning from multilingual corpora.
Mach. Learn., 2010
Multi-view clustering of multilingual documents.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010
Combining coregularization and consensus-based self-training for multilingual text categorization.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010
An Extension of the Aspect PLSA Model to Active and Semi-Supervised Learning for Text Classification.
Proceedings of the Artificial Intelligence: Theories, 2010
Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010
2009
Learning from Multiple Partially Observed Views - an Application to Multilingual Text Categorization.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009
Automatic Detection of Translated Text and its Impact on Machine Translation.
Proceedings of Machine Translation Summit XII: Papers, 2009
Improving SMT by learning translation direction.
Proceedings of the Workshop on Statistical Multilingual Analysis for Retrieval and Translation, 2009
2008
A boosting algorithm for learning bipartite ranking functions with partially labeled data.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008
Semi-supervised Document Classification with a Mislabeling Error Model.
Proceedings of the Advances in Information Retrieval , 2008
2007
Statistical Phrase-Based Post-Editing.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007
Domain adaptation of MT systems through automatic post-editing.
Proceedings of Machine Translation Summit XI: Papers, 2007
A probabilistic model for data cube compression and query approximation.
Proceedings of the DOLAP 2007, 2007
2006
Categorization in multiple category systems.
Proceedings of the Machine Learning, 2006
Lexical Entailment for Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2006
2005
Assisting medical annotation in Swiss-Prot using statistical classifiers.
Int. J. Medical Informatics, 2005
Une approche à la traduction automatique statistique par segments discontinus.
Proceedings of the Actes de la 12ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2005
Relation between PLSA and NMF and implications.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005
Translating with Non-contiguous Phrases.
Proceedings of the HLT/EMNLP 2005, 2005
A Probabilistic Interpretation of Precision, Recall and <i>F</i>-Score, with Implication for Evaluation.
Proceedings of the Advances in Information Retrieval, 2005
2004
Corpus-Based vs. Model-Based Selection of Relevant Features.
Proceedings of the COnférence en Recherche d'Infomations et Applications, 2004
Confidence Estimation for Machine Translation.
Proceedings of the COLING 2004, 2004
Aligning words using matrix factorisation.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004
A Geometric View on Bilingual Lexicon Extraction from Comparable Corpora.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004
2003
J. Mach. Learn. Res., 2003
Reducing Parameter Space for Word Alignment.
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, 2003
A Probabilistic Information Retrieval Approach to Medical Annotation in SWISS-PROT.
Proceedings of the New Navigators: from Professionals to Patients, 2003
Combining NLP and probabilistic categorisation for document and term selection for Swiss-Prot medical annotation.
Proceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology, June 29, 2003
2002
Kernel Methods for Document Filtering.
Proceedings of The Eleventh Text REtrieval Conference, 2002
A Hierarchical Model for Clustering and Categorising Documents.
Proceedings of the Advances in Information Retrieval, 2002
Combining Labelled and Unlabelled Data: A Case Study on Fisher Kernels and Transductive Inference for Biological Entity Recognition.
Proceedings of the 6th Conference on Natural Language Learning, 2002
2001
Sélection de paramètres par pénalisation.
Rev. d'Intelligence Artif., 2001
2000
Adaptive Metric Kernel Regression.
J. VLSI Signal Process., 2000
Extraction of the relevant delays for temporal modeling.
IEEE Trans. Signal Process., 2000
Modelling the Haemodynamic Response in fMRI with Smooth FIR Filters.
IEEE Trans. Medical Imaging, 2000
1998
Behaviour in 0 of the Neural Networks Training Cost.
Neural Process. Lett., 1998
Adaptive regularization of neural networks using conjugate gradient.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1997
Regularization with a Pruning Prior.
Neural Networks, 1997
Note on Free Lunches and Cross-validation.
Neural Comput., 1997
Lag space estimation in time series modelling.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997