Cyril Goutte

Orcid: 0000-0003-4939-6555

Affiliations:
  • Xerox


According to our database1, Cyril Goutte authored at least 80 papers between 1997 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Dialect and Variant Identification as a Multi-Label Classification Task: A Proposal Based on Near-Duplicate Analysis.
Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

2022
Refining an Almost Clean Translation Memory Helps Machine Translation.
Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), 2022

2021
N-gram and Neural Models for Uralic Language Identification: NRC at VarDial 2021.
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects, 2021

2020
Application of machine learning techniques to assess the trends and alignment of the funded research output.
J. Informetrics, 2020

Challenges in Neural Language Identification: NRC at VarDial 2020.
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 2020

Confident Learning Curves in Additive Factor Modeling.
Proceedings of the 13th International Conference on Educational Data Mining, 2020

Human or Neural Translation?
Proceedings of the 28th International Conference on Computational Linguistics, 2020

The Impact of Sentence Alignment Errors on Phrase-Based Machine Translation Performance.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

2019
Event Detection using Images of Temporal Word Patterns.
Proceedings of the Third International Workshop on Recent Trends in News Information Retrieval, 2019

Identifying Misaligned Spans in Parallel Corpora Using Change Point Detection.
Proceedings of the Advances in Artificial Intelligence, 2019

2018
Accurate semantic textual similarity for cleaning noisy parallel corpora using semantic machine translation evaluation metric: The NRC supervised submissions to the Parallel Corpus Filtering task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Measuring sentence parallelism using Mahalanobis distances: The NRC unsupervised submissions to the WMT18 Parallel Corpus Filtering shared task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

EuroGames16: Evaluating Change Detection in Online Conversation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A diagnostic tool for competency-based program engineering.
Proceedings of the 8th International Conference on Learning Analytics and Knowledge, 2018

Standard error considerations on AFM parameters.
Proceedings of the 11th International Conference on Educational Data Mining, 2018

Real-time Change Point Detection using On-line Topic Models.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

On the Learning Curve Attrition Bias in Additive Factor Modeling.
Proceedings of the Artificial Intelligence in Education - 19th International Conference, 2018

2017
Exploring Optimal Voting in Native Language Identification.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Detecting Changes in Twitter Streams using Temporal Clusters of Hashtags.
Proceedings of the Events and Stories in the News Workshop@ACL 2017, 2017

2016
Competency Based Learning in the Web of Learning Data.
Proceedings of the 25th International Conference on World Wide Web, 2016

Advances in Ngram-based Discrimination of Similar Languages.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

CNRC at SemEval-2016 Task 1: Experiments in Crosslingual Semantic Textual Similarity.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Discriminating Similar Languages: Evaluations and Explorations.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Analysing and Refining Pilot Training.
Proceedings of the 9th International Conference on Educational Data Mining, 2016

Extracting Discriminative Keyphrases with Learned Semantic Hierarchies.
Proceedings of the COLING 2016, 2016

2015
Multiview self-learning.
Neurocomputing, 2015

A Probabilistic Model for Knowledge Component Naming.
Proceedings of the 8th International Conference on Educational Data Mining, 2015

Evaluation of Expert-Based Q-Matrices Predictive Quality in Matrix Factorization Models.
Proceedings of the Design for Teaching and Learning in a Networked World, 2015

Towards Automatic Description of Knowledge Components.
Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015

2014
Linear Mixture Models for Robust Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

The NRC System for Discriminating Similar Languages.
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, 2014

CNRC-TMT: Second Language Writing Assistant System Description.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

2013
Reuters RCV1 RCV2 Multilingual, Multiview Text Categorization Test collection.
Dataset, September, 2013

Feature Space Selection and Combination for Native Language Identification.
Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, 2013

2012
Learning to Translate: A Statistical and Computational Analysis.
Adv. Artif. Intell., 2012

Fast on-line learning for multilingual categorization.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Learning Machine Translation from In-domain and Out-of-domain Data.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012

Filtering and routing multilingual documents for translation.
Proceedings of the 2012 IEEE Symposium on Computational Intelligence for Security and Defence Applications, 2012

2011
Learning aspect models with partially labeled data.
Pattern Recognit. Lett., 2011

Multiview Semi-supervised Learning for Ranking Multilingual Documents.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

2010
A co-classification approach to learning from multilingual corpora.
Mach. Learn., 2010

Multi-view clustering of multilingual documents.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Combining coregularization and consensus-based self-training for multilingual text categorization.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

An Extension of the Aspect PLSA Model to Active and Semi-Supervised Learning for Text Classification.
Proceedings of the Artificial Intelligence: Theories, 2010

Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

2009
Learning from Multiple Partially Observed Views - an Application to Multilingual Text Categorization.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Automatic Detection of Translated Text and its Impact on Machine Translation.
Proceedings of Machine Translation Summit XII: Papers, 2009

Improving SMT by learning translation direction.
Proceedings of the Workshop on Statistical Multilingual Analysis for Retrieval and Translation, 2009

2008
A boosting algorithm for learning bipartite ranking functions with partially labeled data.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Semi-supervised Document Classification with a Mislabeling Error Model.
Proceedings of the Advances in Information Retrieval , 2008

2007
Statistical Phrase-Based Post-Editing.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Domain adaptation of MT systems through automatic post-editing.
Proceedings of Machine Translation Summit XI: Papers, 2007

A probabilistic model for data cube compression and query approximation.
Proceedings of the DOLAP 2007, 2007

2006
Categorization in multiple category systems.
Proceedings of the Machine Learning, 2006

Lexical Entailment for Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2006

2005
Assisting medical annotation in Swiss-Prot using statistical classifiers.
Int. J. Medical Informatics, 2005

Une approche à la traduction automatique statistique par segments discontinus.
Proceedings of the Actes de la 12ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2005

Relation between PLSA and NMF and implications.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Translating with Non-contiguous Phrases.
Proceedings of the HLT/EMNLP 2005, 2005

A Probabilistic Interpretation of Precision, Recall and <i>F</i>-Score, with Implication for Evaluation.
Proceedings of the Advances in Information Retrieval, 2005

2004
Corpus-Based vs. Model-Based Selection of Relevant Features.
Proceedings of the COnférence en Recherche d'Infomations et Applications, 2004

Confidence Estimation for Machine Translation.
Proceedings of the COLING 2004, 2004

Aligning words using matrix factorisation.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004

A Geometric View on Bilingual Lexicon Extraction from Comparable Corpora.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004

2003
Word-Sequence Kernels.
J. Mach. Learn. Res., 2003

Reducing Parameter Space for Word Alignment.
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, 2003

A Probabilistic Information Retrieval Approach to Medical Annotation in SWISS-PROT.
Proceedings of the New Navigators: from Professionals to Patients, 2003

Combining NLP and probabilistic categorisation for document and term selection for Swiss-Prot medical annotation.
Proceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology, June 29, 2003

2002
Kernel Methods for Document Filtering.
Proceedings of The Eleventh Text REtrieval Conference, 2002

A Hierarchical Model for Clustering and Categorising Documents.
Proceedings of the Advances in Information Retrieval, 2002

Combining Labelled and Unlabelled Data: A Case Study on Fisher Kernels and Transductive Inference for Biological Entity Recognition.
Proceedings of the 6th Conference on Natural Language Learning, 2002

2001
Sélection de paramètres par pénalisation.
Rev. d'Intelligence Artif., 2001

2000
Adaptive Metric Kernel Regression.
J. VLSI Signal Process., 2000

Extraction of the relevant delays for temporal modeling.
IEEE Trans. Signal Process., 2000

Modelling the Haemodynamic Response in fMRI with Smooth FIR Filters.
IEEE Trans. Medical Imaging, 2000

1998
Behaviour in 0 of the Neural Networks Training Cost.
Neural Process. Lett., 1998

Adaptive regularization of neural networks using conjugate gradient.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Regularization with a Pruning Prior.
Neural Networks, 1997

Note on Free Lunches and Cross-validation.
Neural Comput., 1997

Lag space estimation in time series modelling.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997


  Loading...