Dong Nguyen

Orcid: 0000-0002-6062-3117

Affiliations:
  • Utrecht University, The Netherlands
  • Alan Turing Institute, London, UK


According to our database1, Dong Nguyen authored at least 59 papers between 2008 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Designing and Evaluating General-Purpose User Representations Based on Behavioral Logs from a Measurement Process Perspective: A Case Study with Snapchat.
CoRR, 2023

FTFT: efficient and robust Fine-Tuning by transFerring Training dynamics.
CoRR, 2023

Epicurus at SemEval-2023 Task 4: Improving Prediction of Human Values behind Arguments by Leveraging Their Definitions.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Perceived Algorithmic Fairness using Organizational Justice Theory: An Empirical Case Study on Algorithmic Hiring.
Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023

Measuring the Instability of Fine-Tuning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Template-based Abstractive Microblog Opinion Summarisation.
Trans. Assoc. Comput. Linguistics, 2022

Evaluating the construct validity of text embeddings with application to survey questions.
EPJ Data Sci., 2022

Same Author or Just Same Topic? Towards Content-Independent Style Representations.
Proceedings of the 7th Workshop on Representation Learning for NLP, 2022

2021
Introducing CAD: the Contextual Abuse Dataset.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On learning and representing social meaning in NLP: a sociolinguistic perspective.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Semantic Journeys: Quantifying Change in Emoji Meaning from 2012-2018.
Proceedings of the Workshop Proceedings of the 15th International AAAI Conference on Web and Social Media, 2021

Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Assessing the Reliability of Word Embedding Gender Bias Measures.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

HateCheck: Functional Tests for Hate Speech Detection Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
How We Do Things With Words: Analyzing Text as Social and Cultural Data.
Frontiers Artif. Intell., 2020

Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification.
CoRR, 2020

Do Word Embeddings Capture Spelling Variation?
Proceedings of the 28th International Conference on Computational Linguistics, 2020

tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Room to Glo: A Systematic Comparison of Semantic Change Detection Approaches with Word Embeddings.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Aiming beyond the Obvious: Identifying Non-Obvious Cases in Semantic Similarity Datasets.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Comparing Automatic and Human Evaluation of Local Explanations for Text Classification.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

2017
Emo, Love, and God: Making Sense of Urban Dictionary, a Crowd-Sourced Online Dictionary.
CoRR, 2017

A Kernel Independence Test for Geographical Language Variation.
Comput. Linguistics, 2017

2016
The apocalypse on Twitter.
Digit. Scholarsh. Humanit., 2016

Predicting relevance based on assessor disagreement: analysis and practical applications for search evaluation.
Inf. Retr. J., 2016

Resource Selection for Federated Search on the Web.
CoRR, 2016

Computational Sociolinguistics: A Survey.
Comput. Linguistics, 2016

Automatic Detection of Intra-Word Code-Switching.
Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, 2016

2015
FedWeb Greatest Hits: Presenting the New Test Collection for Federated Web Search.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Audience and the Use of Minority Languages on Twitter.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

#SupportTheCause: Identifying Motivations to Participate in Online Health Campaigns.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

On the Impact of Twitter-based Health Campaigns: A Cross-Country Analysis of Movember.
Proceedings of the Sixth International Workshop on Health Text Mining and Information Analysis, 2015

2014
Exploiting user disagreement for web search evaluation: an experimental approach.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Overview of the TREC 2014 Federated Web Search Track.
Proceedings of The Twenty-Third Text REtrieval Conference, 2014

TweetGenie: Development, Evaluation, and Lessons Learned.
Proceedings of the COLING 2014, 2014

Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment.
Proceedings of the COLING 2014, 2014

Aligning Vertical Collection Relevance with User Intent.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Using Crowdsourcing to Investigate Perception of Narrative Similarity.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Predicting Code-switching in Multilingual Communication for Immigrant Communities.
Proceedings of the First Workshop on Computational Approaches to Code Switching@EMNLP 2014, 2014

2013
"TweetGenie: automatic age prediction from tweets" by D. Nguyen, R. Gravel, D. Trieschnigg, and T. Meder; with Ching-man Au Yeung as coordinator.
SIGWEB Newsl., 2013

Overview of the TREC 2013 Federated Web Search Track.
Proceedings of The Twenty-Second Text REtrieval Conference, 2013

Learning to Extract Folktale Keywords.
Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, 2013

"How Old Do You Think I Am?" A Study of Language and Age in Twitter.
Proceedings of the Seventh International Conference on Weblogs and Social Media, 2013

Word Level Language Identification in Online Multilingual Communication.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Folktale Classification Using Learning to Rank.
Proceedings of the Advances in Information Retrieval, 2013

Snippet-Based Relevance Predictions for Federated Web Search.
Proceedings of the Advances in Information Retrieval, 2013

What Snippets Say About Pages.
Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, 2013

2012
Ensemble Clustering for Result Diversification.
Proceedings of The Twenty-First Text REtrieval Conference, 2012

Automatic classification of folk narrative genres.
Proceedings of the 11th Conference on Natural Language Processing, 2012

Federated search in the wild: the combined power of over a hundred search engines.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

What Snippets Say about Pages in Federated Web Search.
Proceedings of the Information Retrieval Technology, 2012

2011
Author Age Prediction from Text using Linear Regression.
Proceedings of the 5th ACL Workshop on Language Technology for Cultural Heritage, 2011

2010
Combination of Evidence for Effective Web Search.
Proceedings of The Nineteenth Text REtrieval Conference, 2010

An analysis of perspectives in interactive settings.
Proceedings of the First Workshop on Social Media Analytics, 2010

DesignWebs: A Tool for Automatic Construction of Interactive Conceptual Maps from Document Collections.
Proceedings of the Intelligent Tutoring Systems, 10th International Conference, 2010

Exploring the Effectiveness of Social Capabilities and Goal Alignment in Computer Supported Collaborative Learning.
Proceedings of the Intelligent Tutoring Systems, 10th International Conference, 2010

2008
On the Evaluation of Snippet Selection for Information Retrieval.
Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

On the Evaluation of Snippet Selection for WebCLEF.
Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008

WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia.
Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008


  Loading...