Soroush Vosoughi

Orcid: 0000-0002-2564-8909

According to our database1, Soroush Vosoughi authored at least 100 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Disordered-DABS: A Benchmark for Dynamic Aspect-Based Summarization in Disordered Texts.
CoRR, 2024

2023
Quantifying participation biases on social media.
EPJ Data Sci., December, 2023

Towards Sentence Level Inference Attack Against Pre-trained Language Models.
Proc. Priv. Enhancing Technol., July, 2023

SimVLG: Simple and Efficient Pretraining of Visual Language Generative Models.
CoRR, 2023

Training Socially Aligned Language Models in Simulated Human Society.
CoRR, 2023

Knowledge from Large-Scale Protein Contact Prediction Models Can Be Transferred to the Data-Scarce RNA Contact Prediction Task.
CoRR, 2023

Graph-Level Embedding for Time-Evolving Graphs.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Joint Latent Topic Discovery and Expectation Modeling for Financial Markets.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

Bootstrapping Vision-Language Learning with Decoupled Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Language models are multilingual chain-of-thought reasoners.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Mind's Eye: Grounded Language Model Reasoning through Simulation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Hyperbolic Node Structural Role Embedding.
Proceedings of the IEEE International Conference on Data Mining, 2023

Improving Representation Learning for Histopathologic Images with Cluster Constraints.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Deciphering Stereotypes in Pre-Trained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Intersectional Stereotypes in Large Language Models: Dataset and Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Length Does Matter: Summary Length can Bias Summarization Metrics.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Syntactic Probing Correctness and Robustness with Control Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Dynamic Structural Role Node Embedding for User Modeling in Evolving Networks.
ACM Trans. Inf. Syst., 2022

Accurate intercensal estimates of energy access to track Sustainable Development Goal 7.
EPJ Data Sci., 2022

Robin: A Novel Online Suicidal Text Corpus of Substantial Breadth and Scale.
CoRR, 2022

Interpretation Quality Score for Measuring the Quality of interpretability methods.
CoRR, 2022

Quantifying and alleviating political bias in language models.
Artif. Intell., 2022

Dartmouth at SemEval-2022 Task 6: Detection of Sarcasm.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

DartmouthCS at SemEval-2022 Task 8: Predicting Multilingual News Article Similarity with Meta-Information and Translation.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Aligning Generative Language Models with Human Values.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Contrastive Learning for Prompt-based Few-shot Language Learners.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Embedding Hallucination for Few-shot Language Fine-tuning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Large-Scale Longitudinal Multimodal Dataset of State-Backed Information Operations on Twitter.
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, 2022

Measuring Media Bias via Masked Language Modeling.
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, 2022

Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Going Beyond Accuracy: Interpretability Metrics for CNN Representations of Physiological Signals.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Knowledge Infused Decoding.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Non-Parallel Text Style Transfer with Self-Parallel Supervision.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Capturing Topic Framing via Masked Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

EnCBP: A New Benchmark Dataset for Finer-Grained Cultural Background Prediction in English.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
SymptomID: A Framework for Rapid Symptom Identification in Pandemics Using News Reports.
ACM Trans. Manag. Inf. Syst., 2021

An Interpretable Model for Real-time Tracking of Economic Indicators Using Social Media Data.
Trans. Data Sci., 2021

A Transformer-based Framework for Neutralizing and Reversing the Political Polarity of News Articles.
Proc. ACM Hum. Comput. Interact., 2021

Hyperbolic node embedding for temporal networks.
Data Min. Knowl. Discov., 2021

GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks.
CoRR, 2021

Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks.
CoRR, 2021

Emotion-based Modeling of Mental Disorders on Social Media.
Proceedings of the WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence, Melbourne VIC Australia, December 14, 2021

Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Linguistic Complexity Loss in Text-Based Therapy.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Feature Selection for Multivariate Time Series via Network Pruning.
Proceedings of the 2021 International Conference on Data Mining, 2021

Political Depolarization of News Articles Using Attribute-Aware Word Embeddings.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Text Augmentation in a Multi-Task View.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Embedding Node Structural Role Identity Using Stress Majorization.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Graph Embedding via Diffusion-Wavelets-Based Node Feature Distribution Characterization.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Language Model Augmented Relevance Score.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Modulating Language Models with Emotions.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

A Survey of Data Augmentation Approaches for NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Embedding Heterogeneous Networks into Hyperbolic Space Without Meta-path.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Mitigating Political Bias in Language Models through Reinforced Calibration.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Improvements and Extensions on Metaphor Detection.
CoRR, 2020

Enhanced Offensive Language Detection Through Data Augmentation.
CoRR, 2020

Not Judging a User by Their Cover: Understanding Harm in Multi-Modal Processing within Social Media Research.
CoRR, 2020

Towards Improved Model Design for Authorship Identification: A Survey on Writing Style Understanding.
CoRR, 2020

Emoji Prediction: Extensions and Benchmarking.
CoRR, 2020

Query-Free Adversarial Transfer via Undertrained Surrogates.
CoRR, 2020

What Are People Asking About COVID-19? A Question Classification Dataset.
CoRR, 2020

Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural Networks.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Multi-Modal Identification of State-Sponsored Propaganda on Social Media.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Social media data reveals signal for public consumer perceptions.
Proceedings of the ICAIF '20: The First ACM International Conference on AI in Finance, 2020

Multi-resolution Annotations for Emoji Prediction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Style Change Detection Using BERT.
Proceedings of the Working Notes of CLEF 2020, 2020

Embedding Node Structural Role Identity into Hyperbolic Space.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Dartmouth CS at WNUT-2020 Task 2: Informative COVID-19 Tweet Classification Using BERT.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

Big Green at WNUT 2020 Shared Task-1: Relation Extraction as Contextualized Sequence Classification.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

2018
Me, My Echo Chamber, and I: Introspection on Social Media Polarization.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

2017
Rumor Gauge: Predicting the Veracity of Rumors on Twitter.
ACM Trans. Knowl. Discov. Data, 2017

TweetVista: An AI-Powered Interactive Tool for Exploring Conversations on Twitter.
Proceedings of the Companion Publication of the 22nd International Conference on Intelligent User Interfaces, 2017

Mapping Twitter Conversation Landscapes.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Twitter Demographic Classification Using Deep Multi-modal Multi-task Learning.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Human Atlas: A Tool for Mapping Social Networks.
Proceedings of the 25th International Conference on World Wide Web, 2016

Tweet2Vec: Learning Tweet Embeddings Using Character-level CNN-LSTM Encoder-Decoder.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

DeepStance at SemEval-2016 Task 6: Detecting Stance in Tweets Using Character and Word-Level CNNs.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Tweet Acts: A Speech Act Classifier for Twitter.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

A Semi-Automatic Method for Efficient Detection of Stories on Social Media.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

Automatic Detection and Categorization of Election-Related Tweets.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

2015
Enhanced Twitter Sentiment Classification Using Contextual Information.
Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, 2015

Digital Stylometry: Linking Profiles Across Social Networks.
Proceedings of the Social Informatics - 7th International Conference, 2015

A Human-Machine Collaborative System for Identifying Rumors on Twitter.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

2014
Grounding language models in spatiotemporal context.
Proceedings of the INTERSPEECH 2014, 2014

Improving automatic speech recognition through head pose driven visual grounding.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2014

2012
An Automatic Child-Directed Speech Detector for the Study of Child Language Development.
Proceedings of the INTERSPEECH 2012, 2012

A portable audio/video recorder for longitudinal study of child development.
Proceedings of the International Conference on Multimodal Interaction, 2012

2010
Automatic estimation of transcription accuracy and difficulty.
Proceedings of the INTERSPEECH 2010, 2010

2008
Object schemas for grounding language in a responsive robot.
Connect. Sci., 2008

Object schemas for responsive robotic language use.
Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction, 2008


  Loading...