Sameena Shah

Orcid: 0009-0000-5960-5811

According to our database1, Sameena Shah authored at least 75 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
BuDDIE: A Business Document Dataset for Multi-task Information Extraction.
CoRR, 2024

Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency.
CoRR, 2024

Belief and Persuasion in Scientific Discourse on Social Media: A Study of the COVID-19 Pandemic.
CoRR, 2024

TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing.
CoRR, 2024

AliGATr: Graph-based layout generation for form understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Towards a new research agenda for multimodal enterprise document understanding: What are we missing?
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams.
CoRR, 2023

Synthetic Text Generation using Hypergraph Representations.
CoRR, 2023

Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? An Examination on Several Typical Tasks.
CoRR, 2023

Unsupervised Domain Adaptation using Lexical Transformations and Label Injection for Twitter Data.
Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, 2023

DocGraphLM: Documental Graph Language Model for Information Extraction.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Knowledge Discovery from Unstructured Data in Financial Services (KDF) Workshop.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

REFinD: Relation Extraction Financial Dataset.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

BizGraphQA: A Dataset for Image-based Inference over Graph-structured Diagrams from Business Domains.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase.
Proceedings of the 1st International Workshop on Software Defect Datasets, 2023

Log Summarisation for Defect Evolution Analysis.
Proceedings of the 1st International Workshop on Software Defect Datasets, 2023

Robust NLP for Finance (RobustFin).
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

How Effective Are Neural Networks for Fixing Security Vulnerabilities.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

An Automated Code Update Tool For Python Packages.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2023

Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Bayesian Hierarchical Models for Counterfactual Estimation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Using counterfactual contrast to improve compositional generalization for multi-step quantitative reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Explicit Group Sparse Projection with Applications to Deep Learning and NMF.
Trans. Mach. Learn. Res., 2022

Synthetic document generator for annotation-free layout recognition.
Pattern Recognit., 2022

Neural Transition-based Parsing of Library Deprecations.
CoRR, 2022

Bandit Sampling for Multiplex Networks.
CoRR, 2022

Structure with Semantics: Exploiting Document Relations for Retrieval.
CoRR, 2022

Structure and Semantics Preserving Document Representations.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

AI for Automated Code Updates.
Proceedings of the 44th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2022

Online Learning for Mixture of Multivariate Hawkes Processes.
Proceedings of the 3rd ACM International Conference on AI in Finance, 2022

Improving compositional generalization for multi-step quantitative reasoning in question answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

AIR-JPMC@SMM4H'22: BERT + Ensembling = Too Cool: Using Multiple BERT Models Together for Various COVID-19 Tweet Identification Tasks.
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, 2022

AIR-JPMC@SMM4H'22: Identifying Self-Reported Spanish COVID-19 Symptom Tweets Through Multiple-Model Ensembling.
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, 2022

AIR-JPMC@SMM4H'22: Classifying Self-Reported Intimate Partner Violence in Tweets with Multiple BERT-based Models.
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, 2022

2021
Parameterized Explanations for Investor / Company Matching.
CoRR, 2021

A Framework for Institutional Risk Identification using Knowledge Graphs and Automated News Profiling.
CoRR, 2021

Are My Deep Learning Systems Fair? An Empirical Study of Fixed-Seed Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Domain-agnostic Document Representation Learning Using Latent Topics and Metadata.
Proceedings of the Thirty-Fourth International Florida Artificial Intelligence Research Society Conference, 2021

FinQA: A Dataset of Numerical Reasoning over Financial Data.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Debiasing classifiers: is reality at variance with expectation?
CoRR, 2020

Robust Document Representations using Latent Topics and Metadata.
CoRR, 2020

Simulating and classifying behavior in adversarial environments based on action-state traces: an application to money laundering.
Proceedings of the ICAIF '20: The First ACM International Conference on AI in Finance, 2020

2018
An Extensible Event Extraction System With Cross-Media Event Resolution.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

TipMaster: A Knowledge Base of Authoritative Local News Sources on Social Media.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
"Breaking" Disasters: Predicting and Characterizing the Global News Value of Natural and Man-made Disasters.
CoRR, 2017

funSentiment at SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs Using Word Vectors Built from StockTwits and Twitter.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

funSentiment at SemEval-2017 Task 4: Topic-Based Message Sentiment Classification by Exploiting Word Embeddings, Text Features and Target Contexts.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Data Sets: Word Embeddings Learned from Tweets and General Data.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Real-Time Novel Event Detection from Social Media.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Learning Stock Market Sentiment Lexicon and Sentiment-Oriented Word Vector from StockTwits.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Reuters tracer: Toward automated news production using large scale social media data.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016
Tweet Topic Classification Using Distributed Language Representations.
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016

Tweet Sentiment Analysis by Incorporating Sentiment-Specific Word Embedding and Weighted Text Features.
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016

User Behaviors in Newsworthy Rumors: A Case Study of Twitter.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

Perceived, Projected, and True Investment Expertise: Not All Experts Provide Expert Recommendations.
Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics, 2016

Reuters Tracer: A Large Scale System of Detecting & Verifying Real-Time News Events from Twitter.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Hashtag Recommendation Based on Topic Enhanced Embedding, Tweet Entity Data and Learning to Rank.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

TweetSift: Tweet Topic Classification Based on Entity Knowledge Base and Topic Enhanced Word Embedding.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Using paraphrases to improve tweet classification: Comparing WordNet and word embedding approaches.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Table classification using both structure and content information: A case study of financial documents.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Witness Identification in Twitter.
Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, 2016

Discovering Relevant Hashtags for Health Concepts: A Case Study of Twitter.
Proceedings of the World Wide Web and Population Health Intelligence, 2016

2015
Newsworthy Rumor Events: A Case Study of Twitter.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

Real-time Rumor Debunking on Twitter.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
Winning by Following the Winners: Mining the Behaviour of Stock Market Experts in Social Media.
Proceedings of the Social Computing, Behavioral-Cultural Modeling and Prediction, 2014

2013
Ants find the shortest path: a mathematical proof.
Swarm Intell., 2013

Convergence of the dynamic load balancing problem to Nash equilibrium using distributed local interactions.
Inf. Sci., 2013

Stock Prediction Using Event-Based Sentiment Analysis.
Proceedings of the 2013 IEEE/WIC/ACM International Conferences on Web Intelligence, 2013

2011
M-Unit EigenAnt: An Ant Algorithm to Find the M Best Solutions.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Debugging ants: How ants find the shortest route.
Proceedings of the 8th International Conference on Information, 2011

2010
Trail formation in ants. A generalized Polya urn process.
Swarm Intell., 2010

2009
Zero Norm Least Squares Proximal SVR.
Proceedings of the Pattern Recognition and Machine Intelligence, 2009

Kernel Optimization Using a Generalized Eigenvalue Approach.
Proceedings of the Pattern Recognition and Machine Intelligence, 2009

2008
Mathematical Modeling and Convergence Analysis of Trail Formation.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008


  Loading...