Juan M. Banda

Orcid: 0000-0001-8499-824X

According to our database1, Juan M. Banda authored at least 63 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Standing on FURM ground - A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems.
CoRR, 2024

2023
Using weak supervision to generate training datasets from social media data: a proof of concept to identify drug mentions.
Neural Comput. Appl., September, 2023

LatinX in AI research.
Neural Comput. Appl., September, 2023

Representing and utilizing clinical textual data for real world studies: An OHDSI approach.
J. Biomed. Informatics, June, 2023

Reproducible variability: assessing investigator discordance across 9 research teams attempting to reproduce the same observational study.
J. Am. Medical Informatics Assoc., April, 2023

Automatic Extraction of Medication Mentions from Tweets - Overview of the BioCreative VII Shared Task 3 Competition.
Database J. Biol. Databases Curation, February, 2023

Ontologizing health systems data at scale: making translational discovery a reality.
npj Digit. Medicine, 2023

Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery.
CoRR, 2023

Leveraging Large Language Models and Weak Supervision for Social Media Data Annotation: An Evaluation Using COVID-19 Self-reported Vaccination Tweets.
Proceedings of the HCI International 2023 - Late Breaking Papers, 2023

Towards automatic identification of self-reported COVID-19 tweets: Introducing a multilingual manually annotated dataset, baseline systems and exploratory evaluations.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
Identifying epidemic related Tweets using noisy learning.
CoRR, 2022

Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality.
CoRR, 2022

Characterizing Anti-Asian Rhetoric During The COVID-19 Pandemic: A Sentiment Analysis Case Study on Twitter.
Proceedings of the Workshop Proceedings of the 16th International AAAI Conference on Web and Social Media, 2022

Overview of the Seventh Social Media Mining for Health Applications (#SMM4H) Shared Tasks at COLING 2022.
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, 2022

An Empirical Study on Characterizing Natural Disasters in Class Imbalanced Social Media Data using Weak Supervision.
Proceedings of the IEEE International Conference on Big Data, 2022

TweetDIS: A Large Twitter Dataset for Natural Disasters Built using Weak Supervision.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
Pulse of the pandemic: Iterative topic filtering for clinical information extraction from social media.
J. Biomed. Informatics, 2021

ACE: the Advanced Cohort Engine for searching longitudinal patient records.
J. Am. Medical Informatics Assoc., 2021

A Biomedically oriented automatically annotated Twitter COVID-19 Dataset.
CoRR, 2021

Overview of the Sixth Social Media Mining for Health Applications (#SMM4H) Shared Tasks at NAACL 2021.
Proceedings of the Sixth Social Media Mining for Health Workshop and Shared Task, 2021

Characterization of Anonymous Physician Perspectives on COVID-19 Using Social Media Data.
Proceedings of the Biocomputing 2021: Proceedings of the Pacific Symposium, 2021

2020
Ten simple rules to run a successful BioHackathon.
PLoS Comput. Biol., 2020

Development and validation of phenotype classifiers across multiple sites in the observational health data sciences and informatics network.
J. Am. Medical Informatics Assoc., 2020

Characterization of Potential Drug Treatments for COVID-19 using Social Media Data and Machine Learning.
CoRR, 2020

GLEAKE: Global and Local Embedding Automatic Keyphrase Extraction.
CoRR, 2020

A large-scale COVID-19 Twitter chatter dataset for open scientific research - an international collaboration.
CoRR, 2020

A large-scale Twitter dataset for drug safety applications mined from publicly existing resources.
CoRR, 2020

Social Media Mining Toolkit (SMMT).
CoRR, 2020

Mining Archive.org's Twitter Stream Grab for Pharmacovigilance Research Gold.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020

Characterizing drug mentions in COVID-19 Twitter Chatter.
Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020

Normalizing Clinical Document Titles to LOINC Document Ontology: an Initial Study.
Proceedings of the AMIA 2020, 2020

2019
Finding missed cases of familial hypercholesterolemia in health systems using machine learning.
npj Digit. Medicine, 2019

Extending Achilles Heel Data Quality Tool with New Rules Informed by Multi-Site Data Quality Comparison.
Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

Solar Event Tracking with Deep Regression Networks: A Proof of Concept Evaluation.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
Nanopublications: A Growing Resource of Provenance-Centric Scientific Linked Data.
Proceedings of the 14th IEEE International Conference on e-Science, 2018

Identifying cases of metastatic prostate cancer using machine learning on electronic health records.
Proceedings of the AMIA 2018, 2018

Scalable Electronic Phenotyping For Studying Patient Comorbidities.
Proceedings of the AMIA 2018, 2018

2017
Solar Event Classification Using Deep Convolutional Neural Networks.
Proceedings of the Artificial Intelligence and Soft Computing, 2017

Electronic phenotyping with APHRODITE and the Observational Health Sciences and Informatics (OHDSI) data network.
Proceedings of the Summit on Clinical Research Informatics, 2017

2016
Mining At Most Top-K% Spatiotemporal Co-occurrence Patterns in Datasets with Extended Spatial Representations.
ACM Trans. Spatial Algorithms Syst., 2016

Learning statistical models of phenotypes using noisy labeled training data.
J. Am. Medical Informatics Assoc., 2016

2015
On visualization techniques for solar data mining.
Astron. Comput., 2015

Regional content-based image retrieval for solar images: Traditional versus modern methods.
Astron. Comput., 2015

Provenance-Centered Dataset of Drug-Drug Interactions.
Proceedings of the Semantic Web - ISWC 2015, 2015

Unsupervised Learning Techniques for Detection of Regions of Interest in Solar Images.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

2014
Image retrieval on compressed images: Can we tell the difference?
Proceedings of the 4th International Conference on Image Processing Theory, 2014

Large-Scale Region-Based Multimedia Retrieval for Solar Images.
Proceedings of the Artificial Intelligence and Soft Computing, 2014

Scalable solar image Retrieval with Lucene.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

2013
A large-scale solar image dataset with labeled event regions.
Proceedings of the IEEE International Conference on Image Processing, 2013

On Using SIFT Descriptors for Image Parameter Evaluation.
Proceedings of the 13th IEEE International Conference on Data Mining Workshops, 2013

Region-Based Querying of Solar Data Using Descriptor Signatures.
Proceedings of the 13th IEEE International Conference on Data Mining Workshops, 2013

A Comprehensive Study of iDistance Partitioning Strategies for kNN Queries and High-Dimensional Data Indexing.
Proceedings of the Big Data - 29th British National Conference on Databases, 2013

Extending High-Dimensional Indexing Techniques Pyramid and iMinMax(θ): Lessons Learned.
Proceedings of the Big Data - 29th British National Conference on Databases, 2013

Spatiotemporal Co-occurrence Rules.
Proceedings of the New Trends in Databases and Information Systems, 2013

When Too Similar Is Bad: A Practical Example of the Solar Dynamics Observatory Content-Based Image-Retrieval System.
Proceedings of the New Trends in Databases and Information Systems, 2013

Big Data New Frontiers: Mining, Search and Management of Massive Repositories of Solar Image Data and Solar Events.
Proceedings of the New Trends in Databases and Information Systems, 2013

2012
Spatio-temporal Co-occurrence Pattern Mining in Data Sets with Evolving Regions.
Proceedings of the 12th IEEE International Conference on Data Mining Workshops, 2012

Quantitative Comparison of Linear and Non-linear Dimensionality Reduction Techniques for Solar Image Archives.
Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference, 2012

2011
On the surprisingly accurate transfer of image parameters between medical and solar images.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

2010
An Experimental Evaluation of Popular Image Parameters for Monochromatic Solar Image Categorization.
Proceedings of the Twenty-Third International Florida Artificial Intelligence Research Society Conference, 2010

Selection of Image Parameters as the First Step towards Creating a CBIR System for the Solar Dynamics Observatory.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2010

Usage of Dissimilarity Measures and Multidimensional Scaling for Large Scale Solar Data Analysis.
Proceedings of the 2010 Conference on Intelligent Data Understanding, 2010

2009
On the effectiveness of fuzzy clustering as a data discretization technique for large-scale classification of solar images.
Proceedings of the FUZZ-IEEE 2009, 2009


  Loading...