Jie Yang

Orcid: 0000-0001-5696-363X

Affiliations:
  • Harvard University, Harvard Medical School, USA
  • Singapore University of Technology and Design, Singapore (former)


According to our database1, Jie Yang authored at least 47 papers between 2016 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Scalable Medication Extraction and Discontinuation Identification from Electronic Health Records Using Large Language Models.
CoRR, June, 2025

BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text.
CoRR, April, 2025

Natural language processing for scalable feature engineering and ultra-high-dimensional confounding adjustment in healthcare database studies.
J. Biomed. Informatics, 2025

Analysis of longitudinal social media for monitoring symptoms during a pandemic.
J. Biomed. Informatics, 2025

Identification of an ANCA-associated vasculitis cohort using deep learning and electronic health records.
Int. J. Medical Informatics, 2025

Comparative ranking of marginal confounding impact of natural language processing-derived versus structured features in pharmacoepidemiology.
Comput. Biol. Medicine, 2025

Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection.
Proceedings of the ACM on Web Conference 2025, 2025

2024
Better Pay Attention Whilst Fuzzing.
IEEE Trans. Software Eng., February, 2024

Large language models leverage external knowledge to extend clinical insight beyond language boundaries.
J. Am. Medical Informatics Assoc., 2024

Streamlining social media information retrieval for public health research with deep learning.
J. Am. Medical Informatics Assoc., 2024

Revealing COVID-19's Social Dynamics: Diachronic Semantic Analysis of Vaccine and Symptom Discourse on Twitter.
CoRR, 2024

FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure.
CoRR, 2024

Generation is better than Modification: Combating High Class Homophily Variance in Graph Anomaly Detection.
CoRR, 2024

MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway Encoding.
CoRR, 2024

MedJourney: Benchmark and Evaluation of Large Language Models over Patient Clinical Journey.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Guiding Clinical Reasoning with Large Language Models via Knowledge Seeds.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Revealing COVID-19's Social Dynamics: Diachronic Semantic Analysis of Vaccine and Symptom Discourse on Twitter.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Pre-trained Online Contrastive Learning for Insurance Fraud Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Streamlining Social Media Information Retrieval for Public Health Research with Deep Learning.
CoRR, 2023

Qualifying Chinese Medical Licensing Examination with Knowledge Enhanced Generative Pre-training Model.
CoRR, 2023

Exploring Social Media for Early Detection of Depression in COVID-19 Patients.
Proceedings of the ACM Web Conference 2023, 2023

GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language Models at Almost No Cost.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

YATO: Yet Another deep learning based Text analysis Open toolkit.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Using Twitter data to understand public perceptions of approved versus off-label use for COVID-19-related medications.
J. Am. Medical Informatics Assoc., 2022

GreenPLM: Cross-lingual pre-trained language models conversion with (almost) no cost.
CoRR, 2022

METS-CoV: A Dataset of Medical Entity and Targeted Sentiment on COVID-19 Related Tweets.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Better Pay Attention Whilst Fuzzing.
CoRR, 2021

Natural Language Processing to Identify Abnormal Breast, Lung, and Cervical Cancer Screening Test Results from Unstructured Reports to Support Timely Follow-up.
Proceedings of the MEDINFO 2021: One World, One Health - Global Partnership for Digital Innovation, 2021

Comparison of Machine Learning Algorithms for Earlier Detection of Cognitive Decline from Clinical Notes in the Electronic Health Records.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

2020
Lattice LSTM for Chinese Sentence Representation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

From Genesis to Creole Language: Transfer Learning for Singlish Universal Dependencies Parsing and POS Tagging.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Mining clinical phrases from nursing notes to discover risk factors of patient deterioration.
Int. J. Medical Informatics, 2020

Deep Learning to Detect Allergy Events from Hospital Safety Reports.
Proceedings of the AMIA 2020, 2020

Implementing an IT-Based Intervention to Improve Follow-up Rates of Abnormal Cancer Screening Results: the mFOCUS Trial.
Proceedings of the AMIA 2020, 2020

2019
Subword Encoding in Lattice LSTM for Chinese Word Segmentation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Design Challenges and Misconceptions in Neural Sequence Labeling.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Chinese NER Using Lattice LSTM.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

YEDDA: A Lightweight Collaborative Text Span Annotation Tool.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

NCRF++: An Open-source Neural Sequence Labeling Toolkit.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

2017
YEDDA: A Lightweight Collaborative Text Span Annotation Tool.
CoRR, 2017

Neural Reranking for Named Entity Recognition.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

Attention-based Recurrent Convolutional Neural Network for Automatic Essay Scoring.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Neural Word Segmentation with Rich Pretraining.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Universal Dependencies Parsing for Colloquial Singaporean English.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
LibN3L: A Lightweight Package for Neural NLP.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Combining Discrete and Neural Features for Sequence Labeling.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2016


  Loading...