Jingbo Shang

Orcid: 0000-0002-7249-4404

Affiliations:
  • University of California, San Diego, Department of Computer Science and Engineering, CA, USA


According to our database1, Jingbo Shang authored at least 141 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning.
CoRR, 2024

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step.
CoRR, 2024

TOOLVERIFIER: Generalization to New Tools via Self-Verification.
CoRR, 2024

Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models.
CoRR, 2024

Answer is All You Need: Instruction-following Text Embedding via Answering the Question.
CoRR, 2024

MEMORYLLM: Towards Self-Updatable Large Language Models.
CoRR, 2024

Learning a Decision Tree Algorithm with Transformers.
CoRR, 2024

Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision.
CoRR, 2024

Large Language Models for Time Series: A Survey.
CoRR, 2024

MedLM: Exploring Language Models for Medical Question Answering Systems.
CoRR, 2024

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding.
CoRR, 2024

2023
OMNIINPUT: A Model-centric Evaluation Framework through Output Distribution.
CoRR, 2023

Physics-Informed Data Denoising for Real-Life Sensing Systems.
CoRR, 2023

DAIL: Data Augmentation for In-Context Learning via Self-Paraphrase.
CoRR, 2023

EmojiLM: Modeling the New Emoji Language.
CoRR, 2023

Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking.
CoRR, 2023

Fast-ELECTRA for Efficient Pre-training.
CoRR, 2023

Critique Ability of Large Language Models.
CoRR, 2023

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models.
CoRR, 2023

Learning Concise and Descriptive Attributes for Visual Recognition.
CoRR, 2023

Generating Efficient Training Data via LLM-based Attribute Manipulation.
CoRR, 2023

Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image Manipulation.
CoRR, 2023

Modeling Label Semantics Improves Activity Recognition.
CoRR, 2023

Federated Learning with Client-Exclusive Classes.
CoRR, 2023

Navigating Alignment for Non-identical Client Class Sets: A Label Name-Anchored Federated Learning Framework.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Text Is All You Need: Learning Language Representations for Sequential Recommendation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

UCEpic: Unifying Aspect Planning and Lexical Constraints for Generating Explanations in Recommendation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Minimally Supervised Contextual Inference from Human Mobility: An Iterative Collaborative Distillation Framework.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Gradient-based Wang-Landau Algorithm: A Novel Sampler for Output Distribution of Neural Networks over the Input Space.
Proceedings of the International Conference on Machine Learning, 2023

Understand and Modularize Generator Optimization in ELECTRA-style Pretraining.
Proceedings of the International Conference on Machine Learning, 2023

On Compositional Uncertainty Quantification for Seq2seq Graph Parsing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Concise and Descriptive Attributes for Visual Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Diverse and Coherent Augmentation for Time-Series Forecasting.
Proceedings of the IEEE International Conference on Acoustics, 2023

Goal-Driven Explainable Clustering via Language Descriptions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Less than One-shot: Named Entity Recognition via Extremely Weak Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SELFOOD: Self-Supervised Out-Of-Distribution Detection via Learning to Rank.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Retrieval-Augmented Parsing for Complex Graphs by Exploiting Structure and Uncertainty.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Open-world Semi-supervised Generalized Relation Discovery Aligned in a Real-world Setting.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

ClusterLLM: Large Language Models as a Guide for Text Clustering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

WOT-Class: Weakly Supervised Open-world Text Classification.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Unleashing the Power of Shared Label Structures for Human Activity Recognition.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Open-World Product Attribute Mining: A Lightly-Supervised Approach.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

PV2TEA: Patching Visual Modality to Textual-Established Information Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Benchmark on Extremely Weakly Supervised Text Classification: Reconcile Seed Matching and Prompting Approaches.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

PrimeNet: Pre-training for Irregular Multivariate Time Series.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Waveformer: Linear-Time Attention with Forward and Backward Wavelet Transform.
CoRR, 2022

UCEpic: Unifying Aspect Planning and Lexical Constraints for Explainable Recommendation.
CoRR, 2022

SoTeacher: A Student-oriented Teacher Network Training Framework for Knowledge Distillation.
CoRR, 2022

Intermediate Training on Question Answering Datasets Improves Generative Data Augmentation.
CoRR, 2022

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

ESC-GAN: Extending Spatial Coverage of Physical Sensors.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

SQEE: A Machine Perception Approach to Sensing Quality Evaluation at the Edge by Uncertainty Quantification.
Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, 2022

Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

TARNet: Task-Aware Reconstruction for Time-Series Transformer.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Progressive Sentiment Analysis for Code-Switched Text Data.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Leveraging QA Datasets to Improve Generative Data Augmentation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LOPS: Learning Order Inspired Pseudo-Label Selection for Weakly Supervised Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Neural-Symbolic Inference for Robust Autoregressive Graph Parsing via Compositional Uncertainty Quantification.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

WeDef: Weakly Supervised Backdoor Defense for Text Classification.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Fine-grained Contrastive Learning for Relation Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

MGDoc: Pre-training with Multi-granular Hierarchy for Document Image Understanding.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A Pilot Study on Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

DeepViFi: detecting oncoviral infections in cancer genomes using transformers.
Proceedings of the BCB '22: 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Northbrook, Illinois, USA, August 7, 2022

Learning Adaptive Axis Attentions in Fine-tuning: Beyond Fixed Sparse Attention Patterns.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Towards Collaborative Neural-Symbolic Graph Semantic Parsing via Uncertainty.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

UCTopic: Unsupervised Contrastive Learning for Phrase Representations and Topic Mining.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Phrase-aware Unsupervised Constituency Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Towards Comprehensive Patent Approval Predictions: Beyond Traditional Document Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence Framework.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Unsupervised Deep Keyphrase Generation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Double Descent in Adversarial Training: An Implicit Label Noise Perspective.
CoRR, 2021

News Meets Microblog: Hashtag Annotation via Retriever-Generator.
CoRR, 2021

Data Profiling for Adversarial Training: On the Ruin of Problematic Data.
CoRR, 2021

Minimally-Supervised Structure-Rich Text Categorization via Learning on Text-Rich Networks.
Proceedings of the WWW '21: The Web Conference 2021, 2021

AutoName: A Corpus-Based Set Naming Framework.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

UniTS: Short-Time Fourier Inspired Neural Networks for Sensory Time Series Classification.
Proceedings of the SenSys '21: The 19th ACM Conference on Embedded Networked Sensor Systems, Coimbra, Portugal, November 15, 2021

"Misc"-Aware Weakly Supervised Aspect Classification.
Proceedings of the 2021 SIAM International Conference on Data Mining, 2021

X-Class: Text Classification with Extremely Weak Supervision.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

TaxoClass: Hierarchical Multi-Label Text Classification Using Only Class Names.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

UCPhrase: Unsupervised Context-aware Quality Phrase Tagging.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

LayoutReader: Pre-training of Text and Layout for Reading Order Detection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

"Average" Approximates "First Principal Component"? An Empirical Analysis on Representations from Neural Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

BFClass: A Backdoor-free Text Classification Framework.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Sensei: Self-Supervised Sensor Name Segmentation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Weakly Supervised Named Entity Tagging with Learnable Logical Rules.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Overfitting or Underfitting? Understand Robustness Drop in Adversarial Training.
CoRR, 2020

User-Guided Aspect Classification for Domain-Specific Texts.
CoRR, 2020

NetTaxo: Automated Topic Taxonomy Construction from Text-Rich Network.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

FUSE: Multi-faceted Set Expansion by Coherent Clustering of Skip-Grams.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

Scientific Text Mining and Knowledge Graphs.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Towards Adaptive Residual Network Training: A Neural-ODE Perspective.
Proceedings of the 37th International Conference on Machine Learning, 2020

SynSetExpan: An Iterative Framework for Joint Entity Set Expansion and Synonym Discovery.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

META: Metadata-Empowered Weak Supervision for Text Classification.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

SeNsER: Learning Cross-Building Sensor Metadata Tagger.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Empower Entity Set Expansion via Language Model Probing.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Contextualized Weak Supervision for Text Classification.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Constructing and mining structured heterogeneous information networks from massive text corpora
PhD thesis, 2019

TextCube: Automated Construction and Multidimensional Exploration.
Proc. VLDB Endow., 2019

CubeNet: Multi-Facet Hierarchical Heterogeneous Network Construction, Analysis, and Mining.
CoRR, 2019

Raw-to-End Name Entity Recognition in Social Media.
CoRR, 2019

Cross-type biomedical named entity recognition with deep multi-task learning.
Bioinform., 2019

Integrating Local Context and Global Cohesiveness for Open Information Extraction.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

Arabic Named Entity Recognition: What Works and What's Next.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

Constructing and Mining Heterogeneous Information Networks from Massive Text.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

CrossWeigh: Training Named Entity Tagger from Imperfect Annotations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Distantly Supervised Biomedical Named Entity Recognition with Dictionary Expansion.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

2018
Automated Phrase Mining from Massive Text Corpora.
IEEE Trans. Knowl. Data Eng., 2018

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns.
IEEE Trans. Knowl. Data Eng., 2018

Integrating Local Context and Global Cohesiveness for Open Information Extraction.
CoRR, 2018

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling.
CoRR, 2018

Contrast Subgraph Mining from Coherent Cores.
CoRR, 2018

Open Information Extraction with Global Structure Constraints.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Learning Named Entity Tagger using Domain-Specific Dictionary.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Investigating Rumor News Using Agreement-Aware Search.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Empower Sequence Labeling with Task-Aware Neural Language Model.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Phrase Mining from Massive Text and Its Applications
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01910-4, 2017

Detection of Complexes in Biological Networks Through Diversified Dense Subgraph Mining.
J. Comput. Biol., 2017

Constructing Structured Information Networks from Massive Text Corpora.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Building Structured Databases of Factual Knowledge from Massive Text Corpora.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

MetaPAD: Meta Pattern Discovery from Massive Text Corpora.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Detecting Multiple Periods and Periodic Patterns in Event Time Sequences.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

An Attention-based Collaboration Framework for Multi-View Network Representation Learning.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
Meta-Path Guided Embedding for Similarity Search in Large-Scale Heterogeneous Information Networks.
CoRR, 2016

Representing Documents via Latent Keyphrase Inference.
Proceedings of the 25th International Conference on World Wide Web, 2016

DPClass: An Effective but Concise Discriminative Patterns-Based Classification Framework.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

MACFP: Maximal Approximate Consecutive Frequent Pattern Mining under Edit Distance.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

Data-Driven Contextual Valence Shifter Quantification for Multi-Theme Sentiment Analysis.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Mining Discriminative Patterns to Predict Health Status for Cardiopulmonary Patients.
Proceedings of the 7th ACM International Conference on Bioinformatics, 2016

2015
Mining Quality Phrases from Massive Text Corpora.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Detecting urban black holes based on human mobility data.
Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2015

2014
Inferring gas consumption and pollution emission of vehicles throughout a city.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

A Parallel and Efficient Algorithm for Learning to Match.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Constrained Question Recommendation in MOOCs via Submodularity.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014


  Loading...