Yu Cheng

Orcid: 0000-0002-2315-5641

Affiliations:
  • Rice University, TX, USA
  • Microsoft Research, Redmond, WA, USA (former)
  • IBM T.J. Watson Center, Yorktown Heights, NY, USA (former)
  • Northwestern University, Evanston, IL, USA (PhD 2015)


According to our database1, Yu Cheng authored at least 140 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Enhancing Low-Resource Relation Representations through Multi-View Decoupling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy.
CoRR, 2023

GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions.
CoRR, 2023

Filling the Information Gap between Video and Query for Language-Driven Moment Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Robustness Challenges in Model Distillation and Pruning for Natural Language Understanding.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

You Are Catching My Attention: Are Vision Transformers Bad Learners under Backdoor Attacks?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Hypotheses Tree Building for One-Shot Temporal Sentence Localization.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Adversarial Feature Augmentation and Normalization for Visual Recognition.
Trans. Mach. Learn. Res., 2022

M<sup>3</sup>ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design.
CoRR, 2022

M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Backdoor Attacks on Crowd Counting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training.
Proceedings of the Computer Vision - ECCV 2022, 2022

Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction.
Proceedings of the Computer Vision - ECCV 2022, 2022

DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment.
Proceedings of the Computer Vision - ECCV 2022, 2022

Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Unsupervised Temporal Video Grounding with Deep Semantic Clustering.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Memory-Guided Semantic Learning Network for Temporal Sentence Grounding.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Playing Lottery Tickets with Vision and Language.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Efficient Robust Training via Backward Smoothing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Bayesian Cycle-Consistent Generative Adversarial Networks via Marginalizing Latent Sampling.
IEEE Trans. Neural Networks Learn. Syst., 2021

EnlightenGAN: Deep Light Enhancement Without Paired Supervision.
IEEE Trans. Image Process., 2021

What do Compressed Large Language Models Forget? Robustness Challenges in Model Compression.
CoRR, 2021

Playing Lottery Tickets with Vision and Language.
CoRR, 2021

CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning.
CoRR, 2021

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly.
CoRR, 2021

A novel dual-network architecture for mixed-supervised medical image segmentation.
Comput. Medical Imaging Graph., 2021

Meta Module Network for Compositional Visual Reasoning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

MaxVA: Fast Adaptation of Step Sizes by Maximizing Observed Variance of Gradients.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

The Elastic Lottery Ticket Hypothesis.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Chasing Sparsity in Vision Transformers: An End-to-End Exploration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Data-Efficient GAN Training Beyond (Just) Augmentations: A Lottery Ticket Perspective.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

APo-VAE: Text Generation in Hyperbolic Space.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective.
Proceedings of the 9th International Conference on Learning Representations, 2021

UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Context-Aware Biaffine Localizing Network for Temporal Sentence Grounding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Cluster-Former: Clustering-based Sparse Transformer for Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding.
CoRR, 2020

Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos.
CoRR, 2020

Adaptive Learning Rates with Maximum Variation Averaging.
CoRR, 2020

Large-Scale Adversarial Training for Vision-and-Language Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Fine-grained Iterative Attention Network for Temporal Language Localization in Videos.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Sequential Attention GAN for Interactive Image Editing.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Graph Optimal Transport for Cross-Domain Alignment.
Proceedings of the 37th International Conference on Machine Learning, 2020

FreeLB: Enhanced Adversarial Training for Natural Language Understanding.
Proceedings of the 8th International Conference on Learning Representations, 2020

Cross-Thought for Sentence Encoder Pre-training.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Contrastive Distillation on Intermediate Representations for Language Model Compression.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-Fact Correction in Abstractive Text Summarization.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Contextual Text Style Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

UNITER: UNiversal Image-TExt Representation Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models.
Proceedings of the Computer Vision - ECCV 2020, 2020

Violin: A Large-Scale Dataset for Video-and-Language Inference.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

BachGAN: High-Resolution Image Synthesis From Salient Object Layout.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Discourse-Aware Neural Extractive Text Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

INSET: Sentence Infilling with INter-SEntential Transformer.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Distilling Knowledge Learned in BERT for Text Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Contrastively Smoothed Class Alignment for Unsupervised Domain Adaptation.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

What Makes A Good Story? Designing Composite Rewards for Visual Storytelling.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Attend to count: Crowd counting with adaptive capacity multi-scale CNNs.
Neurocomputing, 2019

A hybrid approach with optimization-based and metric-based meta-learner for few-shot learning.
Neurocomputing, 2019

INSET: Sentence Infilling with Inter-sentential Generative Pre-training.
CoRR, 2019

Distilling the Knowledge of BERT for Text Generation.
CoRR, 2019

Discourse-Aware Neural Extractive Model for Text Summarization.
CoRR, 2019

Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee.
CoRR, 2019

FreeLB: Enhanced Adversarial Training for Language Understanding.
CoRR, 2019

UNITER: Learning UNiversal Image-TExt Representations.
CoRR, 2019

A Hybrid Approach with Optimization and Metric-based Meta-Learner for Few-Shot Learning.
CoRR, 2019

Few-shot Learning with Meta Metric Learners.
CoRR, 2019

Adversarial Category Alignment Network for Cross-domain Sentiment Classification.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Relation-Aware Graph Attention Network for Visual Question Answering.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Patient Knowledge Distillation for BERT Model Compression.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Domain Adaptive Text Style Transfer.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

StoryGAN: A Sequential Conditional GAN for Story Visualization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Early Fault Detection Approach With Deep Architectures.
IEEE Trans. Instrum. Meas., 2018

Model Compression and Acceleration for Deep Neural Networks: The Principles, Progress, and Challenges.
IEEE Signal Process. Mag., 2018

Switching-State Dynamical Modeling of Daily Behavioral Data.
J. Heal. Informatics Res., 2018

Sequential Attention GAN for Interactive Image Editing via Dialogue.
CoRR, 2018

Bayesian CycleGAN via Marginalizing Latent Sampling.
CoRR, 2018

Spatial-Temporal Synergic Residual Learning for Video Person Re-Identification.
CoRR, 2018

Dialog-based Interactive Image Retrieval.
CoRR, 2018

Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond.
CoRR, 2018

Dialog-based Interactive Image Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Diverse Few-Shot Text Classification with Multiple Metrics.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

2017
Unsupervised Sequential Outlier Detection With Deep Architectures.
IEEE Trans. Image Process., 2017

Deep Model Based Domain Adaptation for Fault Diagnosis.
IEEE Trans. Ind. Electron., 2017

Robust extrinsic calibration from pedestrians.
Signal Process. Image Commun., 2017

Legislative prediction with dual uncertainty minimization from heterogeneous information.
Stat. Anal. Data Min., 2017

A Survey of Model Compression and Acceleration for Deep Neural Networks.
CoRR, 2017

Exploiting Convolutional Neural Network for Risk Prediction with Medical Feature Embedding.
CoRR, 2017

Boosting Deep Learning Risk Prediction with Generative Adversarial Networks for Electronic Health Records.
Proceedings of the 2017 IEEE International Conference on Data Mining, 2017

Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2015
Fast Neural Networks with Circulant Projections.
CoRR, 2015

An Exploration of Parameter Redundancy in Deep Networks with Circulant Projections.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Reducing infrequent-token perplexity via variational corpora.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
MuSES: Multilingual Sentiment Elicitation System for Social Media Data.
IEEE Intell. Syst., 2014

IBM-Northwestern@TRECVID 2014: Surveillance Event Detection.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Batch Mode Active Learning with Hierarchical-Structured Embedded Variance.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

RiskWheel: Interactive visual analytics for surveillance event detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Estimating Online User Location Distribution without GPS Location.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Dual Uncertainty Minimization Regularization and Its Applications on Heterogeneous Data.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Social Role Identification via Dual Uncertainty Minimization Regularization.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Temporal Sequence Modeling for Video Event Detection.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
IBM Research and Columbia University TRECVID-2013 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), Surveillance Event Detection (SED), and Semantic Indexing (SIN) Systems.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Graphical Modeling of Macro Behavioral Targeting in Social Networks.
Proceedings of the 13th SIAM International Conference on Data Mining, 2013

A robot-in-the-loop face detection and recognition system with coordinaive control platform.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013

JobMiner: a real-time system for mining job-related patterns from social media.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Forecast Oriented Classification of Spatio-Temporal Extreme Events.
Proceedings of the IJCAI 2013, 2013

Mining diabetes complication and treatment patterns for clinical decision support.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Bootstrapping active name disambiguation with crowdsourcing.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Feedback-driven multiclass active learning for data streams.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Elver: Recommending Facebook pages in cold start situation without content features.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

A probabilistic graphical model for brand reputation assessment in social networks.
Proceedings of the Advances in Social Networks Analysis and Mining 2013, 2013

2012
Sentiment identification by incorporating syntax, semantics and context information.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

VOXSUP: a social engagement framework.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

CluChunk: clustering large scale user-generated content incorporating chunklet information.
Proceedings of the 1st International Workshop on Big Data, 2012

Probabilistic macro behavioral targeting.
Proceedings of the 2012 workshop on Data-driven User Behavioral Modelling and Mining from Social Media, 2012

On active learning in hierarchical classification.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
SES: Sentiment Elicitation System for Social Media Data.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Learning to Group Web Text Incorporating Prior Information.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Mining millions of reviews: a technique to rank products based on importance of reviews.
Proceedings of the 13th International Conference on Electronic Commerce, 2011

2010
A real-time face detection and recognition system for a mobile robot in a complex background.
Artif. Life Robotics, 2010

Design of Fast Multiple String Searching Based on Improved Prefix Tree.
Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, 2010

A fast clustering approach for effectively searching person specific image.
Proceedings of the Second International Conference on Digital Image Processing, 2010

2009
Formation and obstacle avoidance in the unknown environment of multi-robot system.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009

Fast person-specific image retrieval using a simple and efficient clustering method.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009

VisionSynaptics: a system convert hand-writing and image symbol into computer symbol.
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, 2009


  Loading...