Yan Yan

Orcid: 0000-0002-3182-1739

Affiliations:
  • Illinois Institute of Technology, IL, USA
  • Texas State University, Department of Computer Science, San Marcos, TX, USA (former)
  • University of Michigan, Department of Electrical Engineering and Computer Science, Ann Arbor, MI, USA (2016 - 2017)
  • University of Trento, Department of Information Engineering and Computer Science, Italy (PhD 2014)


According to our database1, Yan Yan authored at least 176 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LLM Inference Unveiled: Survey and Roofline Model Insights.
CoRR, 2024

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning.
CoRR, 2024

WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Divide-and-Conquer Completion Network for Video Inpainting.
IEEE Trans. Circuits Syst. Video Technol., June, 2023

Egocentric Early Action Prediction via Adversarial Knowledge Distillation.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Cross-View Panorama Image Synthesis.
IEEE Trans. Multim., 2023

Stealthy 3D Poisoning Attack on Video Recognition Models.
IEEE Trans. Dependable Secur. Comput., 2023

ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models.
CoRR, 2023

Unseen Image Synthesis with Diffusion Models.
CoRR, 2023

Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition.
CoRR, 2023

BPT: Binary Point Cloud Transformer for Place Recognition.
CoRR, 2023

Boundary Guided Mixing Trajectory for Semantic Control with Diffusion Models.
CoRR, 2023

Optical Flow Estimation in 360° Videos: Dataset, Model and Application.
CoRR, 2023

Intelligent Constraint Classification for Symbolic Execution.
Proceedings of the IEEE International Conference on Software Analysis, 2023

Few-shot Medical Image Segmentation with Cycle-resemblance Attention.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Boundary Guided Learning-Free Semantic Control with Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MIM4DD: Mutual Information Maximization for Dataset Distillation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cell Instance Segmentation VIA Multi-Scale Non-Local Correlation.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

PFTA-Net: Progressive Feature Alignment and Temporal Attention Fusion Networks for Video Inpainting.
Proceedings of the IEEE International Conference on Image Processing, 2023

Causal-DFQ: Causality Guided Data-free Network Quantization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Saner Deep Image Registration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MLP-GAN for Brain Vessel Image Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Flow-Guided Deformable Alignment Network with Self-Supervision for Video Inpainting.
Proceedings of the IEEE International Conference on Acoustics, 2023

Normal-Abnormal Decoupling Memory for Medical Report Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Semi-Supervised Video Inpainting with Cycle Consistency Constraints.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Deep Stereo Video Inpainting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Post-Training Quantization on Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching.
IEEE Trans. Multim., 2022

Unsupervised High-Resolution Portrait Gaze Correction and Animation.
IEEE Trans. Image Process., 2022

Divide-and-Conquer Predictor for Unbiased Scene Graph Generation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Cross-view panorama image synthesis with progressive attention GANs.
Pattern Recognit., 2022

Saying the Unseen: Video Descriptions via Dialog Agents.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Vision+X: A Survey on Multimodal Learning in the Light of Data.
CoRR, 2022

Semi-Supervised Video Inpainting with Cycle Consistency Constraints.
CoRR, 2022

Learning Omnidirectional Flow in 360-degree Video via Siamese Representation.
CoRR, 2022

Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem.
CoRR, 2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation.
CoRR, 2022

Robust Audio-Visual Instance Discrimination via Active Contrastive Set Mining.
CoRR, 2022

Supplementing Missing Visions via Dialog for Scene Graph Generations.
CoRR, 2022

Measuring Bias and Fairness in Multiclass Classification.
Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2022

C<sup>3</sup>CMR: Cross-Modality Cross-Instance Contrastive Learning for Cross-Media Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Progressive Cross-modal Knowledge Distillation for Human Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Active Contrastive Set Mining for Robust Audio-Visual Instance Discrimination.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Win The Lottery Ticket Via Fourier Analysis: Frequencies Guided Network Pruning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Cross-Modal Knowledge Distillation For Vision-To-Sensor Action Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Quantized GAN for Complex Music Generation from Dance Videos.
Proceedings of the Computer Vision - ECCV 2022, 2022

Network Binarization via Contrastive Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Lipschitz Continuity Retained Binary Neural Network.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Omnidirectional Flow in 360$^\circ $ Video via Siamese Representation.
Proceedings of the Computer Vision - ECCV 2022, 2022

A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression.
IEEE Trans. Parallel Distributed Syst., 2021

Segmenting Objects in Day and Night: Edge-Conditioned CNN for Thermal Image Semantic Segmentation.
IEEE Trans. Neural Networks Learn. Syst., 2021

Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos.
IEEE Trans. Image Process., 2021

Discriminative Cross-Modality Attention Network for Temporal Inconsistent Audio-Visual Event Localization.
IEEE Trans. Image Process., 2021

Multi-scale single-stage pose detection with adaptive sample training in the classroom scene.
Knowl. Based Syst., 2021

A Metamodel and Framework for Artificial General Intelligence From Theory to Practice.
J. Artif. Intell. Conscious., 2021

Structured discriminative tensor dictionary learning for unsupervised domain adaptation.
Neurocomputing, 2021

Expert and Crowd-Guided Affect Annotation and Prediction.
CoRR, 2021

Measure Twice, Cut Once: Quantifying Bias and Fairness in Deep Neural Networks.
CoRR, 2021

Simon Says: Evaluating and Mitigating Bias in Pruned Neural Networks with Knowledge Distillation.
CoRR, 2021

Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Cross-View Exocentric to Egocentric Video Synthesis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Unsupervised Neural Tracing In Densely Labeled Multispectral Brainbow Images.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Lipschitz Continuity Guided Knowledge Distillation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Audio-Visual Correlations From Variational Cross-Modal Generation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Learning To Aggregate and Personalize 3D Face From In-the-Wild Photo Collection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Guest editorial: Image/video understanding and analysis.
Pattern Recognit. Lett., 2020

A Weakly Supervised Multi-task Ranking Framework for Actor-Action Semantic Segmentation.
Int. J. Comput. Vis., 2020

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.
CoRR, 2020

Is Pruning Compression?: Investigating Pruning Via Network Layer Similarity.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Decoupled Self-attention Module for Person Re-identification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Revisiting Optical Flow Estimation in 360 Videos.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset.
Proceedings of the IEEE International Conference on Image Processing, 2020

Local-Global Feature for Video-Based One-Shot Person Re-Identification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Hierarchical HMM for Eye Movement Classification.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Describing Unseen Videos via Multi-modal Cooperative Dialog Agents.
Proceedings of the Computer Vision - ECCV 2020, 2020

Online Depth Learning Against Forgetting in Monocular Videos.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Craft Distillation: Layer-wise Convolutional Neural Network Distillation.
Proceedings of the 7th IEEE International Conference on Cyber Security and Cloud Computing, 2020

Cross-Modal Attention Network for Temporal Inconsistent Audio-Visual Event Localization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Recurrent Face Aging with Hierarchical AutoRegressive Memory.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

GazeCorrection: Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks.
CoRR, 2019

Structured Discriminative Tensor Dictionary Learning for Unsupervised Domain Adaptation.
CoRR, 2019

Multispectral tracing in densely labeled mouse brain with nTracer.
Bioinform., 2019

Deep Micro-Dictionary Learning and Coding Network.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation.
Proceedings of the International Joint Conference on Neural Networks, 2019

Joint Learning of Self-Representation and Indicator for Multi-View Image Clustering.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Expression Conditional Gan for Facial Expression-to-Expression Translation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Dual Attention Matching for Audio-Visual Event Localization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Attribute-Guided Sketch Generation.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Pattern-Affinitive Propagation Across Depth, Surface Normal and Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Bottom-Up Clustering Approach to Unsupervised Person Re-Identification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Guest Editorial: Special Section on "Multimedia Understanding via Multimodal Analytics".
ACM Trans. Multim. Comput. Commun. Appl., 2018

Few-Shot Text and Image Classification via Analogical Transfer Learning.
ACM Trans. Intell. Syst. Technol., 2018

Flexible Manifold Learning With Optimal Graph for Image and Video Representation.
IEEE Trans. Image Process., 2018

Exploring Web Images to Enhance Skin Disease Analysis Under A Computer Vision Framework.
IEEE Trans. Cybern., 2018

Learn to model blurry motion via directional similarity and filtering.
Pattern Recognit., 2018

Guest Editors' Introduction to the Special Section on Learning with Shared Information for Computer Vision and Multimedia Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Guest Editorial: Semantic Concept Discovery in MM Data.
Multim. Tools Appl., 2018

Spatial query based virtual reality GIS analysis platform.
Neurocomputing, 2018

GestureGAN for Hand Gesture-to-Gesture Translation in the Wild.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Media Quality Assessment by Perceptual Gaze-Shift Patterns Discovery.
IEEE Trans. Multim., 2017

Perceptually Guided Photo Retargeting.
IEEE Trans. Cybern., 2017

Multiview Physician-Specific Attributes Fusion for Health Seeking.
IEEE Trans. Cybern., 2017

Guest Editorial: Intermediate representation for vision and multimedia applications.
J. Vis. Commun. Image Represent., 2017

Graph self-representation method for unsupervised feature selection.
Neurocomputing, 2017

Class-wise dictionary learning for hyperspectral image classification.
Neurocomputing, 2017

Guest Editorial: Language in Vision.
Comput. Vis. Image Underst., 2017

Detecting anomalous events in videos by learning deep representations of appearance and motion.
Comput. Vis. Image Underst., 2017

Indoor localization via multi-view images and videos.
Comput. Vis. Image Underst., 2017

Learn to Model Motion from Blurry Footages.
CoRR, 2017

A cross-modal adaptation approach for brain decoding.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Weakly Supervised Actor-Action Segmentation via Robust Multi-task Ranking.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Exploring Multitask and Transfer Learning Algorithms for Head Pose Estimation in Dynamic Multiview Scenarios.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016
Active domain adaptation with noisy labels for multimedia analysis.
World Wide Web, 2016

Semantic Photo Retargeting Under Noisy Image Labels.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Category Specific Dictionary Learning for Attribute Specific Feature Selection.
IEEE Trans. Image Process., 2016

Optimized Graph Learning Using Partial Tags and Multiple Features for Image and Video Annotation.
IEEE Trans. Image Process., 2016

A Multi-Task Learning Framework for Head Pose Estimation under Target Motion.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Guest Editorial: Representation Learning for Multimedia Data Understanding.
Multim. Tools Appl., 2016

Deep and fast: Deep learning hashing with semi-supervised graph construction.
Image Vis. Comput., 2016

Guest editorial: Bridging the semantic gap in multimedia understanding.
Neurocomputing, 2016

Where am I in the dark: Exploring active transfer learning on the use of indoor localization based on thermal imaging.
Neurocomputing, 2016

Collaborative Sparse Coding for Multiview Action Recognition.
IEEE Multim., 2016

Computational Modeling of Affective Qualities of Abstract Paintings.
IEEE Multim., 2016

A Fast 3D Indoor-Localization Approach Based on Video Queries.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Sparse-coded cross-domain adaptation from the visual to the brain domain.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Person Re-identification via Recurrent Feature Aggregation.
Proceedings of the Computer Vision - ECCV 2016, 2016

Recurrent Face Aging.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recognizing Emotions from Abstract Paintings Using Non-Linear Matrix Completion.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Projective Unsupervised Flexible Embedding with Optimal Graph.
Proceedings of the British Machine Vision Conference 2016, 2016

Sparse Code Filtering for Action Pattern Mining.
Proceedings of the Computer Vision - ACCV 2016, 2016

Fortune Teller: Predicting Your Career Path.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
L<sub>1</sub>-Norm Low-Rank Matrix Factorization by Variational Bayesian Method.
IEEE Trans. Neural Networks Learn. Syst., 2015

Event Oriented Dictionary Learning for Complex Event Detection.
IEEE Trans. Image Process., 2015

Egocentric Daily Activity Recognition via Multitask Clustering.
IEEE Trans. Image Process., 2015

Evaluation of semi-supervised learning method on action recognition.
Multim. Tools Appl., 2015

Supervised Hashing with Pseudo Labels for Scalable Multimedia Retrieval.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Who's Afraid of Itten: Using the Art Theory of Color Combination to Analyze Emotions in Abstract Paintings.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Analyzing Free-standing Conversational Groups: A Multimodal Approach.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Attribute Guided Dictionary Learning.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Looking at Mondrian's Victory Boogie-Woogie: What Do I Feel?
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Inferring Painting Style with Multi-Task Dictionary Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

PET: An eye-tracking dataset for animal-centric Pascal object classes.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Localize Me Anywhere, Anytime: A Multi-task Point-Retrieval Approach.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Optimal graph learning with partial tags and multiple features for image and video annotation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning Deep Representations of Appearance and Motion for Anomalous Event Detection.
Proceedings of the British Machine Vision Conference 2015, 2015

Complex Event Detection via Event Oriented Dictionary Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Multiple Tasks are Better than One: Multi-task Learning and Feature Selection for Head Pose Estimation, Action Recognition and Event Detection.
PhD thesis, 2014

Multitask Linear Discriminant Analysis for View Invariant Action Recognition.
IEEE Trans. Image Process., 2014

GLocal tells you more: Coupling GLocal structural for feature selection with sparsity for image and video classification.
Comput. Vis. Image Underst., 2014

You Talkin' to Me?: Recognizing Complex Human Interactions in Unconstrained Videos.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Mystery of Faces: Investigating Face Contribution for Multimedia Event Detection.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Clustered Multi-task Linear Discriminant Analysis for View Invariant Color-Depth Action Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

It's all about habits: Exploiting multi-task clustering for activities of daily living analysis.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Minimizing dataset bias: Discriminative multi-task sparse coding through shared subspace learning for image classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Exploiting transfer learning for personalized view invariant gesture recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Knowing Where I Am: Exploiting Multi-Task Learning for Multi-view Indoor Image-based Localization.
Proceedings of the British Machine Vision Conference, 2014

Recognizing Daily Activities from First-Person Videos with Multi-task Clustering.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013

GLocal structural feature selection with sparsity for multimedia data understanding.
Proceedings of the ACM Multimedia Conference, 2013

On the relationship between head pose, social attention and personality prediction for unstructured and dynamic group interactions.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Multi-task linear discriminant analysis for multi-view action recognition.
Proceedings of the IEEE International Conference on Image Processing, 2013

No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Active transfer learning for multi-view head-pose classification.
Proceedings of the 21st International Conference on Pattern Recognition, 2012


  Loading...