Yan Yan

Proceedings of the 20th IEEE International Conference on Automatic Face and Gesture Recognition, 2026

ProxyTTT: Proxy-driven Test-Time Training for Multi-modal Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Distill Video Datasets into Images.

[BibT_eX]

[DOI]

CoRR, December, 2025

Consistent Instance Field for Dynamic Scene Understanding.

[BibT_eX]

[DOI]

CoRR, December, 2025

From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields.

[BibT_eX]

[DOI]

CoRR, December, 2025

TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing.

[BibT_eX]

[DOI]

CoRR, December, 2025

GLaD: Geometric Latent Distillation for Vision-Language-Action Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance.

[BibT_eX]

[DOI]

CoRR, November, 2025

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs.

[BibT_eX]

[DOI]

CoRR, November, 2025

Semantic-Cohesive Knowledge Distillation for Deep Cross-modal Hashing.

[BibT_eX]

[DOI]

Changchang Sun

Vickie Chen

CoRR, October, 2025

Efficient Multimodal Dataset Distillation via Generative Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation.

[BibT_eX]

[DOI]

CoRR, June, 2025

3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation.

[BibT_eX]

[DOI]

CoRR, April, 2025

MM-UNet: Meta Mamba UNet for Medical Image Segmentation.

[BibT_eX]

[DOI]

Bin Xie

Gady Agam

CoRR, March, 2025

X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction.

[BibT_eX]

[DOI]

CoRR, March, 2025

RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2.

[BibT_eX]

[DOI]

CoRR, February, 2025

Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation.

[BibT_eX]

[DOI]

CoRR, February, 2025

Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

X-Field: A Physically Informed Representation for 3D X-ray Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

ZECO: ZeroFusion Guided 3D MRI Conditional Generation.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Machine Vision and Applications, 2025

Noise-based Regularized Training for Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Machine Vision and Applications, 2025

SSDL: Sensor-to-Skeleton Diffusion Model with Lipschitz Regularization for Human Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling, 2025

Quantized-ViT Efficient Training via Fisher Matrix Regularization.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling, 2025

Visual Grounding with Attention-Driven Constraint Balancing.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Online Multispectral Neuron Tracing.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

Harnessing LLMs for Document-Guided Fuzzing of OpenCV Library.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2025

Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Tie-Breaking Conflict-Ease Cross-Modal Hashing.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Image Processing, ICIP 2025, 2025

MaskSAM: Auto-Prompt SAM with Mask Classification for Volumetric Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

QuEST: Low-Bit Diffusion Model Quantization via Efficient Selective Finetuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LLaVA-Prumerge: Adaptive Token Reduction for Efficient Large Multimodal Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Robin3D Improving 3D Large Language Model via Robust Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Distilling Long-tailed Datasets.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PLRV-O: Advancing Differentially Private Deep Learning via Privacy Loss Random Variable Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security, 2025

2024

Vision + X: A Survey on Multimodal Learning in the Light of Data.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Intelligent salivary biosensors for periodontitis: in vitro simulation of oral oxidative stress conditions.

[BibT_eX]

[DOI]

Medical Biol. Eng. Comput., August, 2024

Stochastic Latent Talking Face Generation Toward Emotional Expressions and Head Poses.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2024

Attribute-Guided Cross-Modal Interaction and Enhancement for Audio-Visual Matching.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2024

THISNet: Tooth Instance Segmentation on 3D Dental Models via Highlighting Tooth Regions.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2024

Robust Audio-Visual Contrastive Learning for Proposal-Based Self-Supervised Sound Source Localization in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2024

Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification.

[BibT_eX]

[DOI]

CoRR, 2024

GALOT: Generative Active Learning via Optimizable Zero-shot Text-to-image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

freePruner: A Training-free Approach for Large Multimodal Model Acceleration.

[BibT_eX]

[DOI]

CoRR, 2024

MambaReg: Mamba-Based Disentangled Convolutional Sparse Coding for Unsupervised Deformable Multi-Modal Image Registration.

[BibT_eX]

[DOI]

CoRR, 2024

Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Manner.

[BibT_eX]

[DOI]

CoRR, 2024

ACTRESS: Active Retraining for Semi-supervised Visual Grounding.

[BibT_eX]

[DOI]

CoRR, 2024

LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Multimodal Wearable Sensor-based Human Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

MaskSAM: Towards Auto-prompt SAM with Mask Classification for Medical Image Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

Online Multi-spectral Neuron Tracing.

[BibT_eX]

[DOI]

CoRR, 2024

LLM Inference Unveiled: Survey and Roofline Model Insights.

[BibT_eX]

[DOI]

CoRR, 2024

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning.

[BibT_eX]

[DOI]

CoRR, 2024

Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Semi-supervised Prototype Semantic Association Learning for Robust Cross-modal Retrieval.

[BibT_eX]

[DOI]

Junsheng Wang

Tiantian Gong

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

PTQ4DiT: Post-training Quantization for Diffusion Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Partially Aligned Cross-modal Retrieval via Optimal Transport-based Prototype Alignment Learning.

[BibT_eX]

[DOI]

Junsheng Wang

Tiantian Gong

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Monocular Expressive 3D Human Reconstruction of Multiple People.

[BibT_eX]

[DOI]

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Gated Multi-Scale Attention Transformer For Few-Shot Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Towards Accurate And Robust Dynamics and Reward Modeling for Model-Based Offline Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

Gengyu Zhang

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

FBPT: A Fully Binary Point Transformer.

[BibT_eX]

[DOI]

Zhixing Hou

Yuzhang Shang

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Audio-Visual Navigation with Anti-Backtracking.

[BibT_eX]

[DOI]

Zhenghao Zhao

Hao Tang

Proceedings of the Pattern Recognition - 27th International Conference, 2024

LightHART: Lightweight Human Activity Recognition Transformer.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - 27th International Conference, 2024

Supplementing Missing Visions Via Dialog for Scene Graph Generations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Text-Video Completion Networks With Motion Compensation And Attention Aggregation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Adaptive Cross-Architecture Mutual Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

Dataset Quantization with Active Learning Based Adaptive Sampling.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Versatile Navigation Under Partial Observability via Value-Guided Diffusion Policy.

[BibT_eX]

[DOI]

Gengyu Zhang

Hao Tang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On the Faithfulness of Vision Transformer Explanations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Token Transformation Matters: Towards Faithful Post-Hoc Explanation for Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing Post-Training Quantization Calibration Through Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Efficient Multitask Dense Predictor via Binarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MS-UMLP: Medical Image Segmentation via Multi-Scale U-shape MLP-Mixer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Divide-and-Conquer Completion Network for Video Inpainting.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., June, 2023

Egocentric Early Action Prediction via Adversarial Knowledge Distillation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Cross-View Panorama Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Stealthy 3D Poisoning Attack on Video Recognition Models.

[BibT_eX]

[DOI]

Shangyu Xie

Yuan Hong

IEEE Trans. Dependable Secur. Comput., 2023

ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Unseen Image Synthesis with Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

BPT: Binary Point Cloud Transformer for Place Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Boundary Guided Mixing Trajectory for Semantic Control with Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Optical Flow Estimation in 360° Videos: Dataset, Model and Application.

[BibT_eX]

[DOI]

CoRR, 2023

Intelligent Constraint Classification for Symbolic Execution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Software Analysis, 2023

Few-shot Medical Image Segmentation with Cycle-resemblance Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Modality Interference Decoupling and Representation Alignment for Caricature-Visual Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Boundary Guided Learning-Free Semantic Control with Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MIM4DD: Mutual Information Maximization for Dataset Distillation.

[BibT_eX]

[DOI]

Yuzhang Shang

Zhihang Yuan

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cell Instance Segmentation VIA Multi-Scale Non-Local Correlation.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

PFTA-Net: Progressive Feature Alignment and Temporal Attention Fusion Networks for Video Inpainting.

[BibT_eX]

[DOI]

Yanni Zhang

Zhiliang Wu

Proceedings of the IEEE International Conference on Image Processing, 2023

Causal-DFQ: Causality Guided Data-free Network Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Saner Deep Image Registration.

[BibT_eX]

[DOI]

Bin Duan

Ming Zhong

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MLP-GAN for Brain Vessel Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Flow-Guided Deformable Alignment Network with Self-Supervision for Video Inpainting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Normal-Abnormal Decoupling Memory for Medical Report Generation.

[BibT_eX]

[DOI]

Guosheng Zhao

Zijian Zhao

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Semi-Supervised Video Inpainting with Cycle Consistency Constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Deep Stereo Video Inpainting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Post-Training Quantization on Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2022

Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Unsupervised High-Resolution Portrait Gaze Correction and Animation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Divide-and-Conquer Predictor for Unbiased Scene Graph Generation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Cross-view panorama image synthesis with progressive attention GANs.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Saying the Unseen: Video Descriptions via Dialog Agents.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Semi-Supervised Video Inpainting with Cycle Consistency Constraints.

[BibT_eX]

[DOI]

CoRR, 2022

Learning Omnidirectional Flow in 360-degree Video via Siamese Representation.

[BibT_eX]

[DOI]

CoRR, 2022

Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem.

[BibT_eX]

[DOI]

CoRR, 2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Robust Audio-Visual Instance Discrimination via Active Contrastive Set Mining.

[BibT_eX]

[DOI]

CoRR, 2022

Measuring Bias and Fairness in Multiclass Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2022

C3CMR: Cross-Modality Cross-Instance Contrastive Learning for Cross-Media Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Progressive Cross-modal Knowledge Distillation for Human Action Recognition.

[BibT_eX]

[DOI]

Jianyuan Ni

Anne H. H. Ngu

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Active Contrastive Set Mining for Robust Audio-Visual Instance Discrimination.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Win The Lottery Ticket Via Fourier Analysis: Frequencies Guided Network Pruning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Cross-Modal Knowledge Distillation For Vision-To-Sensor Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Quantized GAN for Complex Music Generation from Dance Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Network Binarization via Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Lipschitz Continuity Retained Binary Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Omnidirectional Flow in 360$^\circ $ Video via Siamese Representation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021

Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2021

Segmenting Objects in Day and Night: Edge-Conditioned CNN for Thermal Image Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Discriminative Cross-Modality Attention Network for Temporal Inconsistent Audio-Visual Event Localization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Multi-scale single-stage pose detection with adaptive sample training in the classroom scene.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2021

A Metamodel and Framework for Artificial General Intelligence From Theory to Practice.

[BibT_eX]

[DOI]

Kristinn R. Thórisson

J. Artif. Intell. Conscious., 2021

Structured discriminative tensor dictionary learning for unsupervised domain adaptation.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Expert and Crowd-Guided Affect Annotation and Prediction.

[BibT_eX]

[DOI]

CoRR, 2021

Measure Twice, Cut Once: Quantifying Bias and Fairness in Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Simon Says: Evaluating and Mitigating Bias in Pruned Neural Networks with Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2021

Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Cross-View Exocentric to Egocentric Video Synthesis.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Unsupervised Neural Tracing In Densely Labeled Multispectral Brainbow Images.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Lipschitz Continuity Guided Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Audio-Visual Correlations From Variational Cross-Modal Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Learning To Aggregate and Personalize 3D Face From In-the-Wild Photo Collection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Guest editorial: Image/video understanding and analysis.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2020

A Weakly Supervised Multi-task Ranking Framework for Actor-Action Semantic Segmentation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Constraint Solving with Deep Learning for Symbolic Execution.

[BibT_eX]

[DOI]

CoRR, 2020

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2020

Is Pruning Compression?: Investigating Pruning Via Network Layer Similarity.

[BibT_eX]

[DOI]

Cody Blakeney

Ziliang Zong

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Decoupled Self-attention Module for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Revisiting Optical Flow Estimation in 360 Videos.

[BibT_eX]

[DOI]

Keshav Bhandari

Ziliang Zong

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Local-Global Feature for Video-Based One-Shot Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Hierarchical HMM for Eye Movement Classification.

[BibT_eX]

[DOI]

Ye Zhu

Oleg Komogortsev

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Describing Unseen Videos via Multi-modal Cooperative Dialog Agents.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Online Depth Learning Against Forgetting in Monocular Videos.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Craft Distillation: Layer-wise Convolutional Neural Network Distillation.

[BibT_eX]

[DOI]

Proceedings of the 7th IEEE International Conference on Cyber Security and Cloud Computing, 2020

Cross-Modal Attention Network for Temporal Inconsistent Audio-Visual Event Localization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Recurrent Face Aging with Hierarchical AutoRegressive Memory.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

GazeCorrection: Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Structured Discriminative Tensor Dictionary Learning for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2019

Multispectral tracing in densely labeled mouse brain with nTracer.

[BibT_eX]

[DOI]

Madeleine Vandenbrink

Bioinform., 2019

Deep Micro-Dictionary Learning and Coding Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

Joint Learning of Self-Representation and Indicator for Multi-View Image Clustering.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Expression Conditional Gan for Facial Expression-to-Expression Translation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Dual Attention Matching for Audio-Visual Event Localization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Attribute-Guided Sketch Generation.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Pattern-Affinitive Propagation Across Depth, Surface Normal and Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Bottom-Up Clustering Approach to Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Guest Editorial: Special Section on "Multimedia Understanding via Multimodal Analytics".

[BibT_eX]

[DOI]

Liqiang Nie

Rita Cucchiara

ACM Trans. Multim. Comput. Commun. Appl., 2018

Few-Shot Text and Image Classification via Analogical Transfer Learning.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2018

Flexible Manifold Learning With Optimal Graph for Image and Video Representation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Exploring Web Images to Enhance Skin Disease Analysis Under A Computer Vision Framework.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2018

Learn to model blurry motion via directional similarity and filtering.

[BibT_eX]

[DOI]

Pattern Recognit., 2018

Guest Editors' Introduction to the Special Section on Learning with Shared Information for Computer Vision and Multimedia Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Guest Editorial: Semantic Concept Discovery in MM Data.

[BibT_eX]

[DOI]

Xiaojun Chang

Liqiang Nie

Multim. Tools Appl., 2018

Spatial query based virtual reality GIS analysis platform.

[BibT_eX]

[DOI]

Neurocomputing, 2018

GestureGAN for Hand Gesture-to-Gesture Translation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

2017

Media Quality Assessment by Perceptual Gaze-Shift Patterns Discovery.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Perceptually Guided Photo Retargeting.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2017

Multiview Physician-Specific Attributes Fusion for Health Seeking.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2017

Guest Editorial: Intermediate representation for vision and multimedia applications.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2017

Graph self-representation method for unsupervised feature selection.

[BibT_eX]

[DOI]

Neurocomputing, 2017

Class-wise dictionary learning for hyperspectral image classification.

[BibT_eX]

[DOI]

Neurocomputing, 2017

Guest Editorial: Language in Vision.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Detecting anomalous events in videos by learning deep representations of appearance and motion.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Indoor localization via multi-view images and videos.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Learn to Model Motion from Blurry Footages.

[BibT_eX]

[DOI]

CoRR, 2017

A cross-modal adaptation approach for brain decoding.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Weakly Supervised Actor-Action Segmentation via Robust Multi-task Ranking.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Exploring Multitask and Transfer Learning Algorithms for Head Pose Estimation in Dynamic Multiview Scenarios.

[BibT_eX]

[DOI]

Anoop Kolar Rajagopal

Radu L. Vieriu

Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016

Active domain adaptation with noisy labels for multimedia analysis.

[BibT_eX]

[DOI]

Jingkuan Song

Guoyu Lu

World Wide Web, 2016

Semantic Photo Retargeting Under Noisy Image Labels.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2016

Category Specific Dictionary Learning for Attribute Specific Feature Selection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Optimized Graph Learning Using Partial Tags and Multiple Features for Image and Video Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

A Multi-Task Learning Framework for Head Pose Estimation under Target Motion.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2016

Guest Editorial: Representation Learning for Multimedia Data Understanding.

[BibT_eX]

[DOI]

Zhigang Ma

Bingbing Ni

Multim. Tools Appl., 2016

Deep and fast: Deep learning hashing with semi-supervised graph construction.

[BibT_eX]

[DOI]

Image Vis. Comput., 2016

Guest editorial: Bridging the semantic gap in multimedia understanding.

[BibT_eX]

[DOI]

Jiwen Lu

Neurocomputing, 2016

Where am I in the dark: Exploring active transfer learning on the use of indoor localization based on thermal imaging.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Collaborative Sparse Coding for Multiview Action Recognition.

[BibT_eX]

[DOI]

IEEE Multim., 2016

Computational Modeling of Affective Qualities of Abstract Paintings.

[BibT_eX]

[DOI]

IEEE Multim., 2016

A Fast 3D Indoor-Localization Approach Based on Video Queries.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Sparse-coded cross-domain adaptation from the visual to the brain domain.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Person Re-identification via Recurrent Feature Aggregation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Recurrent Face Aging.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recognizing Emotions from Abstract Paintings Using Non-Linear Matrix Completion.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Projective Unsupervised Flexible Embedding with Optimal Graph.

[BibT_eX]

[DOI]

Wei Wang

Feiping Nie

Shuicheng Yan

Proceedings of the British Machine Vision Conference 2016, 2016

Sparse Code Filtering for Action Pattern Mining.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2016, 2016

Fortune Teller: Predicting Your Career Path.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

L1-Norm Low-Rank Matrix Factorization by Variational Bayesian Method.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2015

Event Oriented Dictionary Learning for Complex Event Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Egocentric Daily Activity Recognition via Multitask Clustering.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Evaluation of semi-supervised learning method on action recognition.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2015

Supervised Hashing with Pseudo Labels for Scalable Multimedia Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Who's Afraid of Itten: Using the Art Theory of Color Combination to Analyze Emotions in Abstract Paintings.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Analyzing Free-standing Conversational Groups: A Multimodal Approach.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Attribute Guided Dictionary Learning.

[BibT_eX]

[DOI]

Wei Wang

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Looking at Mondrian's Victory Boogie-Woogie: What Do I Feel?

[BibT_eX]

[DOI]

Andreza Sartori

Gözde Özbal

Alkim Almila Akdag Salah

Albert Ali Salah

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Inferring Painting Style with Multi-Task Dictionary Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

PET: An eye-tracking dataset for animal-centric Pascal object classes.

[BibT_eX]

[DOI]

Syed Omer Gilani

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Localize Me Anywhere, Anytime: A Multi-task Point-Retrieval Approach.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Optimal graph learning with partial tags and multiple features for image and video annotation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning Deep Representations of Appearance and Motion for Anomalous Event Detection.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2015, 2015

Complex Event Detection via Event Oriented Dictionary Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Multiple Tasks are Better than One: Multi-task Learning and Feature Selection for Head Pose Estimation, Action Recognition and Event Detection.

[BibT_eX]

[DOI]

PhD thesis, 2014

Multitask Linear Discriminant Analysis for View Invariant Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

GLocal tells you more: Coupling GLocal structural for feature selection with sparsity for image and video classification.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2014

You Talkin' to Me?: Recognizing Complex Human Interactions in Unconstrained Videos.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Mystery of Faces: Investigating Face Contribution for Multimedia Event Detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimedia Retrieval, 2014

Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Clustered Multi-task Linear Discriminant Analysis for View Invariant Color-Depth Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

It's all about habits: Exploiting multi-task clustering for activities of daily living analysis.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Minimizing dataset bias: Discriminative multi-task sparse coding through shared subspace learning for image classification.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Exploiting transfer learning for personalized view invariant gesture recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Knowing Where I Am: Exploiting Multi-Task Learning for Multi-view Indoor Image-based Localization.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2014

Recognizing Daily Activities from First-Person Videos with Multi-task Clustering.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

2013

Informedia@TRECVID 2013.

[BibT_eX]

[DOI]

Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

GLocal structural feature selection with sparsity for multimedia data understanding.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

On the relationship between head pose, social attention and personality prediction for unstructured and dynamic group interactions.

[BibT_eX]

[DOI]

Subramanian Ramanathan

Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Multi-task linear discriminant analysis for multi-view action recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion.

[BibT_eX]

[DOI]

Subramanian Ramanathan

Proceedings of the IEEE International Conference on Computer Vision, 2013

2012

Active transfer learning for multi-view head-pose classification.

[BibT_eX]

[DOI]

Subramanian Ramanathan