Jun Yu

Orcid: 0000-0002-3197-8103

Affiliations:
  • University of Science and Technology of China, Department of Automation, Hefei, China


According to our database1, Jun Yu authored at least 150 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Regularly Truncated M-Estimators for Learning With Noisy Labels.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Pin-CasNet: Detecting pin status in transmission lines based on cascade network.
Eng. Appl. Artif. Intell., January, 2024

Conditional Consistency Regularization for Semi-Supervised Multi-Label Image Classification.
IEEE Trans. Multim., 2024

AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts.
CoRR, 2024

Efficient Feature Extraction and Late Fusion Strategy for Audiovisual Emotional Mimicry Intensity Estimation.
CoRR, 2024

2023
Dual-scale point cloud completion network based on high-frequency feature fusion.
Image Vis. Comput., November, 2023

Multi-Object Tracking: Decoupling Features to Solve the Contradictory Dilemma of Feature Requirements.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

A viable framework for semi-supervised learning on realistic dataset.
Mach. Learn., June, 2023

A Two-stage Fine-tuning Strategy for Generalizable Manipulation Skill of Embodied AI.
CoRR, 2023

Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision.
CoRR, 2023

SAR2EO: A High-resolution Image Translation Framework with Denoising Enhancement.
CoRR, 2023

A Dual Branch Network for Emotional Reaction Intensity Estimation.
CoRR, 2023

Exploring Large-scale Unlabeled Faces to Enhance Facial Expression Recognition.
CoRR, 2023

Local Region Perception and Relationship Learning Combined with Feature Fusion for Facial Action Unit Detection.
CoRR, 2023

MMT-GD: Multi-Modal Transformer with Graph Distillation for Cross-Cultural Humor Detection.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Image- and Instance-Level Data Augmentation for Occluded Instance Segmentation.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Leveraging the Latent Diffusion Models for Offline Facial Multiple Appropriate Reactions Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Sliding Window Seq2seq Modeling for Engagement Estimation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Answer-Based Entity Extraction and Alignment for Visual Text Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Efficient Micro-Expression Spotting Based on Main Directional Mean Optical Flow Feature.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

FSR-Net: Deep Fourier Network for Shadow Removal.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Responsive Listening Head Synthesis with 3DMM and Dual-Stream Prediction Network.
Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2023

Prototypical Contrastive Learning for Domain Adaptive Semantic Segmentation.
Proceedings of the International Joint Conference on Neural Networks, 2023

CRNet: Combining CenterNet and R-CNN for Object Detection in Traffic Scenes.
Proceedings of the 19th International Conference on Natural Computation, 2023

SAGE-NDVI: A Stereotype-Breaking Evaluation Metric for Remote Sensing Image Dehazing Using Satellite-to-Ground NDVI Knowledge.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Moderate Coreset: A Universal Method of Data Selection for Real-world Data-efficient Deep Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

RatiO R-CNN: An Efficient and Accurate Detection Method for Oriented Object Detection.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Adaptive Fine-Grained Region Matching for Image Harmonization.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Cross-Domain Transformer with Adaptive Thresholding for Domain Adaptive Semantic Segmentation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

A Dual Branch Network for Emotional Reaction Intensity Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Local Region Perception and Relationship Learning Combined with Feature Fusion for Facial Action Unit Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Exploring Large-scale Unlabeled Faces to Enhance Facial Expression Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Robust Generalization Against Photon-Limited Corruptions via Worst-Case Sharpness Minimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SAR2EO: A High-Resolution Image Translation Framework with Denoising Enhancement.
Proceedings of the AI 2023: Advances in Artificial Intelligence, 2023

2022
Densely Enhanced Semantic Network for Conversation System in Social Media.
ACM Trans. Multim. Comput. Commun. Appl., 2022

TWGAN: Twin Discriminator Generative Adversarial Networks.
IEEE Trans. Multim., 2022

LR-SVM+: Learning Using Privileged Information with Noisy Labels.
IEEE Trans. Multim., 2022

Embedding Pose Information for Multiview Vehicle Model Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Learning from Noisy Pairwise Similarity and Unlabeled Data.
J. Mach. Learn. Res., 2022

Monocular depth estimation with spatially coherent sliced network.
Image Vis. Comput., 2022

Dual feature fusion network: A dual feature fusion network for point cloud completion.
IET Comput. Vis., 2022

Efficient 6D object pose estimation based on attentive multi-scale contextual information.
IET Comput. Vis., 2022

Strength-Adaptive Adversarial Training.
CoRR, 2022

Scene Clustering Based Pseudo-labeling Strategy for Multi-modal Aerial View Object Classification.
CoRR, 2022

Multi-model Ensemble Learning Method for Human Expression Recognition.
CoRR, 2022

Do We Need to Penalize Variance of Losses for Learning with Label Noise?
CoRR, 2022

Micro Expression Generation with Thin-plate Spline Motion Model and Face Parsing.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Facial Expression Spotting Based on Optical Flow Features.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Understanding Robust Overfitting of Adversarial Training and Beyond.
Proceedings of the International Conference on Machine Learning, 2022

DBCAN: Dual-Branch Cross-Attention Network for Scene Text Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Sample Selection with Uncertainty of Losses for Learning with Noisy Labels.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Semi-Supervised Hyperspectral Object Detection Challenge Results - PBVS 2022.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Cross-modal Target Retrieval for Tracking by Natural Language.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Pseudo-label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Efficient Model Integration for Snake Classification.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Bag of Tricks and a Strong Baseline for FGVC.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

2021
Learning Face Image Super-Resolution Through Facial Semantic Attribute Transformation and Self-Attentive Structure Enhancement.
IEEE Trans. Multim., 2021

A Multilayer Pyramid Network Based on Learning for Vehicle Logo Recognition.
IEEE Trans. Intell. Transp. Syst., 2021

Multimodal Inputs Driven Talking Face Generation With Spatial-Temporal Dependency.
IEEE Trans. Circuits Syst. Video Technol., 2021

Identity-Expression Dual Branch Network for Facial Expression Recognition.
IEEE Trans. Cogn. Dev. Syst., 2021

BiSTF: Bilateral-Branch Self-Training Framework for Semi-Supervised Large-scale Fine-Grained Recognition.
CoRR, 2021

A Weakly-Supervised Depth Estimation Network Using Attention Mechanism.
CoRR, 2021

Improving White-box Robustness of Pre-processing Defenses via Joint Adversarial Training.
CoRR, 2021

Instance Correction for Learning with Open-set Noisy Labels.
CoRR, 2021

Emotional Deep Learning Programming Controller for Automatic Voltage Control of Power Systems.
IEEE Access, 2021

Facial Expression Recognition With Confidence Guided Refined Horizontal Pyramid Network.
IEEE Access, 2021

Fine-Grained Language Identification in Scene Text Images.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Radar Object Detection Using Data Merging, Enhancement and Fusion.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Single-stage Face Detection under Extremely Low-light Conditions.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Boosting Fairness for Masked Face Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Removing Adversarial Noise in Class Activation Feature Space.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Deep Kinship Verification and Retrieval Based on Fusion Siamese Neural Network.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

2020
A Deep Learning Approach for Face Hallucination Guided by Facial Boundary Responses.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Deep Convolutional Neural Network with Optical Flow for Facial Micro-Expression Recognition.
J. Circuits Syst. Comput., 2020

Coordinated Complex-Valued Encoding Dragonfly Algorithm and Artificial Emotional Reinforcement Learning for Coordinated Secondary Voltage Control and Automatic Voltage Regulation in Multi-Generator Power Systems.
IEEE Access, 2020

Attention Based Beauty Product Retrieval Using Global and Local Descriptors.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Weakly Supervised Local-Global Relation Network for Facial Expression Recognition.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Mobile Centernet for Embedded Deep Learning Object Detection.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Deep Fusion Siamese Network for Automatic Kinship Verification.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

Retrieval of Family Members Using Siamese Neural Network.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

Fair Face Recognition Using Data Balancing, Enhancement and Fusion.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

2019
BLTRCNN-Based 3-D Articulatory Movement Prediction: Learning Articulatory Synchronicity From Both Text and Audio Inputs.
IEEE Trans. Multim., 2019

Real-Time Head Pose Estimation and Face Modeling From a Depth Image.
IEEE Trans. Multim., 2019

Synthesizing 3D Trump: Predicting and Visualizing the Relationship Between Text, Speech, and Articulatory Movements.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Deep Neural Network Based 3D Articulatory Movement Prediction Using Both Text and Audio Inputs.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

STDGAN: ResBlock Based Generative Adversarial Nets Using Spectral Normalization and Two Different Discriminators.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

3D Singing Head for Music VR: Learning External and Internal Articulatory Synchronicity from Lyric, Audio and Notes.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Beauty Product Retrieval Based on Regional Maximum Activation of Convolutions with Generalized Attention.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multi-scale Densely U-Nets Refine Network for Face Alignment.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Grand Challenge of 106-Point Facial Landmark Localization.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Deep Learning Face Hallucination via Attributes Transfer and Enhancement.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Mining Audio, Text and Visual Information for Talking Face Generation.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Dense Semantic Matching Network for Multi-turn Conversation.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

D2PGGAN: Two Discriminators Used in Progressive Growing of GANS.
Proceedings of the IEEE International Conference on Acoustics, 2019


Towards the Gradient Vanishing, Divergence Mismatching and Mode Collapse of Generative Adversarial Nets.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Real-Time 3-D Facial Animation: From Appearance to Internal Articulators.
IEEE Trans. Circuits Syst. Video Technol., 2018

Probability contour guided depth map inpainting and superresolution using non-local total generalized variation.
Multim. Tools Appl., 2018

General-to-specific learning for facial attribute classification in the wild.
J. Vis. Commun. Image Represent., 2018

Synthesizing 3D Acoustic-Articulatory Mapping Trajectories: Predicting Articulatory Movements by Long-Term Recurrent Convolutional Neural Network.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

On the convergence and mode collapse of GAN.
Proceedings of the SIGGRAPH Asia 2018 Technical Briefs, Tokyo, Japan, December 04-07, 2018, 2018

A Cross-Layer Based Network for Faster Image Generation.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Simultaneous Facial Landmark and 3D Action Estimation Based on Probabilistic Random Forest.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A Coarse-to-Fine Face Hallucination Method by Exploiting Facial Prior Knowledge.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Synthesizing Photo-Realistic 3D Talking Head: Learning Lip Synchronicity and Emotion from Audio and Video.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Deep Facial Attribute Detection in the Wild: From General to Specific.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
A realistic 3D articulatory animation system for emotional visual pronunciation.
Multim. Tools Appl., 2017

A Video-Based Facial Motion Tracking and Expression Recognition System.
Multim. Tools Appl., 2017

Realistic emotion visualization by combining facial animation and hairstyle synthesis.
Multim. Tools Appl., 2017

Creating and simulating a realistic physiological tongue model for speech production.
Multim. Tools Appl., 2017

Image classification based on convolutional neural networks with cross-level strategy.
Multim. Tools Appl., 2017

Joint facial landmark detection and action estimation based on deep probabilistic random forest.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Multimodal 3D visible articulation system for syllable based Mandarin Chinese training.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Face Alignment Using Local Probabilistic Features.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

A Unified Framework for Monocular Video-Based Facial Motion Tracking and Expression Recognition.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Speech Synchronized Tongue Animation by Combining Physiology Modeling and X-ray Image Fitting.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

A Real-Time 3D Visual Singing Synthesis: From Appearance to Internal Articulators.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Facial Expression Recognition by Fusing Gabor and Local Binary Pattern Features.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Adaptively Weighted Facial Expression Recognition by Feature Fusion Under Intense Illumination Condition.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

From talking head to singing head: A significant enhancement for more natural human computer interaction.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Depth map super-resolution using non-local higher-order regularization with classified weights.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

HMM based speech-driven 3D tongue animation.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A real-time 3D head mesh modeling and expressive articulatory animation system.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An audio-visual 3D virtual articulation system for visual speech synthesis.
Proceedings of the IEEE International Symposium on Haptic, 2017

2016
Facial video coding/decoding at ultra-low bit-rate: a 2D/3D model-based approach.
Multim. Tools Appl., 2016

A monocular video-based facial expression recognition system by combining static and dynamic knowledge.
Proceedings of the 9th International Conference on Utility and Cloud Computing, 2016

A fast, accurate and complete 3D head mesh modeling system.
Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016

An Emotional Text-Driven 3D Visual Pronunciation System for Mandarin Chinese.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

A Color Model Based Fire Flame Detection System.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

A realistic and reliable 3D pronunciation visualization instruction system for computer-assisted language learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016

A fast and precise speech-triggered tongue animation system by combining parameterized model and anatomical model.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016

2015
A Video, Text, and Speech-Driven Realistic 3-D Virtual Head for Human-Machine Interface.
IEEE Trans. Cybern., 2015

Locating Facial Landmarks Using Probabilistic Random Forest.
IEEE Signal Process. Lett., 2015

Real-Time Robust Video Stabilization Based on Empirical Mode Decomposition and Multiple Evaluation Criteria.
Proceedings of the Image and Graphics - 8th International Conference, 2015

Video Based Face Tracking and Animation.
Proceedings of the Image and Graphics - 8th International Conference, 2015

A real-time 3D hair animation system for human-computer interaction.
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

Electro-Magnetic Articulography data stabilization for speech synchronized articulatory animation.
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

A Digital Video Stabilization System Based on Reliable SIFT Feature Matching and Adaptive Low-Pass Filtering.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015

Cross-Level: A Practical Strategy for Convolutional Neural Networks Based Image Classification.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015

2014
3D facial motion tracking by combining online appearance model and cylinder head model in particle filtering.
Sci. China Inf. Sci., 2014

A mass-spring tongue model with efficient collision detection and response during speech.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Realtime speech-driven facial animation using Gaussian Mixture Models.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Real-time control of 3D facial animation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Modeling a realistic 3D physiological tongue for visual speech synthesis.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Expressive facial animation from videos.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Synthesizing real-time speech-driven facial animation.
Proceedings of the IEEE International Conference on Acoustics, 2014

A Static Hand Gesture Recognition Algorithm Based on Krawtchouk Moments.
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014

2013
Data-driven 3D visual pronunciation of Chinese IPA for language learning.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

2D/3D Model-Based Facial Video Coding/Decoding at Ultra-Low Bit-Rate.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013


  Loading...