Jingya Wang

Orcid: 0000-0003-1122-0634

Affiliations:
  • ShanghaiTech University, School of Information Science and Technology, Shanghai, China
  • Queen Mary University of London, UK (PhD)


According to our database1, Jingya Wang authored at least 84 papers between 2016 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Conformal Reliability: A New Evaluation Metric for Conditional Generation.
CoRR, May, 2026

Sample-Efficient Diffusion-based Reinforcement Learning with Critic Guidance.
CoRR, May, 2026

DiscoForcing: A Unified Framework for Real-Time Audio-Driven Character Control with Diffusion Forcing.
CoRR, May, 2026

SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-based Humanoid Control.
CoRR, May, 2026

Noise Perception Self-Supervised Learning for Unsupervised Person Re-Identification.
IEEE Trans. Emerg. Top. Comput. Intell., February, 2026

Distributional Reinforcement Learning with Diffusion Bridge Critics.
CoRR, February, 2026

2025
Learning Semantic Atomic Skills for Multi-Task Robotic Manipulation.
CoRR, December, 2025

InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs.
CoRR, December, 2025

Sample from What You See: Visuomotor Policy Learning via Diffusion Bridge with Observation-Embedded Stochastic Differential Equation.
CoRR, December, 2025

Commanding Humanoid by Free-form Language: A Large Language Action Model with Unified Motion Vocabulary.
CoRR, November, 2025

Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis.
CoRR, September, 2025

Unlocking Personalized Knowledge in Federated Large Language Model: The Power of Mixture of Experts.
CoRR, June, 2025

UniDB++: Fast Sampling of Unified Diffusion Bridge.
CoRR, May, 2025

One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion.
CoRR, May, 2025

Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics.
CoRR, March, 2025

ARFlow: Human Action-Reaction Flow Matching with Physical Guidance.
CoRR, March, 2025

MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis.
CoRR, March, 2025

OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

LithoSim: A Large, Holistic Lithography Simulation Benchmark for AI-Driven Semiconductor Manufacturing.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

SMGDiff: Soccer Motion Generation using Diffusion Probabilistic Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Multi-Modal Multi-Platform Person Re-Identification: Benchmark and Method.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AffordDP: Generalizable Diffusion Policy with Transferable Affordance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

NLPrompt: Noise-Label Prompt Learning for Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
FedTP: Federated Learning by Transformer Personalization.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

DCTracker: Rethinking MOT in soccer events under dual views via cascade association.
Knowl. Based Syst., 2024

HOI-M3: Capture Multiple Humans and Objects Interaction within Contextual Environment.
CoRR, 2024

Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method.
CoRR, 2024

IMUSIC: IMU-based Facial Expression Capture.
CoRR, 2024

Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Monocular Human-Object Reconstruction in the Wild.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Gait Recognition in Large-scale Free Environment via Single LiDAR.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Guidance with Spherical Gaussian Constraint for Conditional Diffusion.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Harmonizing Generalization and Personalization in Federated Prompt Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LLM-HD: Layout Language Model for Hotspot Detection with GDS Semantic Encoding.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

I'M HOI: Inertia-Aware Monocular Capture of 3D Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HOI-M<sup>3</sup>: Capture Multiple Humans and Objects Interaction within Contextual Environment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LiveHPS: LiDAR-Based Scene-Level Human Pose and Shape Estimation in Free Environment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Global and Local Prompts Cooperation via Optimal Transport for Federated Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Federated Unsupervised Cluster-Contrastive learning for person Re-identification: A coarse-to-fine approach.
Comput. Vis. Image Underst., December, 2023

Cross-domain multi-style merge for image captioning.
Comput. Vis. Image Underst., February, 2023

Fed-CO2: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning.
CoRR, 2023

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models.
CoRR, 2023

Robust Knowledge Adaptation for Federated Unsupervised Person ReID.
CoRR, 2023

Contextually Affinitive Neighborhood Refinery for Deep Clustering.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reduced Policy Optimization for Continuous Control with Hard Constraints.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CSOT: Curriculum and Structure-Aware Optimal Transport for Learning with Noisy Labels.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fed-CO<sub>2</sub>: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Alternating Differentiation for Optimization Layers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Knowledge-Aware Federated Active Learning with Non-IID Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

IKOL: Inverse Kinematics Optimization Layer for 3D Human Pose and Shape Estimation via Gauss-Newton Differentiation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Lifelong Person Re-identification via Knowledge Refreshing and Consolidation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

HybridCap: Inertia-Aid Monocular Capture of Challenging Human Motions.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
SCULPTOR: Skeleton-Consistent Face Creation Using a Learned Parametric Generator.
ACM Trans. Graph., 2022

Adaptive Graph Convolutional Network for PolSAR Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2022

Position-aware image captioning with spatial relation.
Neurocomputing, 2022

Responsible Active Learning via Human-in-the-loop Peer Study.
CoRR, 2022

LiCamGait: Gait Recognition in the Wild by Using LiDAR and Camera Multi-modal Visual Sensors.
CoRR, 2022

Unified Optimal Transport Framework for Universal Domain Adaptation.
CoRR, 2022

Unified Optimal Transport Framework for Universal Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Online Multiple Object Tracking With Cross-Task Synergy.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Symbiotic Adversarial Learning for Attribute-Based Person Search.
Proceedings of the Computer Vision - ECCV 2020, 2020

Pose-Guided Visible Part Matching for Occluded Person ReID.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Person re-identification with activity prediction based on hierarchical spatial-temporal model.
Neurocomputing, 2018

Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Discovering visual concept structure with sparse and incomplete tags.
Artif. Intell., 2017

Attribute Recognition by Joint Recurrent Learning of Context and Correlation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Video Semantic Clustering with Sparse and Incomplete Tags.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016


  Loading...