Xinyuan Chen

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025
Vinci: A Real-time Smart Assistant Based on Egocentric Vision-language Model for Portable Devices.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., September, 2025

CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models.
CoRR, August, 2025

LIA-X: Interpretable Latent Portrait Animator.
CoRR, August, 2025

Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers.
CoRR, August, 2025

Self-Improvement for Audio Large Language Model using Unlabeled Speech.
CoRR, July, 2025

XTransfer: Cross-Modality Model Transfer for Human Sensing with Few Data at the Edge.
CoRR, June, 2025

GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects.
CoRR, June, 2025

Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs.
ACM Trans. Embed. Comput. Syst., May, 2025

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models.
Int. J. Comput. Vis., May, 2025

Training-free Stylized Text-to-Image Generation with Fast Inference.
CoRR, May, 2025

LEO: Generative Latent Image Animator for Human Video Synthesis.
Int. J. Comput. Vis., March, 2025

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset.
CoRR, March, 2025

GMG: A Video Prediction Method Based on Global Focus and Motion Guided.
CoRR, March, 2025

MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis.
CoRR, March, 2025

TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision.
CoRR, March, 2025

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant.
CoRR, March, 2025

Efficient Projection-Based Algorithms for Tip Decomposition on Dynamic Bipartite Graphs.
IEEE Trans. Knowl. Data Eng., February, 2025

Research on the working principle and stability of CLC electric springs based on the impedance analysis method.
Int. J. Circuit Theory Appl., January, 2025

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models.
CoRR, January, 2025

Latte: Latent Diffusion Transformer for Video Generation.
Trans. Mach. Learn. Res., 2025

High-precision model predictive control and experiment of an unmanned surface vehicle with Gaussian process-based error model.
J. Syst. Control. Eng., 2025

Pool-mamba: Pooling state space model for low-light image enhancement.
Neurocomputing, 2025

Effects of Auditory Anticipatory Cues and Lead Time on Visually Induced Motion Sickness.
Hum. Factors, 2025

Diff-TST: Diffusion model for one-shot text-image style transfer.
Expert Syst. Appl., 2025

Predicting post-VR game experiences with wearable physiological sensors.
Entertain. Comput., 2025

A multi-perturbation consistency framework for semi-supervised person re-identification.
Comput. Electr. Eng., 2025

MuFAl: A Universal Drug-Target Interaction Prediction Framework.
Proceedings of the 19th International Conference on Ubiquitous Information Management and Communication, 2025

Efficient Projection-Based Algorithms for Tip Decomposition on Dynamic Bipartite Graphs (Extended Abstract).
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Consistent and Controllable Image Animation with Motion Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Diff-Font: Diffusion Model for Robust One-Shot Font Generation.
Int. J. Comput. Vis., November, 2024

Research on the working mode, working state, and control strategies of generalized electric spring.
Int. J. Circuit Theory Appl., June, 2024

Experimental study on autonomous docking and hook-locking control for unmanned surface vehicle platforms.
J. Syst. Control. Eng., March, 2024

Weakly supervised scene text generation for low-resource languages.
Expert Syst. Appl., March, 2024

An Efficient Group Federated Learning Framework for Large-Scale EEG-Based Driver Drowsiness Detection.
Int. J. Neural Syst., January, 2024

Uncertainty-aware image inpainting with adaptive feedback network.
Expert Syst. Appl., January, 2024

Construction and demolition waste disposal charging scheme design.
Comput. Aided Civ. Infrastructure Eng., January, 2024

Region-Based Unsupervised Low-Light Image Enhancement in the Wild With Explicit Domain Supervision.
IEEE Trans. Instrum. Meas., 2024

Semi-supervised breast cancer pathology image segmentation based on fine-grained classification guidance.
Medical Biol. Eng. Comput., 2024

MP2PMatch: A Mask-guided Part-to-Part Matching network based on transformer for occluded person re-identification.
J. Vis. Commun. Image Represent., 2024

Improving the performance of semi-supervised person Re-identification by selecting reliable unlabeled samples.
Eng. Appl. Artif. Intell., 2024

Combinatorial progressive architecture search for crowd counting.
Displays, 2024

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model.
CoRR, 2024

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models.
CoRR, 2024

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models.
CoRR, 2024

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion.
CoRR, 2024

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

4Diffusion: Multi-view Video Diffusion Model for 4D Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CTHTC: A Hybrid Architecture for Temporal Knowledge Graph Completion.
Proceedings of the 18th International Conference on Ubiquitous Information Management and Communication, 2024

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Vlogger: Make Your Dream A Vlog.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SinSR: Diffusion-Based Image Super-Resolution in a Single Step.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VBench: Comprehensive Benchmark Suite for Video Generative Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

ConditionVideo: Training-Free Condition-Guided Video Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Two Cascade Control Strategy of Generalized Electric Spring.
IEICE Trans. Commun., November, 2023

An alternating direction method of multipliers for solving user equilibrium problem.
Eur. J. Oper. Res., November, 2023

ADD: An automatic desensitization fisheye dataset for autonomous driving.
Eng. Appl. Artif. Intell., November, 2023

EasyGraph: A multifunctional, cross-platform, and effective library for interdisciplinary network analysis.
Patterns, October, 2023

OCR-RTPS: an OCR-based real-time positioning system for the valet parking.
Appl. Intell., July, 2023

A customized two-stage parallel computing algorithm for solving the combined modal split and traffic assignment problem.
Comput. Oper. Res., June, 2023

Real-time and accurate detection of citrus in complex scenes based on HPL-YOLOv4.
Comput. Electron. Agric., February, 2023

Multi-level feature disentanglement network for cross-dataset face forgery detection.
Image Vis. Comput., 2023

ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation.
CoRR, 2023

PPD: A New Valet Parking Pedestrian Fisheye Dataset for Autonomous Driving.
CoRR, 2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
CoRR, 2023

LEO: Generative Latent Image Animator for Human Video Synthesis.
CoRR, 2023

Long-Term Rhythmic Video Soundtracker.
Proceedings of the International Conference on Machine Learning, 2023

2022
Unsupervised Image Restoration With Quality-Task-Perception Loss.
IEEE Trans. Circuits Syst. Video Technol., 2022

Forecasting network-wide multi-step metro ridership with an attention-weighted multi-view graph to sequence learning approach.
Expert Syst. Appl., 2022

Nested Named Entity Recognition Based on Dual Stream Feature Complementation.
Entropy, 2022

DGFont++: Robust Deformable Generative Networks for Unsupervised Font Generation.
CoRR, 2022

Research on the Edge Resource Allocation and Load Balancing Algorithm Based on Vehicle Trajectory.
Complex., 2022

Knowledge representation combining quaternion path integration and depth-wise atrous circular convolution.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Cross Attention Based Style Distribution for Controllable Person Image Synthesis.
Proceedings of the Computer Vision - ECCV 2022, 2022

Research on improving driver's situational awareness in automatic driving by vibration information.
Proceedings of the Tenth International Symposium of Chinese CHI, 2022

Low-cost and lightweight indoor positioning based on computer vision.
Proceedings of the APIT 2022: 4th Asia Pacific Information Technology Conference, Virtual Event, Thailand, January 14, 2022

2021
Inertial Gyro Wave Energy Conversion Nonlinear Modeling and Power-Index Predictive Control for Autonomous Ship.
Complex., 2021

SUKE: Embedding Model for Prediction in Uncertain Knowledge Graph.
IEEE Access, 2021

Scene Text Transfer for Cross-Language.
Proceedings of the Image and Graphics - 11th International Conference, 2021

Multi-Order Adversarial Representation Learning for Composed Query Image Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2021

DG-Font: Deformable Generative Networks for Unsupervised Font Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Swin-Spectral Transformer for Cholangiocarcinoma Hyperspectral Image Segmentation.
Proceedings of the 14th International Congress on Image and Signal Processing, 2021

Bijective Multi-mode Deraining on Single Image.
Proceedings of the 14th International Congress on Image and Signal Processing, 2021

2020
Learning for Visual Synthesis and Transformation
PhD thesis, 2020

Long-Term Video Prediction via Criticization and Retrospection.
IEEE Trans. Image Process., 2020

3DRTE: 3D Rotation Embedding in Temporal Knowledge Graph.
IEEE Access, 2020

2019
Gated-GAN: Adversarial Gated Networks for Multi-Collection Style Transfer.
IEEE Trans. Image Process., 2019

Modeling and Experimental Study for Online Measurement of Hydraulic Cylinder Micro Leakage Based on Convolutional Neural Network.
Sensors, 2019

Adversarial Watermarking to Attack Deep Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
On the Robustness and Reliability in the Pose Deformation System of Mobile Robots.
IEEE Access, 2018

Attention-GAN for Object Transfiguration in Wild Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
A Multiway Semi-supervised Online Sequential Extreme Learning Machine for Facial Expression Recognition with Kinect RGB-D Images.
Proceedings of the Intelligent Computing Theories and Application, 2017

S-OHEM: Stratified Online Hard Example Mining for Object Detection.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

2016
Detecting Anomalous Ratings Using Matrix Factorization for Recommender Systems.
Proceedings of the Web-Age Information Management - 17th International Conference, 2016

A mobile recommendation system based on logistic regression and Gradient Boosting Decision Trees.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

2014
Experiment and hydro-mechanical coupling simulation study on the human periodontal ligament.
Comput. Methods Programs Biomed., 2014


  Loading...