Qifeng Chen

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025
CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation.
CoRR, October, 2025

Orchestrate, Generate, Reflect: A VLM-Based Multi-Agent Collaboration Framework for Automated Driving Policy Learning.
CoRR, September, 2025

VAInpaint: Zero-Shot Video-Audio inpainting framework with LLMs-driven Module.
CoRR, September, 2025

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data.
CoRR, September, 2025

Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation.
CoRR, August, 2025

Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control.
CoRR, August, 2025

Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis.
CoRR, August, 2025

Rethinking Layered Graphic Design Generation with a Top-Down Approach.
CoRR, July, 2025

Calligrapher: Freestyle Text Image Customization.
CoRR, June, 2025

RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors.
CoRR, June, 2025

Fake it till You Make it: Reward Modeling as Discriminative Prediction.
CoRR, June, 2025

LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization.
CoRR, June, 2025

Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning.
CoRR, June, 2025

Follow-Your-Creation: Empowering 4D Creation through Video Inpainting.
CoRR, June, 2025

Model as a Game: On Numerical and Spatial Consistency for Generative Games.
CoRR, March, 2025

MagicColor: Multi-Instance Sketch Colorization.
CoRR, March, 2025

Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation.
CoRR, March, 2025

EEdit: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing.
CoRR, March, 2025

MagicStick: Controllable Video Editing via Control Handle Transformations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

DocAI-TL: Structured Document Tampering Localization with DocAI Model.
Proceedings of the Document Analysis and Recognition - ICDAR 2025, 2025

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MagicQuill: An Intelligent Interactive Image Editing System.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MangaNinja: Line Art Colorization with Precise Reference Following.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos.
Proceedings of the Computational Visual Media - 13th International Conference, 2025

2024
DepthLab: From Partial to Complete.
CoRR, 2024

Large Motion Video Autoencoding with Cross-modal Video VAE.
CoRR, 2024

SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation.
CoRR, 2024

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation.
CoRR, 2024

Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving.
CoRR, 2024

Towards Degradation-Robust Reconstruction in Generalizable NeRF.
CoRR, 2024

LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting.
CoRR, 2024

PLUTO: Pushing the Limit of Imitation Learning-based Planning for Autonomous Driving.
CoRR, 2024

HAWK: Learning to Understand Open-World Video Anomalies.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Using Left and Right Brains Together: Towards Vision and Language Planning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Latent Guard: A Safety Framework for Text-to-Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering.
Proceedings of the Computer Vision - ECCV 2024, 2024

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection.
CoRR, 2023

SAD: Segment Any RGBD.
CoRR, 2023

HyperThumbnail: Real-time 6K Image Rescaling with Rate-distortion Optimization.
CoRR, 2023

Human MotionFormer: Transferring Human Motions with Vision Transformers.
CoRR, 2023

Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Rotating without Seeing: Towards In-hand Dexterity through Touch.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

4D Panoptic Scene Graph Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TextDiffuser: Diffusion Models as Text Painters.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Human MotionFormer: Transferring Human Motions with Vision Transformers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Real-time 6K Image Rescaling with Rate-distortion Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Point Cloud Compression with Sibling Context and Surface Priors.
CoRR, 2022

DCMS: Motion Forecasting with Dual Consistency and Multi-Pseudo-Target Supervision.
CoRR, 2022

Towards Self-Supervised Category-Level Object Pose and Size Estimation.
CoRR, 2022

Non-Cooperative Game and Cooperative Operation of Multi-Level Supply Chain Under Background of Carbon Emission Reduction.
IEEE Access, 2022

Two-Layer Bandit Optimization for Recommendations.
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022

One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Planning for Sample Efficient Imitation Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Real-time Streaming Video Denoising with Bidirectional Buffers.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Harvesting Partially-Disjoint Time-Frequency Information for Improving Degenerate Unmixing Estimation Technique.
Proceedings of the IEEE International Conference on Acoustics, 2022

FS6D: Few-Shot 6D Pose Estimation of Novel Objects.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Robust Federated Learning with Attack-Adaptive Aggregation.
CoRR, 2021

Unsupervised Portrait Shadow Removal via Generative Priors.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

SinIR: Efficient General Image Manipulation with Single Image Reconstruction.
Proceedings of the 38th International Conference on Machine Learning, 2021

Normalized Human Pose Features for Human Action Video Alignment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TPCN: Temporal Point Cloud Networks for Motion Forecasting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Invertible Image Signal Processing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Involution: Inverting the Inherence of Convolution for Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Video Deblurring by Fitting to Test Data.
CoRR, 2020

Evaluating Adversarial Robustness in Simulated Cerebellum.
Proceedings of the NeurIPS 2020 Workshop on Pre-registration in Machine Learning, 2020

Self-supervised Dance Video Synthesis Conditioned on Music.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Learn Parameterized Classification Networks for Scalable Input Images.
Proceedings of the Computer Vision - ECCV 2020, 2020

Deep Reinforced Attention Learning for Quality-Aware Visual Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fully Convolutional Networks for Continuous Sign Language Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Depth Sensing Beyond LiDAR Range.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Attack-Resistant Federated Learning with Residual-based Reweighting.
CoRR, 2019

Music-oriented Dance Video Synthesis with Pose Perceptual Loss.
CoRR, 2019

LeapDetect: An Agile Platform for Inspecting Power Transmission Lines from Drones.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

2017
Efficient optimization via approximation.
PhD thesis, 2017

2010
Clustering algorithms for area geographical entities in spatial data mining.
Proceedings of the Seventh International Conference on Fuzzy Systems and Knowledge Discovery, 2010


  Loading...