Jifei Song

Orcid: 0000-0002-3381-6685

According to our database1, Jifei Song authored at least 47 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation.
CoRR, April, 2026

FG-Portrait: 3D Flow Guided Editable Portrait Animation.
CoRR, March, 2026

Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing.
CoRR, March, 2026

Relax Forcing: Relaxed KV-Memory for Consistent Long Video Generation.
CoRR, March, 2026

Diffusion-Based Makeup Transfer with Facial Region-Aware Makeup Features.
CoRR, March, 2026

LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion.
CoRR, March, 2026

EgoGraph: Temporal Knowledge Graph for Egocentric Video Understanding.
CoRR, February, 2026

Optimizing Multimodal LLMs for Egocentric Video Understanding: A Solution for the HD-EPIC VQA Challenge.
CoRR, January, 2026

Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI.
Proceedings of the ACM Web Conference 2026, 2026

Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
ViMo: A Generative Visual GUI World Model for App Agent.
CoRR, April, 2025

UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Unlocking the Potential of Diffusion Priors in Blind Face Restoration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Learning Precise Affordances From Egocentric Videos for Robotic Manipulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

CaricatureBooth: Data-Free Interactive Caricature Generation in a Photo Booth.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Single-view Image to Novel-view Generation for Hand-Object Interactions.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
ZeroGS: Training 3D Gaussian Splatting from Unposed Images.
CoRR, 2024

SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark.
CoRR, 2024

SCRREAM : SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SAGS: Structure-Aware 3D Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

Human Gaussian Splatting: Real-Time Rendering of Animatable Avatars.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction.
Proceedings of the International Conference on 3D Vision, 2024

2023
Deformable 3D Gaussian Splatting for Animatable Human Avatars.
CoRR, 2023

SWAGS: Sampling Windows Adaptively for Dynamic 3D Gaussian Splatting.
CoRR, 2023

Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video.
CoRR, 2023


On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Is my Depth Ground-Truth Good Enough? HAMMER - Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression.
CoRR, 2022

CroMo: Cross-Modal Learning for Monocular Depth Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Cross-modal learning for sketch visual understanding.
PhD thesis, 2021

Toward Fine-Grained Sketch-Based 3D Shape Retrieval.
IEEE Trans. Image Process., 2021

Fine-Grained Instance-Level Sketch-Based Image Retrieval.
Int. J. Comput. Vis., 2021

2019
Generalizable Person Re-Identification by Domain-Invariant Mapping Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Universal Perceptual Grouping.
CoRR, 2018

Deep Factorised Inverse-Sketching.
Proceedings of the Computer Vision - ECCV 2018, 2018

Universal Sketch Perceptual Grouping.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning to Sketch With Shortcut Cycle Consistency.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

A Transient-Enhanced Digital Low-Dropout Regulator with Bisection Method Tuning.
Proceedings of the 2018 IEEE Asia Pacific Conference on Circuits and Systems, 2018

2017
Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Fine-Grained Image Retrieval: the Text/Sketch Input Dilemma.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Deep Multi-task Attribute-driven Ranking for Fine-grained Sketch-based Image Retrieval.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
A novel real-time digital video stabilization algorithm based on the improved diamond search and modified Kalman filter.
Proceedings of the IEEE 7th International Conference on Awareness Science and Technology, 2015

Underdetermined blind separation of weak sparse sources via matrix transform layer by layer in the Time-Frequency domain.
Proceedings of the IEEE 7th International Conference on Awareness Science and Technology, 2015

2013
One source signal extraction based on metrics transform.
Proceedings of the International Joint Conference on Awareness Science and Technology & Ubi-Media Computing, 2013


  Loading...