Yongjia Ma

Orcid: 0000-0002-0070-4134

According to our database1, Yongjia Ma authored at least 20 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
CogPortrait: Fine-Grained Eye-Region Control in Portrait Animation via Hierarchical Agent Planning.
CoRR, May, 2026

DiTalker: A unified DiT-based framework for high-quality and style-controllable portrait animation.
Comput. Vis. Image Underst., 2026

2025
EverybodyDance: Bipartite Graph-Based Identity Correspondence for Multi-Character Animation.
CoRR, December, 2025

DiTalker: A Unified DiT-based Framework for High-Quality and Speaking Styles Controllable Portrait Animation.
CoRR, August, 2025

Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow.
CoRR, March, 2025

Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion.
CoRR, January, 2025

TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Manipulation.
IEEE Trans. Multim., 2025

MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

Multi-scale Feature Field with Anti-brightness-sensitivity Postprocessing for Few-shot Neural Panoptic Segmentation.
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025

QR-LoRA: Efficient and Disentangled Fine-Tuning via QR Decomposition for Customized Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
TV-3DG: Mastering Text-to-3D Customized Generation with Visual Prompt.
CoRR, 2024

FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset.
CoRR, 2024

Real Face Video Animation Platform.
CoRR, 2024

One-Shot Pose-Driving Face Animation Platform.
CoRR, 2024

TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation.
CoRR, 2024

CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians with Dual Feature Fusion.
CoRR, 2024

Learning Segmented 3D Gaussians via Efficient Feature Unprojection for Zero-Shot Neural Scene Segmentation.
Proceedings of the Neural Information Processing - 31st International Conference, 2024

RD-NERF: Neural Robust Distilled Feature Fields for Sparse-View Scene Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024


  Loading...