Feng Gao

Orcid: 0009-0006-1843-3180

Affiliations:
  • Tsinghua University, Future Laboratory, Beijing, China
  • Peking University, Department of computer science, Beijing, China (PhD 2018)


According to our database1, Feng Gao authored at least 53 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Space-Time Gaussian Surfels for High-Fidelity Dynamic Objects Segmentation and Representation.
IEEE Trans. Circuits Syst. Video Technol., May, 2026

RTGSR: Real-Time Game Content Super-Resolution via Compressed-Domain Coding Priors.
IEEE Trans. Circuits Syst. Video Technol., May, 2026

CamGeo: Sparse Camera-Conditioned Image-to-Video Generation with 3D Geometry Priors.
CoRR, May, 2026

Exploring Talking Head Models with Adjacent Frame Prior for Speech-Preserving Facial Expression Manipulation.
ACM Trans. Multim. Comput. Commun. Appl., April, 2026

MLICv2: Enhanced Multi-Reference Entropy Modeling for Learned Image Compression.
ACM Trans. Multim. Comput. Commun. Appl., April, 2026

Personalized Cross-Modal Emotional Correlation Learning for Speech-Preserving Facial Expression Manipulation.
CoRR, April, 2026

ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction.
CoRR, April, 2026

Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting.
CoRR, March, 2026

Toward Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering.
IEEE Trans. Multim., 2026

High-Fidelity and Lip-Synced Talking Face Synthesis via Landmark-Based Diffusion Model.
IEEE Trans. Image Process., 2026

2025
MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling.
CoRR, August, 2025

MLIC<sup>++</sup>: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression.
ACM Trans. Multim. Comput. Commun. Appl., May, 2025

Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection.
CoRR, April, 2025

Adaptive Prediction Structure for Learned Video Compression.
ACM Trans. Multim. Comput. Commun. Appl., February, 2025

BoxPolypSAM: Leveraging SAM in Box-Supervised Polyp Segmentation.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

RiverEcho: Real-Time Interactive Digital System for Ancient Yellow River Culture.
Proceedings of the IEEE International Conference on Multimedia and Expo, ICME 2025 - Workshops, Nantes, France, June 30, 2025

Multitrack Music Generation Combining Transformer and Diffusion Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, ICME 2025 - Workshops, Nantes, France, June 30, 2025

Aligning Human Motion Generation with Human Perceptions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Structure Embedded Nucleus Classification for Histopathology Images.
IEEE Trans. Medical Imaging, September, 2024

Human Motion Generation: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Editorial for Special Issue on Artificial Intelligence for Art.
Mach. Intell. Res., February, 2024

Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art.
Mach. Intell. Res., February, 2024

LLIC: Large Receptive Field Transform Coding With Adaptive Weights for Learned Image Compression.
IEEE Trans. Multim., 2024

Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation.
IEEE Trans. Image Process., 2024

Annotation-Efficient Polyp Segmentation via Active Learning.
CoRR, 2024

Source-Free Semi-Supervised Domain Adaptation for Tuberculosis Recognition.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

MemoMusic 4.0: Personalized Emotion Music Generation Conditioned by Valence and Arousal as Virtual Tokens.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
MLIC: Multi-Reference Entropy Model for Learned Image Compression.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MemoMusic 3.0: Considering Context at Music Recommendation and Combining Music Theory at Music Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Lesion-Aware Dynamic Kernel for Polyp Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Cross-Level Contrastive Learning and Consistency Constraint for Semi-Supervised Medical Image Segmentation.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Early Prediction of Blastocyst Development via Time-Lapse Video Analysis.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Memomusic Version 2.0: Extending Personalized Music Recommendation with Automatic Music Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022

2021
Towards Large-Scale Object Instance Search: A Multi-Block N-Ary Trie.
IEEE Trans. Circuits Syst. Video Technol., 2021

MemoMusic: A Personalized Music Recommendation Framework Based on Emotion and Memory.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Design and Development of an Intelligent Pet-Type Quadruped Robot.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Deep Transformers For Fast Small Intestine Grounding In Capsule Endoscope Video.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

2019
Codebook-Free Compact Descriptor for Scalable Visual Search.
IEEE Trans. Multim., 2019

A Generative Adversarial Network for AI-Aided Chair Design.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Learning to Remove Reflections for Text Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
Data-Driven Lightweight Interest Point Selection for Large-Scale Visual Search.
IEEE Trans. Multim., 2018

Group-Sensitive Triplet Embedding for Vehicle Reidentification.
IEEE Trans. Multim., 2018

Depth Structure Preserving Scene Image Generation.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style Transfer.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
Incorporating Intra-Class Variance to Fine-Grained Visual Recognition.
CoRR, 2017

From Part to Whole: Who is Behind the Painting?
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Improving object detection with region similarity learning.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Incorporating intra-class variance to fine-grained visual recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Adaptive Weighted Matching of Deep Convolutional Features for Painting Retrieval.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015
A Low Complexity Interest Point Detector.
IEEE Signal Process. Lett., 2015

2013
Compact descriptors for mobile visual search and MPEG CDVS standardization.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013


  Loading...