Feng Gao

Orcid: 0009-0006-1843-3180

Affiliations:

Tsinghua University, Future Laboratory, Beijing, China
Peking University, Department of computer science, Beijing, China (PhD 2018)

According to our database¹, Feng Gao authored at least 53 papers between 2013 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Space-Time Gaussian Surfels for High-Fidelity Dynamic Objects Segmentation and Representation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2026

RTGSR: Real-Time Game Content Super-Resolution via Compressed-Domain Coding Priors.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2026

CamGeo: Sparse Camera-Conditioned Image-to-Video Generation with 3D Geometry Priors.

[BibT_eX]

[DOI]

CoRR, May, 2026

Exploring Talking Head Models with Adjacent Frame Prior for Speech-Preserving Facial Expression Manipulation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2026

MLICv2: Enhanced Multi-Reference Entropy Modeling for Learned Image Compression.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2026

Personalized Cross-Modal Emotional Correlation Learning for Speech-Preserving Facial Expression Manipulation.

[BibT_eX]

[DOI]

CoRR, April, 2026

ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction.

[BibT_eX]

[DOI]

CoRR, April, 2026

Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, March, 2026

Toward Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

High-Fidelity and Lip-Synced Talking Face Synthesis via Landmark-Based Diffusion Model.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2026

2025

MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling.

[BibT_eX]

[DOI]

CoRR, August, 2025

MLIC<sup>++</sup>: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., May, 2025

Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection.

[BibT_eX]

[DOI]

CoRR, April, 2025

Adaptive Prediction Structure for Learned Video Compression.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., February, 2025

BoxPolypSAM: Leveraging SAM in Box-Supervised Polyp Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

RiverEcho: Real-Time Interactive Digital System for Ancient Yellow River Culture.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, ICME 2025 - Workshops, Nantes, France, June 30, 2025

Multitrack Music Generation Combining Transformer and Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, ICME 2025 - Workshops, Nantes, France, June 30, 2025

Aligning Human Motion Generation with Human Perceptions.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Structure Embedded Nucleus Classification for Histopathology Images.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, September, 2024

Human Motion Generation: A Survey.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Editorial for Special Issue on Artificial Intelligence for Art.

[BibT_eX]

[DOI]

Mach. Intell. Res., February, 2024

Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art.

[BibT_eX]

[DOI]

Mach. Intell. Res., February, 2024

LLIC: Large Receptive Field Transform Coding With Adaptive Weights for Learned Image Compression.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

Annotation-Efficient Polyp Segmentation via Active Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Source-Free Semi-Supervised Domain Adaptation for Tuberculosis Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

MemoMusic 4.0: Personalized Emotion Music Generation Conditioned by Valence and Arousal as Virtual Tokens.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

MLIC: Multi-Reference Entropy Model for Learned Image Compression.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

MemoMusic 3.0: Considering Context at Music Recommendation and Combining Music Theory at Music Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Lesion-Aware Dynamic Kernel for Polyp Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Cross-Level Contrastive Learning and Consistency Constraint for Semi-Supervised Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Early Prediction of Blastocyst Development via Time-Lapse Video Analysis.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Memomusic Version 2.0: Extending Personalized Music Recommendation with Automatic Music Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022

2021

Towards Large-Scale Object Instance Search: A Multi-Block N-Ary Trie.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

MemoMusic: A Personalized Music Recommendation Framework Based on Emotion and Memory.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Design and Development of an Intelligent Pet-Type Quadruped Robot.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Deep Transformers For Fast Small Intestine Grounding In Capsule Endoscope Video.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

2019

Codebook-Free Compact Descriptor for Scalable Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

A Generative Adversarial Network for AI-Aided Chair Design.

[BibT_eX]

[DOI]

Zhibo Liu

Feng Gao

Yizhou Wang

Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Learning to Remove Reflections for Text Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018

Data-Driven Lightweight Interest Point Selection for Large-Scale Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Group-Sensitive Triplet Embedding for Vehicle Reidentification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Depth Structure Preserving Scene Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017

Incorporating Intra-Class Variance to Fine-Grained Visual Recognition.

[BibT_eX]

[DOI]

CoRR, 2017

From Part to Whole: Who is Behind the Painting?

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Improving object detection with region similarity learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Incorporating intra-class variance to fine-grained visual recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016

Adaptive Weighted Matching of Deep Convolutional Features for Painting Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015

A Low Complexity Interest Point Detector.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

2013

Compact descriptors for mobile visual search and MPEG CDVS standardization.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Feng Gao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...