Fei Ma
Orcid: 0009-0002-5388-9125Affiliations:
- Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen, China
- Tsinghua University, Tsinghua-Berkeley Shenzhen Institute, DSIT Research Center, Shenzhen, China
According to our database1,
Fei Ma
authored at least 49 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Rethinking Efficient and Effective Point-Based Networks for Event Camera Classification and Regression.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025
Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks for Enhanced Action Understanding.
CoRR, August, 2025
CoRR, August, 2025
CoRR, May, 2025
Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling.
CoRR, May, 2025
CoRR, March, 2025
Observation-Graph Interaction and Key-Detail Guidance for Vision and Language Navigation.
CoRR, March, 2025
Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction.
CoRR, February, 2025
EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models.
CoRR, February, 2025
MLLM-TA: Leveraging Multimodal Large Language Models for Precise Temporal Video Grounding.
IEEE Signal Process. Lett., 2025
Inf. Fusion, 2025
Inf. Fusion, 2025
Exploring Embodied Multimodal Large Models: Development, datasets, and future directions.
Inf. Fusion, 2025
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025
CCIS-DIFF: A Generative Model with Stable Diffusion Prior for Controlled Colonoscopy Image Synthesis.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring.
Proceedings of the Computational Visual Media - 13th International Conference, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Your blush gives you away: detecting hidden mental states with remote photoplethysmography and thermal imaging.
PeerJ Comput. Sci., 2024
SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing.
CoRR, 2024
GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting.
CoRR, 2024
Enhancing and Accelerating Large Language Models via Instruction-Aware Contextual Compression.
CoRR, 2024
Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba.
CoRR, 2024
SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
A Language-Driven Navigation Strategy Integrating Semantic Maps and Large Language Models.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
2023
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023
2022
Poster Abstract: Representation Learning from Multimodal Sensor Data with Maximally Correlated Autoencoders.
Proceedings of the 21st ACM/IEEE International Conference on Information Processing in Sensor Networks, 2022
2021
CoRR, 2021
A Semi-supervised Learning Approach for Visual Question Answering based on Maximal Correlation.
Proceedings of the 2021 IEEE International Conference on Systems, Man, and Cybernetics, 2021
An Efficient Approach for Audio-Visual Emotion Recognition With Missing Labels And Missing Modalities.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
2019
Proceedings of the 18th International Conference on Information Processing in Sensor Networks, 2019
Proceedings of the Neural Information Processing - 26th International Conference, 2019
An End-to-End Learning Approach for Multimodal Emotion Recognition: Extracting Common and Private Information.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
2018
Multimodal Emotion Recognition by extracting common and modality-specific information.
Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems, SenSys 2018, 2018
Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems, SenSys 2018, 2018
Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems, SenSys 2018, 2018