Yangfu Li

Orcid: 0000-0002-4087-2060

According to our database1, Yangfu Li authored at least 15 papers between 2022 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Perceptual Flow Network for Visually Grounded Reasoning.
CoRR, May, 2026

DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models.
CoRR, April, 2026

TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders.
CoRR, April, 2026

StruVis: Enhancing Reasoning-based Text-to-Image Generation via Thinking with Structured Vision.
CoRR, March, 2026

DeepScan: A Training-Free Framework for Visually Grounded Reasoning in Large Vision-Language Models.
CoRR, March, 2026

2025
Why 1 + 1 < 1 in Visual Token Pruning: Beyond Naive Integration via Multi-Objective Balanced Covering.
CoRR, May, 2025

MSA<sup>2</sup>: Multi-Task Framework With Structure-Aware and Style-Adaptive Character Representation for Open-Set Chinese Text Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
DS-TDNN: Dual-Stream Time-Delay Neural Network With Global-Aware Filter for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Free Lunch: Frame-level Contrastive Learning with Text Perceiver for Robust Scene Text Recognition in Lightweight Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

LK-Net: Efficient Large Kernel ConvNet for Document Enhancement.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

FaRE: A Feature-Aware Radical Encoding Strategy for Zero-Shot Chinese Character Recognition.
Proceedings of the Computer Vision - ACCV 2024, 2024

2023
DeflickerCycleGAN: Learning to Detect and Remove Flickers in a Single Image.
IEEE Trans. Image Process., 2023

Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification.
CoRR, 2023

UniVR: A Unified Framework for Pitch-Shifted Voice Restoration in Speaker Identification.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
A Cony-Attention Network for Detecting the Presence of ENF Signal in Short-Duration Audio.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022


  Loading...