Linjiang Huang

Orcid: 0000-0001-9701-6487

According to our database¹, Linjiang Huang authored at least 42 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

FreeEdit: Mask-Free Reference-Based Image Editing With Multi-Modal Instruction.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

VCBench: A Streaming Counting Benchmark for Spatial-Temporal State Maintenance in Long Videos.

[BibT_eX]

[DOI]

CoRR, March, 2026

InfoScale: Unleashing Training-free Variable-scaled Image Generation via Effective Utilization of Information.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

VaccineRAG: Boosting Multimodal Large Language Models' Immunity to Harmful RAG Samples.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

EditThinker: Unlocking Iterative Reasoning for Any Image Editor.

[BibT_eX]

[DOI]

CoRR, December, 2025

Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation.

[BibT_eX]

[DOI]

Hang Xu

Linjiang Huang

Feng Zhao

CoRR, December, 2025

FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal.

[BibT_eX]

[DOI]

Hang Xu

Linjiang Huang

Feng Zhao

CoRR, December, 2025

AnyExperts: On-Demand Expert Allocation for Multimodal Language Models with Mixture of Expert.

[BibT_eX]

[DOI]

CoRR, November, 2025

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Group Critical-token Policy Optimization for Autoregressive Image Generation.

[BibT_eX]

[DOI]

CoRR, September, 2025

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark.

[BibT_eX]

[DOI]

CoRR, September, 2025

InfoScale: Unleashing Training-free Variable-scaled Image Generation via Effective Utilization of Information.

[BibT_eX]

[DOI]

CoRR, September, 2025

SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps.

[BibT_eX]

[DOI]

CoRR, August, 2025

FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment.

[BibT_eX]

[DOI]

CoRR, June, 2025

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing.

[BibT_eX]

[DOI]

CoRR, March, 2025

FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering.

[BibT_eX]

[DOI]

CoRR, February, 2025

AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

'Hi AirStar, Guide Me to the Badminton Court.'.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

DSACap: Enhancing Visual-Semantic Alignment with Diffusion-based Framework for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

PUMA: Empowering Unified MLLM with Multi-Granular Visual Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Comprehensive Attribute Prediction Learning for Person Search by Language.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Teach-DETR: Better Training DETR With Teachers.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Improving Inconspicuous Attributes Modeling for Person Search by Language.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Actor and Action Modular Network for Text-Based Video Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Multi-Modality Self-Distillation for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

Linjiang Huang

Liang Wang

Hongsheng Li

IEEE Trans. Image Process., 2022

Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Cross-modal Co-occurrence Attributes Alignments for Person Search by Language.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation.

[BibT_eX]

[DOI]

Linjiang Huang

Liang Wang

Hongsheng Li

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Modeling Sub-Actions for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

Linjiang Huang

Liang Wang

Hongsheng Li

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Global Context Enhanced Multi-modal Fusion for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Relational Prototypical Network for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Part-aligned pose-guided recurrent network for action recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 10th International Conference, 2019

Linjiang Huang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...