Linjiang Huang

Orcid: 0000-0001-9701-6487

According to our database1, Linjiang Huang authored at least 42 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
FreeEdit: Mask-Free Reference-Based Image Editing With Multi-Modal Instruction.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

VCBench: A Streaming Counting Benchmark for Spatial-Temporal State Maintenance in Long Videos.
CoRR, March, 2026

InfoScale: Unleashing Training-free Variable-scaled Image Generation via Effective Utilization of Information.
Trans. Mach. Learn. Res., 2026

VaccineRAG: Boosting Multimodal Large Language Models' Immunity to Harmful RAG Samples.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
EditThinker: Unlocking Iterative Reasoning for Any Image Editor.
CoRR, December, 2025

Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation.
CoRR, December, 2025

FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal.
CoRR, December, 2025

AnyExperts: On-Demand Expert Allocation for Multimodal Language Models with Mixture of Expert.
CoRR, November, 2025

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning.
CoRR, October, 2025

Group Critical-token Policy Optimization for Autoregressive Image Generation.
CoRR, September, 2025

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark.
CoRR, September, 2025

InfoScale: Unleashing Training-free Variable-scaled Image Generation via Effective Utilization of Information.
CoRR, September, 2025

SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps.
CoRR, August, 2025

FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment.
CoRR, June, 2025

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning.
CoRR, May, 2025

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing.
CoRR, March, 2025

FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering.
CoRR, February, 2025

AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

'Hi AirStar, Guide Me to the Badminton Court.'.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

DSACap: Enhancing Visual-Semantic Alignment with Diffusion-based Framework for Image Captioning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

PUMA: Empowering Unified MLLM with Multi-Granular Visual Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Comprehensive Attribute Prediction Learning for Person Search by Language.
IEEE Trans. Image Process., 2024

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Teach-DETR: Better Training DETR With Teachers.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Improving Inconspicuous Attributes Modeling for Person Search by Language.
IEEE Trans. Image Process., 2023

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Actor and Action Modular Network for Text-Based Video Segmentation.
IEEE Trans. Image Process., 2022

Multi-Modality Self-Distillation for Weakly Supervised Temporal Action Localization.
IEEE Trans. Image Process., 2022

Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Cross-modal Co-occurrence Attributes Alignments for Person Search by Language.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Modeling Sub-Actions for Weakly Supervised Temporal Action Localization.
IEEE Trans. Image Process., 2021

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Global Context Enhanced Multi-modal Fusion for Referring Image Segmentation.
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Relational Prototypical Network for Weakly Supervised Temporal Action Localization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Part-aligned pose-guided recurrent network for action recognition.
Pattern Recognit., 2019

Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition.
Proceedings of the Image and Graphics - 10th International Conference, 2019


  Loading...