Weilin Huang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Leveraging Verifier-Based Reinforcement Learning in Image Editing.
CoRR, April, 2026

Context Unrolling in Omni Models.
CoRR, April, 2026

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation.
CoRR, March, 2026

NeuroLoRA: Context-Aware Neuromodulation for Parameter-Efficient Multi-Task Adaptation.
CoRR, March, 2026

Align Video Diffusion Model with Online Video-Centric Preference Optimization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling.
CoRR, December, 2025

Seedream 4.0: Toward Next-generation Multimodal Image Generation.
CoRR, September, 2025

MEF: A Systematic Evaluation Framework for Text-to-Image Models.
CoRR, September, 2025

RewardDance: Reward Scaling in Visual Generation.
CoRR, September, 2025

PixNerd: Pixel Neural Field Diffusion.
CoRR, July, 2025

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models.
CoRR, June, 2025

Seedance 1.0: Exploring the Boundaries of Video Generation Models.
CoRR, June, 2025

SeedEdit 3.0: Fast and High-Quality Generative Image Editing.
CoRR, June, 2025

ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions.
CoRR, June, 2025

Scaling Diffusion Transformers Efficiently via μP.
CoRR, May, 2025

DanceGRPO: Unleashing GRPO on Visual Generation.
CoRR, May, 2025

Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation.
CoRR, May, 2025

SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL.
CoRR, April, 2025

Seedream 3.0 Technical Report.
CoRR, April, 2025

DDT: Decoupled Diffusion Transformer.
CoRR, April, 2025

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model.
CoRR, March, 2025

Robust Low-Rank Reconstruction of Seismic Data.
IEEE Trans. Geosci. Remote. Sens., 2025

Does Teacher Enthusiasm Facilitate Students' Chemistry Learning in Video Lectures Regardless of Students' Prior Chemistry Knowledge Levels?
J. Comput. Assist. Learn., 2025

An Enhanced EM Framework for Scanning Position Refinement in Ptychography via MCMC Sampling and Posterior Approximation.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

Equivariance-Based Theoretical Analysis and Self-Supervised Learning Framework for Missing Wedge Problem in Cryo-Electron Tomography.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

A Bayesian Method for Tracing Filamentous Structures in Cryo-Electron Microscopy Images.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

Prompt-A-Video: Prompt your Video Diffusion Model via Preference-Aligned LLM.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Joint Sparse Locality Preserving Regression for Discriminative Learning.
IEEE Trans. Emerg. Top. Comput. Intell., February, 2024

Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

CCID-CAN: Cross-Chain Intrusion Detection on CAN Bus for Autonomous Vehicles.
IEEE Internet Things J., 2024

End-to-end dense video grounding via parallel regression.
Comput. Vis. Image Underst., 2024

OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization.
CoRR, 2024

Fast Prompt Alignment for Text-to-Image Generation.
CoRR, 2024

SeedEdit: Align Image Re-Generation to Image Editing.
CoRR, 2024

UniFL: Improve Stable Diffusion via Unified Feedback Learning.
CoRR, 2024

Enhancing Cross-Domain Click-Through Rate Prediction via Explicit Feature Augmentation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

UniFL: Improve Latent Diffusion Model via Unified Feedback Learning.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
DRR: An open-source multi-platform package for the damped rank-reduction method and its applications in seismology.
Comput. Geosci., November, 2023

A Genetic Algorithm Optimized Undersampling Method for Seismic Sparse Acquisition and Reconstruction.
IEEE Trans. Geosci. Remote. Sens., 2023

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models.
CoRR, 2023

Forgedit: Text Guided Image Editing via Learning and Forgetting.
CoRR, 2023

Mixer: Image to Multi-Modal Retrieval Learning for Industrial Application.
CoRR, 2023

Cross-domain Augmentation Networks for Click-Through Rate Prediction.
CoRR, 2023

2022
Completeness and Coherence Learning for Fast Arbitrary Style Transfer.
Trans. Mach. Learn. Res., 2022

Automatic Extraction of Seismic Data Horizon Across Faults.
IEEE Trans. Geosci. Remote. Sens., 2022

Seismic Data Interpolation by Shannon Entropy-Based Shaping.
IEEE Trans. Geosci. Remote. Sens., 2022

Dual-stream pyramid registration network.
Medical Image Anal., 2022

Shannon Entropy-Based Seismic Local Correlation Measure and Enhancement.
IEEE Geosci. Remote. Sens. Lett., 2022

Cross-Architecture Self-supervised Video Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

InsCLR: Improving Instance Retrieval with Self-Supervision.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Surface Diffraction Noise Attenuation for Marine Seismic Data Processing With Mathematical Morphological Filtering.
IEEE Trans. Geosci. Remote. Sens., 2021

Mutually-aware Sub-Graphs Differentiable Architecture Search.
CoRR, 2021

Rethinking Deep Contrastive Learning with Embedding Memory.
CoRR, 2021

Brain Image Synthesis with Unsupervised Multivariate Canonical CSC𝓁<sub>4</sub>Net.
CoRR, 2021

Exploring Classification Equilibrium in Long-Tailed Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TOOD: Task-aligned One-stage Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unchain the Search Space with Hierarchical Differentiable Architecture Search.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
MCMT-GAN: Multi-Task Coherent Modality Transferable GAN for 3D Brain Image Synthesis.
IEEE Trans. Image Process., 2020

Robust Seismic Image Interpolation With Mathematical Morphological Constraint.
IEEE Trans. Image Process., 2020

Brain SegNet: 3D local refinement network for brain lesion segmentation.
BMC Medical Imaging, 2020

V4D: 4D Convolutional Neural Networks for Video-level Representation Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Representation Sharing for Fast Object Detector Search and Beyond.
Proceedings of the Computer Vision - ECCV 2020, 2020

Deformable Siamese Attention Networks for Visual Object Tracking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cross-Batch Memory for Embedding Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Knowledge Integration Networks for Action Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Channel Interaction Networks for Fine-Grained Image Categorization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Pedestrian detection with unsupervised multispectral feature learning using deep neural networks.
Inf. Fusion, 2019

Label-PEnet: Sequential Label Propagation and Enhancement Networks forWeakly Supervised Instance Segmentation.
CoRR, 2019

Compatible and Diverse Fashion Image Inpainting.
CoRR, 2019

Dual-Stream Pyramid Registration Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

The iMaterialist Fashion Attribute Dataset.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Convolutional Character Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

FiNet: Compatible and Diverse Fashion Image Inpainting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

ClothFlow: A Flow-Based Model for Clothed Person Generation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Label-PEnet: Sequential Label Propagation and Enhancement Networks for Weakly Supervised Instance Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Decoupling Category-wise Independence and Relevance with Self-attention for Multi-label Image Classification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-Similarity Loss With General Pair Weighting for Deep Metric Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Damped Dreamlet Representation for Exploration Seismic Data Interpolation and Denoising.
IEEE Trans. Geosci. Remote. Sens., 2018

Iterative Deblending of Simultaneous-Source Seismic Data With Structuring Median Constraint.
IEEE Geosci. Remote. Sens. Lett., 2018

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

Deep Metric Learning with Hierarchical Triplet Loss.
Proceedings of the Computer Vision - ECCV 2018, 2018

An End-to-End TextSpotter With Explicit Alignment and Attention.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs.
IEEE Trans. Image Process., 2017

Locally Supervised Deep Hybrid Model for Scene Recognition.
IEEE Trans. Image Process., 2017

Heterogeneous Face Recognition: A Common Encoding Feature Discriminant Approach.
IEEE Trans. Image Process., 2017

Double Least-Squares Projections Method for Signal Estimation.
IEEE Trans. Geosci. Remote. Sens., 2017

Empirical Low-Rank Approximation for Seismic Noise Attenuation.
IEEE Trans. Geosci. Remote. Sens., 2017

Robust face recognition with structural binary gradient patterns.
Pattern Recognit., 2017

Simultaneous Coherent and Random Noise Attenuation by Morphological Filtering With Dual-Directional Structuring Element.
IEEE Geosci. Remote. Sens. Lett., 2017

Improving scale invariant feature transform with local color contrastive descriptor for image classification.
J. Electronic Imaging, 2017

Learning multiple local binary descriptors for image matching.
Neurocomputing, 2017

Learning Spatio-Temporal Aggregation for Fetal Heart Analysis in Ultrasound Video.
Proceedings of the Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, 2017

Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2017, 2017

Single Shot Text Detector with Regional Attention.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Orientation-Aware Text Proposals Network for Scene Text Detection.
Proceedings of the Biometric Recognition - 12th Chinese Conference, 2017

2016
Text-Attentional Convolutional Neural Network for Scene Text Detection.
IEEE Trans. Image Process., 2016

An open-source Matlab code package for improved rank-reduction 3D seismic data denoising and reconstruction.
Comput. Geosci., 2016

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network.
CoRR, 2016

Locally-Supervised Deep Hybrid Model for Scene Recognition.
CoRR, 2016

Detecting Text in Natural Image with Connectionist Text Proposal Network.
Proceedings of the Computer Vision - ECCV 2016, 2016

Reading Scene Text in Deep Convolutional Sequences.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Local Multi-Grouped Binary Descriptor With Ring-Based Pooling Configuration and Optimization.
IEEE Trans. Image Process., 2015

Places205-VGGNet Models for Scene Recognition.
CoRR, 2015

Text-Attentional Convolutional Neural Networks for Scene Text Detection.
CoRR, 2015

Local Color Contrastive Descriptor for Image Classification.
CoRR, 2015

2014
Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
Robust facial representation for recognition.
PhD thesis, 2013

Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2010
Adaptive nonlinear manifolds and their applications to pattern recognition.
Inf. Sci., 2010

A dissimilarity kernel with local features for robust facial recognition.
Proceedings of the International Conference on Image Processing, 2010

2009
ViSOM for Dimensionality Reduction in Face Recognition.
Proceedings of the Advances in Self-Organizing Maps, 7th International Workshop, 2009

Nonlinear Dimensionality Reduction for Face Recognition.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2009

Linear and nonlinear dimensionality reduction for face recognition.
Proceedings of the International Conference on Image Processing, 2009


  Loading...