Zequn Jie

Orcid: 0000-0002-3038-5891

According to our database¹, Zequn Jie authored at least 75 papers between 2014 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization.

[BibT_eX]

[DOI]

CoRR, 2024

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion.

[BibT_eX]

[DOI]

CoRR, 2024

MindBench: A Comprehensive Benchmark for Mind Map Structure Recognition and Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Matten: Video Generation with Mamba-Attention.

[BibT_eX]

[DOI]

CoRR, 2024

Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2024

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs.

[BibT_eX]

[DOI]

Shaoxiang Chen

Zequn Jie

Lin Ma

CoRR, 2024

Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Weakly supervised semantic segmentation via self-supervised destruction learning.

[BibT_eX]

[DOI]

Neurocomputing, December, 2023

Weakly Supervised Semantic Segmentation Via Progressive Patch Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment.

[BibT_eX]

[DOI]

CoRR, 2023

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning.

[BibT_eX]

[DOI]

CoRR, 2023

FastPillars: A Deployment-friendly Pillar-based 3D Detector.

[BibT_eX]

[DOI]

CoRR, 2023

RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Suspected Objects Matter: Rethinking Model's Prediction for One-stage Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi View Action Recognition for Distracted Driver Behavior Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AeDet: Azimuth-Invariant Multi-View 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Curriculum Multi-Negative Augmentation for Debiased Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Multiple Object Tracking Challenge Technical Report for Team MT_IoT.

[BibT_eX]

[DOI]

CoRR, 2022

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

MT-Net Submission to the Waymo 3D Detection Leaderboard.

[BibT_eX]

[DOI]

CoRR, 2022

PromptDet: Expand Your Detector Vocabulary with Uncurated Images.

[BibT_eX]

[DOI]

CoRR, 2022

Suspected Object Matters: Rethinking Model's Prediction for One-stage Visual Grounding.

[BibT_eX]

[DOI]

CoRR, 2022

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design.

[BibT_eX]

[DOI]

Xujie Zhang

Yu Sha

Michael C. Kampffmeyer

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

CASNet: A Cross-Attention Siamese Network for Video Salient Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Anytime Recognition with Routing Convolutional Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Delving deep into the imbalance of positive proposals in two-stage object detection.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Two-stage Visual Cues Enhancement Network for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

Matching Image and Sentence With Multi-Faceted Representations.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Joint Task-Recursive Learning for RGB-D Scene Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Delving into the Imbalance of Positive Proposals in Two-stage Object Detection.

[BibT_eX]

[DOI]

CoRR, 2020

PS-RCNN: Detecting Secondary Human Instances in a Crowd via Primary Object Suppression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Central Similarity Quantization for Efficient Image and Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NMS by Representative Region: Towards Crowded Pedestrian Detection by Proposal Pairing.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MTL-NAS: Task-Agnostic Neural Architecture Search Towards General-Purpose Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Bidirectional image-sentence retrieval by local and global deep matching.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Central Similarity Hashing via Hadamard matrix.

[BibT_eX]

[DOI]

CoRR, 2019

A Sufficient Condition for Convergences of Adam and RMSProp.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Object-Wise Semantic Representation for Detection in Remote Sensing Imagery.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Geometry-Aware Distillation for Indoor Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Localizing Natural Language in Videos.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Video-Based Person Re-Identification With Accumulative Motion Context.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Object Proposal Generation With Fully Convolutional Networks.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Learning with rethinking: Recurrently improving convolutional neural networks through feedback.

[BibT_eX]

[DOI]

Pattern Recognit., 2018

Real-Time Referring Expression Comprehension by Single-Stage Grounding Network.

[BibT_eX]

[DOI]

CoRR, 2018

Multi-View Image Generation from a Single-View.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Image-level to Pixel-wise Labeling: From Theory to Practice.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Policy Optimization with Demonstrations.

[BibT_eX]

[DOI]

Bingyi Kang

Zequn Jie

Jiashi Feng

Proceedings of the 35th International Conference on Machine Learning, 2018

Temporally Grounding Natural Sentence in Video.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Modular Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Left-Right Comparative Recurrent Model for Stereo Matching.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Predicting Scene Parsing and Motion Dynamics in the Future.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Neural Person Search Machines.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

FoveaNet: Perspective-Aware Urban Scene Parsing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Video Scene Parsing with Predictive Feature Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Self-Taught Learning for Weakly Supervised Object Localization.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Multi-Path Feedback Recurrent Neural Networks for Scene Parsing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Scale-Aware Pixelwise Object Proposal Networks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Learning to segment with image-level annotations.

[BibT_eX]

[DOI]

Pattern Recognit., 2016

Multi-Path Feedback Recurrent Neural Network for Scene Parsing.

[BibT_eX]

[DOI]

CoRR, 2016

Scale-aware Pixel-wise Object Proposal Networks.

[BibT_eX]

[DOI]

CoRR, 2016

Tree-Structured Reinforcement Learning for Sequential Object Localization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Accurate On-Road Vehicle Detection with Deep Fully Convolutional Networks.

[BibT_eX]

[DOI]

Zequn Jie

Wen Feng Lu

Francis Eng Hock Tay

Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2016

Reversible Recursive Instance-Level Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2014

Robust Scene Classification with Cross-Level LLC Coding on CNN Features.

[BibT_eX]

[DOI]

Zequn Jie

Shuicheng Yan

Proceedings of the Computer Vision - ACCV 2014, 2014

Zequn Jie

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...