Zequn Jie

Orcid: 0000-0002-3038-5891

According to our database1, Zequn Jie authored at least 69 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models.
CoRR, 2024

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset.
CoRR, 2024

LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs.
CoRR, 2024

Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Weakly supervised semantic segmentation via self-supervised destruction learning.
Neurocomputing, December, 2023

Weakly Supervised Semantic Segmentation Via Progressive Patch Learning.
IEEE Trans. Multim., 2023

Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment.
CoRR, 2023

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning.
CoRR, 2023

FastPillars: A Deployment-friendly Pillar-based 3D Detector.
CoRR, 2023

RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Suspected Objects Matter: Rethinking Model's Prediction for One-stage Visual Grounding.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi View Action Recognition for Distracted Driver Behavior Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AeDet: Azimuth-Invariant Multi-View 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Curriculum Multi-Negative Augmentation for Debiased Video Grounding.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multiple Object Tracking Challenge Technical Report for Team MT_IoT.
CoRR, 2022

AeDet: Azimuth-invariant Multi-view 3D Object Detection.
CoRR, 2022

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.
CoRR, 2022

MT-Net Submission to the Waymo 3D Detection Leaderboard.
CoRR, 2022

PromptDet: Expand Your Detector Vocabulary with Uncurated Images.
CoRR, 2022

Suspected Object Matters: Rethinking Model's Prediction for One-stage Visual Grounding.
CoRR, 2022

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes.
Proceedings of the Computer Vision - ECCV 2022, 2022

PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
CASNet: A Cross-Attention Siamese Network for Video Salient Object Detection.
IEEE Trans. Neural Networks Learn. Syst., 2021

Anytime Recognition with Routing Convolutional Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Delving deep into the imbalance of positive proposals in two-stage object detection.
Neurocomputing, 2021

Two-stage Visual Cues Enhancement Network for Referring Image Segmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Matching Image and Sentence With Multi-Faceted Representations.
IEEE Trans. Circuits Syst. Video Technol., 2020

Joint Task-Recursive Learning for RGB-D Scene Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Delving into the Imbalance of Positive Proposals in Two-stage Object Detection.
CoRR, 2020

PS-RCNN: Detecting Secondary Human Instances in a Crowd via Primary Object Suppression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Central Similarity Quantization for Efficient Image and Video Retrieval.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NMS by Representative Region: Towards Crowded Pedestrian Detection by Proposal Pairing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MTL-NAS: Task-Agnostic Neural Architecture Search Towards General-Purpose Multi-Task Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Bidirectional image-sentence retrieval by local and global deep matching.
Neurocomputing, 2019

Central Similarity Hashing via Hadamard matrix.
CoRR, 2019

A Sufficient Condition for Convergences of Adam and RMSProp.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Object-Wise Semantic Representation for Detection in Remote Sensing Imagery.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Geometry-Aware Distillation for Indoor Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Localizing Natural Language in Videos.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Video-Based Person Re-Identification With Accumulative Motion Context.
IEEE Trans. Circuits Syst. Video Technol., 2018

Object Proposal Generation With Fully Convolutional Networks.
IEEE Trans. Circuits Syst. Video Technol., 2018

Learning with rethinking: Recurrently improving convolutional neural networks through feedback.
Pattern Recognit., 2018

Real-Time Referring Expression Comprehension by Single-Stage Grounding Network.
CoRR, 2018

Multi-View Image Generation from a Single-View.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Image-level to Pixel-wise Labeling: From Theory to Practice.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Policy Optimization with Demonstrations.
Proceedings of the 35th International Conference on Machine Learning, 2018

Temporally Grounding Natural Sentence in Video.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Modular Generative Adversarial Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry.
Proceedings of the Computer Vision - ECCV 2018, 2018

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Left-Right Comparative Recurrent Model for Stereo Matching.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Predicting Scene Parsing and Motion Dynamics in the Future.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Neural Person Search Machines.
Proceedings of the IEEE International Conference on Computer Vision, 2017

FoveaNet: Perspective-Aware Urban Scene Parsing.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Video Scene Parsing with Predictive Feature Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Self-Taught Learning for Weakly Supervised Object Localization.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Multi-Path Feedback Recurrent Neural Networks for Scene Parsing.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Scale-Aware Pixelwise Object Proposal Networks.
IEEE Trans. Image Process., 2016

Learning to segment with image-level annotations.
Pattern Recognit., 2016

Multi-Path Feedback Recurrent Neural Network for Scene Parsing.
CoRR, 2016

Scale-aware Pixel-wise Object Proposal Networks.
CoRR, 2016

Tree-Structured Reinforcement Learning for Sequential Object Localization.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Accurate On-Road Vehicle Detection with Deep Fully Convolutional Networks.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2016

Reversible Recursive Instance-Level Object Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2014
Robust Scene Classification with Cross-Level LLC Coding on CNN Features.
Proceedings of the Computer Vision - ACCV 2014, 2014


  Loading...