Chongjian Ge

Orcid: 0000-0003-1142-9171

According to our database1, Chongjian Ge authored at least 25 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Rethinking Attentive Object Detection via Neural Attention Learning.
IEEE Trans. Image Process., 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation.
CoRR, 2024

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis.
CoRR, 2024

DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
CycleMLP: A MLP-Like Architecture for Dense Visual Predictions.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Advancing Vision Transformers with Group-Mix Attention.
CoRR, 2023

Large Language Models as Automated Aligners for benchmarking Vision-Language Models.
CoRR, 2023

InstructDET: Diversifying Referring Object Detection with Generalized Instructions.
CoRR, 2023

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis.
CoRR, 2023

Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training.
CoRR, 2023

MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation.
CoRR, 2023

DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving.
CoRR, 2023

Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations.
CoRR, 2022

AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EViT: Expediting Vision Transformers via Token Reorganizations.
Proceedings of the Tenth International Conference on Learning Representations, 2022

CycleMLP: A MLP-like Architecture for Dense Prediction.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning.
CoRR, 2021

CycleMLP: A MLP-like Architecture for Dense Prediction.
CoRR, 2021

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Watch Only Once: An End-to-End Video Action Detection Framework.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Parser-Free Virtual Try-On via Distilling Appearance Flows.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021


  Loading...