Jianhua Han

According to our database1, Jianhua Han authored at least 47 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model.
CoRR, 2024

NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning.
CoRR, 2024

From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs.
CoRR, 2024

Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models.
CoRR, 2024

Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion.
CoRR, 2023

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model.
CoRR, 2023

Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving.
CoRR, 2023

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.
CoRR, 2023

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models.
CoRR, 2023

HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving.
CoRR, 2023

GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training.
CoRR, 2023

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability.
CoRR, 2023

MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation.
CoRR, 2023

Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards.
CoRR, 2023

Towards Universal Vision-language Omni-supervised Segmentation.
CoRR, 2023

Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DetGPT: Detect What You Need via Reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CLIP<sup>2</sup>: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CapDet: Unifying Dense Captioning and Open-World Detection Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NLIP: Noise-Robust Language-Image Pre-training.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
P<sup>3</sup>OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection.
CoRR, 2022

DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Generative Negative Text Replay for Continual Vision-Language Pretraining.
Proceedings of the Computer Vision - ECCV 2022, 2022

Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding.
Proceedings of the Computer Vision - ECCV 2022, 2022

CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2022, 2022

ONCE-3DLanes: Building Monocular 3D Lane Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Laneformer: Object-Aware Row-Column Transformers for Lane Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
SODA10M: Towards Large-Scale Object Detection Benchmark for Autonomous Driving.
CoRR, 2021

SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

2019
Order-aware Embedding Neural Network for CTR Prediction.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Optimizing Ranking Algorithm in Recommender System via Deep Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Advanced Manufacturing, 2019

2017
Aggregating Crowd Wisdoms with Label-aware Autoencoders.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016
Label Aggregation with Instance Grouping Model.
Proceedings of the 25th International Conference on World Wide Web, 2016

Aggregating Crowd Wisdom with Instance Grouping Methods.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

2014
Fault-Tolerant Control, Fault Diagnosis and Recovery in Runtime of Business Docking Service Composition Flow in the Cloud Environment.
Proceedings of the Intelligent Computing Methodologies - 10th International Conference, 2014

2013
A study on the scalable flow model of web services choreography and orchestration based on dynamic workflow.
Int. J. Inf. Commun. Technol., 2013

2011
Construction and Application of the Merging Network Teaching Platform.
Proceedings of the Frontiers in Computer Education [International Conference on Frontiers in Computer Education, 2011

2010
A study on academic performance and interpersonal interactions based on network.
Proceedings of the 2010 14th International Conference on Computer Supported Cooperative Work in Design, 2010

A New Network Collaborative Manufacturing Based on the STEP-NC.
Proceedings of the International Conference on Computational Aspects of Social Networks, 2010

2008
Cooperative Petition Processing Model Based on Dynamic Workflow Management.
Proceedings of the Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008


  Loading...