Jun Wang

Orcid: 0009-0008-6066-4286

Affiliations:
  • Salesforce AI Research, Palo Alto, CA, USA
  • University of Maryland, College Park, MD, USA (PhD 2023)


According to our database1, Jun Wang authored at least 14 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models.
CoRR, 2024

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions.
CoRR, 2024

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models.
CoRR, 2024

EVE: Enabling Anyone to Train Robot using Augmented Reality.
CoRR, 2024

EVE: Enabling Anyone to Train Robots using Augmented Reality.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

xGen-VideoSyn-1: High-Fidelity Text-to-Video Synthesis with Compressed Representations.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

2023
Deep Learning for Scene perception and Understanding.
PhD thesis, 2023

Align and Attend: Multimodal Summarization with Dual Contrastive Losses.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
M3DETR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

ESSumm: Extractive Speech Summarization from Untranscribed Meeting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

PointMotionNet: Point-Wise Motion Learning for Large-Scale LiDAR Point Clouds Sequences.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2020
InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...