Bei Liu

Orcid: 0000-0001-8857-0953

Affiliations:
  • Microsoft Research Asia, Beijing, China


According to our database1, Bei Liu authored at least 40 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Spatiotemporal Predictive Pre-training for Robotic Motor Control.
CoRR, 2024

Revisiting Latent Space of GAN Inversion for Robust Real Image Editing.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Language-Guided Face Animation by Recurrent StyleGAN-Based Generator.
IEEE Trans. Multim., 2023

ViCo: Engaging Video Comment Generation with Human Preference Rewards.
CoRR, 2023

Revisiting Latent Space of GAN Inversion for Real Image Editing.
CoRR, 2023

Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots.
CoRR, 2023

Balancing Reconstruction and Editing Quality of GAN Inversion for Real Image Editing with StyleGAN Prior Latent Space.
CoRR, 2023

AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation.
CoRR, 2023

SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SINC: Self-Supervised In-Context Learning for Vision-Language Tasks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Anchor-Based Detection for Natural Language Localization in Ego-Centric Videos.
Proceedings of the IEEE International Conference on Consumer Electronics, 2023

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment.
CoRR, 2022

Exploring Anchor-based Detection for Ego4D Natural Language Query.
CoRR, 2022

Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Reference-Based Defect Detection Network.
IEEE Trans. Image Process., 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training.
CoRR, 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Searching the Search Space of Vision Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Fine-Grained Motion Embedding for Landscape Animation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Unifying Multimodal Transformer for Bi-directional Image and Text Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Seeing Out of the Box: End-to-End Pre-Training for Vision-Language Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers.
CoRR, 2020

Aesthetic-Aware Image Style Transfer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
Learning Rich Image Region Representation for Visual Question Answering.
CoRR, 2019

WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection.
CoRR, 2019

Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos.
CoRR, 2019

Neural Storyboard Artist: Visualizing Stories with Coherent Image Sequences.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

SMP Challenge: An Overview of Social Media Prediction Challenge 2019.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Emotion Reinforced Visual Storytelling.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2016
Cognition-Aware Summarization of Photos Representing Events.
IEICE Trans. Inf. Syst., 2016

2014
Finding Photo Sets of Events by Minimizing Misrecognition from Neighbor Events.
Proceedings of the Web-Age Information Management - 15th International Conference, 2014


  Loading...