Yalong Bai

Orcid: 0000-0002-8416-9027

According to our database1, Yalong Bai authored at least 36 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models.
CoRR, 2024

2023
Augmentation Pathways Network for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Boosting Generic Visual-Linguistic Representation With Dynamic Contexts.
IEEE Trans. Multim., 2023

Interactive Conversational Head Generation.
CoRR, 2023

Deep Equilibrium Multimodal Fusion.
CoRR, 2023

Visual-Aware Text-to-Speech.
CoRR, 2023

Learning and Evaluating Human Preferences for Conversational Head Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Visual-Aware Text-to-Speech<sup>*</sup>.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Visualizing and Understanding Patch Interactions in Vision Transformer.
CoRR, 2022

Freeform Body Motion Generation from Speech.
CoRR, 2022

Responsive Listening Head Generation: A Benchmark Dataset and Baseline.
Proceedings of the Computer Vision - ECCV 2022, 2022

Directional Self-supervised Learning for Heavy Image Augmentations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Responsive Listening Head Generation: A Benchmark Dataset and Baseline.
CoRR, 2021

Directional Self-supervised Learning for Risky Image Augmentations.
CoRR, 2021

Augmentation Pathways Network for Visual Recognition.
CoRR, 2021

Flat and Shallow: Understanding Fake Image Detection Models by Architecture Profiling.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Exploiting Relationship for Complex-scene Image Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Products-10K: A Large-scale Product Recognition Dataset.
CoRR, 2020

Look-Into-Object: Self-Supervised Structure Modeling for Object Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Rethinking Visual Relationships for High-level Image Understanding.
CoRR, 2019

VrR-VG: Refocusing Visually-Relevant Relationships.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Destruction and Construction Learning for Fine-Grained Image Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Automatic Data Augmentation from Massive Web Images for Deep Visual Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Deep Attention Neural Tensor Network for Visual Question Answering.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Automatic Dataset Augmentation.
CoRR, 2017

Convolutional neural networks for posed and spontaneous expression recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Improve dog recognition by mining more information from both click-through logs and pre-trained models.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

2015
Learning Cross Space Mapping via DNN Using Large Scale Click-Through Logs.
IEEE Trans. Multim., 2015

Automatic Image Dataset Construction from Click-through Logs Using Deep Neural Network.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2014
Visualizing and Comparing Convolutional Neural Networks.
CoRR, 2014

Learning High-level Image Representation for Image Retrieval via Multi-Task DNN using Clickthrough Data.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Bag-of-Words Based Deep Neural Network for Image Retrieval.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

RC-NET: A General Framework for Incorporating Knowledge into Word Representations.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

DNN Flow: DNN Feature Pyramid based Image Matching.
Proceedings of the British Machine Vision Conference, 2014

2013
Learning Domain Differences Automatically for Dependency Parsing Adaptation.
Proceedings of the IJCAI 2013, 2013

Cross-lingual Projections between Languages from Different Families.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013


  Loading...