Song Bai

Orcid: 0000-0002-2570-9118

According to our database¹, Song Bai authored at least 142 papers between 2014 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Understanding and Mitigating Dimensional Collapse in Federated Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

CenterNet++ for Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

You-Only-Look-Once Multiple-Strategy Printed Circuit Board Defect Detection Model.

[BibT_eX]

[DOI]

IEEE Multim., 2024

Debiasing Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human.

[BibT_eX]

[DOI]

Song Bai

Jie Li

CoRR, 2024

2023

Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Smart Public Transportation Sensing: Enhancing Perception and Data Management for Efficient and Safety Operations.

[BibT_eX]

[DOI]

Sensors, November, 2023

Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

A probabilistic fatigue life prediction method under random combined high and low cycle fatigue load history.

[BibT_eX]

[DOI]

Reliab. Eng. Syst. Saf., October, 2023

Patch-Based Separable Transformer for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Guest Editorial: Introduction to the Special Section on Graphs in Vision and Pattern Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

RIS-Assisted Joint Active and Passive Transmission With Distributed Reception.

[BibT_eX]

[DOI]

IEEE Trans. Veh. Technol., May, 2023

Supervised Phenotype Discovery From Multimodal Brain Imaging.

[BibT_eX]

[DOI]

Christian F. Beckmann

IEEE Trans. Medical Imaging, March, 2023

General Object Foundation Model for Images and Videos at Scale.

[BibT_eX]

[DOI]

CoRR, 2023

Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery.

[BibT_eX]

[DOI]

CoRR, 2023

Dataset Condensation via Generative Model.

[BibT_eX]

[DOI]

CoRR, 2023

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks.

[BibT_eX]

[DOI]

CoRR, 2023

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing.

[BibT_eX]

[DOI]

CoRR, 2023

Intriguing Properties of Text-guided Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

SRFormer: Permuted Self-Attention for Single Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2023

Mixed Samples as Probes for Unsupervised Model Selection in Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A network architecture for carrying terminal functions on 5G and 6G satellites.

[BibT_eX]

[DOI]

Proceedings of the International Wireless Communications and Mobile Computing, 2023

PV3D: A 3D Generative Model for Portrait Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Is Synthetic Data from Generative Models Ready for Image Recognition?

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

SRFormer: Permuted Self-Attention for Single Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

InstMove: Instance Motion for Object-centric Video Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PLA: Language-Driven Open-Vocabulary 3D Scene Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RIS-Empowered Phase and Code Index Modulation.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Computer and Communication Systems, 2023

2022

Stability analysis of explicit MPM.

[BibT_eX]

[DOI]

Song Bai

Craig A. Schroeder

Comput. Graph. Forum, December, 2022

Wasserstein Loss With Alternative Reinforcement Learning for Severity-Aware Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2022

End-to-End Temporal Action Detection With Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Author Correction: Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.

[BibT_eX]

[DOI]

Pattanasak Mongkolwat

Lorena Escudero Sanchez

Nat. Mach. Intell., 2022

AutoScale: Learning to Scale for Crowd Counting.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

Occluded Video Instance Segmentation: A Benchmark.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

Language-driven Open-Vocabulary 3D Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2022

LUMix: Improving Mixup by Better Modelling Label Uncertainty.

[BibT_eX]

[DOI]

CoRR, 2022

The Runner-up Solution for YouTube-VIS Long Video Challenge 2022.

[BibT_eX]

[DOI]

CoRR, 2022

1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words.

[BibT_eX]

[DOI]

CoRR, 2022

Runner-Up Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Contextual Text Block Detection towards Scene Text Understanding.

[BibT_eX]

[DOI]

CoRR, 2022

VMRF: View Matching Neural Radiance Fields.

[BibT_eX]

[DOI]

CoRR, 2022

Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting.

[BibT_eX]

[DOI]

CoRR, 2022

Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Contextual Text Block Detection Towards Scene Text Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

In Defense of Online Models for Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

SeqFormer: Sequential Transformer for Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Fourier Document Restoration for Robust Document Dewarping and Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Empirical Study of End-to-End Temporal Action Detection.

[BibT_eX]

[DOI]

Xiaolong Liu

Song Bai

Xiang Bai

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TransMix: Attend to Mix for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

YouMVOS: An Actor-centric Multi-shot Video Object Segmentation Dataset.

[BibT_eX]

[DOI]

Anirudh Srinivasan Chakravarthy

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Hypergraph convolution and hypergraph attention.

[BibT_eX]

[DOI]

Song Bai

Feihu Zhang

Philip H. S. Torr

Pattern Recognit., 2021

Learning Regional Attraction for Line Segment Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Adversarial Metric Attack and Defense for Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.

[BibT_eX]

[DOI]

Pattanasak Mongkolwat

Lorena Escudero Sanchez

Nat. Mach. Intell., 2021

Deep learning for predicting COVID-19 malignant progression.

[BibT_eX]

[DOI]

Medical Image Anal., 2021

SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2021

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence.

[BibT_eX]

[DOI]

Pattanasak Mongkolwat

Lorena Escudero Sanchez

Carola-Bibiane Schönlieb

Tian Xia

CoRR, 2021

Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation.

[BibT_eX]

[DOI]

Anirudh Srinivasan Chakravarthy

CoRR, 2021

CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis.

[BibT_eX]

[DOI]

CoRR, 2021

Visual Parser: Representing Part-whole Hierarchies with Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

End-to-end Temporal Action Detection with Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Location-Sensitive Visual Recognition with Cross-IOU Loss.

[BibT_eX]

[DOI]

CoRR, 2021

Anchor-Free Person Search.

[BibT_eX]

[DOI]

CoRR, 2021

Occluded Video Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2021

Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Anchor-Free Person Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SwiftNet: Real-Time Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multi-Shot Temporal Event Localization: A Benchmark.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Application of data mining for predicting hemodynamics instability during pheochromocytoma surgery.

[BibT_eX]

[DOI]

BMC Medical Informatics Decis. Mak., December, 2020

An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

A comparison of methods for 3D scene shape retrieval.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2020

Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training.

[BibT_eX]

[DOI]

CoRR, 2020

Reinforced Wasserstein Training for Severity-Aware Semantic Segmentation in Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2020

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Dual Attention GANs for Semantic Image Synthesis.

[BibT_eX]

[DOI]

Hao Tang

Song Bai

Nicu Sebe

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Instance Segmentation of LiDAR Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

XingGAN for Person Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Corner Proposal Network for Anchor-Free, Two-Stage Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Holistically-Attracted Wireframe Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Architecture Search for Lightweight Non-Local Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bipartite Graph Reasoning GANs for Person Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 31st British Machine Vision Conference 2020, 2020

Learning Transferable Adversarial Examples via Ghost Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Automatic Ensemble Diffusion for 3D Shape and Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Training convolutional neural network from multi-domain contour images for 3D shape retrieval.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2019

Regularized Diffusion Process on Bidirectional Context for Object Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Asymmetric Non-local Neural Networks for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2019

Learn to Scale: Generating Multipolar Normalized Density Map for Crowd Counting.

[BibT_eX]

[DOI]

CoRR, 2019

Adversarial Metric Attack for Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2019

Feature context learning for human parsing.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2019

Semi-Supervised 3D Abdominal Multi-Organ Segmentation Via Deep Multi-Planar Co-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Asymmetric Non-Local Neural Networks for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Prior-Aware Neural Network for Partially-Supervised Multi-Organ Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Anchor Diffusion for Unsupervised Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Symmetry-Constrained Rectification Network for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

View N-Gram Network for 3D Object Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CenterNet: Keypoint Triplets for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Attraction Field Representation for Robust Line Segment Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Improving Transferability of Adversarial Examples With Input Diversity.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Re-Ranking via Metric Fusion for Object Retrieval and Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Improving context-sensitive similarity via smooth neighborhood for object retrieval.

[BibT_eX]

[DOI]

Pattern Recognit., 2018

Position Tracking Control for Permanent Magnet Linear Motor via Continuous-Time Fast Terminal Sliding Mode Control.

[BibT_eX]

[DOI]

J. Control. Sci. Eng., 2018

Learn to Interpret Atari Agents.

[BibT_eX]

[DOI]

CoRR, 2018

Hard-Aware Point-to-Set Deep Metric for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Triplet-Center Loss for Multi-View 3D Object Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2D Scene Sketch-Based 3D Scene Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 11th Eurographics Workshop on 3D Object Retrieval, 2018

2D Image-Based 3D Scene Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 11th Eurographics Workshop on 3D Object Retrieval, 2018

2017

GIFT: Towards Scalable 3D Shape Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

The Fabrication and Characterization of Ni/4H-SiC Schottky Diode Radiation Detectors with a Sensitive Area of up to 4 cm<sup>2</sup>.

[BibT_eX]

[DOI]

Sensors, 2017

GraphHP: A Hybrid Platform for Iterative Graph Processing.

[BibT_eX]

[DOI]

CoRR, 2017

Ensemble Diffusion for Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Scalable Person Re-identification on Supervised Smoothed Manifold.

[BibT_eX]

[DOI]

Song Bai

Xiang Bai

Qi Tian

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Divide and Fuse: A Re-ranking Approach for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2017, 2017

Regularized Diffusion Process for Visual Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Multidimensional Scaling on Multiple Input Distance Matrices.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Large-Scale 3D Shape Retrieval from ShapeNet Core55.

[BibT_eX]

[DOI]

Proceedings of the 10th Eurographics Workshop on 3D Object Retrieval, 2017

2016

Multiple Stage Residual Model for Image Classification and Vector Compression.

[BibT_eX]

[DOI]

Song Bai

Xiang Bai

Wenyu Liu

IEEE Trans. Multim., 2016

Sparse Contextual Activation for Efficient Visual Re-Ranking.

[BibT_eX]

[DOI]

Song Bai

Xiang Bai

IEEE Trans. Image Process., 2016

Co-spectral for robust shape clustering.

[BibT_eX]

[DOI]

Song Bai

Zhiyong Liu

Xiang Bai

Pattern Recognit. Lett., 2016

Deep Learning Representation using Autoencoder for 3D Shape Retrieval.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Smooth Neighborhood Structure Mining on Multiple Affinity Graphs with Applications to Context-Sensitive Similarity.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

GIFT: A Real-Time and Scalable 3D Shape Search Engine.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Large-Scale 3D Shape Retrieval from ShapeNet Core55.

[BibT_eX]

[DOI]

Proceedings of the 9th Eurographics Workshop on 3D Object Retrieval, 2016

2015

DeepPano: Deep Panoramic Representation for 3-D Shape Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

Neural shape codes for 3D model retrieval.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2015

3D Shape Matching via Two Layer Coding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Beyond diffusion process: Neighbor set similarity for fast re-ranking.

[BibT_eX]

[DOI]

Xiang Bai

Song Bai

Xinggang Wang

Inf. Sci., 2015

2014

Aggregating contour fragments for shape classification.

[BibT_eX]

[DOI]

Song Bai

Xinggang Wang

Xiang Bai

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Multiple Stage Residual Model for Accurate Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

Song Bai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...