Song Bai

Orcid: 0000-0002-2570-9118

According to our database1, Song Bai authored at least 142 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Understanding and Mitigating Dimensional Collapse in Federated Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

CenterNet++ for Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

You-Only-Look-Once Multiple-Strategy Printed Circuit Board Defect Detection Model.
IEEE Multim., 2024

Debiasing Text-to-Image Diffusion Models.
CoRR, 2024

Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human.
CoRR, 2024

2023
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Smart Public Transportation Sensing: Enhancing Perception and Data Management for Efficient and Safety Operations.
Sensors, November, 2023

Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

A probabilistic fatigue life prediction method under random combined high and low cycle fatigue load history.
Reliab. Eng. Syst. Saf., October, 2023

Patch-Based Separable Transformer for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Guest Editorial: Introduction to the Special Section on Graphs in Vision and Pattern Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

RIS-Assisted Joint Active and Passive Transmission With Distributed Reception.
IEEE Trans. Veh. Technol., May, 2023

Supervised Phenotype Discovery From Multimodal Brain Imaging.
IEEE Trans. Medical Imaging, March, 2023

General Object Foundation Model for Images and Videos at Scale.
CoRR, 2023

Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery.
CoRR, 2023

Dataset Condensation via Generative Model.
CoRR, 2023

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks.
CoRR, 2023

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding.
CoRR, 2023

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing.
CoRR, 2023

Intriguing Properties of Text-guided Diffusion Models.
CoRR, 2023

SRFormer: Permuted Self-Attention for Single Image Super-Resolution.
CoRR, 2023

Mixed Samples as Probes for Unsupervised Model Selection in Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A network architecture for carrying terminal functions on 5G and 6G satellites.
Proceedings of the International Wireless Communications and Mobile Computing, 2023

PV3D: A 3D Generative Model for Portrait Video Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Is Synthetic Data from Generative Models Ready for Image Recognition?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SRFormer: Permuted Self-Attention for Single Image Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

InstMove: Instance Motion for Object-centric Video Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PLA: Language-Driven Open-Vocabulary 3D Scene Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RIS-Empowered Phase and Code Index Modulation.
Proceedings of the 8th International Conference on Computer and Communication Systems, 2023

2022
Stability analysis of explicit MPM.
Comput. Graph. Forum, December, 2022

Wasserstein Loss With Alternative Reinforcement Learning for Severity-Aware Semantic Segmentation.
IEEE Trans. Intell. Transp. Syst., 2022

End-to-End Temporal Action Detection With Transformer.
IEEE Trans. Image Process., 2022

Author Correction: Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.
Nat. Mach. Intell., 2022

AutoScale: Learning to Scale for Crowd Counting.
Int. J. Comput. Vis., 2022

Occluded Video Instance Segmentation: A Benchmark.
Int. J. Comput. Vis., 2022

Language-driven Open-Vocabulary 3D Scene Understanding.
CoRR, 2022

LUMix: Improving Mixup by Better Modelling Label Uncertainty.
CoRR, 2022

The Runner-up Solution for YouTube-VIS Long Video Challenge 2022.
CoRR, 2022

1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words.
CoRR, 2022

Runner-Up Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition.
CoRR, 2022

Contextual Text Block Detection towards Scene Text Understanding.
CoRR, 2022

VMRF: View Matching Neural Radiance Fields.
CoRR, 2022

Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting.
CoRR, 2022

Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting.
Proceedings of the Computer Vision - ECCV 2022, 2022

Contextual Text Block Detection Towards Scene Text Understanding.
Proceedings of the Computer Vision - ECCV 2022, 2022

In Defense of Online Models for Video Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

SeqFormer: Sequential Transformer for Video Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Fourier Document Restoration for Robust Document Dewarping and Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Empirical Study of End-to-End Temporal Action Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TransMix: Attend to Mix for Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

YouMVOS: An Actor-centric Multi-shot Video Object Segmentation Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Hypergraph convolution and hypergraph attention.
Pattern Recognit., 2021

Learning Regional Attraction for Line Segment Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Adversarial Metric Attack and Defense for Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.
Nat. Mach. Intell., 2021

Deep learning for predicting COVID-19 malignant progression.
Medical Image Anal., 2021

SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation.
CoRR, 2021

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence.
CoRR, 2021

Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation.
CoRR, 2021

CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis.
CoRR, 2021

Visual Parser: Representing Part-whole Hierarchies with Transformers.
CoRR, 2021

End-to-end Temporal Action Detection with Transformer.
CoRR, 2021

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.
CoRR, 2021

Location-Sensitive Visual Recognition with Cross-IOU Loss.
CoRR, 2021

Anchor-Free Person Search.
CoRR, 2021

Occluded Video Instance Segmentation.
CoRR, 2021

Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Anchor-Free Person Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SwiftNet: Real-Time Video Object Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multi-Shot Temporal Event Localization: A Benchmark.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Application of data mining for predicting hemodynamics instability during pheochromocytoma surgery.
BMC Medical Informatics Decis. Mak., December, 2020

An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval.
IEEE Trans. Image Process., 2020

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

A comparison of methods for 3D scene shape retrieval.
Comput. Vis. Image Underst., 2020

Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training.
CoRR, 2020

Reinforced Wasserstein Training for Severity-Aware Semantic Segmentation in Autonomous Driving.
CoRR, 2020

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition.
CoRR, 2020

Dual Attention GANs for Semantic Image Synthesis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Instance Segmentation of LiDAR Point Clouds.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

XingGAN for Person Image Generation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses.
Proceedings of the Computer Vision - ECCV 2020, 2020

Corner Proposal Network for Anchor-Free, Two-Stage Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Holistically-Attracted Wireframe Parsing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Architecture Search for Lightweight Non-Local Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bipartite Graph Reasoning GANs for Person Image Generation.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Learning Transferable Adversarial Examples via Ghost Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Automatic Ensemble Diffusion for 3D Shape and Image Retrieval.
IEEE Trans. Image Process., 2019

Training convolutional neural network from multi-domain contour images for 3D shape retrieval.
Pattern Recognit. Lett., 2019

Regularized Diffusion Process on Bidirectional Context for Object Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Asymmetric Non-local Neural Networks for Semantic Segmentation.
CoRR, 2019

Learn to Scale: Generating Multipolar Normalized Density Map for Crowd Counting.
CoRR, 2019

Adversarial Metric Attack for Person Re-identification.
CoRR, 2019

Feature context learning for human parsing.
Sci. China Inf. Sci., 2019

Semi-Supervised 3D Abdominal Multi-Organ Segmentation Via Deep Multi-Planar Co-Training.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Asymmetric Non-Local Neural Networks for Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Prior-Aware Neural Network for Partially-Supervised Multi-Organ Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Anchor Diffusion for Unsupervised Video Object Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Symmetry-Constrained Rectification Network for Scene Text Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

View N-Gram Network for 3D Object Retrieval.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CenterNet: Keypoint Triplets for Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Attraction Field Representation for Robust Line Segment Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Improving Transferability of Adversarial Examples With Input Diversity.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Re-Ranking via Metric Fusion for Object Retrieval and Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Improving context-sensitive similarity via smooth neighborhood for object retrieval.
Pattern Recognit., 2018

Position Tracking Control for Permanent Magnet Linear Motor via Continuous-Time Fast Terminal Sliding Mode Control.
J. Control. Sci. Eng., 2018

Learn to Interpret Atari Agents.
CoRR, 2018

Hard-Aware Point-to-Set Deep Metric for Person Re-identification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Triplet-Center Loss for Multi-View 3D Object Retrieval.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018



2017
GIFT: Towards Scalable 3D Shape Retrieval.
IEEE Trans. Multim., 2017

The Fabrication and Characterization of Ni/4H-SiC Schottky Diode Radiation Detectors with a Sensitive Area of up to 4 cm<sup>2</sup>.
Sensors, 2017

GraphHP: A Hybrid Platform for Iterative Graph Processing.
CoRR, 2017

Ensemble Diffusion for Retrieval.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Scalable Person Re-identification on Supervised Smoothed Manifold.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Divide and Fuse: A Re-ranking Approach for Person Re-identification.
Proceedings of the British Machine Vision Conference 2017, 2017

Regularized Diffusion Process for Visual Retrieval.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Multidimensional Scaling on Multiple Input Distance Matrices.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017


2016
Multiple Stage Residual Model for Image Classification and Vector Compression.
IEEE Trans. Multim., 2016

Sparse Contextual Activation for Efficient Visual Re-Ranking.
IEEE Trans. Image Process., 2016

Co-spectral for robust shape clustering.
Pattern Recognit. Lett., 2016

Deep Learning Representation using Autoencoder for 3D Shape Retrieval.
Neurocomputing, 2016

Smooth Neighborhood Structure Mining on Multiple Affinity Graphs with Applications to Context-Sensitive Similarity.
Proceedings of the Computer Vision - ECCV 2016, 2016

GIFT: A Real-Time and Scalable 3D Shape Search Engine.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016


2015
DeepPano: Deep Panoramic Representation for 3-D Shape Recognition.
IEEE Signal Process. Lett., 2015

Neural shape codes for 3D model retrieval.
Pattern Recognit. Lett., 2015

3D Shape Matching via Two Layer Coding.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Beyond diffusion process: Neighbor set similarity for fast re-ranking.
Inf. Sci., 2015

2014
Aggregating contour fragments for shape classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Multiple Stage Residual Model for Accurate Image Classification.
Proceedings of the Computer Vision - ACCV 2014, 2014


  Loading...