Yutong Ban

Orcid: 0000-0001-5396-9251

According to our database1, Yutong Ban authored at least 28 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Concept Graph Neural Networks for Surgical Video Understanding.
IEEE Trans. Medical Imaging, January, 2024

INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer.
CoRR, 2024

Hypergraph-Transformer (HGT) for Interactive Event Prediction in Laparoscopic and Robotic Surgery.
CoRR, 2024

2023
TransCenter: Transformers With Dense Representations for Multiple-Object Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models.
CoRR, 2023

Infrastructure-based End-to-End Learning and Prevention of Driver Failure.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

On the Forward Invariance of Neural ODEs.
Proceedings of the International Conference on Machine Learning, 2023

2022
SUPR-GAN: SUrgical PRediction GAN for Event Anticipation in Laparoscopic and Robotic Surgery.
IEEE Robotics Autom. Lett., 2022

Enhancing direct-path relative transfer function using deep neural network for robust sound source localization.
CAAI Trans. Intell. Technol., 2022

A Deep Concept Graph Network for Interaction-Aware Trajectory Prediction.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

2021
Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

SUrgical PRediction GAN for Events Anticipation.
CoRR, 2021

TransCenter: Transformers with Dense Queries for Multiple-Object Tracking.
CoRR, 2021

Aggregating Long-Term Context for Learning Laparoscopic and Robot-Assisted Surgical Workflows.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

2020
Aggregating Long-Term Context for Learning Surgical Workflows.
CoRR, 2020

ODANet: Online Deep Appearance Network for Identity-Consistent Multi-person Tracking.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

How to Train Your Deep Multi-Object Tracker.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Audio-Visual Multiple-Speaker Tracking for Robot Perception. (Suivi multi-locuteurs avec des informations audio-visuelles pour la perception des robots).
PhD thesis, 2019

Tracking Multiple Audio Sources With the von Mises Distribution and Variational EM.
IEEE Signal Process. Lett., 2019

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments.
IEEE J. Sel. Top. Signal Process., 2019

DeepMOT: A Differentiable Framework for Training Multiple Object Trackers.
CoRR, 2019

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

2018
A cascaded multiple-speaker localization and tracking system.
CoRR, 2018

A Deep Network for Arousal-Valence Emotion Prediction with Acoustic-Visual Cues.
CoRR, 2018

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Tracking a varying number of people with a visually-controlled robotic head.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Exploiting the Complementarity of Audio and Visual Data in Multi-speaker Tracking.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2016
Tracking Multiple Persons Based on a Variational Bayesian Model.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016


  Loading...