Xinyu Li

Orcid: 0000-0001-5398-3707

Affiliations:
  • Amazon Web Service, AWS AI Labs, Seattle, USA
  • Rutgers University, Department of Electrical and Computer Engineering, NJ, USA


According to our database1, Xinyu Li authored at least 41 papers between 2016 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

NUTA: Non-uniform Temporal Aggregation for Action Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

TubeR: Tubelet Transformer for Video Action Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Id-Free Person Similarity Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Unsupervised Action Segmentation with Self-supervised Feature Learning and Co-occurrence Parsing.
CoRR, 2021

TubeR: Tube-Transformer for Action Detection.
CoRR, 2021

Long Short-Term Transformer for Online Action Detection.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Video Contrastive Learning with Global Context.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

VidTr: Video Transformer Without Convolutions.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Selective Feature Compression for Efficient Activity Recognition Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Label Activity Recognition Using Activity-Specific Features and Activity Correlations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SiamMOT: Siamese Multi-Object Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
A Comprehensive Study of Deep Video Action Recognition.
CoRR, 2020

Multi-Label Activity Recognition using Activity-specific Features.
CoRR, 2020

Application of Multi-Object Tracking with Siamese Track-RCNN to the Human in Events Dataset.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Directional Temporal Modeling for Action Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Mutual Correlation Attentive Factors in Dyadic Fusion Networks for Speech Emotion Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multi-Stream Network with Temporal Attention for Environmental Sound Classification.
Proceedings of the Interspeech 2019, 2019

Speech Audio Super-Resolution for Speech Recognition.
Proceedings of the Interspeech 2019, 2019

2018
Tri-axial Self-Attention for Concurrent Activity Recognition.
CoRR, 2018

Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Hybrid Attention based Multimodal Network for Spoken Language Classification.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Multimodal Affective Analysis Using Hierarchical Attention Strategy with Word-Level Alignment.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Progress Estimation and Phase Detection for Sequential Processes.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2017

A Framework for Evaluating Trace Alignments.
CoRR, 2017

Concurrent Activity Recognition with Multimodal CNN-LSTM Structure.
CoRR, 2017

Process Progress Estimation and Phase Detection.
CoRR, 2017

Online People Tracking and Identification with RFID and Kinect.
CoRR, 2017

Region-based Activity Recognition Using Conditional GAN.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

CAR - a deep learning structure for concurrent activity recognition: poster abstract.
Proceedings of the 16th ACM/IEEE International Conference on Information Processing in Sensor Networks, 2017

3D activity localization with multiple sensors: poster abstract.
Proceedings of the 16th ACM/IEEE International Conference on Information Processing in Sensor Networks, 2017

Evaluation of Trace Alignment Quality and its Application in Medical Process Mining.
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017

Language-Based Process Phase Detection in the Trauma Resuscitation.
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017

Speech Intention Classification with Multimodal Deep Learning.
Proceedings of the Advances in Artificial Intelligence, 2017

2016
Online process phase detection using multimodal deep learning.
Proceedings of the 7th IEEE Annual Ubiquitous Computing, 2016

Deep Learning for RFID-Based Activity Recognition.
Proceedings of the 14th ACM Conference on Embedded Network Sensor Systems, SenSys 2016, 2016

Activity recognition for medical teamwork based on passive RFID.
Proceedings of the 2016 IEEE International Conference on RFID, 2016

Deep neural network for RFID-based activity recognition.
Proceedings of the Eighth Wireless of the Students, 2016

Privacy Preserving Dynamic Room Layout Mapping.
Proceedings of the Image and Signal Processing - 7th International Conference, 2016


  Loading...