Yin Li

Orcid: 0000-0003-4173-9453

Affiliations:
  • University of Wisconsin-Madison, Biostatistics & Medical Informatics, USA
  • Georgia Institute of Technology, USA (former)


According to our database1, Yin Li authored at least 76 papers between 2009 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
In the Eye of the Beholder: Gaze and Actions in First Person Video.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Weakly supervised foreground learning for weakly supervised localization and detection.
Pattern Recognit., May, 2023

Virtuoso: Energy- and Latency-aware Streamlining of Streaming Videos on Systems-on-Chips.
ACM Trans. Design Autom. Electr. Syst., 2023

NMS Threshold matters for Ego4D Moment Queries - 2nd place solution to the Ego4D Moment Queries Challenge 2023.
CoRR, 2023

Spike-Based Anytime Perception.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Learned Compressive Representations for Single-Photon 3D Imaging.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Where a Strong Backbone Meets Strong Features - ActionFormer for Ego4D Moment Queries Challenge.
CoRR, 2022

A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge.
CoRR, 2022

Physics to the Rescue: Deep Non-line-of-sight Reconstruction for High-speed Imaging.
CoRR, 2022

Robust Scene Inference under Noise-Blur Dual Corruptions.
Proceedings of the IEEE International Conference on Computational Photography, 2022

LiteReconfig: cost and content aware reconfiguration of video object detection systems for mobile GPUs.
Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022

ActionFormer: Localizing Moments of Actions with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

Egocentric Activity Recognition and Localization on a 3D Map.
Proceedings of the Computer Vision - ECCV 2022, 2022

3D Scene Inference from Transient Histograms.
Proceedings of the Computer Vision - ECCV 2022, 2022

Event Neural Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

RegionCLIP: Region-based Language-Image Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Smartadapt: Multi-branch Object Detection Framework for Videos on Mobiles.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Virtuoso: Video-based Intelligence for real-time tuning on SOCs.
CoRR, 2021

Weakly Supervised Foreground Learning for Weakly Supervised Localization and Detection.
CoRR, 2021

Benchmarking Video Object Detection Systems on Embedded Devices under Resource Contention.
Proceedings of the EMDL@MobiSys 2021: Proceedings of the 5th International Workshop on Embedded and Mobile Deep Learning, 2021

Learning to Generate Scene Graph from Natural Language Supervision.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Simple Baseline for Weakly-Supervised Scene Graph Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Dual-Stream Multiple Instance Learning Network for Whole Slide Image Classification With Self-Supervised Contrastive Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Nyströmformer: A Nyström-based Algorithm for Approximating Self-Attention.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
FingerTrak: Continuous 3D Hand Pose Tracking by Deep Learning Hand Silhouettes Captured by Miniature Thermal Cameras on Wrist.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2020

ApproxDet: content and contention-aware approximate object detection for mobiles.
Proceedings of the SenSys '20: The 18th ACM Conference on Embedded Networked Sensor Systems, 2020

Gradients as Features for Deep Representation Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Comprehensive Image Captioning via Scene Graph Decomposition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video.
Proceedings of the Computer Vision - ECCV 2020, 2020

Interpretable and Accurate Fine-grained Recognition via Region Grouping.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Attention Distillation for Learning Video Representations.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
Deep Crisp Boundaries: From Boundaries to Higher-Level Tasks.
IEEE Trans. Image Process., 2019

Focal Boundary Guided Salient Object Detection.
IEEE Trans. Image Process., 2019

Learning Two-Branch Neural Networks for Image-Text Matching Tasks.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Forecasting Human Object Interaction: Joint Prediction of Motor Attention and Egocentric Activity.
CoRR, 2019

Paying More Attention to Motion: Attention Distillation for Learning Video Representations.
CoRR, 2019

2018
Learning embodied models of actions from first person video.
PhD thesis, 2018

Adaptive Discrete Hypergraph Matching.
IEEE Trans. Cybern., 2018

Beyond Grids: Learning Graph Representations for Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Grasp Without Seeing.
Proceedings of the 2018 International Symposium on Experimental Robotics, 2018

Densely Cascaded Shadow Detection Network via Deeply Supervised Parallel Fusion.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

In the Eye of Beholder: Joint Learning of Gaze and Actions in First Person Video.
Proceedings of the Computer Vision - ECCV 2018, 2018

Compositional Learning for Human Object Interaction.
Proceedings of the Computer Vision - ECCV 2018, 2018

3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks.
CoRR, 2017

First-Person Action Decomposition and Zero-Shot Learning.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Paralinguistic Analysis of Children's Speech in Natural Environments.
Proceedings of the Mobile Health - Sensors, Analytic Methods, and Applications, 2017

2016
Learning Deep Structure-Preserving Image-Text Embeddings.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Unsupervised Learning of Edges.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Cardiac and Respiratory Parameter Estimation Using Head-mounted Motion-sensitive Sensors.
EAI Endorsed Trans. Pervasive Health Technol., 2015

Detecting bids for eye contact using a wearable camera.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Gaze-enabled egocentric video summarization via constrained submodular maximization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Delving into egocentric actions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Combining acoustic and visual features to detect laughter in adults' speech.
Proceedings of the Auditory-Visual Speech Processing, 2015

2014
BioGlass: Physiological parameter estimation using a head-mounted wearable device.
Proceedings of the 4th International Conference on Wireless Mobile Communication and Healthcare: "Transforming healthcare through innovations in mobile and wireless technologies", 2014

Graduated Consistency-Regularized Optimization for Multi-graph Matching.
Proceedings of the Computer Vision - ECCV 2014, 2014

Joint Semantic Segmentation and 3D Reconstruction from Monocular Video.
Proceedings of the Computer Vision - ECCV 2014, 2014

The Secrets of Salient Object Segmentation.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Learning to Predict Gaze in Egocentric Video.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Decoding Children's Social Behavior.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Detecting eye contact using wearable eye-tracking glasses.
Proceedings of the 2012 ACM Conference on Ubiquitous Computing, 2012

Learning to Recognize Daily Actions Using Gaze.
Proceedings of the Computer Vision - ECCV 2012, 2012

Learning sparse covariance patterns for natural scenes.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Hole-Filling by Rank Sparsity Tensor Decomposition for Medical Imaging.
IEICE Trans. Inf. Syst., 2011

Robust facial feature points extraction in color images.
Eng. Appl. Artif. Intell., 2011

2010
Exploration into Single Image Super-Resolution via Self Similarity by Sparse Representation.
IEICE Trans. Inf. Syst., 2010

Visual saliency detection via rank-sparsity decomposition.
Proceedings of the International Conference on Image Processing, 2010

Tensor error correction for corrupted values in visual data.
Proceedings of the International Conference on Image Processing, 2010

Optimum Subspace Learning and Error Correction for Tensors.
Proceedings of the Computer Vision, 2010

2009
Incremental sparse saliency detection.
Proceedings of the International Conference on Image Processing, 2009

An Accelerated Human Motion Tracking System Based on Voxel Reconstruction under Complex Environments.
Proceedings of the Computer Vision, 2009

Visual Saliency Based on Conditional Entropy.
Proceedings of the Computer Vision, 2009


  Loading...