Yifei Huang

Orcid: 0000-0001-8067-6227

According to our database1, Yifei Huang authored at least 85 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World.
CoRR, 2024

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding.
CoRR, 2024

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding.
CoRR, 2024

FineBio: A Fine-Grained Video Dataset of Biological Experiments with Hierarchical Annotation.
CoRR, 2024

Retrieval-Augmented Egocentric Video Captioning.
CoRR, 2024

2023
Predicting central lymph node metastasis in patients with papillary thyroid carcinoma based on ultrasound radiomic and morphological features analysis.
BMC Medical Imaging, December, 2023

Explainable Program Synthesis by Localizing Specifications.
Proc. ACM Program. Lang., October, 2023

High-order Runge-Kutta structure-preserving methods for the coupled nonlinear Schrödinger-KdV equations.
Math. Comput. Simul., June, 2023

MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding.
CoRR, 2023

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
CoRR, 2023

Memory-and-Anticipation Transformer for Online Action Understanding.
CoRR, 2023

VideoLLM: Modeling Video Sequence with Large Language Models.
CoRR, 2023

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Enhancing Visual Understanding by Removing Dithering with Global and Self-Conditioned Transformation.
Proceedings of the 16th International Symposium on Visual Information Communication and Interaction, 2023

3D Segmenter: 3D Transformer based Semantic Segmentation via 2D Panoramic Distillation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Memory-and-Anticipation Transformer for Online Action Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Pretraining Language Models with Text-Attributed Heterogeneous Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Weakly Supervised Temporal Sentence Grounding with Uncertainty-Guided Self-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

First Bite/Chew: distinguish different types of food by first biting/chewing and the corresponding hand movement.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Proposal-based Temporal Action Localization with Point-level Supervision.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

First Bite/Chew: distinguish typical allergic food by two IMUs.
Proceedings of the Augmented Humans International Conference 2023, 2023

2022
Spatio-Temporal Perturbations for Video Attribution.
IEEE Trans. Circuits Syst. Video Technol., 2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges.
CoRR, 2022

Precise Affordance Annotation for Egocentric Action Video Datasets.
CoRR, 2022

Transfer learning meets sales engagement email classification: Evaluation, analysis, and strategies.
Concurr. Comput. Pract. Exp., 2022

Seeing our Blind Spots: Smart Glasses-based Simulation to Increase Design Students' Awareness of Visual Impairment.
Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022

Inner self drawing machine.
Proceedings of the SIGGRAPH Asia 2022 Art Gallery, 2022

GazeSync: Eye Movement Transfer Using an Optical Eye Tracker and Monochrome Liquid Crystal Displays.
Proceedings of the IUI 2022: 27th International Conference on Intelligent User Interfaces, Helsinki, Finland, March 22 - 25, 2022, 2022

Dual-Path Geometry-Aware Network for Semantic Segmentation of High-Resolution Aerial Images.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Le-BEiT: A Local-Enhanced Self-Supervised Transformer for Semantic Segmentation of High Resolution Remote Sensing Images.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Compound Prototype Matching for Few-Shot Action Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

CLRNet: Cross Layer Refinement Network for Lane Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Interact before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


TopsBot: Facilitating Mental Health Peer Support with a Topic-based Writing Assistant in Instant Messaging Apps.
Proceedings of the Tenth International Symposium of Chinese CHI, 2022

2021
Learning Representations for High-Dynamic-Range Image Color Transfer in a Self-Supervised Way.
IEEE Trans. Multim., 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

Spatio-Temporal Perturbations for Video Attribution.
CoRR, 2021

EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report.
CoRR, 2021

Towards Visually Explaining Video Understanding Networks with Perturbation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Adversarial Robustness of Stabilized Neural ODE Might be from Obfuscated Gradients.
Proceedings of the Mathematical and Scientific Machine Learning, 2021

Precise Multi-Modal In-Hand Pose Estimation using Low-Precision Sensors for Robotic Assembly.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Goal-Oriented Gaze Estimation for Zero-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Mutual Context Network for Jointly Estimating Egocentric Gaze and Action.
IEEE Trans. Image Process., 2020

An Ego-Vision System for Discovering Human Joint Attention.
IEEE Trans. Hum. Mach. Syst., 2020

Learn to Extract Building Outline from Misaligned Annotation through Nearest Feature Selector.
Remote. Sens., 2020

Handling Missing Sensors in Topology-Aware IoT Applications with Gated Graph Neural Network.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2020

Rethinking Breiman's Dilemma in Neural Networks: Phase Transitions of Margin Dynamics.
Frontiers Appl. Math. Stat., 2020

Adversarial Robustness of Stabilized NeuralODEs Might be from Obfuscated Gradients.
CoRR, 2020

A Comprehensive Study on Visual Explanations for Spatio-temporal Networks.
CoRR, 2020

DeSmoothGAN: Recovering Details of Smoothed Images via Spatial Feature-wise Transformation and Full Attention.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learn to Recover Visible Color for Video Surveillance in a Day.
Proceedings of the Computer Vision - ECCV 2020, 2020

Improving Action Segmentation via Graph-Based Temporal Reasoning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Situation Awareness and Information Fusion in Sales and Customer Engagement: A Paradigm Shift.
Proceedings of the IEEE Conference on Cognitive and Computational Aspects of Situation Management, 2020

2019
GBRTVis: online analysis of gradient boosting regression tree.
J. Vis., 2019

Discovery of Bias and Strategic Behavior in Crowdsourced Performance Assessment.
CoRR, 2019

Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions.
CoRR, 2019

Manipulation-Skill Assessment from Videos with Spatial Attention Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

An Evaluation of Transfer Learning for Classifying Sales Engagement Emails at Large Scale.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018
Differentiable Fine-grained Quantization for Deep Neural Network Compression.
CoRR, 2018

On Breiman's Dilemma in Neural Networks: Phase Transitions of Margin Dynamics.
CoRR, 2018

Translucent Image Recoloring through Homography Estimation.
Comput. Graph. Forum, 2018

Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Semantic Aware Attention Based Deep Object Co-segmentation.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
A Proposed Network Balance Index for Heterogeneous Networks.
IEEE Wirel. Commun. Lett., 2017

Discrete Wigner Function Derivation of the Aaronson-Gottesman Tableau Algorithm.
Entropy, 2017

Base Station Preference Association with Network Dynamics.
Proceedings of the 85th IEEE Vehicular Technology Conference, 2017

Temporal Localization and Spatial Segmentation of Joint Attention in Multiple First-Person Videos.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2016
Mode Selection, Resource Allocation, and Power Control for D2D-Enabled Two-Tier Cellular Network.
IEEE Trans. Commun., 2016

Effects of load dependent dynamic biasing and association order for cell range expansion.
Proceedings of the 10th International Conference on Signal Processing and Communication Systems, 2016

Cross-region traffic prediction for China on OpenStreetMap.
Proceedings of the 9th ACM SIGSPATIAL International Workshop on Computational Transportation Science, 2016

Graphical generalization of power control in multiuser interference channels.
Proceedings of the 2016 Australian Communications Theory Workshop, AusCTW 2016, Melbourne, 2016

2015
Interference Suppression Using Generalized Inverse Precoder for Downlink Heterogeneous Networks.
IEEE Wirel. Commun. Lett., 2015

Interference nulling for offloaded heterogeneous users using macro generalized inverse precoder.
Proceedings of the 15th International Symposium on Communications and Information Technologies, 2015

2013
Detection of Temporally Correlated Signals over Multipath Fading Channels.
IEEE Trans. Wirel. Commun., 2013

2012
Spectrum sensing over frequency-selective fading channel with tap and spatial correlations.
Proceedings of the 23rd IEEE International Symposium on Personal, 2012

Optimal spectrum sensing over multipath channels.
Proceedings of IEEE International Conference on Communications, 2012

2011
Clashes of femtocells with limited number of location area codes.
Proceedings of the 6th International ICST Conference on Communications and Networking in China, 2011

2010
Hierarchical Cooperative Data Caching for Wireless Mesh Networks.
Proceedings of the IEEE/IFIP 8th International Conference on Embedded and Ubiquitous Computing, 2010

2009
Decoding Movement Trajectories Through a T-Maze Using Point Process Filters Applied to Place Field Data from Rat Hippocampal Region CA1.
Neural Comput., 2009


  Loading...