Yan Lu

Orcid: 0000-0001-5383-6424

Affiliations:
  • Microsoft Research Asia, Beijing, China
  • Harbin Institute of Technology, China (PhD 2003)


According to our database1, Yan Lu authored at least 149 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exploring Neighbor Correspondence Matching for Multiple-hypotheses Video Frame Synthesis.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

A Universal Optimization Framework for Learning-based Image Codec.
ACM Trans. Multim. Comput. Commun. Appl., January, 2024

Uncertainty-Aware Deep Video Compression With Ensembles.
IEEE Trans. Multim., 2024

2023
Micro-Doppler Effect and Sparse Representation Analysis of Underwater Targets.
Sensors, October, 2023

Temporal Context Mining for Learned Video Compression.
IEEE Trans. Multim., 2023

Video Instance Segmentation by Instance Flow Assembly.
IEEE Trans. Multim., 2023

Latent-Domain Predictive Neural Speech Coding.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Low-latency Speech Enhancement via Speech Token Generation.
CoRR, 2023

Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition.
CoRR, 2023

Time-Variance Aware Real-Time Speech Enhancement.
CoRR, 2023

Disentangle Propagation and Restoration for Efficient Video Recovery.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Masked Audio Modeling with CLAP and Multi-Objective Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

EVC: Towards Real-Time Neural Image Compression with Mask Decay.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Real-Time Speech Enhancement with Dynamic Attention Span.
Proceedings of the IEEE International Conference on Acoustics, 2023

Contrast-PLC: Contrastive Learning for Packet Loss Concealment.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Speech Enhancement via Event-Based Query.
Proceedings of the IEEE International Conference on Acoustics, 2023

Dasformer: Deep Alternating Spectrogram Transformer For Multi/Single-Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Disentangled Feature Learning for Real-Time Neural Speech Coding.
Proceedings of the IEEE International Conference on Acoustics, 2023

Two-shot Video Object Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Video Compression with Diverse Contexts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High-Fidelity and Freely Controllable Talking Head Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Time-Variance Aware Dynamic Kernel Generation for Real-Time Acoustic Echo Cancellation.
IEEE Signal Process. Lett., 2022

MonoGRNet: A General Framework for Monocular 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Disentangled Feature Learning for Real-Time Neural Speech Coding.
CoRR, 2022

Estimating Neural Reflectance Field from Radiance Field using Tree Structures.
CoRR, 2022

Predictive Neural Speech Coding.
CoRR, 2022

Online Video Instance Segmentation via Robust Context Fusion.
CoRR, 2022

R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency.
CoRR, 2022

End-to-End Neural Audio Coding for Real-Time Communications.
CoRR, 2022

Alignment-guided Temporal Attention for Video Action Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Robust Video Object Segmentation with Adaptive Object Calibration.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Neighbor Correspondence Matching for Flow-based Video Frame Synthesis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Error-Resilient Neural Speech Coding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Cross-Scale Vector Quantization for Scalable Neural Speech Coding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Neural Speech Coding for Real-Time Communications.
Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Capture of Animatable 3D Human from Monocular Video.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semantic-aligned Fusion Transformer for One-shot Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Rethinking Minimal Sufficient Representation in Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Compression-Based Feature Learning for Video Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reliable Propagation-Correction Modulation for Video Object Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Affinity Derivation for Accurate Instance Segmentation.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Residual Refinement Network with Attribute Guidance for Precise Saliency Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2021

A Deep Reinforcement Learning Approach to Multiple Streams' Joint Bitrate Allocation.
IEEE Trans. Circuits Syst. Video Technol., 2021

Deep Contextual Video Compression.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Supervised Video Representation Learning with Meta-Contrastive Network.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Phoneme-Based Distribution Regularization for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Universal Encoder Rate Distortion Optimization Framework for Learned Compression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

SSAN: Separable Self-Attention Network for Video Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Interactive Speech and Noise Modeling for Speech Enhancement.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Weakly-supervised Temporal Action Localization by Uncertainty Modeling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Single-stage Instance Segmentation.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Background Modeling via Uncertainty Estimation for Weakly-supervised Action Localization.
CoRR, 2020

RT-VENet: A Convolutional Network for Real-time Video Enhancement.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Weakly Supervised 3D Object Detection from Point Clouds.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
A Hardware-Accelerated System for High Resolution Real-Time Screen Sharing.
IEEE Trans. Circuits Syst. Video Technol., 2019

Reinforcement learning for bandwidth estimation and congestion control in real-time communications.
CoRR, 2019

IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition.
CoRR, 2019

In Defense of the Classification Loss for Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Triangulation Learning Network: From Monocular to Stereo 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

MVPNet: Multi-View Point Regression Networks for 3D Object Reconstruction from A Single Image.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Parallel In-Loop Filtering in HEVC Encoder on GPU.
IEEE Trans. Consumer Electron., 2018

Weakly Supervised Local Attention Network for Fine-Grained Visual Classification.
CoRR, 2018

Affinity Derivation and Graph Merge for Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Intra Block Copy for Screen Content in the Emerging AV1 Video Codec.
Proceedings of the 2018 Data Compression Conference, 2018

Feature Selective Networks for Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Delay-Rate-Distortion Optimization for Cloud Gaming With Hybrid Streaming.
IEEE Trans. Circuits Syst. Video Technol., 2017

2016
A High-Fidelity and Low-Interaction-Delay Screen Sharing System.
ACM Trans. Multim. Comput. Commun. Appl., 2016

GPU-based optimization for sample adaptive offset in HEVC.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015
Layered Compression for High-Precision Depth Data.
IEEE Trans. Image Process., 2015

Introduction to the Special Section on Visual Computing in the Cloud: Cloud Gaming and Virtualization.
IEEE Trans. Circuits Syst. Video Technol., 2015

Region-of-interest based coding scheme for synthesized video.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

2014
An adaptive multi-layer low-latency transmission scheme for H.264 based screen sharing system.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

High frame rate screen video coding for screen sharing applications.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

A low latency cloud gaming system using edge preserved image homography.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

A novel cloud gaming framework using joint video and graphics streaming.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Content adaptive screen image scaling.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Kinect-Like Depth Data Compression.
IEEE Trans. Multim., 2013

A Low-Complexity Screen Compression Scheme for Interactive Screen Sharing.
IEEE Trans. Circuits Syst. Video Technol., 2013

Depth sensor assisted real-time gesture recognition for interactive presentation.
J. Vis. Commun. Image Represent., 2013

Rate-distortion optimized block classification and bit allocation in screen video compression.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Layered screen video coding leveraging hardware video codec.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Effective hand segmentation and gesture recognition for browsing web pages on a large screen.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Arbitrary-sized motion detection in screen video coding.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
A low-complexity screen compression scheme.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Layered compression for high dynamic range depth.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Content-aware layered compound video compression.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

A low-latency transmission scheme for interactive screen sharing.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Texture-assisted Kinect depth inpainting.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Kinect-like depth denoising.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Kinect-Like Depth Compression with 2D+T Prediction.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

2011
Virtualized Screen: A Third Element for Cloud-Mobile Convergence.
IEEE Multim., 2011

Browser-friendly hybrid codec for compound image compression.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

2010
High-Dynamic-Range Texture Compression for Rendering Systems of Different Capacities.
IEEE Trans. Vis. Comput. Graph., 2010

ReDi: an interactive virtual display system for ubiquitous devices.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A proxy-based mobile web browser.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Low-cost realtime screen sharing to multiple clients.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
Complexity-Constrained H.264 Video Encoding.
IEEE Trans. Circuits Syst. Video Technol., 2009

A High-Performance Remote Computing Platform.
Proceedings of the Seventh Annual IEEE International Conference on Pervasive Computing and Communications, 2009

Real-time screen image scaling and its GPU acceleration.
Proceedings of the International Conference on Image Processing, 2009

Level embedded medical image compression based on value of interest.
Proceedings of the International Conference on Image Processing, 2009

2008
Efficient Multiple-Description Image Coding Using Directional Lifting-Based Transform.
IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv-Based Multiview Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.
IEEE Trans. Circuits Syst. Video Technol., 2008

B-picture coding in AVS video compression standard.
Signal Process. Image Commun., 2008

Three-tiered network model for image hallucination.
Proceedings of the International Conference on Image Processing, 2008

DHTC: An Effective DXTC-based HDR Texture Compression Scheme.
Proceedings of the EUROGRAPHICS/ACM SIGGRAPH Conference on Graphics Hardware 2008, 2008

2007
Joint Source-Channel Rate-Distortion Optimization for H.264 Video Coding Over Error-Prone Networks.
IEEE Trans. Multim., 2007

Real-time video coding under power constraint based on H.264 codec.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Rate-distortion optimized color quantization for compound image compression.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Distributed Video Coding with Trellis Coded Quantization.
Proceedings of the Advances in Multimedia Modeling, 2007

Distributed Video Coding with Spatial Correlation Exploited Only at the Decoder.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Enable Efficient Compound Image Compression in H.264/AVC Intra Coding.
Proceedings of the International Conference on Image Processing, 2007

2006
4-D Wavelet-Based Multiview Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2006

Adaptive rate control for H.264.
J. Vis. Commun. Image Represent., 2006

Distributed video coding using wavelet.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Bit-Stream Switching in Multiple Bit-Rate Video Streaming using Wyner-Ziv Coding.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Practical Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.
Proceedings of the International Conference on Image Processing, 2006

Wyner-Ziv Video Coding Based on Set Partitioning in Hierarchical Tree.
Proceedings of the International Conference on Image Processing, 2006

Joint Power-Distortion Optimization on Devices with MPEG-4 AVC/H.264 Codec.
Proceedings of IEEE International Conference on Communications, 2006

2005
Rate-distortion analysis for H.264/AVC video coding and its application to rate control.
IEEE Trans. Circuits Syst. Video Technol., 2005

Directional Lifting-Based Wavelet Transform for Multiple Description Image Coding with Quincunx Segmentation.
Proceedings of the Advances in Multimedia Information Processing, 2005

Scalable multiview video coding using wavelet.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Viewpoint switching in multiview video streaming.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

2004
Optimum End-to-End Distortion Estimation for Error Resilient Video Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A Study on the Quantization Scheme in H.264/AVC and Its Application to Rate Control.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Enhanced direct mode coding for bi-predictive pictures.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Multiple modes intra-prediction in intra coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Context-based 2D-VLC for video coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Improved error concealment algorithms based on H.264/AVC non-normative decoder.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

New bi-prediction techniques for B pictures coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Mode mapping method for h.264/avc spatial downscaling transcoding.
Proceedings of the 2004 International Conference on Image Processing, 2004

Error resilience video coding in H.264 encoder with potential distortion tracking.
Proceedings of the 2004 International Conference on Image Processing, 2004

New scaling technique for direct mode coding in B pictures.
Proceedings of the 2004 International Conference on Image Processing, 2004

2003
Efficient background video coding with static sprite generation and arbitrary-shape spatial prediction techniques.
IEEE Trans. Circuits Syst. Video Technol., 2003

Rate control for advance video coding (AVC) standard.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

Latest arrival time leaky bucket for HRD constrained video coding.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A novel coefficient scanning scheme for directional spatial prediction-based image compression.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Rate control for JVT video coding scheme with HRD considerations.
Proceedings of the 2003 International Conference on Image Processing, 2003

2002
High efficient sprite coding with directional spatial prediction.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
Fast and Robust Sprite Generation for MPEG-4 Video Coding.
Proceedings of the Advances in Multimedia Information Processing, 2001

Sprite generation for frame-based video coding.
Proceedings of the 2001 International Conference on Image Processing, 2001


  Loading...