Matthias Grundmann

Affiliations:
  • Google Research, Mountain View, CA, USA
  • Georgia Institute of Technology, Atlanta, GA, USA (former, PhD 2013)


According to our database1, Matthias Grundmann authored at least 39 papers between 2008 and 2025.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Scaling On-Device GPU Inference for Large Generative Models.
CoRR, May, 2025

Scaling On-Device GPU Inference for Large Generative Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
Binaural Angular Separation Network.
Proceedings of the IEEE International Conference on Acoustics, 2024

STREAMVC: Real-Time Low-Latency Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2024

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction.
CoRR, 2023

Semi-Implicit Denoising Diffusion Models (SIDDMs).
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On-device Real-time Custom Hand Gesture Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Guided Speech Enhancement Network.
Proceedings of the IEEE International Conference on Acoustics, 2023

BlazeStyleGAN: A Real-Time On-Device StyleGAN.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Efficient Heterogeneous Video Segmentation at the Edge.
CoRR, 2022

BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose Estimation.
CoRR, 2022

2021
On-device Real-time Hand Gesture Recognition.
CoRR, 2021

Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild With Pose Annotations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Instant 3D Object Tracking with Applications in Augmented Reality.
CoRR, 2020

Attention Mesh: High-fidelity Face Mesh Prediction in Real-time.
CoRR, 2020

MediaPipe Hands: On-device Real-time Hand Tracking.
CoRR, 2020

BlazePose: On-device Real-time Body Pose tracking.
CoRR, 2020

MobilePose: Real-Time Pose Estimation for Unseen Objects with Weak Shape Supervision.
CoRR, 2020

2019
Instant Motion Tracking and Its Applications to Augmented Reality.
CoRR, 2019

Real-time Facial Surface Geometry from Monocular Video on Mobile GPUs.
CoRR, 2019

BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs.
CoRR, 2019

On-Device Neural Net Inference with Mobile GPUs.
CoRR, 2019

MediaPipe: A Framework for Building Perception Pipelines.
CoRR, 2019

2015
Finding Temporally Consistent Occlusion Boundaries in Videos Using Geometric Context.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

2013
Computational video: post-processing methods for stabilization, retargeting and segmentation.
PhD thesis, 2013

Post-processing approach for radiometric self-calibration of video.
Proceedings of the IEEE International Conference on Computational Photography, 2013

Geometric Context from Videos.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Calibration-free rolling shutter removal.
Proceedings of the 2012 IEEE International Conference on Computational Photography, 2012

Weakly Supervised Learning of Object Segmentations from Web-Scale Video.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

2011
Auto-directed video stabilization with robust L1 optimal camera paths.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Motion fields to predict play evolution in dynamic sport scenes.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Player localization using multiple static cameras for sports visualization.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Efficient hierarchical graph-based video segmentation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Discontinuous seam-carving for video retargeting.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2008
Real-Time Object Detection and Tracking for Industrial Applications.
Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008

3D Shape Context and Distance Transform for action recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008


  Loading...