Thomas Sikora

Orcid: 0000-0001-5695-9614

Affiliations:
  • Technical University of Berlin, Department of Mathematics, Germany


According to our database1, Thomas Sikora authored at least 273 papers between 1985 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference.
CoRR, 2024

2023
Segmentation-based Initialization for Steered Mixture of Experts.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Edge-Aware Autoencoder Design for Real-Time Mixture-of-Experts Image Compression.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

Steered-Mixture-of-Experts Regression for Image Denoising with Multi-Model Inference.
Proceedings of the 31st European Signal Processing Conference, 2023

A software test bed for sharing and evaluating color transfer algorithms for images and 3D objects.
Proceedings of the 20th ACM SIGGRAPH European Conference on Visual Media Production, 2023

2022
Accelerated Deep Lossless Image Coding with Unified Paralleleized GPU Coding Architecture.
Proceedings of the Picture Coding Symposium, 2022

Bottleneck Detection in Crowded Video Scenes Utilizing Lagrangian Motion Analysis Via Density and Arc Length Measures.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022

Utilizing Crowd Collectiveness to Enhance Bottleneck Detection Based on the Lagrangian Framework.
Proceedings of the 18th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2022

2021
Color Transfer of 3D Point Clouds For XR Applications.
Proceedings of the International Conference on 3D Immersion, 2021

2020
Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding.
IEEE Trans. Multim., 2020

Edge Oriented Hierarchical Motion Estimation For Video Coding.
Proceedings of the IEEE International Conference on Image Processing, 2020

2019
Quantized and Regularized Optimization for Coding Images Using Steered Mixtures-of-Experts.
Proceedings of the Data Compression Conference, 2019

Video-based Bottleneck Detection utilizing Lagrangian Dynamics in Crowded Scenes.
Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2019

2018
Progressive Modeling of Steered Mixture-of-Experts for Light Field Video Approximation.
Proceedings of the 2018 Picture Coding Symposium, 2018

An MSE Approach For Training And Coding Steered Mixtures Of Experts.
Proceedings of the 2018 Picture Coding Symposium, 2018

Restricted Boltzmann Machine Image Compression.
Proceedings of the 2018 Picture Coding Symposium, 2018

Deep Active Learning for In Situ Plankton Classification.
Proceedings of the Pattern Recognition and Information Forensics, 2018

Hierarchical Learning of Sparse Image Representations Using Steered Mixture-of-Experts.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

Scale-Adaptive Real-Time Crowd Detection and Counting for Drone Images.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Regularized Gradient Descent Training of Steered Mixture of Experts for Sparse Image Representation.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Deepstereobrush: Interactive Depth Map Creation.
Proceedings of the International Conference on 3D Immersion, 2018

Steered Mixture-of-Experts Approximation of Spherical Image Data.
Proceedings of the 26th European Signal Processing Conference, 2018


Optical Flow Dataset and Benchmark for Visual Crowd Analysis.
Proceedings of the 15th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2018


Extending IOU Based Multi-Object Tracking by Visual Information.
Proceedings of the 15th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2018

2017
Crowd Violence Detection Using Global Motion-Compensated Lagrangian Features and Scale-Sensitive Video-Level Representation.
IEEE Trans. Inf. Forensics Secur., 2017

Steered mixture-of-experts for light field coding, depth estimation, and processing.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

A consistent two-level metric for evaluation of automated abandoned object detection methods.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Hyper-parameter optimization for convolutional neural network committees based on evolutionary algorithms.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Color prediction in image coding using Steered Mixture-of-Experts.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017


Sequential sensor fusion combining probability hypothesis density and kernelized correlation filters for multi-object tracking in video data.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

Assessing post-detection filters for a generic pedestrian detector in a tracking-by-detection scheme.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

High-Speed tracking-by-detection without using image information.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

Motion-aware video quality assessment.
Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016
Video representation and coding using a sparse steered mixture-of-experts network.
Proceedings of the 2016 Picture Coding Symposium, 2016

Robust local optical flow: Dense motion vector field interpolation.
Proceedings of the 2016 Picture Coding Symposium, 2016

A universal image coding approach using sparse steered Mixture-of-Experts regression.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Robust local optical flow: Long-range motions and varying illuminations.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Training a convolutional neural network for multi-class object detection using solely virtual world data.
Proceedings of the 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2016

2015
Evaluation of the wavelet image two-line coder: A low complexity scheme for image compression.
Signal Process. Image Commun., 2015

Spatio-temporal crowd density model in a human detection and tracking framework.
Signal Process. Image Commun., 2015

Lossless image compression based on Kernel Least Mean Squares.
Proceedings of the 2015 Picture Coding Symposium, 2015

Motion modeling for motion vector coding in HEVC.
Proceedings of the 2015 Picture Coding Symposium, 2015

A novel Kernel PCA/KLT approach for transform coding of waveforms.
Proceedings of the 2015 Picture Coding Symposium, 2015

Image guided phase unwrapping for real-time 3D-scanning.
Proceedings of the 2015 Picture Coding Symposium, 2015

A local feature based on lagrangian measures for violent video classification.
Proceedings of the 6th International Conference on Imaging for Crime Detection and Prevention, 2015

An Ontology-Based Architecture for Storage Management.
Proceedings of the Formal Ontologies Meet Industry - 7th International Workshop, 2015

Georeferencing Flickr Resources Based on Multimodal Features.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

Human Versus Machine: Establishing a Human Baseline for Multimodal Location Estimation.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

2014
Creating Experts From the Crowd: Techniques for Finding Workers for Difficult Tasks.
IEEE Trans. Multim., 2014

Adaptively Splitted GMM With Feedback Improvement for the Task of Background Subtraction.
IEEE Trans. Inf. Forensics Secur., 2014

Real-time generation of multi-view video plus depth content using mixed narrow and wide baseline.
J. Vis. Commun. Image Represent., 2014

Multiview Point Cloud Filtering for Spatiotemporal Consistency.
Proceedings of the VISAPP 2014, 2014

A Multi-configuration Part-based Person Detector.
Proceedings of the SIGMAP 2014, 2014


TUB @ MediaEval 2014 Visual Privacy Task: Reversible Scrambling on Foreground Masks.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Lossy image coding in the pixel domain using a sparse steering kernel synthesis approach.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Crowd analysis in non-static cameras using feature tracking and multi-person density.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Cross based robust local optical flow.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Person Re-identification Using Region Covariance in a Multi-feature Approach.
Proceedings of the Image Analysis and Recognition - 11th International Conference, 2014

Theoretical Considerations Concerning Pixelwise Temporal Filtering.
Proceedings of the Data Compression Conference, 2014

Line-preserving hole-filling for 2D-to-3D conversion.
Proceedings of the 11th European Conference on Visual Media Production, 2014

2013
Adaptive Image Warping for Hole Prevention in 3D View Synthesis.
IEEE Trans. Image Process., 2013

Monte-Carlo-Based Parametric Motion Estimation Using a Hybrid Model Approach.
IEEE Trans. Circuits Syst. Video Technol., 2013

Image Adaptation and Dynamic Browsing Based on Two-Layer Saliency Combination.
IEEE Trans. Broadcast., 2013

Motion-based object segmentation using hysteresis and bidirectional inter-frame change detection in sequences with moving camera.
Signal Process. Image Commun., 2013

Image Guided Cost Aggregation for Hierarchical Depth Map Fusion.
Proceedings of the VISAPP 2013, 2013

Blip10000: a social video dataset containing SPUG content for tagging and retrieval.
Proceedings of the Multimedia Systems Conference 2013, 2013

DCT-based features for categorisation of social media in compressed domain.
Proceedings of the 15th IEEE International Workshop on Multimedia Signal Processing, 2013

A novel fusion method for integrating multiple modalities and knowledge for multimodal location estimation.
Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia, 2013

Human vs machine: establishing a human baseline for multimodal location estimation.
Proceedings of the ACM Multimedia Conference, 2013

TUB @ MediaEval 2013 Visual Privacy Task: Reversible Scrambling with colour-preservative Characteristic.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

MediaEval 2013 Visual Privacy Task: Using Adaptive Edge Detection for Privacy in Surveillance Videos.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

A dynamic model buffer for parametric motion vector prediction in random-access coding scenarios.
Proceedings of the IEEE International Conference on Image Processing, 2013

Robust local optical flow estimation using bilinear equations for sparse motion estimation.
Proceedings of the IEEE International Conference on Image Processing, 2013

Consensus-based multiview texturing and depth-map completion.
Proceedings of the IEEE International Conference on Image Processing, 2013

Adpative dense vector field interpolation for temporal filtering.
Proceedings of the IEEE International Conference on Image Processing, 2013

A decentralized privacy-sensitive video surveillance framework.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Crowd context-dependent privacy protection filters.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Video indexing and summarization as a tool for privacy protection.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

A Parametric Merge Candidate for High Efficiency Video Coding.
Proceedings of the 2013 Data Compression Conference, 2013

Efficient Quadtree Compression for Temporal Trajectory Filtering.
Proceedings of the 2013 Data Compression Conference, 2013

Multiple cue indexing and summarization of surveillance video.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

Enhancing human detection using crowd density measures and an adaptive correction filter.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

2012
Robust Local Optical Flow for Feature Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2012

Adaptive Global Motion Temporal Filtering for High Efficiency Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2012

Adaptive Temporal Trajectory Filtering for Video Compression.
IEEE Trans. Circuits Syst. Video Technol., 2012

Lossy parametric motion model compression for global motion temporal filtering.
Proceedings of the 2012 Picture Coding Symposium, 2012

Parametric motion vector prediction for hybrid video coding.
Proceedings of the 2012 Picture Coding Symposium, 2012

Adaptive global motion temporal filtering.
Proceedings of the 2012 Picture Coding Symposium, 2012

Weighted temporal long trajectory filtering for video compression.
Proceedings of the 2012 Picture Coding Symposium, 2012

A Lagrangian framework for video analytics.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Human action recognition using Lagrangian descriptors.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Pushing the limits of mechanical turk: qualifying the crowd for video geo-location.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

Cross-modal categorisation of user-generated video sequences.
Proceedings of the International Conference on Multimedia Retrieval, 2012

TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visual)-Words Approaches.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

How Spatial Segmentation improves the Multimodal Geo-Tagging.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Performance Evaluation of Feature Detection for Local Optical Flow Tracking.
Proceedings of the ICPRAM 2012, 2012

Multi-camera rectification using linearized trifocal tensor.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Quadtree-based temporal trajectory filtering.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Multimodal geo-tagging in social media websites using hierarchical spatial segmentation.
Proceedings of the 5th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, 2012

Resampling of multiple camera point cloud data.
Proceedings of the 2012 IEEE Consumer Communications and Networking Conference (CCNC), 2012

Detecting People Carrying Objects Utilizing Lagrangian Dynamics.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Clustering Motion for Real-Time Optical Flow Based Tracking.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Boosting Multi-hypothesis Tracking by Means of Instance-Specific Models.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Splitting Gaussians in Mixture Models.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Real-Time Multi-human Tracking Using a Probability Hypothesis Density Filter and Multiple Detectors.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

2011
Rate-Distortion Optimized Video Coding Using Automatic Sprites.
IEEE J. Sel. Top. Signal Process., 2011

Static Object Detection Based on a Dual Background Model and a Finite-State Machine.
EURASIP J. Image Video Process., 2011

Detecting people carrying objects based on an optical flow motion model.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Robust modified L<sup>2</sup> local optical flow estimation and feature tracking.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Detection of static objects for the task of video surveillance.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Multi-modal, multi-resource methods for placing Flickr videos on the map.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

TUB @ MediaEval 2011 Genre Tagging Task: Prediction using bag-of-words approaches.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Short-term motion-based object segmentation.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Efficient real-time local optical flow estimation by means of integral projections.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Theoretical consideration of global motion temporal filtering.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

A block-adaptive skip mode for inter prediction basedonparametric motion models.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Temporal trajectory filtering for bi-directional predicted frames.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Feature-based global motion estimation using the Helmholtz principle.
Proceedings of the IEEE International Conference on Acoustics, 2011

Document Capturing Method with a Camera Using Robust Feature Points Detection.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Audio similarity matrices enhancement in an image processing framework.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011

On building decentralized wide-area surveillance networks based on ONVIF.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Real-time person counting by propagating networks flows.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Complementary background models for the detection of static and moving objects in crowded environments.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds.
IEEE Trans. Speech Audio Process., 2010

Automatic MPEG-4 sprite coding - Comparison of integrated object segmentation algorithms.
Multim. Tools Appl., 2010

Semi-automatic object tracking in video sequences by extension of the MRSST algorithm.
Proceedings of the 11th International Workshop on Image Analysis for Multimedia Interactive Services, 2010

Towards Detecting People Carrying Objects - A Periodicity Dependency Pattern Approach.
Proceedings of the VISAPP 2010 - Proceedings of the Fifth International Conference on Computer Vision Theory and Applications, Angers, France, May 17-21, 2010, 2010

Recent advances in video coding using static background models.
Proceedings of the Picture Coding Symposium, 2010

Adaptive global motion temporal prediction for video coding.
Proceedings of the Picture Coding Symposium, 2010

A novel inloop filter for video-compression based on temporal pixel trajectories.
Proceedings of the Picture Coding Symposium, 2010

Subjective evaluation of scalable video coding for content distribution.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Music Structure Discovery in Popular Music using Non-negative Matrix Factorization.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Background modeling for video coding: From sprites to Global Motion Temporal filtering.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Compressed domain global motion estimation using the Helmholtz Tradeoff Estimator.
Proceedings of the International Conference on Image Processing, 2010

Robust global motion estimation using motion vectors of variable size blocks and automatic motion model selection.
Proceedings of the International Conference on Image Processing, 2010

Global motion temporal filtering for in-loop deblocking.
Proceedings of the International Conference on Image Processing, 2010

II-LK - A Real-Time Implementation for Sparse Optical Flow.
Proceedings of the Image Analysis and Recognition, 7th International Conference, 2010

Counting People in Crowded Environments by Fusion of Shape and Motion Information.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

2009
3DTV: Capture, Transmission, and Display of 3D Video.
EURASIP J. Adv. Signal Process., 2009

Feature-based video key frame extraction for low quality video sequences.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Automatic topic detection strategy for information retrieval in spoken document.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Evaluation of pixel- and motion vector-based global motion estimation for camera motion characterization.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Global motion estimation using variable block sizes and its application to object segmentation.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Automating multi-camera self-calibration.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2009), 2009

Multimodal person search combining information fusion and relevance feedback.
Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009

Automatic Generation of Lead Sheets from Polyphonic Music Signals.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Fast generation of cylindrical panoramic views from free-hand video sequences.
Proceedings of the 2nd International ICST Conference on Immersive Telecommunications, 2009

Rate-distortion optimization for automatic sprite video coding using H.264/AVC.
Proceedings of the International Conference on Image Processing, 2009

Video coding using global motion temporal filtering.
Proceedings of the International Conference on Image Processing, 2009

Incorporating prior knowledge on the digital media creation process into audio classifiers.
Proceedings of the IEEE International Conference on Acoustics, 2009

Motion-based object segmentation using local background sprites.
Proceedings of the IEEE International Conference on Acoustics, 2009

Polyphonic musical instrument recognition based on a dynamic model of the spectral envelope.
Proceedings of the IEEE International Conference on Acoustics, 2009

Combining confusion networks with probabilistic phone matching for open-vocabulary keyword spotting in spontaneous speech signal.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Stereoscopic 3D from 2D video with super-resolution capability.
Signal Process. Image Commun., 2008

Multimedia Retrieval and Delivery: Essential Metadata Challenges and Standards.
Proc. IEEE, 2008


New Real-Time Approaches for Video-Genre-Classification Using High-Level Descriptors and a Set of Classifiers.
Proceedings of the 2th IEEE International Conference on Semantic Computing (ICSC 2008), 2008

Camera motion-constraint video codec selection.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

On the detection and localization of facial occlusions and its use within different scenarios.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Recent developments in panoramic image generation and sprite coding.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Rushes video summarization using a collaborative approach.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Noise filtering method for color images based on LDA and nonlinear diffusion.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Multi-view video streaming over P2P networks with low start-up delay.
Proceedings of the International Conference on Image Processing, 2008

Extending H.264/AVC with a background sprite prediction mode.
Proceedings of the International Conference on Image Processing, 2008

Real-time detection of sport in MPEG-2 sequences using high-level AV-descriptors and SVM.
Proceedings of the Third IEEE International Conference on Digital Information Management (ICDIM), 2008

More robust face recognition by considering occlusion information.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

On the robustness of audio features for musical instrument classification.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Towards Fully Automatic Image Segmentation Evaluation.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2008

2007
Components and Their Topology for Robust Face Detection in the Presence of Partial Occlusions.
IEEE Trans. Inf. Forensics Secur., 2007

Towards 3-D scene reconstruction from broadcast video.
Signal Process. Image Commun., 2007

Efficient allocation of packet-level forward error correction in video streaming over the Internet.
J. Electronic Imaging, 2007

Knowledge-Assisted Media Analysis for Interactive Multimedia Applications.
EURASIP J. Adv. Signal Process., 2007

Motion-based Object Segmentation using Sprites and Anisotropic Diffusion.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

Image Denoising Method Using Diffusion Equation and Edge Map Estimated with K-Means Clustering Algorithm.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

Optimal multiple sprite generation based on physical camera parameter estimation.
Proceedings of the Visual Communications and Image Processing 2007, 2007


Towards Person Google: Multimodal Person Search and Retrieval.
Proceedings of the Semantic Multimedia, 2007

Monaural Source Separation from Musical Mixtures Based on Time-Frequency Timbre Models.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Extensible Platform for Multimedia Analysis (XPMA).
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Object-Based Multiple Sprite Coding of Unsegmented Videos using H.264/AVC.
Proceedings of the International Conference on Image Processing, 2007

Window-Based Image Registration using Variable Window Sizes.
Proceedings of the International Conference on Image Processing, 2007

An Image-Based Rendering (IBR) Approach for Realistic Stereo View Synthesis of TV Broadcast Based on Structure from Motion.
Proceedings of the International Conference on Image Processing, 2007

Confocal Disparity Estimation and Recovery of Pinhole Image for Real-Aperture Stereo Camera Systems.
Proceedings of the International Conference on Image Processing, 2007

A generic approach for motion-based video parsing.
Proceedings of the 15th European Signal Processing Conference, 2007

Multiple-description coding of speech using forward error correction codes.
Proceedings of the 15th European Signal Processing Conference, 2007

Super-Resolution Stereo- and Multi-View Synthesis from Monocular Video Sequences.
Proceedings of the Sixth International Conference on 3-D Digital Imaging and Modeling, 2007

2006
Multi-view synthesis: A novel view creation approach for free viewpoint video.
Signal Process. Image Commun., 2006

Unbalanced Quantized Multiple State Video Coding.
EURASIP J. Adv. Signal Process., 2006


Improved Image Registration using the Up-Sampled Domain.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Audiovisual Anchorperson Detection for Topic-Oriented Navigation in Broadcast News.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Recognizing Commercials in Real-Time using Three Visual Descriptors and a Decision-Tree.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Windowed Image Registration for Robust Mosaicing of Scenes with Large Background Occlusions.
Proceedings of the International Conference on Image Processing, 2006

Robust Anisotropic Disparity Estimation with Perceptual Maximum Variation Modeling.
Proceedings of the International Conference on Image Processing, 2006

Prioritized Sequential 3D Reconstruction in Video Sequences with Multiple Motions.
Proceedings of the International Conference on Image Processing, 2006

Extracting High Level Semantics by Means of Speech, Audio, and Image Primitives in Surveillance Applications.
Proceedings of the International Conference on Image Processing, 2006

Extending Single-View Scalable Video Coding to Multi-View Based on H.264/AVC.
Proceedings of the International Conference on Image Processing, 2006

Noise robust relative transfer function estimation.
Proceedings of the 14th European Signal Processing Conference, 2006

A geometric segmentation approach for the 3D reconstruction of dynamic scenes in 2D video sequences.
Proceedings of the 14th European Signal Processing Conference, 2006

Improving Wyner-Ziv video coding by block-based distortion estimation.
Proceedings of the 14th European Signal Processing Conference, 2006

A Framework for Evaluating Video Object Segmentation Algorithms.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Online Image Retrieval System Using Long Term Relevance Feedback.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

A Modular Scheme for 2D/3D Conversion of TV Broadcast.
Proceedings of the 3rd International Symposium on 3D Data Processing, 2006

2005
Announcement.
IEEE Trans. Circuits Syst. Video Technol., 2005

Trends and Perspectives in Image and Video Coding.
Proc. IEEE, 2005

Special Issue on Advances in Video Coding and Delivery.
Proc. IEEE, 2005

Coding with Temporal Layers or Multiple Descriptions for Lossy Video Transmission.
Proceedings of the Visual Content Processing and Representation, 2005

Detection of goal events in soccer videos.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2005, 2005

Comparison of different phone-based spoken document retrieval methods with text and spoken queries.
Proceedings of the INTERSPEECH 2005, 2005

Hybrid recursive energy-based method for robust optical flow on large motion fields.
Proceedings of the 2005 International Conference on Image Processing, 2005

Hybrid Speaker-Based Segmentation System Using Model-Level Clustering.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Distortion Estimation for Temporal Layered Video Coding.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Cartoon-recognition using video & audio descriptors.
Proceedings of the 13th European Signal Processing Conference, 2005

Multiple-description coding of logarithmic PCM.
Proceedings of the 13th European Signal Processing Conference, 2005

Gaussian Scale-Space Dense Disparity Estimation with Anisotropic Disparity-Field Diffusion.
Proceedings of the Fifth International Conference on 3D Digital Imaging and Modeling (3DIM 2005), 2005

2004
Audio classification based on MPEG-7 spectral basis representations.
IEEE Trans. Circuits Syst. Video Technol., 2004

Fast Index Filtering in Vector Approximation File.
Fundam. Informaticae, 2004

Human body posture recognition using MPEG-7 descriptors.
Proceedings of the Visual Communications and Image Processing 2004, 2004

Temporal layered vs. multistate video coding.
Proceedings of the Visual Communications and Image Processing 2004, 2004

Automatic segmentation of speakers in broadcast audio material.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Performance of MPEG-7 spectral basis representations for retrieval of home video abstract.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Comparison of MPEG-7 basis projection features and MFCC applied to robust speaker recognition.
Proceedings of the ODYSSEY 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Evaluation of distance measures for MPEG-7 melody contours.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Phonetic confusion based document expansion for spoken document retrieval.
Proceedings of the INTERSPEECH 2004, 2004

Speech enhancement based on smoothing of spectral noise floor.
Proceedings of the INTERSPEECH 2004, 2004

A gradient based approach for stereoscopic error concealment.
Proceedings of the 2004 International Conference on Image Processing, 2004

Recursive decoder distortion estimation based on AF(1) source modeling for video.
Proceedings of the 2004 International Conference on Image Processing, 2004

Audio content description with wavelets and neural nets.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Comparison of MPEG-7 audio spectrum projection features and MFCC applied to speaker recognition, sound classification and audio segmentation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

An Overview of a New European Consortium: Integrated Three-Dimensional Television - Capture, Transmission and Display (3DTV).
Proceedings of the Knowledge-Based Media Analysis for Self-Adaptive and Agile Multi-Media, 2004

Combination of phone N-grams for a MPEG-7-based spoken document retrieval system.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

Audio spectrum projection based on several basis decomposition algorithms applied to general sound recognition and audio segmentation.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

Robust Concealment for Erroneous Block Bursts in Stereoscopic Images.
Proceedings of the 2nd International Symposium on 3D Data Processing, 2004

2003
MPEG-4 objektbasierte Videocodierung (MPEG-4 Object-Based Video Coding).
it Inf. Technol., 2003

Model-based unbalanced multiple description video transmission using path diversity.
Proceedings of the Visual Communications and Image Processing 2003, 2003

Enhancement of noisy speech for noise robust front-end and speech reconstruction at back-end of DSR system.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speaker recognition using MPEG-7 descriptors.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Three-dimensional browsing environment for MPEG-7 image databases.
Proceedings of the Storage and Retrieval for Media Databases 2002, 2002

Media semantics: who needs it and why?
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Hierarchical image database browsing environment with embedded relevance feedback.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
The MPEG-7 visual standard for content description-an overview.
IEEE Trans. Circuits Syst. Video Technol., 2001

Overview of the MPEG-7 standard.
IEEE Trans. Circuits Syst. Video Technol., 2001

Introduction to the special issue on MPEG-7.
IEEE Trans. Circuits Syst. Video Technol., 2001

Visualization and navigation in image database applications based on MPEG-7 descriptors.
Proceedings of the 2001 International Conference on Image Processing, 2001

2000
A family of single-user autostereoscopic displays with head-tracking capabilities.
IEEE Trans. Circuits Syst. Video Technol., 2000

Special Issue on Shape Coding for Emerging Multimedia Applications.
Signal Process. Image Commun., 2000

Image visualization based on MPEG-7 color descriptors.
Proceedings of the Visual Communications and Image Processing 2000, 2000

MPEG Standardization Activities.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

1999
Long-term global motion estimation and its application for sprite coding, content description, and segmentation.
IEEE Trans. Circuits Syst. Video Technol., 1999

Real-time estimation of long-term 3-D motion parameters for SNHC face animation and model-based coding applications.
IEEE Trans. Circuits Syst. Video Technol., 1999

Special Issue On Representation And Coding Of Images And Video II.
IEEE Trans. Circuits Syst. Video Technol., 1999

1998
Special Issue On Representation And Coding Of Images And Video I [Guest Editorial].
IEEE Trans. Circuits Syst. Video Technol., 1998

Guest Editorial.
IEEE Trans. Circuits Syst. Video Technol., 1998

Image sequence analysis for emerging interactive multimedia services-the European COST 211 framework.
IEEE Trans. Circuits Syst. Video Technol., 1998

Texture coding on arbitrarily shaped image segments : transform methods.
Ann. des Télécommunications, 1998

1997
Optimal Wiener interpolation filters for multiresolution coding of images.
IEEE Trans. Circuits Syst. Video Technol., 1997

The MPEG-4 video standard verification model.
IEEE Trans. Circuits Syst. Video Technol., 1997

Functional coding of video using a shape-adaptive DCT algorithm and an object-based motion prediction toolbox.
IEEE Trans. Circuits Syst. Video Technol., 1997

MPEG digital video-coding standards.
IEEE Signal Process. Mag., 1997

MPEG Digital Audio-and Video-Coding Standards.
IEEE Signal Process. Mag., 1997

Estimation of Motion Parameters of a Rigid Body from a Monocular Image Sequence for MPEG-4 Applications.
Proceedings of the Interactive Distributed Multimedia Systems and Telecommunication Services, 1997

1996
Optimal block-overlapping synthesis transforms for coding images and video at very low bitrates.
IEEE Trans. Circuits Syst. Video Technol., 1996

Compression algorithms for software coding of video.
Signal Process. Image Commun., 1996

The European COST211ter activities-research towards advanced algorithms for coding of video signals at very low bit rates.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

1995
Shape-adaptive DCT for generic coding of video.
IEEE Trans. Circuits Syst. Video Technol., 1995

Efficiency of shape-adaptive 2-D transforms for coding of arbitrarily shaped image segments.
IEEE Trans. Circuits Syst. Video Technol., 1995

Low complexity shape-adaptive DCT for coding of arbitrarily shaped image segments.
Signal Process. Image Commun., 1995

Digital video coding standards and their role in video communications.
Proc. IEEE, 1995

1985
Simulation eines ungeregelten pulsatilen Modelles des Herzkreislaufsystems.
Proceedings of the Simulationstechnik, 1985


  Loading...