Hanseok Ko

Orcid: 0000-0002-8744-4514

Affiliations:
  • Korea University, Department of Electrical Engineering, Seoul, South Korea


According to our database1, Hanseok Ko authored at least 253 papers between 1994 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision.
Sensors, 2024

ConSeisGen: Controllable Synthetic Seismic Waveform Generation.
IEEE Geosci. Remote. Sens. Lett., 2024

Classification and Magnitude Estimation of Global and Local Seismic Events Using Conformer and Low-Rank Adaptation Fine-Tuning.
IEEE Geosci. Remote. Sens. Lett., 2024

Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model.
CoRR, 2024

4D Facial Avatar Reconstruction From Monocular Video via Efficient and Controllable Neural Radiance Fields.
IEEE Access, 2024

Hard Sample-aware Consistency for Low-resolution Facial Expression Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Sound source localization using complex-valued deep neural networks.
Proceedings of the IEEE International Conference on Consumer Electronics, 2024

2023
Fast Non-Local Attention network for light super-resolution.
J. Vis. Commun. Image Represent., September, 2023

Domain-agnostic single-image super-resolution via a meta-transfer neural architecture search.
Neurocomputing, March, 2023

Searching similar weather maps using convolutional autoencoder and satellite images.
ICT Express, February, 2023

Channel Shuffle Neural Architecture Search for Key Word Spotting.
IEEE Signal Process. Lett., 2023

ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models.
CoRR, 2023

Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition.
CoRR, 2023

A teacher-student framework with Fourier Transform augmentation for COVID-19 infection segmentation in CT images.
Biomed. Signal Process. Control., 2023

The KU-ISPL entry to the GENEA Challenge 2023-A Diffusion Model for Co-speech Gesture generation.
Proceedings of the International Conference on Multimodal Interaction, 2023

MPE4G : Multimodal Pretrained Encoder for Co-Speech Gesture Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Lightweight Dynamic Filter For Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

Open Set Bioacoustic Signal Classification based on Class Anchor Clustering with Closed Set Unknown Bioacoustic Signals.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

Frame Level Emotion Guided Dynamic Facial Expression Recognition with Emotion Grouping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Information Bottleneck Measurement for Compressed Sensing Image Reconstruction.
IEEE Signal Process. Lett., 2022

Prototypical Knowledge Distillation for Noise Robust Keyword Spotting.
IEEE Signal Process. Lett., 2022

Discriminatory and Orthogonal Feature Learning for Noise Robust Keyword Spotting.
IEEE Signal Process. Lett., 2022

Learnable Maximum Amplitude Structure for Earthquake Event Classification.
IEEE Geosci. Remote. Sens. Lett., 2022

Feedback Network With Curriculum Learning for Earthquake Event Classification.
IEEE Geosci. Remote. Sens. Lett., 2022

Feature Sparse Coding With CoordConv for Side Scan Sonar Image Enhancement.
IEEE Geosci. Remote. Sens. Lett., 2022

Graph Convolution Networks for Seismic Events Classification Using Raw Waveform Data From Multiple Stations.
IEEE Geosci. Remote. Sens. Lett., 2022

Single Cell Training on Architecture Search for Image Denoising.
CoRR, 2022

Controllable Face Manipulation and UV Map Generation by Self-supervised Learning.
CoRR, 2022

Generate and Edit Your Own Character in a Canonical View.
CoRR, 2022

Efficient dynamic filter for robust and low computational feature extraction.
CoRR, 2022

Unsupervised domain adaptation based COVID-19 CT infection segmentation network.
Appl. Intell., 2022

Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition.
IEEE Access, 2022

Efficient Dynamic Filter For Robust and Low Computational Feature Extraction.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

CaraNet: context axial reverse attention network for segmentation of small medical objects.
Proceedings of the Medical Imaging 2022: Image Processing, 2022

DIFAI: Diverse Facial Inpainting using StyleGAN Inversion.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

3D Human Motion Generation from the Text Via Gesture Action Classification and the Autoregressive Model.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
COVID-19 CT Image Synthesis With a Conditional Generative Adversarial Network.
IEEE J. Biomed. Health Informatics, 2021

Sound Event Detection by Pseudo-Labeling in Weakly Labeled Dataset.
Sensors, 2021

TrSeg: Transformer for semantic segmentation.
Pattern Recognit. Lett., 2021

Earthquake Event Classification Using Multitasking Deep Learning.
IEEE Geosci. Remote. Sens. Lett., 2021

Attention-Based Convolutional Neural Network for Earthquake Event Classification.
IEEE Geosci. Remote. Sens. Lett., 2021

Multifeature Fusion-Based Earthquake Event Classification Using Transfer Learning.
IEEE Geosci. Remote. Sens. Lett., 2021

Side-Scan Sonar Image Synthesis Based on Generative Adversarial Network for Images in Multiple Frequencies.
IEEE Geosci. Remote. Sens. Lett., 2021

Deep Clustering for Improved Inter-Cluster Separability and Intra-Cluster Homogeneity with Cohesive Loss.
IEICE Trans. Inf. Syst., 2021

SpecMix : A Mixed Sample Data Augmentation method for Training withTime-Frequency Domain Features.
CoRR, 2021

Two-Stream Learning-Based Compressive Sensing Network With High-Frequency Compensation for Effective Image Denoising.
IEEE Access, 2021

Multimodal Emotion Recognition Fusion Analysis Adapting BERT With Heterogeneous Feature Unification.
IEEE Access, 2021

Feedback Module Based Convolution Neural Networks for Sound Event Classification.
IEEE Access, 2021

Sketch-and-Fill Network for Semantic Segmentation.
IEEE Access, 2021

Memory-based Semantic Segmentation for Off-road Unstructured Natural Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

SpecMix : A Mixed Sample Data Augmentation Method for Training with Time-Frequency Domain Features.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Few-Shot Learning for Ct Scan Based Covid-19 Diagnosis.
Proceedings of the IEEE International Conference on Acoustics, 2021

Reference Guided Image Inpainting using Facial Attributes.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Adaptive Content Feature Enhancement GAN for Multimodal Selfie to Anime Translation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Adverse Weather Image Translation with Asymmetric and Uncertainty-aware GAN.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Deep Degradation Prior for Real-World Super-Resolution.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

PETS2021: Through-foliage detection and tracking challenge and evaluation.
Proceedings of the 17th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2021

Injecting Sparsity in Anomaly Detection for Efficient Inference.
Proceedings of the 17th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2021

CPNet: Cross-Parallel Network for Efficient Anomaly Detection.
Proceedings of the 17th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2021

Action Recognition with Domain Invariant Features of Skeleton Image.
Proceedings of the 17th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2021

2020
Fusion of Heterogeneous Adversarial Networks for Single Image Dehazing.
IEEE Trans. Image Process., 2020

Amphibian Sounds Generating Network Based on Adversarial Learning.
IEEE Signal Process. Lett., 2020

Spectro-Temporal Attention-Based Voice Activity Detection.
IEEE Signal Process. Lett., 2020

Seismic Data Augmentation Based on Conditional Generative Adversarial Networks.
Sensors, 2020

Fusion-ConvBERT: Parallel Convolution and BERT Fusion for Speech Emotion Recognition.
Sensors, 2020

Weighted Kernel Filter Based Anti-Air Object Tracking for Thermal Infrared Systems.
Sensors, 2020

Orthogonal Gradient Penalty for Fast Training of Wasserstein GAN Based Multi-Task Autoencoder toward Robust Speech Recognition.
IEICE Trans. Inf. Syst., 2020

Data Separability for Neural Network Classifiers and the Development of a Separability Index.
CoRR, 2020

Multimodal Deep Fusion Network for Visibility Assessment With a Small Training Dataset.
IEEE Access, 2020

KU-ISPL TRECVID 2020 VTT Model.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Dual Stage Learning Based Dynamic Time-Frequency Mask Generation for Audio Event Classification.
Proceedings of the Interspeech 2020, 2020

Seismic Signal Synthesis by Generative Adversarial Network with Gated Convolutional Neural Network Structure.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2020

Convolutional Recurrent Neural Networks for Earthquake Epicentral Distance Estimation Using Single-Channel Seismic Waveform.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2020

CAFE-GAN: Arbitrary Face Attribute Editing with Complementary Attention Feature.
Proceedings of the Computer Vision - ECCV 2020, 2020


FBRNN: feedback recurrent neural network for extreme image super-resolution.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unsupervised Real-World Super Resolution with Cycle Generative Adversarial Network and Domain Discriminator.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Nonhomogeneous Noise Removal From Side-Scan Sonar Images Using Structural Sparsity.
IEEE Geosci. Remote. Sens. Lett., 2019

Relay dueling network for visual tracking with broad field-of-view.
IET Comput. Vis., 2019

Side Scan Sonar Image Super Resolution via Region-Selective Sparse Coding.
IEICE Trans. Inf. Syst., 2019

Channel and Frequency Attention Module for Diverse Animal Sound Classification.
IEICE Trans. Inf. Syst., 2019

Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement.
CoRR, 2019

Sinusoidal wave generating network based on adversarial learning and its application: synthesizing frog sounds for data augmentation.
CoRR, 2019

Multi-task Learning for Animal Species and Group Category Classification.
Proceedings of the ICIT 2019, 2019

A Novel Probabilistic Appearance Model for Cigarette Detection Under Illumination Change.
Proceedings of the International Conference on Electronics, Information, and Communication, 2019

Self-Subtraction Network for End to End Noise Robust Classification.
Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2019

2018
Robust Target Tracking with Multi-Static Sensors under Insufficient TDOA Information.
Sensors, 2018

Man-Made Radio Frequency Interference Suppression for Compact HF Surface Wave Radar.
IEEE Geosci. Remote. Sens. Lett., 2018

Accurate Target Motion Analysis from a Small Measurement Set Using RANSAC.
IEICE Trans. Inf. Syst., 2018

Analysis Acoustic Features for Acoustic Scene Classification and Score fusion of multi-classification systems applied to DCASE 2016 challenge.
CoRR, 2018

KU-ISPL TRECVID 2018 VTT Model.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Multimodal Fusion Strategies: Human vs. Machine.
Proceedings of the 2018 Workshop on Audio-Visual Scene Understanding for Immersive Multimedia, 2018

A time delay convolutional neural network for acoustic scene classification.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Robust remote heart rate estimation in car driving environment.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Precise Regression for Bounding Box Correction for Improved Tracking Based on Deep Reinforcement Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Real-time Optical Imaging of Microbubble Destruction with an Acoustic Lens Attached Ultrasonic Diagnostic Probe in Microfluidic Capillary Models.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Convolutional Feature Vectors and Support Vector Machine for Animal Sound Classification.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Hierarchical spatial object detection for ATM vandalism surveillance.
Proceedings of the 15th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2018

Image fusion and influence function for performance improvement of ATM vandalism action recognition.
Proceedings of the 15th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2018

2017
Continuous hand gesture recognition based on trajectory shape information.
Pattern Recognit. Lett., 2017

A feature descriptor based on the local patch clustering distribution for illumination-robust image matching.
Pattern Recognit. Lett., 2017

Compact HF Surface Wave Radar Data Generating Simulator for Ship Detection and Tracking.
IEEE Geosci. Remote. Sens. Lett., 2017

Online multi-person tracking with two-stage data association and online appearance model learning.
IET Comput. Vis., 2017

New Generalized Sidelobe Canceller with Denoising Auto-Encoder for Improved Speech Enhancement.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Enhancing Underwater Color Images via Optical Imaging Model and Non-Local Means Denoising.
IEICE Trans. Inf. Syst., 2017

DNN Transfer Learning Based Non-Linear Feature Extraction for Acoustic Event Classification.
IEICE Trans. Inf. Syst., 2017

A Novel Discriminative Feature Extraction for Acoustic Scene Classification Using RNN Based Source Separation.
IEICE Trans. Inf. Syst., 2017

License Plate Detection and Character Segmentation Using Adaptive Binarization Based on Superpixels under Illumination Change.
IEICE Trans. Inf. Syst., 2017

KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation.
CoRR, 2017

KU-ISPL TRECVID 2017 VTT System.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

Target motion analysis with evolutionary search by fusion of two moving acoustic sensors.
Proceedings of the 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2017

Coastal ship monitoring based on multiple compact high frequency surface wave radars.
Proceedings of the 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2017

Autoencoder Based Domain Adaptation for Speaker Recognition Under Insufficient Channel Information.
Proceedings of the Interspeech 2017, 2017

Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition.
Proceedings of the Interspeech 2017, 2017

Automated malaria cell counter using Hough transform based method.
Proceedings of the IEEE International Conference on Consumer Electronics, 2017

Subspace projection cepstral coefficients for noise robust acoustic event recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Deep Neural Network based learning and transferring mid-level audio features for acoustic scene classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Acoustic Scene Classification Based on Convolutional Neural Network Using Double Image Features.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Generative Adversarial Network Based Acoustic Scene Training Set Augmentation and Selection Using SVM Hyper-Plane.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Online pedestrian tracking with multi-stage re-identification.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

2016
Dialogue enabling speech-to-text user assistive agent system for hearing-impaired person.
Medical Biol. Eng. Comput., 2016

Joint patch clustering-based dictionary learning for multimodal image fusion.
Inf. Fusion, 2016

Key Frame Extraction Based on Chaos Theory and Color Information for Video Summarization.
IEICE Trans. Inf. Syst., 2016

Hybrid Retinal Image Registration Using Mutual Information and Salient Features.
IEICE Trans. Inf. Syst., 2016

Non-negative matrix factorization-based subband decomposition for acoustic source localization.
CoRR, 2016

KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge.
CoRR, 2016

KU-ISPL TRECVID 2016 Multimedia Event Detection System.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Deep Neural Network Bottleneck Features for Acoustic Event Recognition.
Proceedings of the Interspeech 2016, 2016

Nighttime image dehazing with local atmospheric light and weighted entropy.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Top-view people detection based on multiple subarea pose models for smart home system.
Proceedings of the IEEE International Conference on Consumer Electronics, 2016

SVM based dynamic classifier for sleep disorder monitoring wearable device.
Proceedings of the IEEE International Conference on Consumer Electronics, 2016

Enhancing underwater color images of diving mask mounted digital camera via non-local means denoising.
Proceedings of the IEEE International Conference on Consumer Electronics, 2016

Effective character segmentation for license plate recognition under illumination changing environment.
Proceedings of the IEEE International Conference on Consumer Electronics, 2016

Online Multi-object Tracking Based on Hierarchical Association Framework.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

Single object tracking based on active and passive detection information in distributed heterogeneous sensor network.
Proceedings of the 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2016

2015
Video-Based Dynamic Stagger Measurement of Railway Overhead Power Lines Using Rotation-Invariant Feature Matching.
IEEE Trans. Intell. Transp. Syst., 2015

Acoustic event filterbank for enabling robust event recognition by cleaning robot.
IEEE Trans. Consumer Electron., 2015

A novel approach for denoising and enhancement of extremely low-light video.
IEEE Trans. Consumer Electron., 2015

Visual Speech Recognition Using Weighted Dynamic Time Warping.
IEICE Trans. Inf. Syst., 2015

Underwater Radiated Signal Analysis in the Modulation Spectrogram Domain.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2015

KU-ISPL TRECVID 2015 Multimedia Event Detection System.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Recognition of Human Group Activity for Video Analytics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Acoustic event recognition using dominant spectral basis vectors.
Proceedings of the INTERSPEECH 2015, 2015

Robust speaker direction estimation with microphone array using NMF for smart TV interaction.
Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Robust visual voice activity detection using local variance histogram in vehicular environments.
Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Online multi-person tracking for intelligent video surveillance systems.
Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Video summarization based on extracted key position of spotted objects.
Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Maximum likelihood Linear Dimension Reduction of heteroscedastic feature for robust Speaker Recognition.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014
Visual voice activity detection via chaos based lip motion measure robust under illumination changes.
IEEE Trans. Consumer Electron., 2014

Hidden Markov Model on a unit hypersphere space for gesture trajectory recognition.
Pattern Recognit. Lett., 2014

Rule-based trajectory segmentation for modeling hand motion trajectory.
Pattern Recognit., 2014

Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function.
IEICE Trans. Inf. Syst., 2014

KU-ISPL TRECVID 2014 Multimedia Event Detection System.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Single image haze removal using novel estimation of atmospheric light and transmission.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Single image dehazing with image entropy and information fidelity.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Robust visual voice activity detection using chaos theory under illumination varying environment.
Proceedings of the IEEE International Conference on Consumer Electronics, 2014

Acoustic feature extraction for robust event recognition on cleaning robot platform.
Proceedings of the IEEE International Conference on Consumer Electronics, 2014

A novel framework for extremely low-light video enhancement.
Proceedings of the IEEE International Conference on Consumer Electronics, 2014

2013
Acoustic signal based abnormal event detection in indoor environment using multiclass adaboost.
IEEE Trans. Consumer Electron., 2013

Fast Single Image De-Hazing Using Characteristics of RGB Channel of Foggy Image.
IEICE Trans. Inf. Syst., 2013

Multimodal image fusion via sparse representation with local patch dictionaries.
Proceedings of the IEEE International Conference on Image Processing, 2013

Dialogue enabling speech-to-text user assistive agent with auditory perceptual beamforming for hearing-impaired.
Proceedings of the IEEE International Conference on Consumer Electronics, 2013

Acoustic signal based abnormal event detection system with multiclass adaboost.
Proceedings of the IEEE International Conference on Consumer Electronics, 2013

Single image haze removal with WLS-based edge-preserving smoothing filter.
Proceedings of the IEEE International Conference on Acoustics, 2013

Robust sound source localization using a Wiener filter.
Proceedings of 2013 IEEE 18th Conference on Emerging Technologies & Factory Automation, 2013

Abnormal acoustic event localization based on selective frequency bin in high noise environment for audio surveillance.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

2012
Full Azimuth Multiple Sound Source Localization with 3-Channel Microphone Array.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2012

Gesture recognition using depth-based hand tracking for contactless controller application.
Proceedings of the IEEE International Conference on Consumer Electronics, 2012

Sudden noise source localization system for intelligent automobile application with acoustic sensors.
Proceedings of the IEEE International Conference on Consumer Electronics, 2012

Fog-degraded image restoration using characteristics of RGB channel in single monocular image.
Proceedings of the IEEE International Conference on Consumer Electronics, 2012

Acoustic and visual signal based violence detection system for indoor security application.
Proceedings of the IEEE International Conference on Consumer Electronics, 2012

Crowd Density Estimation Using Multi-class Adaboost.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Selective Background Adaptation Based Abnormal Acoustic Event Recognition for Audio Surveillance.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Combining Infrared and Visible Images Using Novel Transform and Statistical Information.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

2011
Acoustic and visual signal based context awareness system for mobile application.
IEEE Trans. Consumer Electron., 2011

Suppressing Ghost Targets via Gating and Track History in Y-Shaped Passive Linear Array Sonars.
IEEE Trans. Aerosp. Electron. Syst., 2011

Adaptive height-modified histogram equalization and chroma correction in YCbCr color space for fast backlight image compensation.
Image Vis. Comput., 2011

Robust video super resolution algorithm using measurement validation method and scene change detection.
EURASIP J. Adv. Signal Process., 2011

Hostile intent and behaviour detection in elevators.
Proceedings of the 4th International Conference on Imaging for Crime Detection and Prevention, 2011

Rule Based Trajectory Segmentation Applied to an HMM-Based Isolated Hand Gesture Recognizer.
Proceedings of the HCI International 2011 - Posters' Extended Abstracts, 2011

Robust background subtraction using data fusion for real elevator scene.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Resolution enhancement of ROI from surveillance video using Bernstein interpolation.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Hierarchical approach for abnormal acoustic event classification in an elevator.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
Harmonic Components Based Post-Filter Design for Residual Echo Suppression.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2010

Sound source separation by using matched beamforming and time-frequency masking.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Reinforced blocking matrix with cross channel projection for speech enhancement.
Proceedings of the INTERSPEECH 2010, 2010

License Plate Detection Using Local Structure Patterns.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

Robust Dynamic Super Resolution under Inaccurate Motion Estimation.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

2009
Extension of two-channel transfer function based generalized sidelobe canceller for dealing with both background and point-source noise.
Speech Commun., 2009

W-Disjoint Orthogonality Based Residual Acoustic Echo Cancellation for Hands-Free Communication.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

Robust Relative Transfer Function Estimation for Dual Microphone-Based Generalized Sidelobe Canceller.
IEICE Trans. Inf. Syst., 2009

Optimal Gain Filter Design for Perceptual Acoustic Echo Suppressor.
IEICE Trans. Inf. Syst., 2009

2008
Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync.
IEEE Trans. Multim., 2008

Gradient-based local affine invariant feature extraction for mobile robot localization in indoor environments.
Pattern Recognit. Lett., 2008

Topological Mappings of Video and Audio Data.
Int. J. Neural Syst., 2008

Masking Property Based Residual Acoustic Echo Cancellation for Hands-Free Communication in Automobile Environment.
IEICE Trans. Inf. Syst., 2008

Effective lip localization and tracking for achieving multimodal speech recognition.
Proceedings of the IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2008

Enhancement of image degraded by fog using cost function based on human visual model.
Proceedings of the IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2008

Combining acoustic echo cancellation and adaptive beamforming for achieving robust speech interface in mobile robot.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Feature Locations in Images.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2008

Bregman Divergences and the Self Organising Map.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2008

More powerful discriminants for classifying phylogenetic signals in dinucleotide frequencies.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Effective Energy Feature Compensation Using Modified Log-energy Dynamic Range Normalization for Robust Speech Recognition.
IEICE Trans. Commun., 2007

Enabling directional human-robot speech interface via adaptive beamforming and spatial noise reduction.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Visualising and Clustering Video Data.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2007

Learning Kernel Subspace Classifier.
Proceedings of the Advances in Biometrics, International Conference, 2007

3D Environment Modeling and its Application to Human Robot Interaction.
Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies 2007, 2007

Combination of self-organization map and kernel mutual subspace method for video surveillance.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

2006
Dual channel based speech enhancement using novelty filter for robust speech recognition in automobile environment.
IEEE Trans. Consumer Electron., 2006

Achieving a reliable compact acoustic model for embedded speech recognition system with high confusion frequency model handling.
Speech Commun., 2006

Competing models-based text-prompted speaker independent verification algorithm.
Speech Commun., 2006

Prediction Based Occluded Multitarget Tracking Using Spatio-temporal Attention.
Int. J. Pattern Recognit. Artif. Intell., 2006

Svm-based Phoneme Classification and Lip Shape Refinement in Real-time Lip-synch System.
Int. J. Pattern Recognit. Artif. Intell., 2006

A new state-dependent phonetic tied-mixture model with head-body-tail structured HMM for real-time continuous phoneme recognition system.
Proceedings of the INTERSPEECH 2006, 2006

Indoor Environment Modeling for Interactive VR - Based Robot Security Service.
Proceedings of the Advances in Artificial Reality and Tele-Existence, 2006

Decision Theoretic Fusion Framework for Actionability Using Data Mining on an Embedded System.
Proceedings of the Data Mining - Theory, Methodology, Techniques, and Applications, 2006

2005
Background noise reduction via dual-channel scheme for speech recognition in vehicular environment.
IEEE Trans. Consumer Electron., 2005

Bayesian fusion of confidence measures for speech recognition.
IEEE Signal Process. Lett., 2005

Effective acoustic model clustering via decision-tree with supervised learning.
Speech Commun., 2005

Bayesian Confidence Scoring and Adaptation Techniques for Speech Recognition.
IEICE Trans. Commun., 2005

Environment-independent mask estimation for missing-feature reconstruction.
Proceedings of the INTERSPEECH 2005, 2005

Predictive Estimation Method to Track Occluded Multiple Objects Using Joint Probabilistic Data Association Filter.
Proceedings of the Image Analysis and Recognition, Second International Conference, 2005

Occlusion Activity Detection Algorithm Using Kalman Filter for Detecting Occluded Multiple Objects.
Proceedings of the Computational Science, 2005

Speaker Adaptive Confidence Scoring Using Bayesian Combining.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Spatio-temporal Attention Mechanism for More Complex Analysis to Track Multiple Objects.
Proceedings of the Brain, 2005

Model Based Abnormal Acoustic Source Detection Using a Microphone Array.
Proceedings of the AI 2005: Advances in Artificial Intelligence, 2005

2004
A New Feature Normalization Scheme Based on Eigenspace for Noisy Speech Recognition.
Proceedings of the String Processing and Information Retrieval, 2004

Compact acoustic model for embedded implementation.
Proceedings of the INTERSPEECH 2004, 2004

Multi-eigenspace normalization for robust speech recognition in noisy environments.
Proceedings of the INTERSPEECH 2004, 2004

Face detection using support vector domain description in color images.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

PCMM-based feature compensation schemes using model interpolation and mixture sharing.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Voice Code Verification Algorithm Using Competing Models for User Entrance Authentication.
Proceedings of the AI 2004: Advances in Artificial Intelligence, 2004

2003
Utterance verification under distributed detection and fusion framework.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Feature compensation scheme based on parallel combined mixture model.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Robust Reference Point Detection Using Gradient of Fingerprint Direction and Feature Extraction Method.
Proceedings of the Computational Science - ICCS 2003, 2003

GPD-Based State Modification by Weighted Linear Loss Function.
Proceedings of the Computational Science - ICCS 2003, 2003

Spectral Subtraction Using Spectral Harmonics for Robust Speech Recognition in Car Environments.
Proceedings of the Computational Science - ICCS 2003, 2003

A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Construction of decision tree from data driven clustering.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

On effective speaker verification based on subword model.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Achieving Real-Time Lip Synch via SVM-Based Phoneme Classification and Lip Shape Refinement.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Multiple vehicle tracking based on regional estimation in nighttime CCD images.
Proceedings of the IEEE International Conference on Acoustics, 2002

Improved acoustic modeling based on selective data-driven PMC.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Model based stress decision method.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A Threshold Based Scheduling Algorithm for Input Queue Switch.
Proceedings of the 15th International Conference on Information Networking, 2001

Tracking of mobile phone using IMM in CDMA environment.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Dynamical behavior of autoassociative memory performing novelty filtering for signal enhancement.
IEEE Trans. Neural Networks Learn. Syst., 2000

Background noise suppression for signal enhancement by novelty filtering.
IEEE Trans. Aerosp. Electron. Syst., 2000

An effective acoustic modeling of names based on model induction.
Proceedings of the IEEE International Conference on Acoustics, 2000

Effective speaker adaptations for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2000

1997
Design and implementation of dual processor block with shared external cache memory.
Microprocess. Microsystems, 1997

1994
Signal detectability enhancement with auto-associative backpropagation networks.
Neurocomputing, 1994


  Loading...