Yoshihiko Gotoh

Orcid: 0000-0003-1668-0867

According to our database1, Yoshihiko Gotoh authored at least 54 papers between 1994 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Exploration of verbal descriptions and dynamic indoors environments for people with sight loss.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Improving Audiovisual Active Speaker Detection in Egocentric Recordings with the Data-Efficient Image Transformer.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2020
Graph-based topic models for trajectory clustering in crowd videos.
Mach. Vis. Appl., 2020

2019
3D Visual Speech Animation Using 2D Videos.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Graph-Based Correlated Topic Model for Trajectory Clustering in Crowded Videos.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Graph-based Correlated Topic Model for Motion Patterns Analysis in Crowded Scenes from Tracklets.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Generating natural language tags for video information management.
Mach. Vis. Appl., 2017

Medical Image Colorization for Better Visualization and Segmentation.
Proceedings of the Medical Image Understanding and Analysis - 21st Annual Conference, 2017

Natural Language Descriptions for Human Activities in Video Streams.
Proceedings of the 10th International Conference on Natural Language Generation, 2017

2016
The University of Sheffield and University of Engineering & Technology, Lahore at TRECVID 2016: Video to Text Description Task.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Natural Language Descriptions of Human Activities Scenes: Corpus Generation and Analysis.
Proceedings of the 5th Workshop on Vision and Language, 2016

2015
A framework for creating natural language descriptions of video streams.
Inf. Sci., 2015

A unified spatio-temporal human body region tracking approach to action recognition.
Neurocomputing, 2015

University of Engineering & Technology, Lahore / The University of Sheffield at TRECVID 2015: Instance Search.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Corpus Generation and Analysis: Incorporating Audio Data Towards Curbing Missing Information.
Proceedings of the 1st International Workshop on Knowledge Discovery on the WEB, 2015

2014
The University of Sheffield and University of Engineering & Technology, Lahore at TECVID 2014: Instance Search Task.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Manifold Matching with Application to Instance Search Based on Video Queries.
Proceedings of the Image and Signal Processing - 6th International Conference, 2014

Alignment of nearly-repetitive contents in a video stream with manifold embedding.
Proceedings of the IEEE International Conference on Acoustics, 2014

Video Clip Retrieval by Graph Matching.
Proceedings of the Advances in Information Retrieval, 2014

2013
The University of Sheffield , Harbin University and University of Engineering & Technology, Lahore at TRECVID 2013: Instance Search & Semantic Indexing.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Spatio-temporal Human Body Segmentation from Video Stream.
Proceedings of the Computer Analysis of Images and Patterns, 2013

Spatio-temporal Manifold Embedding for Nearly-Repetitive Contents in a Video Stream.
Proceedings of the Computer Analysis of Images and Patterns, 2013

2012
The University of Sheffield and Harbin Engineering University at TRECVID 2012: Instance Search.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Generating coherent natural language annotations for video streams.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Natural Language Descriptions of Visual Scenes Corpus Generation and Analysis.
Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation HyTra@EACL 2012, 2012

Spatio-temporal SIFT and Its Application to Human Action Classification.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Spatio-temporal Video Representation with Locality-Constrained Linear Coding.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

2011
Video scene classification based on natural language description.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Human Focused Video Description.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Towards coherent natural language description of video streams.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

2010
Nearly-repetitive video synchronisation using nonlinear manifold embedding.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
On the subjectivity of human-authored summaries.
Nat. Lang. Eng., 2009

2008
A Cascaded Broadcast News Highlighter.
IEEE Trans. Speech Audio Process., 2008

University of Sheffield at TRECVID 2008: Rushes Summarisation and Video Copy Detection.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

2007
University of Sheffield at TRECVID 2007: Shot Boundary Detection and Rushes Summarisation.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Speaker role based structural classification of broadcast news stories.
Proceedings of the INTERSPEECH 2007, 2007

Relative evaluation of informativeness in machine generated summaries.
Proceedings of the INTERSPEECH 2007, 2007

2006
Glasgow University at TRECVid 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

2005
Multi-stage compaction approach to broadcast news summarisation.
Proceedings of the INTERSPEECH 2005, 2005

Maximum entropy segmentation of broadcast news.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
From Text Summarisation to Style-Specific Summarisation for Broadcast News.
Proceedings of the Advances in Information Retrieval, 2004

2000
Information Extraction from Broadcast News
CoRR, 2000

Variable word rate N-grams.
Proceedings of the IEEE International Conference on Acoustics, 2000

Statistical Language Modelling.
Proceedings of the Text- and Speech-Triggered Information Access, 2000

1999
Topic-based mixture language modelling.
Nat. Lang. Eng., 1999

Integrated transcription and identification of named entities in broadcast speech.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Named entity tagged language models.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Efficient training algorithms for HMMs using incremental estimation.
IEEE Trans. Speech Audio Process., 1998

1997
Document space models using latent semantic analysis.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Analysis of LPC/DFT features for an HMM-based alphadigit recognizer.
IEEE Signal Process. Lett., 1996

Taggers for Parsers.
Artif. Intell., 1996

Incremental ML estimation of HMM parameters for efficient training.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Microphone-array speech recognition via incremental map training.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1994
Using MAP estimated parameters to improve HMM speech recognition performance.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994


  Loading...