Prithwijit Guha

Orcid: 0000-0003-2885-0026

According to our database1, Prithwijit Guha authored at least 76 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Designing a U-Net Architecture for Underwater Image Enhancement.
Proceedings of the National Conference on Communications, 2024

2023
FPGA Implementation of Batch-Mode Depth-Pipelined Two Means Decision Tree.
IEEE Embed. Syst. Lett., March, 2023

Dual Attention and Question Categorization-Based Visual Question Answering.
IEEE Trans. Artif. Intell., February, 2023

Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

VQA with Cascade of Self- and Co-Attention Blocks.
CoRR, 2023

IndiRA: Design and Implementation of a Pipelined RISC-V Processor.
Proceedings of the 33rd International Conference Radioelektronika, 2023

A Novel Network Architecture for Microplankton Classification in Digital Holographic Images.
Proceedings of the Pattern Recognition and Machine Intelligence, 2023

Image Caption Synthesis for Low Resource Assamese Language using Bi-LSTM with Bilinear Attention.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

S-VQA: Sentence-Based Visual Question Answering.
Proceedings of the Fourteenth Indian Conference on Computer Vision, 2023

Aggregated Co-attention based Visual Question Answering.
Proceedings of the Fourteenth Indian Conference on Computer Vision, 2023

Deciphering Storytelling Events: A Study of Neural and Prompt-Driven Event Detection in Short Stories.
Proceedings of the International Conference on Asian Language Processing, 2023

Relevance of Language-Specific Training on Image Caption Synthesis for Low Resource Assamese Language.
Proceedings of the International Conference on Asian Language Processing, 2023

2022
Speech/music classification using phase-based and magnitude-based features.
Speech Commun., 2022

Only overlay text: novel features for TV news broadcast video segmentation.
Multim. Tools Appl., 2022

Speech Music Overlap Detection Using Spectral Peak Evolutions.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Overlapped Speech Detection Using AM-FM Based Time-Frequency Representations.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Foreground-Background Audio Separation using Spectral Peaks based Time-Frequency Masks.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Design of a Low Power and Area Efficient Bfloat16 based Generalized Systolic Array for DNN Applications.
Proceedings of the 32nd International Conference Radioelektronika, 2022

Comparison of Floating-point Representations for the Efficient Implementation of Machine Learning Algorithms.
Proceedings of the 32nd International Conference Radioelektronika, 2022

Hardware Implementation of Low Complexity High-speed Perceptron Block.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

GAUR: Genetic Algorithm based Unlocking of Register Transfer Level Locking.
Proceedings of the GLSVLSI '22: Great Lakes Symposium on VLSI 2022, Irvine CA USA, June 6, 2022

2021
Efficient Hardware Implementation of Decision Tree Training Accelerator.
SN Comput. Sci., February, 2021

Training Accelerator for Two Means Decision Tree.
IEEE Trans. Very Large Scale Integr. Syst., 2021

FPGA Implementation of Low Complexity Hybrid Decision Tree Training Accelerator.
Proceedings of the 64th IEEE International Midwest Symposium on Circuits and Systems, 2021

Automatic Detection of Shouted Speech Segments in Indian News Debates.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Speech/Music Classification Using Features From Spectral Peaks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

A system for semantic segmentation of TV news broadcast videos.
Multim. Tools Appl., 2020

Facial Keypoint Sequence Generation from Audio.
CoRR, 2020

Classification of Speech vs. Speech with Background Music.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Overlapped/Non-Overlapped Speech Transition Point Detection Using Bag-of-Audio-Words.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Analysis of Excitation Source Characteristics for Shouted and Normal Speech Classification.
Proceedings of the 2020 National Conference on Communications, 2020

CQ-VQA: Visual Question Answering on Categorized Questions.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

A Novel Ensemble Framework for Face Search.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Multi-stage Attention based Visual Question Answering.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

IQ-VQA: Intelligent Visual Question Answering.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Siamese Fully Convolutional Tracker with Motion Correction.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Segmenting with style: detecting program and story boundaries in TV news broadcast videos.
Multim. Tools Appl., 2019

Visual Object Tracking Using Perceptron Forests and Optical Flow.
Proceedings of the Pattern Recognition and Machine Intelligence, 2019

Shouted and Normal Speech Classification Using 1D CNN.
Proceedings of the Pattern Recognition and Machine Intelligence, 2019

2018
Time-Frequency Audio Features for Speech-Music Classification.
CoRR, 2018

Excitation Source Feature for Discriminating Shouted and Normal Speech.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Visual Tracking with Breeding Fireflies using Brightness from Background-Foreground Information.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

2017
Success based locally weighted Multiple Kernel combination.
Pattern Recognit., 2017

Object Tracking with Classification Score Weighted Histogram of Sparse Codes.
Proceedings of the Pattern Recognition and Machine Intelligence, 2017

2016
Overlay Text Extraction From TV News Broadcast.
CoRR, 2016

TV Commercial Detection Using Success Based Locally Weighted Kernel Combination.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

News Program Detection in TV Broadcast Videos.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Generic TV advertisement detection using progressively balanced perceptron trees.
Proceedings of the Tenth Indian Conference on Computer Vision, 2016

Reinforcement Learning via Recurrent Convolutional Neural Networks.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Multiple kernel learning using data envelopment analysis and feature vector selection and projection.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Story segmentation in TV news broadcast.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

2015
TV News Channel Commercial Detection Dataset.
Dataset, March, 2015

TV News Commercials Detection using Success based Locally Weighted Kernel Combination.
CoRR, 2015

PD-Shift: Patch Detector Shift based Tracker.
Proceedings of the 2015 Fifth National Conference on Computer Vision, 2015

A Hierarchical Frame-by-Frame Association Method Based on Graph Matching for Multi-object Tracking.
Proceedings of the Advances in Visual Computing - 11th International Symposium, 2015

An occlusion reasoning scheme for monocular pedestrian tracking in dynamic scenes.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

A novel local success weighted ensemble classifier.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Commercial Block Detection in Broadcast News Videos.
Proceedings of the 2014 Indian Conference on Computer Vision, 2014

A Novel Method for Face Track Linking in Videos.
Proceedings of the 2014 Indian Conference on Computer Vision, 2014

2012
The Video Face Book.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Unsupervised Language Learning for Discovered Visual Concepts.
Proceedings of the Computer Vision, 2012

2011
OSiMa: Human Pose Estimation from a Single Image.
Proceedings of the Pattern Recognition and Machine Intelligence, 2011

OCS-14 : You Can Get Occluded in Fourteen Ways.
Proceedings of the IJCAI 2011, 2011

Activity Discovery Using Compressed Suffix Trees.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

Formulation, detection and application of occlusion states (Oc-7) in the context of multiple object tracking.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2008
Objects from Animacy: Joint Discovery in Shape and Haar Feature Space.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Back to the future: Robust foreground extraction with reversed-time background modeling.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

2007
Language Label Learning for Visual Concepts Discovered from Video Sequences.
Proceedings of the Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint, 2007

2006
Path Planning for a Statically Stable Biped Robot Using PRM and Reinforcement Learning.
J. Intell. Robotic Syst., 2006

Appearance Based Multiple Agent Tracking Under Complex Occlusions.
Proceedings of the PRICAI 2006: Trends in Artificial Intelligence, 2006

Spatio-temporal Discovery: Appearance + Behavior = Agent.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Efficient Continuous Re-grasp Planning for Moving and Deforming Planar Objects.
Proceedings of the 2006 IEEE International Conference on Robotics and Automation, 2006

Activity Discovery from Surveillance Videos.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Multiscale Co-linearity Statistic Based Approach to Robust Background Modeling.
Proceedings of the Computer Vision, 2006

2005
Hybrid Hierarchical Learning from Dynamic Scenes.
Proceedings of the Pattern Recognition and Machine Intelligence, 2005

2004
Handling Occlusions in Monocular Surveillance Systems.
Proceedings of the ICVGIP 2004, 2004


  Loading...