Ravi Kiran Sarvadevabhatla

Orcid: 0000-0003-4134-1154

According to our database1, Ravi Kiran Sarvadevabhatla authored at least 66 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MAdVerse: A Hierarchical Dataset of Multi-Lingual Ads from Diverse Sources and Categories.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Automated Detection and Counting of Windows using UAV Imagery based Remote Sensing.
CoRR, 2023

DSAG: A Scalable Deep Framework for Action-Conditioned Multi-Actor Full Body Motion Synthesis.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

F3: Fair and Federated Face Attribute Classification with Heterogeneous Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

"Draw Fast, Guess Slow": Characterizing Interactions in Cooperative Partially Observable Settings with Online Pictionary as a Case Study.
Proceedings of the Human-Computer Interaction - INTERACT 2023 - 19th IFIP TC13 International Conference, York, UK, August 28, 2023

Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Action Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

SeamFormer: High Precision Text Line Segmentation for Handwritten Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

A Cloud-Fog Architecture for Video Analytics on Large Scale Camera Networks Using Semantic Scene Analysis.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

2022
A Fine-Grained Vehicle Detection (FGVD) Dataset for Unconstrained Roads.
Dataset, December, 2022

Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Zero Shot Action Generation.
CoRR, 2022

Counting in the 2020s: Binned Representations and Inclusive Performance Measures for Deep Crowd Counting Approaches.
CoRR, 2022

MUGL: Large Scale Multi Person Conditional Action Generation with Locomotion.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Hear Me out: Fusional Approaches for Audio Augmented Temporal Action Localization.
Proceedings of the 17th International Joint Conference on Computer Vision, 2022

DrawMon: A Distributed System for Detection of Atypical Sketch Content in Concurrent Pictionary Games.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Fine-Grained Vehicle Detection (FGVD) Dataset for Unconstrained Roads✱.
Proceedings of the Thirteenth Indian Conference on Computer Vision, 2022

PSUMNet: Unified Modality Part Streams Are All You Need for Efficient Pose-Based Action Recognition.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

UAV-Based Visual Remote Sensing for Automated Building Inspection.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Detecting, Tracking and Counting Motorcycle Rider Traffic Violations on Unconstrained Roads.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Quo Vadis, Skeleton Action Recognition?
Int. J. Comput. Vis., 2021

MeronymNet: A Hierarchical Approach for Unified and Controllable Multi-Category Object Generation.
CoRR, 2021

RackLay: Multi-Layer Layout Estimation for Warehouse Racks.
CoRR, 2021

NTU60-X: Towards Skeleton-based Recognition of Subtle Human Actions.
CoRR, 2021

Early Bird: Loop Closures from Opposing Viewpoints for Perceptually-aliased Indoor Environments.
Proceedings of the 16th International Joint Conference on Computer Vision, 2021

Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MeronymNet: A Hierarchical Model for Unified and Controllable Multi-Category Object Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

NTU-X: an enhanced large-scale dataset for improving pose-based recognition of subtle human actions.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Monocular multi-layer layout estimation for warehouse racks.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Automatic quantification and visualization of street trees.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Deformable deep networks for instance segmentation of overlapping multi page handwritten documents.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Syntactically Guided Generative Embeddings for Zero-Shot Skeleton Action Recognition.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

MediTables: A New Dataset and Deep Network for Multi-category Table Localization in Medical Documents.
Proceedings of the Document Analysis and Recognition, 2021

DocVisor: A Multi-purpose Web-Based Interactive Visualizer for Document Image Analytics.
Proceedings of the Document Analysis and Recognition, 2021

2020
Operator-in-the-Loop Deep Sequential Multi-Camera Feature Fusion for Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2020

Pictionary-Style Word Guessing on Hand-Drawn Object Sketches: Dataset, Analysis and Deep Network Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Early Bird: Loop Closures from Opposing Viewpoints for Perceptually-Aliased Indoor Environments.
CoRR, 2020

OPAL-Net: A Generative Model for Part-based Object Layout Generation.
CoRR, 2020

Topological Mapping for Manhattan-like Repetitive Environments.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

An OCR for Classical Indic Documents Containing Arbitrarily Long Words.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
HInDoLA: A Unified Cloud-Based Platform for Annotation, Visualization and Machine Learning-Based Layout Analysis of Historical Manuscripts.
Proceedings of the 2nd International Workshop on Open Services and Tools for Document Analysis, 2019

Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

2018
Deep Sequential Multi-camera Feature Fusion for Person Re-identification.
CoRR, 2018

Game of Sketches: Deep Recurrent Models of Pictionary-Style Word Guessing.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Object Category Understanding via Eye Fixations on Freehand Sketches.
IEEE Trans. Image Process., 2017

SketchParse: Towards Rich Descriptions for Poorly Drawn Sketches using Multi-Task Hierarchical Deep Networks.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

DeLiGAN: Generative Adversarial Networks for Diverse and Limited Data.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

An Introduction to Deep Convolutional Neural Nets for Computer Vision.
Proceedings of the Deep Learning for Medical Image Analysis, 1st Edition, 2017

2016
Enabling My Robot To Play Pictionary: Recurrent Neural Networks For Sketch Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

SwiDeN: Convolutional Neural Networks For Depiction Invariant Object Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Analyzing Structural Characteristics of Object Category Representations From Their Semantic-part Distributions.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Analyzing object categories via novel category ranking measures defined on visual feature embeddings.
Proceedings of the Tenth Indian Conference on Computer Vision, 2016

'Part'ly First Among Equals: Semantic Part-Based Benchmarking for State-of-the-Art Object Recognition Systems.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision.
Frontiers Robotics AI, 2015

Freehand Sketch Recognition Using Deep Features.
CoRR, 2015

Category-Epitomes : Discriminatively Minimalist Representations for Object Categories.
CoRR, 2015

Expresso : A user-friendly GUI for designing, training and using Convolutional Neural Networks.
CoRR, 2015

Eye of the Dragon: Exploring Discriminatively Minimalist Sketch-based Abstractions for Object Categories.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2012
Captain may i?: proxemics study examining factors that influence distance between humanoid robots, children, and adults, during human-robot interaction.
Proceedings of the International Conference on Human-Robot Interaction, 2012

2011
Multimodal approach to affective human-robot interaction design with children.
ACM Trans. Interact. Intell. Syst., 2011

Adaptive facial expression recognition using inter-modal top-down context.
Proceedings of the 13th International Conference on Multimodal Interfaces, 2011

2010
Extended duration human-robot interaction: Tools and analysis.
Proceedings of the 19th IEEE International Conference on Robot and Human Interactive Communication, 2010

2009
Cognitive map architecture.
IEEE Robotics Autom. Mag., 2009

Learning together: ASIMO developing an interactive learning partnership with children.
Proceedings of the 18th IEEE International Symposium on Robot and Human Interactive Communication, 2009

Panoramic attention for humanoid robots.
Proceedings of the 9th IEEE-RAS International Conference on Humanoid Robots, 2009

2008
The memory game: Creating a human-robot interactive scenario for ASIMO.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008


  Loading...