Anirban Dasgupta

Orcid: 0000-0002-8494-3692

  • Indian Institute of Technology at Gandhinagar, Computer Science Department
  • Yahoo! Research
  • Cornell University, Department of Computer Science

According to our database1, Anirban Dasgupta authored at least 65 papers between 2002 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:



VPTDrone: Video Processing Toolkit for Smart Surveillance Drone.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

Simple Weak Coresets for Non-decomposable Classification Measures.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Locality Sensitive Hashing in Fourier Frequency Domain For Soft Set Containment Search.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online Coresets for Parameteric and Non-Parametric Bregman Clustering.
Trans. Mach. Learn. Res., 2022

On additive approximate submodularity.
Theor. Comput. Sci., 2022

On Coresets for Fair Regression and Individually Fair Clustering.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Analyzing Topic Transitions in Text-Based Social Cascades Using Dual-Network Hawkes Process.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

Online Coresets for Clustering with Bregman Divergences.
CoRR, 2020

Efficient Hierarchical Clustering for Classification and Anomaly Detection.
CoRR, 2020

Streaming Coresets for Symmetric Tensor Factorization.
Proceedings of the 37th International Conference on Machine Learning, 2020

On Coresets for Regularized Regression.
Proceedings of the 37th International Conference on Machine Learning, 2020

Improved linear embeddings via Lagrange duality.
Mach. Learn., 2019

On NC algorithms for problems on bounded rank-width graphs.
Inf. Process. Lett., 2018

Mallows Models for Top-k Lists.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Discovering Topical Interactions in Text-Based Cascades Using Hidden Markov Hawkes Processes.
Proceedings of the IEEE International Conference on Data Mining, 2018

Task-Specific Representation Learning for Web-Scale Entity Disambiguation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Caching with Dual Costs.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Saving Critical Nodes with Firefighters is FPT.
Proceedings of the 44th International Colloquium on Automata, Languages, and Programming, 2017

On Sampling Nodes in a Network.
Proceedings of the 25th International Conference on World Wide Web, 2016

A Framework for Estimating Stream Expression Cardinalities.
Proceedings of the 19th International Conference on Database Theory, 2016

On Learning Mixture Models for Permutations.
Proceedings of the 2015 Conference on Innovations in Theoretical Computer Science, 2015

Approximate Modularity.
Proceedings of the IEEE 56th Annual Symposium on Foundations of Computer Science, 2015

Enabling Compliance of Environmental Conditions.
Proceedings of the 2015 Annual Symposium on Computing for Development, 2015

On estimating the average degree.
Proceedings of the 23rd International World Wide Web Conference, 2014

Learning Entangled Single-Sample Gaussians.
Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, 2014

Superposter behavior in MOOC forums.
Proceedings of the First (2014) ACM Conference on Learning @ Scale, 2014

On Reconstructing a Hidden Permutation.
Proceedings of the Approximation, 2014

Crowdsourced judgement elicitation with endogenous proficiency.
Proceedings of the 22nd International World Wide Web Conference, 2013

Aggregating information from the crowd and the network.
Proceedings of the 22nd International World Wide Web Conference, 2013

Optimal hashing schemes for entity matching.
Proceedings of the 22nd International World Wide Web Conference, 2013

Aggregating crowdsourced binary ratings.
Proceedings of the 22nd International World Wide Web Conference, 2013

Summarization Through Submodularity and Dispersion.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

A Constant-Factor Approximation Algorithm for Co-clustering.
Theory Comput., 2012

Overcoming browser cookie churn with clustering.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Impact of Spam Exposure on User Engagement.
Proceedings of the 21th USENIX Security Symposium, Bellevue, WA, USA, August 8-10, 2012, 2012

Vote calibration in community question-answering systems.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Selecting Diverse Features via Spectral Regularization.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Social sampling.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Sparse and Lopsided Set Disjointness via Information Theory.
Proceedings of the Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 2012

Enhanced email spam filtering through combining similarity graphs.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

On scheduling in map-reduce and flow-shops.
Proceedings of the SPAA 2011: Proceedings of the 23rd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2011

Fast locality-sensitive hashing.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Spam or ham?: characterizing and detecting fraudulent "not spam" reports in web mail systems.
Proceedings of the 8th Annual Collaboration, 2011

A sparse Johnson: Lindenstrauss transform.
Proceedings of the 42nd ACM Symposium on Theory of Computing, 2010

Sampling Algorithms and Coresets for $\ell<sub>p</sub> Regression.
SIAM J. Comput., 2009

Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters.
Internet Math., 2009

Online story scheduling in web advertising.
Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, 2009

Feature hashing for large scale multitask learning.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Near-Optimal Network Design with Selfish Agents.
Theory Comput., 2008

The Price of Stability for Network Design with Fair Cost Allocation.
SIAM J. Comput., 2008

Statistical properties of community structure in large social and information networks.
Proceedings of the 17th International Conference on World Wide Web, 2008

Sampling algorithms and coresets for ℓ<sub><i>p</i></sub> regression.
Proceedings of the Nineteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2008

Approximation algorithms for co-clustering.
Proceedings of the Twenty-Seventh ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, 2008

De-duping URLs via rewrite rules.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Finding (Short) Paths in Social Networks.
Internet Math., 2007

Sampling Algorithms and Coresets for Lp Regression
CoRR, 2007

The discoverability of the web.
Proceedings of the 16th International Conference on World Wide Web, 2007

Spectral clustering with limited independence.
Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2007

Feature selection methods for text classification.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Learning Using Spectral Methods.
PhD thesis, 2006

Spectral Clustering by Recursive Partitioning.
Proceedings of the Algorithms, 2006

Variable latent semantic indexing.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

On Learning Mixtures of Heavy-Tailed Distributions.
Proceedings of the 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2005), 2005

Spectral Analysis of Random Graphs with Skewed Degree Distributions.
Proceedings of the 45th Symposium on Foundations of Computer Science (FOCS 2004), 2004

Quantified Computation Tree Logic.
Inf. Process. Lett., 2002
