Sourav Dutta

Orcid: 0000-0002-8934-9166

Affiliations:
  • Huawei Research Centre, Dublin, Ireland
  • Nokia Bell Labs, Blanchardstown, Ireland (former)
  • Max Planck Institute for Informatics, Saarbücken, Germany (former)


According to our database1, Sourav Dutta authored at least 53 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
VeNoM: Approximate Subgraph Matching with Enhanced Neighbourhood Structural Information.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

2023
Intent Classification by the Use of Automatically Generated Knowledge Graphs.
Inf., May, 2023

Learning fine-grained search space pruning and heuristics for combinatorial optimization.
J. Heuristics, 2023

Improved Vector Quantization For Dense Retrieval with Contrastive Distillation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Gradient Sparsification For Masked Fine-Tuning of Transformers.
Proceedings of the International Joint Conference on Neural Networks, 2023

AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Self-distilled Pruning of Deep Neural Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Cage: A Hybrid Framework for Closed-Domain Conversational Agents.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Semantic Aware Answer Sentence Selection Using Self-Learning Based Domain Adaptation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

AX-MABSA: A Framework for Extremely Weakly Supervised Multi-label Aspect Based Sentiment Analysis.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Enhanced Sentence Meta-Embeddings for Textual Understanding.
Proceedings of the Advances in Information Retrieval, 2022

Multi-Stage Framework with Refinement Based Point Set Registration for Unsupervised Bi-Lingual Word Alignment.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Aligned Weight Regularizers for Pruning Pretrained Neural Networks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Deep Neural Compression Via Concurrent Pruning and Self-Distillation.
CoRR, 2021

Sequence-to-Sequence Learning on Keywords for Efficient FAQ Retrieval.
CoRR, 2021

DTAFA: Decoupled Training Architecture for Efficient FAQ Retrieval.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Cross-lingual Sentence Embedding using Multi-Task Learning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

VerSaChI: Finding Statistically Significant Subgraph Matches using Chebyshev's Inequality.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Efficient Multi-Lingual Sentence Classification Framework with Sentence Meta Encoders.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Mufin: Enriching Semantic Understanding of Sentence Embedding using Dual Tune Framework.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Qasar: Self-Supervised Learning Framework for Extractive Question Answering.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
ChiSeL: Graph Similarity Search using Chi-Squared Statistics in Large Probabilistic Graphs.
Proc. VLDB Endow., 2020

Unsupervised Word Translation Pairing using Refinement based Point Set Registration.
CoRR, 2020

RADAR: Fast Approximate Reverse Rank Queries.
Proceedings of the Intelligent Systems and Applications, 2020

Towards Quantifying the Distance between Opinions.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020

2019
Automated assessment of knowledge hierarchy evolution: comparing directed acyclic graphs.
Inf. Retr. J., 2019

Learning Multi-Stage Sparsification for Maximum Clique Enumeration.
CoRR, 2019

Finding a Maximum Clique in Dense Graphs via χ2 Statistics.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

A System for Analysis and Remediation of Attrition.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Fine-Grained Search Space Classification for Hard Enumeration Variants of Subset Problems.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Enriching Taxonomies With Functional Domain Knowledge.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Automated Knowledge Hierarchy Assessment.
Proceedings of the Joint Proceedings of the First International Workshop on Professional Search (ProfS2018); the Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, 2018

Efficient Auto-Generation of Taxonomies for Structured Knowledge Discovery and Organization.
Proceedings of the 29th on Hypertext and Social Media, 2018

ANNOTATE: orgANizing uNstructured cOntenTs viA Topic labEls.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Efficient knowledge management for named entities from text.
PhD thesis, 2017

Credible Review Detection with Limited Information using Consistency Analysis.
CoRR, 2017

Neighbor-Aware Search for Approximate Labeled Graph Matching using the Chi-Square Statistics.
Proceedings of the 26th International Conference on World Wide Web, 2017

2016
Credible Review Detection with Limited Information Using Consistency Features.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016

KOGNAC: Efficient Encoding of Large Knowledge Graphs.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Cross-Document Co-Reference Resolution using Sample-Based Clustering with Knowledge Enrichment.
Trans. Assoc. Comput. Linguistics, 2015

Unsupervised Rank Aggregation using Hierarchical User Similarity Clustering.
Proceedings of the Thirteenth Scandinavian Conference on Artificial Intelligence, 2015

Predictive Caching Framework for Mobile Wireless Networks.
Proceedings of the 16th IEEE International Conference on Mobile Data Management, 2015

C3EL: A Joint Model for Cross-Document Co-Reference Resolution and Entity Linking.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

MIST: Top-k Approximate Sub-string Mining Using Triplet Statistical Significance.
Proceedings of the Advances in Information Retrieval, 2015

2013
Streaming Quotient Filter: A Near Optimal Approximate Duplicate Detection Approach for Data Streams.
Proc. VLDB Endow., 2013

2012
CloudMap: Workload-aware placement in private heterogeneous clouds.
Proceedings of the 2012 IEEE Network Operations and Management Symposium, 2012

Towards "intelligent compression" in streams: a biased reservoir sampling based Bloom filter approach.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

SmartScale: Automatic Application Scaling in Enterprise Clouds.
Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing, 2012

2011
Caching Stars in the Sky: A Semantic Caching Approach to Accelerate Skyline Queries.
Proceedings of the Database and Expert Systems Applications, 2011

2010
Mining Statistically Significant Substrings Based on the Chi-Square Measure
CoRR, 2010

Most Significant Substring Mining Based on Chi-square Measure.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

INSTRUCT - Space-Efficient Structure for Indexing and Complete Query Management of String Databases.
Proceedings of the 16th International Conference on Management of Data, 2010


  Loading...