Chi Wang

Orcid: 0000-0001-5610-5547

Affiliations:
  • Microsoft Research, Redmond, WA, USA
  • University of Illinois at Urbana-Champaign, IL, USA


According to our database1, Chi Wang authored at least 102 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Universal Graph Compression: Stochastic Block Models.
IEEE Trans. Inf. Theory, February, 2024

StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows.
CoRR, 2024

Training Language Model Agents without Modifying Language Models.
CoRR, 2024

Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications.
CoRR, 2024

2023
Towards Lightweight and Automated Representation Learning System for Networks.
IEEE Trans. Knowl. Data Eng., September, 2023

EcoAssistant: Using LLM Assistant More Affordably and Accurately.
CoRR, 2023

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework.
CoRR, 2023

An Empirical Study on Challenging Math Problem Solving with GPT-4.
CoRR, 2023

HyperTime: Hyperparameter Optimization for Combating Temporal Distribution Shifts.
CoRR, 2023

Targeted Hyperparameter Optimization with Lexicographic Preferences Over Multiple Objectives.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference.
Proceedings of the International Conference on Automated Machine Learning, 2023

2022
ACE: Adaptive Constraint-aware Early Stopping in Hyperparameter Optimization.
CoRR, 2022

Mining Robust Default Configurations for Resource-constrained AutoML.
CoRR, 2022

ISUM: Efficiently Compressing Large and Complex Workloads for Scalable Index Tuning.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Budget-aware Index Tuning with Reinforcement Learning.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Automated Machine Learning & Tuning with FLAML.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021
Bounding the Last Mile: Efficient Learned String Indexing.
CoRR, 2021

Fair AutoML.
CoRR, 2021

Q-error Bounds of Random Uniform Sampling for Cardinality Estimation.
CoRR, 2021

LightNE: A Lightweight Graph Processing System for Network Embedding.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Instance-Optimized Data Layouts for Cloud Analytics Workloads.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

FLAML: A Fast and Lightweight AutoML Library.
Proceedings of Machine Learning and Systems 2021, 2021

ChaCha for Online AutoML.
Proceedings of the 38th International Conference on Machine Learning, 2021

DynaTune: Dynamic Tensor Program Optimization in Deep Neural Network Compilation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Economic Hyperparameter Optimization with Blended Search Strategy.
Proceedings of the 9th International Conference on Learning Representations, 2021

An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Frugal Optimization for Cost-related Hyperparameters.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Efficiently Approximating Selectivity Functions using Low Overhead Regression Models.
Proc. VLDB Endow., 2020

Understanding the hardness of approximate query processing with joins.
CoRR, 2020

Concentration Bounds for Co-occurrence Matrices of Markov Chains.
CoRR, 2020

Cost Effective Optimization for Cost-related Hyperparameters.
CoRR, 2020

TaxoExpan: Self-supervised Taxonomy Expansion with Position-Enhanced Graph Neural Network.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Qd-tree: Learning Data Layouts for Big Data Analytics.
Proceedings of the 2020 International Conference on Management of Data, 2020

ALEX: An Updatable Adaptive Learned Index.
Proceedings of the 2020 International Conference on Management of Data, 2020

A Matrix Chernoff Bound for Markov Chains and Its Application to Co-occurrence Matrices.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

AdaTune: Adaptive Tensor Program Compilation Made Efficient.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Faster Graph Embeddings via Coarsening.
Proceedings of the 37th International Conference on Machine Learning, 2020

Towards Extracting Highlights From Recorded Live Videos: An Implicit Crowdsourcing Approach.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

2019
Selectivity Estimation for Range Predicates using Lightweight Models.
Proc. VLDB Endow., 2019

FLO: Fast and Lightweight Hyperparameter Optimization for AutoML.
CoRR, 2019

ALEX: An Updatable Adaptive Learned Index.
CoRR, 2019

NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization.
Proceedings of the World Wide Web Conference, 2019

Fast Approximation of Empirical Entropy via Subsampling.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Efficient Identification of Approximate Best Configuration of Training in Large Datasets.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Role Discovery.
Proceedings of the Encyclopedia of Social Network Analysis and Mining, 2nd Edition, 2018

LinkSO: a dataset for learning to retrieve similar question answer pairs on software development forums.
Proceedings of the 4th ACM SIGSOFT International Workshop on NLP for Software Engineering, 2018

Efficient Attribute Recommendation with Probabilistic Guarantee.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

2017
Accounting for the Correspondence in Commented Data.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Identifying Outlier Arms in Multi-Armed Bandit.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Identifying Semantically Deviating Outlier Documents.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Trust, but Verify: Optimistic Visualizations of Approximate Queries for Exploring Big Data.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

2016
Automatic Entity Recognition and Typing in Massive Text Corpora.
Proceedings of the 25th International Conference on World Wide Web, 2016

Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee.
Proceedings of the 2016 International Conference on Management of Data, 2016

2015
Mining Latent Entity Structures
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01907-4, 2015

A privacy mechanism for mobile-based urban traffic monitoring.
Pervasive Mob. Comput., 2015

Constructing topical hierarchies in heterogeneous information networks.
Knowl. Inf. Syst., 2015

Concept Expansion Using Web Tables.
Proceedings of the 24th International Conference on World Wide Web, 2015

Mining Quality Phrases from Massive Text Corpora.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

GIN: A Clustering Model for Capturing Dual Heterogeneity in Networked Data.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Towards Interactive Construction of Topical Hierarchy: A Recursive Tensor Decomposition Approach.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Automatic Entity Recognition and Typing from Massive Text Corpora: A Phrase and Network Mining Approach.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Constrained Information-Theoretic Tripartite Graph Clustering to Identify Semantically Similar Relations.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

2014
Role Discovery.
Encyclopedia of Social Network Analysis and Mining, 2014

Mining latent entity structures from massive unstructured and interconnected data
PhD thesis, 2014

Scalable Topical Phrase Mining from Text Corpora.
Proc. VLDB Endow., 2014

Scalable and Robust Construction of Topical Hierarchies.
CoRR, 2014

User profiling in an ego network: co-profiling attributes and relationships.
Proceedings of the 23rd International World Wide Web Conference, 2014

NewsNetExplorer: automatic construction and exploration of news information networks.
Proceedings of the International Conference on Management of Data, 2014

Mining latent entity structures from massive unstructured and interconnected data.
Proceedings of the International Conference on Management of Data, 2014

Automatic Construction and Ranking of Topical Keyphrases on Collections of Short Documents.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Scalable Moment-Based Inference for Latent Dirichlet Allocation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Bringing structure to text: mining phrases, entities, topics, and hierarchies.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

The Wisdom of Minority: Unsupervised Slot Filling Validation based on Multi-dimensional Truth-Finding.
Proceedings of the COLING 2014, 2014

2013
KERT: Automatic Extraction and Ranking of Topical Keyphrases from Content-Representative Document Titles.
CoRR, 2013

Research-insight: providing insight on research by publication network analysis.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Multi-View Clustering via Joint Nonnegative Matrix Factorization.
Proceedings of the 13th SIAM International Conference on Data Mining, 2013

On the Detectability of Node Grouping in Networks.
Proceedings of the 13th SIAM International Conference on Data Mining, 2013

Social patterns: Community detection using behavior-generated network datasets.
Proceedings of the 2nd IEEE Network Science Workshop, 2013

A phrase mining framework for recursive construction of a topical hierarchy.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

EventCube: multi-dimensional search and mining of structured and text data.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Ranking-based name matching for author disambiguation in bibliographic data.
Proceedings of the 2013 KDD Cup 2013 Workshop, 2013

Mining evidences for named entity disambiguation.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Large-Scale Spectral Clustering on Graphs.
Proceedings of the IJCAI 2013, 2013

Constructing Topical Hierarchies in Heterogeneous Information Networks.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Semantic Frame-Based Document Representation for Comparable Corpora.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Content coverage maximization on word networks for hierarchical topic summarization.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Scalable influence maximization for independent cascade model in large-scale social networks.
Data Min. Knowl. Discov., 2012

Targeted disambiguation of ad-hoc, homogeneous sets of named entities.
Proceedings of the 21st World Wide Web Conference 2012, 2012

Learning Hierarchical Relationships among Partially Ordered Objects with Heterogeneous Attributes and Links.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

2011
WINACS: construction and analysis of web-based computer science information networks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Learning online discussion structures by conditional random fields.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Learning relevance from heterogeneous social network and its application in online targeting.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

LikeMiner: a system for mining the power of 'like' in social media networks.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Dynamic Social Influence Analysis through Time-Dependent Factor Graphs.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

2010
Mining advisor-advisee relationships from research publication networks.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

On community outliers and their efficient detection in information networks.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Scalable influence maximization for prevalent viral marketing in large-scale social networks.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

2009
Social influence analysis in large-scale networks.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Decomposition: Privacy Preservation for Multiple Sensitive Attributes.
Proceedings of the Database Systems for Advanced Applications, 2009

2008
BSGI: An Effective Algorithm towards Stronger l-Diversity.
Proceedings of the Database and Expert Systems Applications, 19th International Conference, 2008


  Loading...