ChengXiang Zhai

According to our database1, ChengXiang Zhai authored at least 318 papers between 1990 and 2018.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2017, "For contributions to information retrieval and text data mining".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2018
NRF: A Naive Re-identification Framework.
Proceedings of the 2018 Workshop on Privacy in the Electronic Society, 2018

A Large-Scale Empirical Study on Android Runtime-Permission Rationale Messages.
Proceedings of the 2018 IEEE Symposium on Visual Languages and Human-Centric Computing, 2018

A Tutorial on Probabilistic Topic Models for Text Data Retrieval and Analysis.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

A Taxonomy of Queries for E-commerce Search.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Modeling Diverse Relevance Patterns in Ad-hoc Retrieval.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Are we on the Right Track?: An Examination of Information Retrieval Methodologies.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Mining Android App Descriptions for Permission Requirements Recommendation.
Proceedings of the 26th IEEE International Requirements Engineering Conference, 2018

Learning to Rank and Discover for E-Commerce Search.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2018

CLaDS: a cloud-based virtual lab for the delivery of scalable hands-on assignments for practical data science education.
Proceedings of the 23rd Annual ACM Conference on Innovation and Technology in Computer Science Education, 2018

Mining MOOC Lecture Transcripts to Construct Concept Dependency Graphs.
Proceedings of the 11th International Conference on Educational Data Mining, 2018

JIM: Joint Influence Modeling for Collective Search Behavior.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval.
SIGIR Forum, 2017

Document Language Models, Query Models, and Risk Minimization for Information Retrieval.
SIGIR Forum, 2017

Report on the SIGIR 2017 Workshop on Axiomatic Thinking for Information Retrieval and Related Tasks (ATIR).
SIGIR Forum, 2017

Dynamic credit allocation in scientific literature.
Scientometrics, 2017

Modeling the Influence of Popular Trending Events on User Search Behavior.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Numerical Facet Range Partition: Evaluation Metric and Methods.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Constructing and Embedding Abstract Event Causality Networks from Text Snippets.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Probabilistic Topic Models for Text Data Retrieval and Analysis.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

On Application of Learning to Rank for E-Commerce Search.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Axiomatic Thinking for Information Retrieval: And Related Tasks.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Modeling MOOC Student Behavior With Two-Layer Hidden Markov Models.
Proceedings of the Fourth ACM Conference on Learning @ Scale, 2017

A Probabilistic Approach for Discovering Difficult Course Topics Using Clickstream Data.
Proceedings of the Fourth ACM Conference on Learning @ Scale, 2017

ContextCare: Incorporating Contextual Information Networks to Representation Learning on Medical Forum Data.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Information Retrieval Evaluation as Search Simulation: A General Formal Framework for IR Evaluation.
Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval, 2017

High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm.
Proceedings of the 34th International Conference on Machine Learning, 2017

Identifying Humor in Reviews using Background Text Sources.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Modeling MOOC Student Behavior With Two-Layer Hidden Markov Models.
Proceedings of the 10th International Conference on Educational Data Mining, 2017

A Study of Feature Construction for Text-based Forecasting of Time Series Variables.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

TextScope: Enhance human perception via text mining.
Proceedings of the 2017 IEEE International Conference on Big Data, BigData 2017, 2017

Text-based geolocation prediction of social media users with neural networks.
Proceedings of the 2017 IEEE International Conference on Big Data, BigData 2017, 2017

Temporal reflected logistic regression for probabilistic heart failure survival score prediction.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

HEMnet: Integration of Electronic Medical Records with Molecular Interaction Networks and Domain Knowledge for Survival Analysis.
Proceedings of the 8th ACM International Conference on Bioinformatics, 2017

Framing Electronic Medical Records as Polylingual Documents in Query Expansion.
Proceedings of the AMIA 2017, 2017

Towards Privacy-Preserving Evaluation for Information Retrieval Models Over Industry Data Sets.
Proceedings of the Information Retrieval Technology, 2017

Dual-Clustering Maximum Entropy with Application to Classification and Word Embedding.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Non-native text analysis: A survey.
Natural Language Engineering, 2016

Personalized generation of word clouds from tweets.
JASIST, 2016

Towards a game-theoretic framework for text data retrieval.
IEEE Data Eng. Bull., 2016

DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.
Bioinformatics, 2016

A Sequential Decision Formulation of the Interface Card Model for Interactive IR.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Learning Query and Document Relevance from a Web-scale Click Graph.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

An Exploration of Automated Grading of Complex Assignments.
Proceedings of the Third ACM Conference on Learning @ Scale, 2016

Scaling up Online Question Answering via Similar Question Retrieval.
Proceedings of the Third ACM Conference on Learning @ Scale, 2016

Blind Men and The Elephant: Thurstonian Pairwise Preference for Ranking in Crowdsourcing.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Generative Feature Language Models for Mining Implicit Features from Customer Reviews.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Mobile App Retrieval for Social Media Users via Inference of Implicit Intent in Social Media Text.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Exploiting temporal divergence of topic distributions for event detection.
Proceedings of the 2016 IEEE International Conference on Big Data, 2016

A conditional probabilistic model for joint analysis of symptoms, diseases, and herbs in traditional Chinese medicine patient records.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016

PaReCat: Patient Record Subcategorization for Precision Traditional Chinese Medicine.
Proceedings of the 7th ACM International Conference on Bioinformatics, 2016

MeTA: A Unified Toolkit for Text Retrieval and Analysis.
Proceedings of ACL-2016 System Demonstrations, Berlin, Germany, August 7-12, 2016, 2016

2015
Understanding User Intents in Online Health Forums.
IEEE J. Biomedical and Health Informatics, 2015

Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval.
SIGIR Forum, 2015

Overcoming bias to learn about controversial topics.
JASIST, 2015

Negative query generation: bridging the gap between query likelihood retrieval models and relevance.
Inf. Retr. Journal, 2015

OpinoFetch: a practical and efficient approach to collecting opinions on arbitrary entities.
Inf. Retr. Journal, 2015

Exploiting ontology graph for predicting sparsely annotated gene function.
Bioinformatics, 2015

MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence.
Bioinformatics, 2015

Information Retrieval as Card Playing: A Formal Model for Optimizing Interactive Retrieval Interface.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Towards a Game-Theoretic Framework for Information Retrieval.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Leveraging User Reviews to Improve Accuracy for Mobile App Retrieval.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Retrieval of Relevant Opinion Sentences for New Products.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

SpecLDA: Modeling Product Reviews and Specifications to Generate Augmented Specifications.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Joint adaptive loss and l2/l0-norm minimization for unsupervised feature selection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Beomap: Ad Hoc Topic Maps for Enhanced Exploration of Social Media Data.
Proceedings of the Engineering the Web in the Big Data Era - 15th International Conference, 2015

Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

Mining Coordinated Intent Representation for Entity Search and Recommendation.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

SyntacticDiff: Operator-based transformation for comparative text mining.
Proceedings of the 2015 IEEE International Conference on Big Data, 2015

Hotspots of news articles: Joint mining of news text & social media to discover controversial points in news.
Proceedings of the 2015 IEEE International Conference on Big Data, 2015

Recommending forum posts to designated experts.
Proceedings of the 2015 IEEE International Conference on Big Data, 2015

2014
Exploiting rich user information for one-class collaborative filtering.
Knowl. Inf. Syst., 2014

Content-based citation analysis: The next generation of citation analysis.
JASIST, 2014

Bug characteristics in open source software.
Empirical Software Engineering, 2014

User modeling in search logs via a nonparametric bayesian approach.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

A two-dimensional click model for query auto-completion.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

VIRLab: A Platform for Privacy-Preserving Evaluation for Information Retrieval Models.
Proceedings of the Proceeding of the 1st International Workshop on Privacy-Preserving IR: When Information Retrieval Meets Privacy and Security co-located with 37th Annual International ACM SIGIR conference, 2014

Axiomatic analysis and optimization of information retrieval models.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

VIRLab: a web-based virtual lab for learning and studying information retrieval models.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

A Constrained Hidden Markov Model Approach for Non-Explicit Citation Context Extraction.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

The Fudan-UIUC Participation in the BioASQ Challenge Task 2a: The Antinomyra system.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

Mining Semi-Structured Online Knowledge Bases to Answer Natural Language Questions on Community QA Websites.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Unsupervised Feature Selection for Multi-View Clustering on Text-Image Web News Data.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Revisiting the Divergence Minimization Feedback Model.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Random walks on adjacency graphs for mining lexical relations from big text data.
Proceedings of the 2014 IEEE International Conference on Big Data, 2014

Understanding user intents in online health forums.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

SideEffectPTM: an unsupervised topic model to mine adverse drug reactions from health forums.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

Resolving healthcare forum posts via similar thread retrieval.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

Text Classification.
Proceedings of the Data Classification: Algorithms and Applications, 2014

2013
MiTexCube: MicroTextCluster Cube for online analysis of text cells and its applications.
Statistical Analysis and Data Mining, 2013

Supporting Keyword Search in Product Database: A Probabilistic Approach.
PVLDB, 2013

Leveraging comparable corpora for cross-lingual information retrieval in resource-lean language pairs.
Inf. Retr., 2013

A learning approach to optimizing exploration-exploitation tradeoff in relevance feedback.
Inf. Retr., 2013

Content-aware click modeling.
Proceedings of the 22nd International World Wide Web Conference, 2013

Ranking explanatory sentences for opinion summarization.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Structural Parse Tree Features for Text Representation.
Proceedings of the 2013 IEEE Seventh International Conference on Semantic Computing, 2013

Understanding evolution of research themes: a probabilistic generative model for citations.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

EventCube: multi-dimensional search and mining of structured and text data.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Robust Unsupervised Feature Selection.
Proceedings of the IJCAI 2013, 2013

Axiomatic Analysis and Optimization of Information Retrieval Models.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Exploiting Forum Thread Structures to Improve Thread Clustering.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Information Retrieval with Time Series Query.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Statistical Translation Language Model for Twitter Search.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Content coverage maximization on word networks for hierarchical topic summarization.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Mining entity attribute synonyms via compact clustering.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Unsupervised identification of synonymous query intent templates for attribute intents.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Mining causal topics in text data: iterative topic modeling with time series feedback.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Compact explanatory opinion summarization.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

FindiLike: a preference driven entity search engine for evaluating entity retrieval and opinion summarization.
Proceedings of the 2013 workshop on Living labs for information retrieval evaluation, 2013

A probabilistic mixture model for mining and analyzing product search log.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Leveraging medical thesauri and physician feedback for improving medical literature retrieval for case queries.
JAMIA, 2012

Opinion-based entity ranking.
Inf. Retr., 2012

Integer linear programming for Constrained Multi-Aspect Committee Review Assignment.
Inf. Process. Manage., 2012

CloudSpeller: query spelling correction by using a unified hidden markov model with web-scale resources.
Proceedings of the 21st World Wide Web Conference, 2012

Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions.
Proceedings of the 21st World Wide Web Conference 2012, 2012

FindiLike: preference driven entity search.
Proceedings of the 21st World Wide Web Conference, 2012

Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

A generalized hidden Markov model with discriminative training for query spelling correction.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

SympGraph: a framework for mining clinical notes through symptom relation graphs.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Building enriched web page representations using link paths.
Proceedings of the 23rd ACM Conference on Hypertext and Social Media, 2012

A Discriminative Model for Query Spelling Correction with Latent Structural SVM.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Reliability Prediction of Webpages in the Medical Domain.
Proceedings of the Advances in Information Retrieval, 2012

A Log-Logistic Model-Based Interpretation of TF Normalization of BM25.
Proceedings of the Advances in Information Retrieval, 2012

Axiomatic Analysis of Translation Language Model for Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2012

Score Transformation in Linear Combination for Multi-criteria Relevance Ranking.
Proceedings of the Advances in Information Retrieval, 2012

BiasTrust: teaching biased users about controversial topics.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Mining long-lasting exploratory user interests from search history.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Query likelihood with negative query generation.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Unsupervised discovery of opposing opinion networks from forum discussions.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

InCaToMi: integrative causal topic miner between textual and non-textual time series data.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Click patterns: an empirical representation of complex query intents.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Unbiased learning of controversial topics.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

Enriching text representation with frequent pattern mining for probabilistic topic modeling.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

Predicting future popularity trend of events in microblogging platforms.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

A Survey of Text Classification Algorithms.
Proceedings of the Mining Text Data, 2012

A Survey of Text Clustering Algorithms.
Proceedings of the Mining Text Data, 2012

An Introduction to Text Mining.
Proceedings of the Mining Text Data, 2012

2011
Diagnostic Evaluation of Information Retrieval Models.
ACM Trans. Inf. Syst., 2011

Efficient Keyword-Based Search for Top-K Cells in Text Cube.
IEEE Trans. Knowl. Data Eng., 2011

BeeSpace Navigator: exploratory analysis of gene function using semantic indexing of biological literature.
Nucleic Acids Research, 2011

Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA.
Inf. Retr., 2011

Geographical topic discovery and comparison.
Proceedings of the 20th International Conference on World Wide Web, 2011

Automatic construction of a context-aware sentiment lexicon: an optimization approach.
Proceedings of the 20th International Conference on World Wide Web, 2011

Mining named entities with temporally correlated bursts from multilingual web news streams.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Beyond search: statistical topic models for text analysis.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Learning online discussion structures by conditional random fields.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

A boosting approach to improving pseudo-relevance feedback.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

When documents are very long, BM25 fails!
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Unsupervised query segmentation using clickthrough for information retrieval.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Latent aspect rating analysis without aspect keyword supervision.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Content-driven trust propagation framework.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Axiomatic Analysis and Optimization of Information Retrieval Models.
Proceedings of the Advances in Information Retrieval Theory, 2011

LPTA: A Probabilistic Model for Latent Periodic Topic Analysis.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Exploiting Thread Structures to Improve Smoothing of Language Models for Forum Post Retrieval.
Proceedings of the Advances in Information Retrieval, 2011

Adaptive term frequency normalization for BM25.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Lower-bounding term frequency normalization.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Interactive sense feedback for difficult queries.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Improving retrieval accuracy of difficult queries through generalizing negative document language models.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Automatic query reformulation with syntactic operators to alleviate search difficulty.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

MiTexCube: MicroTextCluster Cube for Online Analysis of Text Cells.
Proceedings of the 2011 Conference on Intelligent Data Understanding, 2011

Structural Topic Model for Latent Topical Structure Analysis.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Web N-gram workshop 2010.
SIGIR Forum, 2010

BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects.
Nucleic Acids Research, 2010

Introduction to special issue on learning to rank for information retrieval.
Inf. Retr., 2010

Discovery of gene network variability across samples representing multiple classes.
IJBRA, 2010

Identifying overrepresented concepts in gene lists from literature: a statistical approach based on Poisson mixture model.
BMC Bioinformatics, 2010

Towards natural question guided search.
Proceedings of the 19th International Conference on World Wide Web, 2010

Positional relevance model for pseudo-relevance feedback.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Estimation of statistical translation models based on mutual information for ad hoc information retrieval.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Latent aspect rating analysis on review text data: a rating regression approach.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

TopCells: Keyword-based search of top-k aggregated documents in text cube.
Proceedings of the 26th International Conference on Data Engineering, 2010

Summarizing Contrastive Viewpoints in Opinionated Text.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Aggregation of Multiple Judgments for Evaluating Ordered Lists.
Proceedings of the Advances in Information Retrieval, 2010

Shallow Information Extraction from Medical Forum Data.
Proceedings of the COLING 2010, 2010

Exploiting Structured Ontology to Organize Scattered Online Opinions.
Proceedings of the COLING 2010, 2010

Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions.
Proceedings of the COLING 2010, 2010

Medical Case-based Retrieval by Leveraging Medical Ontology and Physician Feedback: UIUC-IBM at ImageCLEF 2010.
Proceedings of the CLEF 2010 LABs and Workshops, 2010

PTM: probabilistic topic mapping model for mining parallel document collections.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Improving one-class collaborative filtering by incorporating rich user information.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Exploration-exploitation tradeoff in interactive relevance feedback.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Keyword Search in Text Cube: Finding Top-k Aggregated Cell Documents.
Proceedings of the 2010 Conference on Intelligent Data Understanding, 2010

Cross-Lingual Latent Topic Extraction.
Proceedings of the ACL 2010, 2010

2009
Web Search Result De-duplication and Clustering.
Proceedings of the Encyclopedia of Database Systems, 2009

Web Search Relevance Feedback.
Proceedings of the Encyclopedia of Database Systems, 2009

Learning to rank for information retrieval (LR4IR 2009).
SIGIR Forum, 2009

Topic modeling for OLAP on multidimensional text databases: topic cube and its applications.
Statistical Analysis and Data Mining, 2009

iNextCube: Information Network-Enhanced Text Cube.
PVLDB, 2009

An empirical study of gene synonym query expansion in biomedical information retrieval.
Inf. Retr., 2009

Inference of gene pathways using mixture Bayesian networks.
BMC Systems Biology, 2009

Rated aspect summarization of short comments.
Proceedings of the 18th International Conference on World Wide Web, 2009

Adaptive Clustering of Search Results.
Proceedings of the User Modeling, 2009

Finding Related Entities by Retrieving Relations: UIUC at TREC 2009 Entity Track.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

A Study of Term Proximity and Document Weighting Normalization in Pseudo Relevance Feedback--UIUC at TREC 2009 Million Query Track.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

Massive Implicit Feedback: Organizing Search Logs into Topic Maps for Collaborative Surfing.
Proceedings of the Workshop on Understanding the User, 2009

Positional language models for information retrieval.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases.
Proceedings of the SIAM International Conference on Data Mining, 2009


Parallel PathFinder Algorithms for Mining Structures from Graphs.
Proceedings of the ICDM 2009, 2009

Beyond hyperlinks: organizing information footprints in search logs to support effective browsing.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

A comparative study of methods for estimating query language models with pseudo feedback.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Adaptive relevance feedback in information retrieval.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Generating comparative summaries of contradictory opinions in text.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Constrained multi-aspect expertise matching for committee review assignment.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Evaluation of methods for relative comparison of retrieval systems based on clickthroughs.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Statistical Language Models for Information Retrieval
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, 2008

DirichletRank: Solving the zero-one gap problem of PageRank.
ACM Trans. Inf. Syst., 2008

Learning to rank for information retrieval (LR4IR 2008).
SIGIR Forum, 2008

Smoothing document language models with probabilistic term count propagation.
Inf. Retr., 2008

Statistical Language Models for Information Retrieval: A Critical Review.
Foundations and Trends in Information Retrieval, 2008

Multi-label literature classification based on the Gene Ontology graph.
BMC Bioinformatics, 2008

Topic modeling with network regularization.
Proceedings of the 17th International Conference on World Wide Web, 2008

Opinion integration through semi-supervised topic modeling.
Proceedings of the 17th International Conference on World Wide Web, 2008

A Study of Adaptive Relevance Feedback - UIUC TREC 2008 Relevance Feedback Experiments.
Proceedings of The Seventeenth Text REtrieval Conference, 2008

Opinion Summarization Using Entity Features and Probabilistic Sentence Coherence Optimization: UIUC at TAC 2008 Opinion Summarization Pilot.
Proceedings of the First Text Analysis Conference, 2008

A study of methods for negative relevance feedback.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

A general optimization framework for smoothing language models on graph structures.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Mining multi-faceted overviews of arbitrary topics in a text collection.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Ranking Database Queries with User Feedback: A Neural Network Approach.
Proceedings of the Database Systems for Advanced Applications, 2008

Mining term association patterns from search logs for effective query reformulation.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Multi-aspect expertise matching for review assignment.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Modeling hidden topics on document manifold.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Generating Impact-Based Summaries for Scientific Literature.
Proceedings of the ACL 2008, 2008

2007
Semantic annotation of frequent patterns.
TKDD, 2007

Privacy protection in personalized search.
SIGIR Forum, 2007

Learning to rank for information retrieval (LR4IR 2007).
SIGIR Forum, 2007

Meeting of the MINDS: an information retrieval research agenda.
SIGIR Forum, 2007

An empirical study of tokenization strategies for biomedical information retrieval.
Inf. Retr., 2007

Generating gene summaries from biomedical literature: A study of semi-structured summarization.
Inf. Process. Manage., 2007

Topic sentiment mixture: modeling facets and opinions in weblogs.
Proceedings of the 16th International Conference on World Wide Web, 2007

Context-Aware Wrapping: Synchronized Data Extraction.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Language Models for Genomics Information Retrieval: UIUC at TREC 2007 Genomics Track.
Proceedings of The Sixteenth Text REtrieval Conference, 2007

Learn from web search logs to organize search results.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

An exploration of proximity measures in information retrieval.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Term feedback for information retrieval with language models.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

A study of Poisson query generation model for information retrieval.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Statistical Language Models for Information Retrieval.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

A Systematic Exploration of the Feature Space for Relation Extraction.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Mining correlated bursty topic patterns from coordinated text streams.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Automatic labeling of multinomial topic models.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Collaborative Wrapping: A Turbo Framework for Web Data Extraction.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Probabilistic Models for Expert Finding.
Proceedings of the Advances in Information Retrieval, 2007

Improve retrieval accuracy for difficult queries using negative feedback.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

A two-stage approach to domain adaptation for statistical classifiers.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

Inference of Gene Pathways Using Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2007

Instance Weighting for Domain Adaptation in NLP.
Proceedings of the ACL 2007, 2007

2006
Extraction of coherent relevant passages using hidden Markov models.
ACM Trans. Inf. Syst., 2006

Research Paper: Enhancing Text Categorization with Semantic-enriched Representation and Training Data Augmentation.
JAMIA, 2006

A study of mixture models for collaborative filtering.
Inf. Retr., 2006

A risk minimization framework for information retrieval.
Inf. Process. Manage., 2006

A probabilistic approach to spatiotemporal theme pattern mining on weblogs.
Proceedings of the 15th international conference on World Wide Web, 2006

Robust Pseudo Feedback Estimation and HMM Passage Extraction: UIUC at TREC 2006 Genomics Track.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Language Models for Expert Finding--UIUC TREC 2006 Enterprise Track Experiments.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Latent semantic analysis for multiple-type interrelated data objects.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Regularized estimation of mixture models for robust pseudo-relevance feedback.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Semantic term matching in axiomatic approaches to information retrieval.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Automatically Generating Gene Summaries from Biomedical Literature.
Proceedings of the Biocomputing 2006, 2006

Language Model Information Retrieval with Document Expansion.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Exploiting Domain Structure for Named Entity Recognition.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Mining long-term search history to improve search accuracy.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

A mixture model for contextual text mining.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Generating semantic annotations for frequent patterns with context analysis.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Unsupervised Named Entity Transliteration Using Temporal and Phonetic Correlation.
Proceedings of the EMNLP 2006, 2006

Best-k queries on database systems.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

A probabilistic relevance propagation model for hypertext retrieval.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Have things changed now?: an empirical study of bug characteristics in modern open source software.
Proceedings of the 1st Workshop on Architectural and System Support for Improving Software Dependability, 2006

Named Entity Transliteration with Comparable Corpora.
Proceedings of the ACL 2006, 2006

2005
UIUC/MUSC at TREC 2005 Genomics Track.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Interactive Construction of Query Language Models - UIUC TREC 2005 HARD Track Experiments.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

An Axiomatic Approach to IR--UIUC TREC 2005 Robust Track Experiments.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Active feedback in ad hoc information retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

UCAIR: a personalized search toolbar.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Context-sensitive information retrieval using implicit feedback.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

An exploration of axiomatic approaches to information retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Mining comparable bilingual text corpora for cross-language information integration.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Discovering evolutionary theme patterns from text: an exploration of temporal text mining.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Accurate language model estimation with document expansion.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Implicit user modeling for personalized search.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Accurately extracting coherent relevant passages using hidden Markov models.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

2004
A study of smoothing methods for language models applied to information retrieval.
ACM Trans. Inf. Syst., 2004

Automatic annotation of protein motif function with Gene Ontology terms.
BMC Bioinformatics, 2004

UIUC in HARD 2004--Passage Retrieval Using HMMs.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

A two-stage mixture model for pseudo feedback.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A session-based search engine.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

ACES: a contextual engine for search.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A formal study of information retrieval heuristics.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A cross-collection mixture model for comparative text mining.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Audio segment retrieval using a short duration example query.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Subspace Clustering for Microarray Data Analysis: Multiple Criteria and Significance Assessment.
Proceedings of the 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference, 2004

2003
Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002.
SIGIR Forum, 2003

Building Data Integration Systems: A Mass Collaboration Approach.
Proceedings of the International Workshop on Web and Databases, 2003

Preference-based Graphic Models for Collaborative Filtering.
Proceedings of the UAI '03, 2003

Improving the Robustness of Language Models - UIUC TREC 2003 Robust and Genomics Experiments.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Active Feedback - UIUC TREC-2003 HARD Experiments.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Relevance Propagation for Topic Distillation UIUC TREC 2003 Web Track Experiments.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Beyond independent relevance: methods and evaluation metrics for subtopic retrieval.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Exploiting query history for document ranking in interactive information retrieval.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Error analysis of difficult TREC topics.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Information retrieval for OCR documents: a content-based probabilistic correction model.
Proceedings of the Document Recognition and Retrieval X, 2003

Text classification from positive and unlabeled documents.
Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management, 2003

Collaborative filtering with decoupled models for preferences and ratings.
Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management, 2003

2002
Database Research at the University of Illinois at Urbana-Champaign.
SIGMOD Record, 2002

Risk minimization and language modeling in text retrieval dissertation abstract.
SIGIR Forum, 2002

Two-stage language models for information retrieval.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Title language model for information retrieval.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

2001
A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval.
Proceedings of the SIGIR 2001: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001

Document Language Models, Query Models, and Risk Minimization for Information Retrieval.
Proceedings of the SIGIR 2001: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001

Model-based Feedback in the Language Modeling Approach to Information Retrieval.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
Exploration of a heuristic approach to threshold learning in adaptive filtering.
Proceedings of the SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2000

1999
Optimization in CLARIT TREC-8 Adaptive Filtering.
Proceedings of The Eighth Text REtrieval Conference, 1999

CLARIT TREC-8 Manual Ad-Hoc Experiments.
Proceedings of The Eighth Text REtrieval Conference, 1999

1998
Threshold Calibration in CLARIT Adaptive Filtering.
Proceedings of The Seventh Text REtrieval Conference, 1998

1997
Fast Statistical Parsing of Noun Phrases for Document Indexing.
Proceedings of the 5th Applied Natural Language Processing Conference, 1997

1996
Evaluation of Syntactic Phrase Indexing -- CLARIT NLP Track Report.
Proceedings of The Fifth Text REtrieval Conference, 1996

OCR Correction and Query Expansion for Retrieval on OCR Data -- CLARIT TREC-5 Confusion Track Report.
Proceedings of The Fifth Text REtrieval Conference, 1996

Experiments on Chinese Text Indexing -- CLARIT TREC-5 Chinese Track Report.
Proceedings of The Fifth Text REtrieval Conference, 1996

CLARIT Compound Queries and Constraint-Controlled Feedback in TREC-5 Ad-Hoc Experiments.
Proceedings of The Fifth Text REtrieval Conference, 1996

Noun-Phrase Analysis in Unrestricted Text for Information Retrieval.
Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, 1996

1995
CLARIT TREC-4 Interactive Experiments.
Proceedings of The Fourth Text REtrieval Conference, 1995

1990
Preliminary ideas of a conceptual programming language.
SIGPLAN Notices, 1990


  Loading...