Yiming Yang

Orcid: 0000-0001-8322-607X

Affiliations:
  • Carnegie Mellon University, Language Technologies Institute, Pittsburgh, PA, USA
  • Kyoto University, Japan (PhD 1996)


According to our database1, Yiming Yang authored at least 244 papers between 1984 and 2026.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Alternative mixed integer linear programming optimization for joint job scheduling and data allocation in grid computing.
Future Gener. Comput. Syst., 2026

2025
Training Proactive and Personalized LLM Agents.
CoRR, November, 2025

Scaling Long-Horizon LLM Agent via Context-Folding.
CoRR, October, 2025

ZeroGR: A Generalizable and Scalable Framework for Zero-Shot Generative Retrieval.
CoRR, October, 2025

Data Management System Analysis for Distributed Computing Workloads.
CoRR, October, 2025

CGSim: A Simulation Framework for Large Scale Distributed Computing Environment.
CoRR, October, 2025

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym.
CoRR, September, 2025

Machine Learning-Driven Predictive Resource Management in Complex Science Workflows.
CoRR, September, 2025

SPL-LNS: Sampling-Enhanced Large Neighborhood Search for Solving Integer Linear Programs.
CoRR, August, 2025

Agentic-R1: Distilled Dual-Strategy Reasoning.
CoRR, July, 2025

Towards Community-Driven Agents for Machine Learning Engineering.
CoRR, June, 2025

Towards an Introspective Dynamic Model of Globally Distributed Computing Infrastructures.
CoRR, June, 2025

Sample Complexity and Representation Ability of Test-time Scaling Paradigms.
CoRR, June, 2025

Enhancing Training Data Attribution with Representational Optimization.
CoRR, May, 2025

A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization.
CoRR, May, 2025

CodePDE: An Inference Framework for LLM-driven PDE Solver Generation.
CoRR, May, 2025

CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization.
CoRR, April, 2025

Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning.
CoRR, January, 2025

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Few-shot Personalization of LLMs with Mis-aligned Responses.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Maximal Update Parametrization and Zero-Shot Hyperparameter Transfer for Fourier Neural Operators.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Regularized Langevin Dynamics for Combinatorial Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Optimizing Temperature for Language Models with Multi-Sample Inference.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Self-Play Preference Optimization for Language Model Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Lean-STaR: Learning to Interleave Thinking and Proving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Improve Vision Language Model Chain-of-thought Reasoning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models.
CoRR, 2024

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild.
CoRR, 2024

Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination.
CoRR, 2024

Representation Learning and Information Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024


Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AutoMix: Automatically Mixing Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

In-Context Principle Learning from Mistakes.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SALMON: Self-Alignment with Instructable Reward Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Performance-Improving Code Edits.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Functional Interpolation for Relative Positions improves Long Context Transformers.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning to Correct for QA Reasoning with Black-box LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Aligning Large Multimodal Models with Factually Augmented RLHF.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest.
CoRR, 2023

SALMON: Self-Alignment with Principle-Following Reward Models.
CoRR, 2023

Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation.
CoRR, 2023

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs.
CoRR, 2023

Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-tuned GPT.
CoRR, 2023

Self-Refine: Iterative Refinement with Self-Feedback.
CoRR, 2023

Learning Performance-Improving Code Edits.
CoRR, 2023

Retrieval-Enhanced Generative Model for Large-Scale Knowledge Graph Completion.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-Refine: Iterative Refinement with Self-Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Neural PDE Solver with Temporal Stencil Modeling.
Proceedings of the International Conference on Machine Learning, 2023

PAL: Program-aided Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Recitation-Augmented Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CompleQA: Benchmarking the Impacts of Knowledge Graph Completion Methods on Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Active Retrieval Augmented Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Long-tailed Extreme Multi-label Text Classification by the Retrieval of Generated Pseudo Label Descriptions.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
FLOWGEN: Fast and slow graph generation.
CoRR, 2022

Long-tailed Extreme Multi-label Text Classification with Generated Pseudo Label Descriptions.
CoRR, 2022

Exploiting Local and Global Features in Transformer-based Extreme Multi-label Text Classification.
CoRR, 2022

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning to repair: Repairing model output errors after deployment using a dynamic memory of feedback.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Sparse Attention with Learning to Hash.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Language Models of Code are Few-Shot Commonsense Learners.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Memory-assisted prompt editing to improve GPT-3 after deployment.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Conditional set generation using Seq2seq models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

JAKET: Joint Pre-training of Knowledge Graph and Language Understanding.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Improving scripts with a memory of natural feedback.
CoRR, 2021

Interscript: A dataset for interactive learning of scripts through error feedback.
CoRR, 2021

Improving Neural Model Performance through Natural Language Feedback on Their Explanations.
CoRR, 2021

Improving Hyper-Relational Knowledge Graph Completion.
CoRR, 2021

CURIE: An Iterative Querying Approach for Reasoning About Situations.
CoRR, 2021

Knowledge Embedding Based Graph Convolutional Network.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Unsupervised Extractive Text Summarization with Distance-Augmented Sentence Graphs.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Enhancing Summarization with Text Classification via Topic Consistency.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Neural Language Modeling for Contextualized Temporal Graph Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Meta Back-Translation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Rethinking Transformer-based Set Prediction for Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Think about it! Improving defeasible reasoning by first modeling the question scenario.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Towards Using Heterogeneous Relation Graphs for End-to-End TTS.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Could you give me a hint ? Generating inference graphs for defeasible reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
EIGEN: Event Influence GENeration using Pre-trained Language Models.
CoRR, 2020

Unsupervised Parallel Corpus Mining on Web Data.
CoRR, 2020

Kernel Stein Generative Modeling.
CoRR, 2020

Generalized Multi-Relational Graph Convolution Network.
CoRR, 2020

Practical Comparable Data Collection for Low-Resource Languages via Images.
CoRR, 2020

Explainable Unsupervised Change-point Detection via Graph Neural Networks.
CoRR, 2020

Graph-Revised Convolutional Network.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Taming Pretrained Transformers for Extreme Multi-label Text Classification.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Correlation-Aware Change-Point Detection via Graph Neural Networks.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

An EM Approach to Non-autoregressive Conditional Sequence Generation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework.
Proceedings of the 8th International Conference on Learning Representations, 2020

Pre-training Tasks for Embedding-based Large-scale Retrieval.
Proceedings of the 8th International Conference on Learning Representations, 2020

On the Sentence Embeddings from Pre-trained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Violin: A Large-Scale Dataset for Video-and-Language Inference.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Relation Entailment with Structured and Textual Information.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Predicting Performance for Natural Language Processing Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Re-evaluation of Knowledge Graph Completion Methods.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Politeness Transfer: A Tag and Generate Approach.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
XL-Editor: Post-editing Sentences with XLNet.
CoRR, 2019

Active Learning for Graph Neural Networks via Node Feature Propagation.
CoRR, 2019

Bridging the domain gap in cross-lingual document classification.
CoRR, 2019

A Modular Deep Learning Approach for Extreme Multi-label Text Classification.
CoRR, 2019

The ARIEL-CMU Systems for LoReHLT18.
CoRR, 2019

An Adversarial Approach to High-Quality, Sentiment-Controlled Neural Dialogue Generation.
CoRR, 2019

XLNet: Generalized Autoregressive Pretraining for Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Re-examination of the Role of Latent Variables in Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

DARTS: Differentiable Architecture Search.
Proceedings of the 7th International Conference on Learning Representations, 2019

Kernel Change-point Detection with Auxiliary Deep Generative Models.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Implicit Kernel Learning.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Transformer-XL: Attentive Language Models beyond a Fixed-Length Context.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
The ARIEL-CMU situation frame detection pipeline for LoReHLT16: a model translation approach.
Mach. Transl., 2018

Stochastic WaveNet: A Generative Latent Variable Model for Sequential Data.
CoRR, 2018

Deep Learning for Epidemiological Predictions.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Large-scale Machine Learning over Graphs.
Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018

Graph Convolutional Matrix Completion for Bipartite Edge Prediction.
Proceedings of the 10th International Joint Conference on Knowledge Discovery, 2018

Unsupervised Cross-lingual Transfer of Word Embedding Spaces.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Low-resource Cross-lingual Event Type Detection via Distant Supervision with Minimal Effort.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Likelihood Almost Free Inference Networks.
CoRR, 2017

Convolutional Normalizing Flows.
CoRR, 2017

Learning Graph Convolution Filters from Data Manifold.
CoRR, 2017

Co-Clustering for Multitask Learning.
CoRR, 2017

CMU CS Event TAC-KBP2017 Event Argument Extraction System.
Proceedings of the 2017 Text Analysis Conference, 2017

Deep Learning for Extreme Multi-label Text Classification.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

MMD GAN: Towards Deeper Understanding of Moment Matching Network.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Data-driven Random Fourier Features using Stein Effect.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Experiments in Curation: Towards Machine-Assisted Construction of Software Architecture Knowledge Bases.
Proceedings of the 2017 IEEE International Conference on Software Architecture, 2017

Analogical Inference for Multi-relational Embeddings.
Proceedings of the 34th International Conference on Machine Learning, 2017

RACE: Large-scale ReAding Comprehension Dataset From Examinations.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Cross-lingual Distillation for Text Classification.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Cross-Domain Kernel Induction for Transfer Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning Concept Graphs from Online Educational Data.
J. Artif. Intell. Res., 2016

CMU CS Event TAC-KBP2016 Event Argument Extraction System.
Proceedings of the 2016 Text Analysis Conference, 2016

Adaptive Smoothed Online Multi-Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Efficient Shift-Invariant Dictionary Learning.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Cross-Graph Learning of Multi-Relational Associations.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Data-driven Automated Induction of Prerequisite Structure Graphs.
Proceedings of the 9th International Conference on Educational Data Mining, 2016

Leveraging Multilingual Training for Limited Resource Event Extraction.
Proceedings of the COLING 2016, 2016

Cross-lingual Text Classification via Model Translation with Limited Dictionaries.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Semi-Supervised Learning with Adaptive Spectral Transform.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
Hierarchical Bayesian Inference and Recursive Regularization for Large-Scale Classification.
ACM Trans. Knowl. Discov. Data, 2015

Concept Graph Learning from Educational Data.
Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, 2015

Modeling Event Extraction via Multilingual Data Sources.
Proceedings of the 2015 Text Analysis Conference, 2015

Bipartite Edge Prediction via Transductive Learning over Product Graphs.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2014
Transformation-based Probabilistic Clustering with Supervision.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Von Mises-Fisher Clustering Models.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Recursive regularization for large-scale classification with hierarchical and graphical dependencies.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Distributed training of Large-scale Logistic models.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Multilabel classification with meta-level features in a learning-to-rank framework.
Mach. Learn., 2012

Bayesian models for Large-scale Hierarchical Classification.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A unified optimization framework for auction and guaranteed delivery in online advertising.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Statistical Learning for File-Type Identification.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

Modeling personalized email prioritization: classification-based and regression-based approaches.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Personalized Email Prioritization Based on Content and Social Network Analysis.
IEEE Intell. Syst., 2010

Multilabel classification with meta-level features.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Active Ordering of Interactive Prediction Tasks.
Proceedings of the SIAM International Conference on Data Mining, 2010

Active Learning for Multi-Task Adaptive Filtering.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Learning to rank relevant and novel documents through user feedback.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

CiteData: a new multi-faceted dataset for evaluating personalized search performance.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Protein identification as an information retrieval problem.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Multi-field Correlated Topic Modeling.
Proceedings of the SIAM International Conference on Data Mining, 2009

Toward Optimal Ordering of Prediction Tasks.
Proceedings of the SIAM International Conference on Data Mining, 2009

Protein Identification from Tandem Mass Spectra with Probabilistic Language Modeling.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Mining social networks for personalized email prioritization.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Modeling Expected Utility of Multi-session Information Distillation.
Proceedings of the Advances in Information Retrieval Theory, 2009

Graph Structure Learning for Task Ordering.
Proceedings of the ICEIS 2009, 2009

2008
Text categorization.
Scholarpedia, 2008

Flexible latent variable models for multi-task learning.
Mach. Learn., 2008

An evaluation of adaptive filtering in the context of realistic task-based information exploration.
Inf. Process. Manag., 2008

Personalized active learning for collaborative filtering.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Corpus microsurgery: criteria optimization for medical cross-language ir.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
Utility-based information distillation over temporally sequenced documents.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Generalizing from relevance feedback using named entity wildcards.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2005
Support vector machines classification with a very large-scale taxonomy.
SIGKDD Explor., 2005

Analysis of recursive gene selection approaches from microarray data.
Bioinform., 2005

An experimental study on large-scale web categorization.
Proceedings of the 14th international conference on World Wide Web, 2005

Robustness of adaptive filtering methods in a cross-benchmark evaluation.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Analysis of recursive feature elimination methods.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Using recursive classification to discover predictive features.
Proceedings of the 2005 ACM Symposium on Applied Computing (SAC), 2005

Learning Multiple Related Tasks using Latent Independent Component Analysis.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

From Lasso regression to Feature vector machine.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Combining Categorization-based and Corpus-based Approaches for CLIR.
Proceedings of the Eighteenth International Florida Artificial Intelligence Research Society Conference, 2005

Using Modified Lasso Regression to Learn Large Undirected Graphs in a Probabilistic Framework.
Proceedings of the Proceedings, 2005

2004
RCV1: A New Benchmark Collection for Text Categorization Research.
J. Mach. Learn. Res., 2004

Resource selection for domain-specific cross-lingual IR.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A Probabilistic Model for Online Document Clustering with Application to Novelty Detection.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Probabilistic score estimation with piecewise logistic regression.
Proceedings of the Machine Learning, 2004

The Enron Corpus: A New Dataset for Email Classification Research.
Proceedings of the Machine Learning: ECML 2004, 2004

Learning Table Extraction from Examples.
Proceedings of the COLING 2004, 2004

Introducing the Enron Corpus.
Proceedings of the CEAS 2004, 2004

Applying CLIR Techniques to Event Tracking.
Proceedings of the Information Retrieval Technology, Asia Information Retrieval Symposium, 2004

Customizing Parallel Corpora at the Document Level.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26, 2004, 2004

2003
Robustness of regularized linear classification methods in text categorization.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

A scalability analysis of classifiers in text categorization.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Modified Logistic Regression: An Approximation to SVM and Its Applications in Large-Scale Text Categorization.
Proceedings of the Machine Learning, 2003

A Loss Function Analysis for Classification Methods in Text Categorization.
Proceedings of the Machine Learning, 2003

CONTROL: CLEF-2003 with Open, Transparent Resources Off-Line.
Proceedings of the Working Notes for CLEF 2003 Workshop co-located with the 7th European Conference on Digital Libraries (ECDL 2003), 2003

Multilingual Information Retrieval Using Open, Transparent Resources in CLEF 2003.
Proceedings of the Comparative Evaluation of Multilingual Information Access Systems, 2003

Margin-based local regression for adaptive filtering.
Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management, 2003

Unsupervised Learning of Arabic Stemming Using a Parallel Corpus.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

2002
A Study of Approaches to Hypertext Categorization.
J. Intell. Inf. Syst., 2002

Information Filtering in TREC-9 and TDT-3: A Comparative Analysis.
Inf. Retr., 2002

CMU in Cross-Language Information Retrieval at NTCIR-3.
Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, 2002

Topic-conditioned novelty detection.
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002

High-performing feature selection for text classification.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

Boosting to correct inductive bias in text classification.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

Stochastic Link and Group Detection.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001
kNN, Rocchio and Metrics for Information Filtering at TREC-10.
Proceedings of The Tenth Text REtrieval Conference, 2001

A Study on Thresholding Strategies for Text Categorization.
Proceedings of the SIGIR 2001: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001

Hypertext Categorization using Hyperlink Patterns and Meta Data.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

CMU PRF using a Comparable Corpus.
Proceedings of the Working Notes for CLEF 2001 Workshop co-located with the 5th European Conference on Digital Libraries (ECDL 2001), 2001

Cross-Lingual Pseudo-Relevance Feedback Using a Comparable Corpus.
Proceedings of the Evaluation of Cross-Language Information Retrieval Systems, 2001

2000
Special Issue of Machine Learning on Information Retrieval - Introduction.
Mach. Learn., 2000

kNN at TREC-9.
Proceedings of The Ninth Text REtrieval Conference, 2000

Improving text categorization methods for event tracking.
Proceedings of the SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2000

Combining Multiple Learning Strategies for Effective Cross Validation.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

1999
An Evaluation of Statistical Approaches to Text Categorization.
Inf. Retr., 1999

Intelligent information retrieval.
IEEE Intell. Syst., 1999

Learning approaches for detecting and tracking news events.
IEEE Intell. Syst., 1999

A Re-Examination of Text Categorization Methods.
Proceedings of the SIGIR '99: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1999

1998
Translingual Information Retrieval: Learning from Bilingual Corpora.
Artif. Intell., 1998

A Study of Retrospective and On-Line Event Detection.
Proceedings of the SIGIR '98: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998

1997
Translingual Information Retrieval: A Comparative Evaluation.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

A Comparative Study on Feature Selection in Text Categorization.
Proceedings of the Fourteenth International Conference on Machine Learning (ICML 1997), 1997

1996
Using Corpus Statistics to Remove Redundant Words in Text Categorization.
J. Am. Soc. Inf. Sci., 1996

An analysis of statistical term strength and its use in the indexing and retrieval of molecular biology texts.
Comput. Biol. Medicine, 1996

1995
Noise Reduction in a Statistical Approach to Text Categorization.
Proceedings of the SIGIR'95, 1995

1994
An Example-Based Mapping Method for Text Categorization and Retrieval.
ACM Trans. Inf. Syst., 1994

TREC-3 Retrieval Evaluation Using Expert Network.
Proceedings of The Third Text REtrieval Conference, 1994

1993
An Application of Least Squares Fit Mapping to Text Information Retrieval.
Proceedings of the 16th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. Pittsburgh, PA, USA, June 27, 1993

1992
A Linear Least Squares Fit Mapping Method For Information Retrieval From Natural Language Texts.
Proceedings of the 14th International Conference on Computational Linguistics, 1992

1985
Partial Constraints in Chinese Analysis.
Proceedings of the 9th International Joint Conference on Artificial Intelligence. Los Angeles, 1985

1984
Use Of Heuristic Knowledge In Chinese Language Analysis.
Proceedings of the 10th International Conference on Computational Linguistics and 22nd Annual Meeting of the Association for Computational Linguistics, 1984


  Loading...