Wei Wang

Orcid: 0000-0002-8180-2886

Affiliations:
  • University of California Los Angeles, Department of Computer Science, CA, USA
  • University of North Carolina at Chapel Hill, Department of Computer Science, NC, USA (2002 - 2012)
  • IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA (1999 - 2002)
  • University of California Los Angeles, Department of Computer Science, CA, USA (PhD 1999)


According to our database1, Wei Wang authored at least 285 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Survey on Self-Supervised Learning for Non-Sequential Tabular Data.
CoRR, 2024

STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ShuttleSHAP: A Turn-Based Feature Attribution Approach for Analyzing Forecasting Models in Badminton.
CoRR, 2023

From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media.
CoRR, 2023

PGraphDTA: Improving Drug Target Interaction Prediction using Protein Language Models and Contact Maps.
CoRR, 2023

Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs.
CoRR, 2023

Large Language Models Can Be Good Privacy Protection Learners.
CoRR, 2023

Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks.
CoRR, 2023

Unveiling Invariances via Neural Network Pruning.
CoRR, 2023

SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models.
CoRR, 2023

Professional Basketball Player Behavior Synthesis via Planning with Diffusion.
CoRR, 2023

Can Directed Graph Neural Networks be Adversarially Robust?
CoRR, 2023

STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models.
CoRR, 2023

Code Recommendation for Open Source Software Developers.
Proceedings of the ACM Web Conference 2023, 2023

Where Does Your News Come From? Predicting Information Pathways in Social Media.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation.
Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

Universality and Limitations of Prompt Tuning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Generalizing Graph ODE for Learning Complex System Dynamics across Environments.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks.
Proceedings of the Seventeenth International AAAI Conference on Web and Social Media, 2023

Learning under Label Proportions for Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Introducing Semantics into Speech Encoders.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Multivariate time-series classification with hierarchical variational graph pooling.
Neural Networks, 2022

Knowledge Source Rankings for Semi-Supervised Topic Modeling.
Inf., 2022

A Mobility-Aware Deep Learning Model for Long-Term COVID-19 Pandemic Prediction and Policy Impact Analysis.
CoRR, 2022

Introducing Semantics into Speech Encoders.
CoRR, 2022

Subgraph Matching via Query-Conditioned Subgraph Matching Neural Networks and Bi-Level Tree Search.
CoRR, 2022

Multi-source Inductive Knowledge Graph Transfer.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

A Bayesian Topic Model for Human-Evaluated Interpretability.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Dual-Geometric Space Embedding Model for Two-View Knowledge Graphs.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

RLogic: Recursive Logical Rule Learning from Knowledge Graphs.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2022

OpBerg: Discovering Causal Sentences Using Optimal Alignments.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2022

Scalable Graph Representation Learning via Locality-Sensitive Hashing.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

ReLiable: Offline Reinforcement Learning for Tactical Strategies in Professional Basketball Games.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Multilingual Knowledge Graph Completion with Self-Supervised Adaptive Graph Alignment.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Towards Fine-Grained Reasoning for Fake News Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
SEIZE: Runtime Inspection for Parallel Dataflow Systems.
IEEE Trans. Parallel Distributed Syst., 2021

Drug-Target Interaction Prediction with Graph Attention networks.
CoRR, 2021

Clinical Named Entity Recognition using Contextualized Token Representations.
CoRR, 2021

JEDI: circular RNA prediction based on junction encoders and deep interaction among splice sites.
Bioinform., 2021

Experiment Selection in Meta-Analytic Piecemeal Causal Discovery.
IEEE Access, 2021

Discovering Undisclosed Paid Partnership on Social Media via Aspect-Attentive Sponsored Post Learning.
Proceedings of the WSDM '21, 2021

MEDTO: Medical Data to Ontology Matching Using Hybrid Graph Neural Networks.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Coupled Graph ODE for Learning Interacting System Dynamics.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Evaluating Audience Loyalty and Authenticity in Influencer Marketing via Multi-task Multi-relational Learning.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

GLSearch: Maximum Common Subgraph Detection via Learning to Search.
Proceedings of the 38th International Conference on Machine Learning, 2021

Bi-Level Attention Graph Neural Networks.
Proceedings of the IEEE International Conference on Data Mining, 2021

CREATe: Clinical Report Extraction and Annotation Technology.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

The Biased Coin Flip Process for Nonparametric Topic Modeling.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Powering Comparative Classification with Sentiment Analysis via Domain Adaptive Knowledge Transfer.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Recommend for a Reason: Unlocking the Power of Unsupervised Aspect-Sentiment Co-Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

#StayHome or #Marathon?: Social Media Enhanced Pandemic Surveillance on Spatial-temporal Dynamic Graphs.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

You Are What and Where You Are: Graph Enhanced Attention Network for Explainable POI Recommendation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
PolyCluster: Minimum Fragment Disagreement Clustering for Polyploid Phasing.
IEEE ACM Trans. Comput. Biol. Bioinform., 2020

Correction to: Memory-based random walk for multi-query local community detection.
Knowl. Inf. Syst., 2020

Memory-based random walk for multi-query local community detection.
Knowl. Inf. Syst., 2020

Multivariate Time Series Classification with Hierarchical Variational Graph Pooling.
CoRR, 2020

Bi-Level Graph Neural Networks for Drug-Drug Interaction Prediction.
CoRR, 2020

Hierarchical and Fast Graph Similarity Computation via Graph Coarsening and Deep Graph Learning.
CoRR, 2020

Software Language Comprehension using a Program-Derived Semantic Graph.
CoRR, 2020

Fast Detection of Maximum Common Subgraph via Deep Q-Learning.
CoRR, 2020

Measuring Time-Sensitive and Topic-Specific Influence in Social Networks With LSTM and Self-Attention.
IEEE Access, 2020

Recommending Themes for Ad Creative Design via Visual-Linguistic Representations.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes✱.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Few-Shot Learning for New User Recommendation in Location-based Social Networks.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Multimodal Post Attentive Profiling for Influencer Marketing.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

End-to-End Deep Attentive Personalized Item Retrieval for Online Content-sharing Platforms.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Clustering and Constructing User Coresets to Accelerate Large-scale Top-K Recommender Systems.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Adversarial Learning to Compare: Self-Attentive Prospective Customer Recommendation in Location based Social Networks.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Automatic Speaker Recognition with Limited Data.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Interpretable Click-Through Rate Prediction through Hierarchical Attention.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Social Media User Geolocation via Hybrid Attention.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Node Classification in Temporal Graphs Through Stochastic Sparsification and Temporal Structural Convolution.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

Learning Continuous System Dynamics from Irregularly-Sampled Partial Observations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Robust Graph Representation Learning via Neural Sparsification.
Proceedings of the 37th International Conference on Machine Learning, 2020

Fast Adaptation for Cold-start Collaborative Filtering with Meta-learning.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

SEIZE User Desired Moments: Runtime Inspection for Parallel Dataflow Systems.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

Bridging Mixture Density Networks with Meta-Learning for Automatic Speaker Identification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Long Document Ranking with Query-Directed Sparse Transformer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

SpEC: Sparse Embedding-Based Community Detection in Attributed Graphs.
Proceedings of the Database Systems for Advanced Applications, 2020

On-demand Influencer Discovery on Social Media.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Learning to Create Better Ads: Generation and Ranking Approaches for Ad Creative Refinement.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

MARU: Meta-context Aware Random Walks for Heterogeneous Network Representation Learning.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

P-Companion: A Principled Framework for Diversified Complementary Product Recommendation.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases.
Proceedings of the BCB '20: 11th ACM International Conference on Bioinformatics, 2020

"The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning-Based Efficient Graph Similarity Computation via Multi-Scale Convolutional Set Matching.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Unsupervised Inductive Whole-Graph Embedding by Preserving Graph Proximity.
CoRR, 2019

De novo Nanopore read quality improvement using deep learning.
BMC Bioinform., 2019

Multifaceted protein-protein interaction prediction based on Siamese residual RCNN.
Bioinform., 2019

Click Feedback-Aware Query Recommendation Using Adversarial Examples.
Proceedings of the World Wide Web Conference, 2019

CORALS: Who Are My Potential New Customers? Tapping into the Wisdom of Customers' Decisions.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

SimGNN: A Neural Network Approach to Fast Graph Similarity Computation.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

Universal Representation Learning of Knowledge Bases by Jointly Embedding Instances and Ontological Concepts.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Learn Smart with Less: Building Better Online Decision Trees with Fewer Training Examples.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Unsupervised Inductive Graph-Level Representation Learning via Graph-Graph Proximity.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Self-Attentive Attributed Network Embedding Through Adversarial Learning.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Learning Robust Representations with Graph Denoising Policy Network.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

DynGraphGAN: Dynamic Graph Embedding via Generative Adversarial Networks.
Proceedings of the Database Systems for Advanced Applications, 2019

Learning to Predict Human Stress Level with Incomplete Sensor Data from Wearable Devices.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

On Generating Dominators of Customer Preferences.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Enhancing Air Quality Prediction with Social Media and Natural Language Processing.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Personalized Question Routing via Heterogeneous Network Embedding.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Convolutional Set Matching for Graph Similarity.
CoRR, 2018

Convolutional Neural Networks for Fast Approximation of Graph Edit Distance.
CoRR, 2018

Graph Edit Distance Computation via Graph Neural Networks.
CoRR, 2018

A randomized approach to speed up the analysis of large-scale read-count data in the application of CNV detection.
BMC Bioinform., 2018

Identifying Users behind Shared Accounts in Online Streaming Services.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Modeling Co-Evolution Across Multiple Networks.
Proceedings of the 2018 SIAM International Conference on Data Mining, 2018

Learning to Disentangle Interleaved Conversational Threads with a Siamese Hierarchical Network and Similarity Ranking.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Learning Deep Network Representations with Adversarially Regularized Autoencoders.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

NetWalk: A Flexible Deep Embedding Approach for Anomaly Detection in Dynamic Networks.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

On Multi-query Local Community Detection.
Proceedings of the IEEE International Conference on Data Mining, 2018

Learning Gender-Neutral Word Embeddings.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

RIN: Reformulation Inference Network for Context-Aware Query Suggestion.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Inferring Microbial Communities for City Scale Metagenomics Using Neural Networks.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Predicting Disease-related Associations by Heterogeneous Network Embedding.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

2017
Ranking Causal Anomalies for System Fault Diagnosis via Temporal and Dynamical Analysis on Vanishing Correlations.
ACM Trans. Knowl. Discov. Data, 2017

Efficient Approach to Correct Read Alignment for Pseudogene Abundance Estimates.
IEEE ACM Trans. Comput. Biol. Bioinform., 2017

Computer-Aided Experiment Planning toward Causal Discovery in Neuroscience.
Frontiers Neuroinformatics, 2017

Aztec: A Platform to Render Biomedical Software Findable, Accessible, Interoperable, and Reusable.
CoRR, 2017

Temporally Factorized Network Modeling for Evolutionary Network Analysis.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Open Source Repository Recommendation in Social Coding.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Event Detection and Summarization Using Phrase Network.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

Link Prediction with Spatial and Temporal Consistency in Dynamic Networks.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Source-LDA: Enhancing Probabilistic Topic Models Using Prior Knowledge Sources.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Translating literature into causal graphs: Toward automated experiment selection.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

Fleximer: Accurate Quantification of RNA-Seq via Variable-Length k-mers.
Proceedings of the 8th ACM International Conference on Bioinformatics, 2017

2016
Introduction to the Special Issue of Best Papers in ACM SIGKDD 2014.
ACM Trans. Knowl. Discov. Data, 2016

CGC: A Flexible and Robust Approach to Integrating Co-Regularized Multi-Domain Graph for Clustering.
ACM Trans. Knowl. Discov. Data, 2016

HICC: an entropy splitting-based framework for hierarchical co-clustering.
Knowl. Inf. Syst., 2016

Sparse regression models for unraveling group and individual associations in eQTL mapping.
BMC Bioinform., 2016

MSAcquisitionSimulator: data-dependent acquisition simulator for LC-MS shotgun proteomics.
Bioinform., 2016

Towards customer trouble tickets resolution automation in large cellular services: demo.
Proceedings of the 22nd Annual International Conference on Mobile Computing and Networking, 2016

Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

2015
Fast and robust group-wise eQTL mapping using sparse graphical models.
BMC Bioinform., 2015

REAFUM: Representative Approximate Frequent Subgraph Mining.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Max-Intensity: Detecting Competitive Advertiser Communities in Sponsored Search Market.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Robust Multi-Network Clustering via Joint Cross-Domain Cluster Alignment.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

HapColor: A graph coloring framework for polyploidy phasing.
Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

Data Science for Social Good - 2014 KDD Highlights.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Searching Dimension Incomplete Databases.
IEEE Trans. Knowl. Data Eng., 2014

Total orderings defined on the set of all fuzzy numbers.
Fuzzy Sets Syst., 2014

Performance research on time-triggered Ethernet based on network calculus.
EURASIP J. Wirel. Commun. Netw., 2014

RNA-Skim: a rapid method for RNA-Seq quantification at transcript level.
Bioinform., 2014

FastHap: fast and accurate single individual haplotype reconstruction using fuzzy conflict graphs.
Bioinform., 2014

Graph-regularized dual Lasso for robust eQTL mapping.
Bioinform., 2014

A novel multi-alignment pipeline for high-throughput sequencing data.
Database J. Biol. Databases Curation, 2014

Big Data, Big Challenges.
Proceedings of the 2014 IEEE International Conference on Semantic Computing, 2014

Individual haplotyping prediction agreements.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

PseudoLasso: leveraging read alignment in homologous regions to correct pseudogene expression estimates via RNASeq.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

2013
GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference owing to RNA-seq reads misalignment.
Bioinform., 2013

Flexible and robust co-regularized multi-domain graph clustering.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Transforming Genomes Using MOD Files with Applications.
Proceedings of the ACM Conference on Bioinformatics, 2013

Read Annotation Pipeline for High-Throughput Sequencing Data.
Proceedings of the ACM Conference on Bioinformatics, 2013

Grid-Based Clustering.
Proceedings of the Data Clustering: Algorithms and Applications, 2013

2012
Chapter 10: Mining Genome-Wide Genetic Markers.
PLoS Comput. Biol., 2012

seeQTL: a searchable database for human eQTLs.
Bioinform., 2012

Dual Transfer Learning.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Metric Learning from Relative Comparisons by Minimizing Squared Residual.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Hierarchical co-clustering based on entropy splitting.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Inferring novel associations between SNP sets and gene sets in eQTL study using sparse graphical model.
Proceedings of the ACM International Conference on Bioinformatics, 2012

2011
Tools for efficient epistasis detection in genome-wide association study.
Source Code Biol. Medicine, 2011

Measuring Opinion Relevance in Latent Topic Space.
Proceedings of the PASSAT/SocialCom 2011, Privacy, 2011

Clustering with relative constraints.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

LTS: Discriminative subgraph mining by learning from search history.
Proceedings of the 27th International Conference on Data Engineering, 2011

2010
Mining High-Dimensional Data.
Proceedings of the Data Mining and Knowledge Discovery Handbook, 2nd ed., 2010

TKDD Special Issue SIGKDD 2009.
ACM Trans. Knowl. Discov. Data, 2010

Functional neighbors: inferring relationships between nonhomologous protein families using family-specific packing motifs.
IEEE Trans. Inf. Technol. Biomed., 2010

COE: A General Approach for Efficient Genome-Wide Two-Locus Epistasis Test in Disease Association Study.
J. Comput. Biol., 2010

Discriminative Subgraph Mining for Protein Classification.
Int. J. Knowl. Discov. Bioinform., 2010

TEAM: efficient two-locus epistasis tests in human genome-wide association study.
Bioinform., 2010

Efficient genome ancestry inference in complex pedigrees with inbreeding.
Bioinform., 2010

GAIA: graph classification using evolutionary computation.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Genome-wide compatible SNP intervals and their properties.
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology, 2010

Gene set analysis using principal components.
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology, 2010

Finding High-Order Correlations in High-Dimensional Biological Data.
Proceedings of the Link Mining: Models, Algorithms, and Applications, 2010

2009
Efficient algorithms for genome-wide association study.
ACM Trans. Knowl. Discov. Data, 2009

Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications.
J. Comput. Aided Mol. Des., 2009

Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.
J. Comput. Aided Mol. Des., 2009

Split-Order Distance for Clustering and Classification Hierarchies.
Proceedings of the Scientific and Statistical Database Management, 2009

FastChi: An Efficient Algorithm for Analyzing Gene-Gene Interactions.
Proceedings of the Biocomputing 2009: Proceedings of the Pacific Symposium, 2009

Inferring Genome-Wide Mosaic Structure.
Proceedings of the Biocomputing 2009: Proceedings of the Pacific Symposium, 2009

TreeQA: Quantitative Genome Wide Association Mapping Using Local Perfect Phylogeny Trees.
Proceedings of the Biocomputing 2009: Proceedings of the Pacific Symposium, 2009

Map-matching for low-sampling-rate GPS trajectories.
Proceedings of the 17th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, 2009

Graph classification based on pattern co-occurrence.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Introduction to special issue on bioinformatics.
ACM Trans. Knowl. Discov. Data, 2008

Mining non-redundant high order correlations in binary data.
Proc. VLDB Endow., 2008

Genotype Sequence Segmentation: Handling Constraints and Noise.
Proceedings of the Algorithms in Bioinformatics, 8th International Workshop, 2008

CRD: fast co-clustering on large datasets utilizing sampling-based matrix decomposition.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Fastanova: an efficient algorithm for genome-wide association study.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Quantitative Association Analysis Using Tree Hierarchies.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Mining Approximate Order Preserving Clusters in the Presence of Noise.
Proceedings of the 24th International Conference on Data Engineering, 2008

CARE: Finding Local Linear Correlations in High Dimensional Data.
Proceedings of the 24th International Conference on Data Engineering, 2008

Approximate Clustering on Distributed Data Streams.
Proceedings of the 24th International Conference on Data Engineering, 2008

A General Framework for Fast Co-clustering on Large Datasets Using Matrix Decomposition.
Proceedings of the 24th International Conference on Data Engineering, 2008

REDUS: finding reducible subspaces in high dimensional data.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Functional Neighbors: Inferring Relationships between Non-Homologous Protein Families Using Family-Specific Packing Motifs.
Proceedings of the 2008 IEEE International Conference on Bioinformatics and Biomedicine, 2008

Efficient Data-Mining Methods Enabling Genome-Wide Computing.
Proceedings of the Next Generation of Data Mining., 2008

2007
Benchmarking the effectiveness of sequential pattern mining methods.
Data Knowl. Eng., 2007

An Efficient Algorithm for Mining Coherent Patterns from Heterogeneous Microarrays.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

A Fast Algorithm for Approximate Quantiles in High Speed Data Streams.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

Mining RNA Tertiary Motifs with Structure Graphs.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

On Demand Phenotype Ranking through Subspace Clustering.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

PoClustering: Lossless Clustering of Dissimilarity Data.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

Intelligent Sequential Mining Via Alignment: Optimization Techniques for Very Large DB.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows.
Proceedings of the Proceedings 15th International Conference on Intelligent Systems for Molecular Biology (ISMB) & 6th European Conference on Computational Biology (ECCB), 2007

Incremental Subspace Clustering over Multiple Data Streams.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Sample Selection for Maximal Diversity.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Graph Database Indexing Using Structured Graph Decomposition.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Accelerating Profile Queries in Elevation Maps.
Proceedings of the 23rd International Conference on Data Engineering, 2007

An efficient algorithm for approximate biased quantile computation in data streams.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2006
Sequential Pattern Mining in Multi-Databases via Multiple Alignment.
Data Min. Knowl. Discov., 2006

Local Structure Comparison of Proteins.
Adv. Comput., 2006

Human motion estimation from a reduced marker set.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2006

Mining Approximate Frequent Itemsets In the Presence of Noise: Algorithm and Analysis.
Proceedings of the Sixth SIAM International Conference on Data Mining, 2006

Clustering pair-wise dissimilarity data into partially ordered sets.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Mining coherent patterns from heterogeneous microarray data.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005
Comparative Study of Sequential Pattern Mining Models.
Proceedings of the Foundations of Data Mining and knowledge Discovery, 2005

Mining Sequential Patterns from Large Data Sets
Advances in Database Systems 28, Kluwer, ISBN: 978-0-387-24246-0, 2005

Guest Editors' Introduction: Special Issue on Mining Biological Data.
IEEE Trans. Knowl. Data Eng., 2005

BIOKDD 2005 workshop report.
SIGKDD Explor., 2005

Comparing Graph Representations of Protein Structure for Mining Family-Specific Residue-Based Packing Motifs.
J. Comput. Biol., 2005

An Improved Biclustering Method for Analyzing Gene Expression Profiles.
Int. J. Artif. Intell. Tools, 2005

A system for analyzing and indexing human-motion databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Finding Representative Set from Massive Data.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Mining Approximate Frequent Itemsets from Noisy Data.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Mining High-Dimensional Data.
Proceedings of the Data Mining and Knowledge Discovery Handbook., 2005

2004
Discovering High-Order Periodic Patterns.
Knowl. Inf. Syst., 2004

WAR: Weighted Association Rules for Item Intensities.
Knowl. Inf. Syst., 2004

Mining Surprising Periodic Patterns.
Data Min. Knowl. Discov., 2004

BASS: Approximate Search on Large String Databases.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

Fast Computation of Database Operations using Graphics Processors.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Mining protein family specific residue packing patterns from protein structure graphs.
Proceedings of the Eighth Annual International Conference on Computational Molecular Biology, 2004

Accurate Classification of Protein Structural Families Using Coherent Subgraph Analysis.
Proceedings of the Biocomputing 2004, 2004

A framework for ontology-driven subspace clustering.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

SPIN: mining maximal frequent subgraphs from graph databases.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

AGILE: A General Approach to Detect Transitions in Evolving Data Streams.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

Revealing True Subspace Clusters in High Dimensions.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

Understanding Social Welfare Service Patterns Using Sequential Analysis.
Proceedings of the 2004 Annual National Conference on Digital Government Research, 2004

Successfully Adopting IT for Social Welfare Program Management.
Proceedings of the 2004 Annual National Conference on Digital Government Research, 2004

Adopting IT for Effective Management of Social Welfare Programs.
Proceedings of the 2004 Annual National Conference on Digital Government Research, 2004

Biclustering in Gene Expression Data by Tendency.
Proceedings of the 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference, 2004

Gene Ontology Friendly Biclustering of Expression Profiles.
Proceedings of the 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference, 2004

2003
Mining Asynchronous Periodic Patterns in Time Series Data.
IEEE Trans. Knowl. Data Eng., 2003

Recent Progress on Selected Topics in Database Research - A Report by Nine Young Chinese Researchers Working in the United States.
J. Comput. Sci. Technol., 2003

STAMP: On Discovery of Statistically Important Pattern Repeats in Long Sequential Data.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

ApproxMAP: Approximate Mining of Consensus Sequential Patterns.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

OP-Cluster: Clustering by Tendency in High Dimensional Space.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

CLUSEQ: Efficient and Effective Sequence Clustering.
Proceedings of the 19th International Conference on Data Engineering, 2003

Social Welfare Program Administration and Evaluation and Policy Analysis Using Knowledge Discovery and Data Mining (KDD) on Administrative Data.
Proceedings of the 2003 Annual National Conference on Digital Government Research, 2003

Management Assistance for Work First via a Dynamic Website.
Proceedings of the 2003 Annual National Conference on Digital Government Research, 2003

Discovering Compact and Highly Discriminative Features or Feature Combinations of Drug Activities Using Support Vector Machines.
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

Reconstruction of Ancestral Gene Order after Segmental Duplication and Gene Loss.
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

Enhanced Biclustering on Expression Data.
Proceedings of the 3rd IEEE International Symposium on BioInformatics and BioEngineering (BIBE 2003), 2003

2002
Mining long sequential patterns in a noisy environment.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Clustering by pattern similarity in large data sets.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Efficient Filtering of Large DatasetA User-Centric Paradigm.
Proceedings of the Second SIAM International Conference on Data Mining, 2002

InfoMiner+: Mining Partial Periodic Patterns with Gap Penalties.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

delta-Clusters: Capturing Subspace Correlation in a Large Data Set.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

A Framework Towards Efficient and Effective Sequence Clusterin.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

Accelerating Approximate Subsequence Search on Large Protein Sequence Databases.
Proceedings of the 1st IEEE Computer Society Bioinformatics Conference, 2002

Towards Automatic Clustering of Protein Sequences.
Proceedings of the 1st IEEE Computer Society Bioinformatics Conference, 2002

2001
Infominer: mining surprising periodic patterns.
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

Meta-patterns: Revealing Hidden Periodic Patterns.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

TAR: Temporal Association Rules on Evolving Numerical Attributes.
Proceedings of the 17th International Conference on Data Engineering, 2001

2000
An Approach to Active Spatial Data Mining Based on Statistical Information.
IEEE Trans. Knowl. Data Eng., 2000

Dynamo: design, implementation, and evaluation of cooperative persistent object management in a local area network.
Softw. Pract. Exp., 2000

Mining Patterns in Long Sequential Data with Noise.
SIGKDD Explor., 2000

Collaborative Web caching based on proxy affinities.
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, 2000

Efficient mining of weighted association rules (WAR).
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

Dynamic Adaptive File Management in a Local Area Network.
Proceedings of the 20th International Conference on Distributed Computing Systems, 2000

1999
STING+: An Approach to Active Spatial Data Mining.
Proceedings of the 15th International Conference on Data Engineering, 1999

1998
Performance Analysis of Three Text-Join Algorithms.
IEEE Trans. Knowl. Data Eng., 1998

Genetic algorithms for determining fuzzy measures from data.
J. Intell. Fuzzy Syst., 1998

DynamO: Dynamic Objects with Persistent Storage.
Proceedings of the Advances in Persistent Object Systems, 1998

PK-tree: A Spatial Index Structure for High Dimensional Point Data.
Proceedings of the 5th International Conference of Foundations of Data Organization (FODO'98), 1998

1997
STING: A Statistical Information Grid Approach to Spatial Data Mining.
Proceedings of the VLDB'97, 1997

1996
Monotone set functions defined by Choquet integral.
Fuzzy Sets Syst., 1996

Performance Analysis of Several Algorithms for Processing Joins between Textual Attributes.
Proceedings of the Twelfth International Conference on Data Engineering, February 26, 1996


  Loading...