Wei Wang

According to our database1, Wei Wang
  • authored at least 154 papers between 1996 and 2017.
  • has a "Dijkstra number"2 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2017
Ranking Causal Anomalies for System Fault Diagnosis via Temporal and Dynamical Analysis on Vanishing Correlations.
TKDD, 2017

Efficient Approach to Correct Read Alignment for Pseudogene Abundance Estimates.
IEEE/ACM Trans. Comput. Biology Bioinform., 2017

Computer-Aided Experiment Planning toward Causal Discovery in Neuroscience.
Front. Neuroinform., 2017

Temporally Factorized Network Modeling for Evolutionary Network Analysis.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Source-LDA: Enhancing Probabilistic Topic Models Using Prior Knowledge Sources.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

2016
Introduction to the Special Issue of Best Papers in ACM SIGKDD 2014.
TKDD, 2016

CGC: A Flexible and Robust Approach to Integrating Co-Regularized Multi-Domain Graph for Clustering.
TKDD, 2016

HICC: an entropy splitting-based framework for hierarchical co-clustering.
Knowl. Inf. Syst., 2016

Sparse regression models for unraveling group and individual associations in eQTL mapping.
BMC Bioinformatics, 2016

Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

2015
Fast and robust group-wise eQTL mapping using sparse graphical models.
BMC Bioinformatics, 2015

REAFUM: Representative Approximate Frequent Subgraph Mining.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Max-Intensity: Detecting Competitive Advertiser Communities in Sponsored Search Market.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Robust Multi-Network Clustering via Joint Cross-Domain Cluster Alignment.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

HapColor: A graph coloring framework for polyploidy phasing.
Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

Data Science for Social Good - 2014 KDD Highlights.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Searching Dimension Incomplete Databases.
IEEE Trans. Knowl. Data Eng., 2014

Total orderings defined on the set of all fuzzy numbers.
Fuzzy Sets and Systems, 2014

Performance research on time-triggered Ethernet based on network calculus.
EURASIP J. Wireless Comm. and Networking, 2014

RNA-Skim: a rapid method for RNA-Seq quantification at transcript level.
Bioinformatics, 2014

FastHap: fast and accurate single individual haplotype reconstruction using fuzzy conflict graphs.
Bioinformatics, 2014

Graph-regularized dual Lasso for robust eQTL mapping.
Bioinformatics, 2014

A novel multi-alignment pipeline for high-throughput sequencing data.
Database, 2014

PseudoLasso: leveraging read alignment in homologous regions to correct pseudogene expression estimates via RNASeq.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

2013
GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference owing to RNA-seq reads misalignment.
Bioinformatics, 2013

Flexible and robust co-regularized multi-domain graph clustering.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Transforming Genomes Using MOD Files with Applications.
Proceedings of the ACM Conference on Bioinformatics, 2013

Read Annotation Pipeline for High-Throughput Sequencing Data.
Proceedings of the ACM Conference on Bioinformatics, 2013

Grid-Based Clustering.
Data Clustering: Algorithms and Applications, 2013

2012
Chapter 10: Mining Genome-Wide Genetic Markers.
PLoS Computational Biology, 2012

seeQTL: a searchable database for human eQTLs.
Bioinformatics, 2012

Dual Transfer Learning.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Metric Learning from Relative Comparisons by Minimizing Squared Residual.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Hierarchical co-clustering based on entropy splitting.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Inferring novel associations between SNP sets and gene sets in eQTL study using sparse graphical model.
Proceedings of the ACM International Conference on Bioinformatics, 2012

2011
Tools for efficient epistasis detection in genome-wide association study.
Source Code for Biology and Medicine, 2011

Measuring Opinion Relevance in Latent Topic Space.
Proceedings of the PASSAT/SocialCom 2011, Privacy, 2011

Clustering with relative constraints.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

LTS: Discriminative subgraph mining by learning from search history.
Proceedings of the 27th International Conference on Data Engineering, 2011

2010
Mining High-Dimensional Data.
Proceedings of the Data Mining and Knowledge Discovery Handbook, 2nd ed., 2010

Functional neighbors: inferring relationships between nonhomologous protein families using family-specific packing motifs.
IEEE Trans. Information Technology in Biomedicine, 2010

COE: A General Approach for Efficient Genome-Wide Two-Locus Epistasis Test in Disease Association Study.
Journal of Computational Biology, 2010

Discriminative Subgraph Mining for Protein Classification.
IJKDB, 2010

TEAM: efficient two-locus epistasis tests in human genome-wide association study.
Bioinformatics [ISMB], 2010

Efficient genome ancestry inference in complex pedigrees with inbreeding.
Bioinformatics [ISMB], 2010

GAIA: graph classification using evolutionary computation.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Genome-wide compatible SNP intervals and their properties.
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology, 2010

Gene set analysis using principal components.
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology, 2010

2009
Efficient algorithms for genome-wide association study.
TKDD, 2009

Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications.
Journal of Computer-Aided Molecular Design, 2009

Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.
Journal of Computer-Aided Molecular Design, 2009

Split-Order Distance for Clustering and Classification Hierarchies.
Proceedings of the Scientific and Statistical Database Management, 2009

COE: A General Approach for Efficient Genome-Wide Two-Locus Epistasis Test in Disease Association Study.
Proceedings of the Research in Computational Molecular Biology, 2009

FastChi: An Efficient Algorithm for Analyzing Gene-Gene Interactions.
Proceedings of the Biocomputing 2009: Proceedings of the Pacific Symposium, 2009

Inferring Genome-Wide Mosaic Structure.
Proceedings of the Biocomputing 2009: Proceedings of the Pacific Symposium, 2009

TreeQA: Quantitative Genome Wide Association Mapping Using Local Perfect Phylogeny Trees.
Proceedings of the Biocomputing 2009: Proceedings of the Pacific Symposium, 2009

Graph classification based on pattern co-occurrence.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Introduction to special issue on bioinformatics.
TKDD, 2008

Mining non-redundant high order correlations in binary data.
PVLDB, 2008

Genotype Sequence Segmentation: Handling Constraints and Noise.
Proceedings of the Algorithms in Bioinformatics, 8th International Workshop, 2008

CRD: fast co-clustering on large datasets utilizing sampling-based matrix decomposition.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Fastanova: an efficient algorithm for genome-wide association study.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Quantitative Association Analysis Using Tree Hierarchies.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Mining Approximate Order Preserving Clusters in the Presence of Noise.
Proceedings of the 24th International Conference on Data Engineering, 2008

CARE: Finding Local Linear Correlations in High Dimensional Data.
Proceedings of the 24th International Conference on Data Engineering, 2008

Approximate Clustering on Distributed Data Streams.
Proceedings of the 24th International Conference on Data Engineering, 2008

A General Framework for Fast Co-clustering on Large Datasets Using Matrix Decomposition.
Proceedings of the 24th International Conference on Data Engineering, 2008

REDUS: finding reducible subspaces in high dimensional data.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Functional Neighbors: Inferring Relationships between Non-Homologous Protein Families Using Family-Specific Packing Motifs.
Proceedings of the 2008 IEEE International Conference on Bioinformatics and Biomedicine, 2008

2007
Benchmarking the effectiveness of sequential pattern mining methods.
Data Knowl. Eng., 2007

An Efficient Algorithm for Mining Coherent Patterns from Heterogeneous Microarrays.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

A Fast Algorithm for Approximate Quantiles in High Speed Data Streams.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

Mining RNA Tertiary Motifs with Structure Graphs.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

On Demand Phenotype Ranking through Subspace Clustering.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

PoClustering: Lossless Clustering of Dissimilarity Data.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

Intelligent Sequential Mining Via Alignment: Optimization Techniques for Very Large DB.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows.
Proceedings of the Proceedings 15th International Conference on Intelligent Systems for Molecular Biology (ISMB) & 6th European Conference on Computational Biology (ECCB), 2007

Incremental Subspace Clustering over Multiple Data Streams.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Sample Selection for Maximal Diversity.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Graph Database Indexing Using Structured Graph Decomposition.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Accelerating Profile Queries in Elevation Maps.
Proceedings of the 23rd International Conference on Data Engineering, 2007

An efficient algorithm for approximate biased quantile computation in data streams.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2006
Sequential Pattern Mining in Multi-Databases via Multiple Alignment.
Data Min. Knowl. Discov., 2006

Local Structure Comparison of Proteins.
Advances in Computers, 2006

Human motion estimation from a reduced marker set.
Proceedings of the 33. International Conference on Computer Graphics and Interactive Techniques, 2006

Human motion estimation from a reduced marker set.
Proceedings of the 2006 Symposium on Interactive 3D Graphics, 2006

Mining Approximate Frequent Itemsets In the Presence of Noise: Algorithm and Analysis.
Proceedings of the Sixth SIAM International Conference on Data Mining, 2006

Ranking Outliers Using Symmetric Neighborhood Relationship.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2006

Clustering pair-wise dissimilarity data into partially ordered sets.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Mining coherent patterns from heterogeneous microarray data.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005
Comparative Study of Sequential Pattern Mining Models.
Proceedings of the Foundations of Data Mining and knowledge Discovery, 2005

Mining Sequential Patterns from Large Data Sets
Advances in Database Systems 28, Kluwer, ISBN: 978-0-387-24246-0, 2005

Guest Editors' Introduction: Special Issue on Mining Biological Data.
IEEE Trans. Knowl. Data Eng., 2005

BIOKDD 2005 workshop report.
SIGKDD Explorations, 2005

Comparing Graph Representations of Protein Structure for Mining Family-Specific Residue-Based Packing Motifs.
Journal of Computational Biology, 2005

An Improved Biclustering Method for Analyzing Gene Expression Profiles.
International Journal on Artificial Intelligence Tools, 2005

A system for analyzing and indexing human-motion databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Fast computation of database operations using graphics processors.
Proceedings of the 32. International Conference on Computer Graphics and Interactive Techniques, 2005

Finding Representative Set from Massive Data.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Mining Approximate Frequent Itemsets from Noisy Data.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Mining High-Dimensional Data.
Proceedings of the Data Mining and Knowledge Discovery Handbook., 2005

2004
Discovering High-Order Periodic Patterns.
Knowl. Inf. Syst., 2004

WAR: Weighted Association Rules for Item Intensities.
Knowl. Inf. Syst., 2004

Mining Surprising Periodic Patterns.
Data Min. Knowl. Discov., 2004

BASS: Approximate Search on Large String Databases.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

Fast Computation of Database Operations using Graphics Processors.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Mining protein family specific residue packing patterns from protein structure graphs.
Proceedings of the Eighth Annual International Conference on Computational Molecular Biology, 2004

Accurate Classification of Protein Structural Families Using Coherent Subgraph Analysis.
Proceedings of the Biocomputing 2004, 2004

A framework for ontology-driven subspace clustering.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

SPIN: mining maximal frequent subgraphs from graph databases.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

AGILE: A General Approach to Detect Transitions in Evolving Data Streams.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

Revealing True Subspace Clusters in High Dimensions.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

Understanding Social Welfare Service Patterns Using Sequential Analysis.
Proceedings of the 2004 Annual National Conference on Digital Government Research, 2004

Successfully Adopting IT for Social Welfare Program Management.
Proceedings of the 2004 Annual National Conference on Digital Government Research, 2004

Adopting IT for Effective Management of Social Welfare Programs.
Proceedings of the 2004 Annual National Conference on Digital Government Research, 2004

Biclustering in Gene Expression Data by Tendency.
Proceedings of the 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference, 2004

Gene Ontology Friendly Biclustering of Expression Profiles.
Proceedings of the 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference, 2004

2003
Mining Asynchronous Periodic Patterns in Time Series Data.
IEEE Trans. Knowl. Data Eng., 2003

Recent Progress on Selected Topics in Database Research - A Report by Nine Young Chinese Researchers Working in the United States.
J. Comput. Sci. Technol., 2003

STAMP: On Discovery of Statistically Important Pattern Repeats in Long Sequential Data.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

ApproxMAP: Approximate Mining of Consensus Sequential Patterns.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

OP-Cluster: Clustering by Tendency in High Dimensional Space.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

CLUSEQ: Efficient and Effective Sequence Clustering.
Proceedings of the 19th International Conference on Data Engineering, 2003

Social Welfare Program Administration and Evaluation and Policy Analysis Using Knowledge Discovery and Data Mining (KDD) on Administrative Data.
Proceedings of the 2003 Annual National Conference on Digital Government Research, 2003

Management Assistance for Work First via a Dynamic Website.
Proceedings of the 2003 Annual National Conference on Digital Government Research, 2003

Discovering Compact and Highly Discriminative Features or Feature Combinations of Drug Activities Using Support Vector Machines.
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

Reconstruction of Ancestral Gene Order after Segmental Duplication and Gene Loss.
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

Enhanced Biclustering on Expression Data.
Proceedings of the 3rd IEEE International Symposium on BioInformatics and BioEngineering (BIBE 2003), 2003

2002
Mining long sequential patterns in a noisy environment.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Clustering by pattern similarity in large data sets.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Efficient Filtering of Large DatasetA User-Centric Paradigm.
Proceedings of the Second SIAM International Conference on Data Mining, 2002

InfoMiner+: Mining Partial Periodic Patterns with Gap Penalties.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

delta-Clusters: Capturing Subspace Correlation in a Large Data Set.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

A Framework Towards Efficient and Effective Sequence Clusterin.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

Accelerating Approximate Subsequence Search on Large Protein Sequence Databases.
Proceedings of the 1st IEEE Computer Society Bioinformatics Conference, 2002

Towards Automatic Clustering of Protein Sequences.
Proceedings of the 1st IEEE Computer Society Bioinformatics Conference, 2002

2001
Infominer: mining surprising periodic patterns.
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

Meta-patterns: Revealing Hidden Periodic Patterns.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

TAR: Temporal Association Rules on Evolving Numerical Attributes.
Proceedings of the 17th International Conference on Data Engineering, 2001

2000
An Approach to Active Spatial Data Mining Based on Statistical Information.
IEEE Trans. Knowl. Data Eng., 2000

Dynamo: design, implementation, and evaluation of cooperative persistent object management in a local area network.
Softw., Pract. Exper., 2000

Mining Patterns in Long Sequential Data with Noise.
SIGKDD Explorations, 2000

Collaborative Web caching based on proxy affinities.
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, 2000

Mining asynchronous periodic patterns in time series data.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

Efficient mining of weighted association rules (WAR).
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

Dynamic Adaptive File Management in a Local Area Network.
Proceedings of the 20th International Conference on Distributed Computing Systems, 2000

1999
STING+: An Approach to Active Spatial Data Mining.
Proceedings of the 15th International Conference on Data Engineering, 1999

1998
Performance Analysis of Three Text-Join Algorithms.
IEEE Trans. Knowl. Data Eng., 1998

DynamO: Dynamic Objects with Persistent Storage.
Proceedings of the Advances in Persistent Object Systems, 1998

PK-tree: A Spatial Index Structure for High Dimensional Point Data.
Proceedings of the 5th International Conference of Foundations of Data Organization (FODO'98), 1998

1997
STING: A Statistical Information Grid Approach to Spatial Data Mining.
Proceedings of the VLDB'97, 1997

1996
Performance Analysis of Several Algorithms for Processing Joins between Textual Attributes.
Proceedings of the Twelfth International Conference on Data Engineering, February 26, 1996


  Loading...