Joshua Zhexue Huang

Orcid: 0000-0002-6797-2571

Affiliations:
  • University of Hong Kong


According to our database1, Joshua Zhexue Huang authored at least 256 papers between 1991 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Density estimation-based method to determine sample size for random sample partition of big data.
Frontiers Comput. Sci., October, 2024

A scalable and flexible basket analysis system for big transaction data in Spark.
Inf. Process. Manag., March, 2024

Clustering approximation via a fusion of multiple random samples.
Inf. Fusion, January, 2024

2023
An ensemble method for estimating the number of clusters in a big data set using multiple random samples.
J. Big Data, December, 2023

A novel observation points-based positive-unlabeled learning algorithm.
CAAI Trans. Intell. Technol., December, 2023

Data quality model for assessing public COVID-19 big datasets.
J. Supercomput., November, 2023

Approximate Clustering Ensemble Method for Big Data.
IEEE Trans. Big Data, August, 2023

Observation points classifier ensemble for high-dimensional imbalanced classification.
CAAI Trans. Intell. Technol., June, 2023

Survey of Distributed Computing Frameworks for Supporting Big Data Analysis.
Big Data Min. Anal., June, 2023

A novel correlation Gaussian process regression-based extreme learning machine.
Knowl. Inf. Syst., May, 2023

A review of optimization methods for computation offloading in edge computing networks.
Digit. Commun. Networks, April, 2023

Offloading dependent tasks in MEC-enabled IoT systems: A preference-based hybrid optimization method.
Peer Peer Netw. Appl., March, 2023

A Hybrid Method to Measure Distribution Consistency of Mixed-Attribute Datasets.
IEEE Trans. Artif. Intell., February, 2023

An intelligent hybrid method: Multi-objective optimization for MEC-enabled devices of IoE.
J. Parallel Distributed Comput., January, 2023

Wireless Network Slice Assignment With Incremental Random Vector Functional Link Network.
IEEE Trans. Netw. Sci. Eng., 2023

Random vector functional link network with subspace-based local connections.
Appl. Intell., 2023

MMCo-Clus - An Evolutionary Co-clustering Algorithm for Gene Selection (Extended abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

RSP-gcForest: A Distributed Deep Forest via Random Sample Partition.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
MMCo-Clus - An Evolutionary Co-clustering Algorithm for Gene Selection.
IEEE Trans. Knowl. Data Eng., 2022

Wealth Flow Model: Online Portfolio Selection Based on Learning Wealth Flow Matrices.
ACM Trans. Knowl. Discov. Data, 2022

Bayesian Attribute Bagging-Based Extreme Learning Machine for High-Dimensional Classification and Regression.
ACM Trans. Intell. Syst. Technol., 2022

Directly solving normalized cut for multi-view data.
Pattern Recognit., 2022

Creating synthetic minority class samples based on autoencoder extreme learning machine.
Pattern Recognit., 2022

Correction to: Recent advances in multiple criteria decision making techniques.
Int. J. Mach. Learn. Cybern., 2022

Recent advances in multiple criteria decision making techniques.
Int. J. Mach. Learn. Cybern., 2022

A new method to build the adaptive k-nearest neighbors similarity graph matrix for spectral clustering.
Neurocomputing, 2022

A novel dependency-oriented mixed-attribute data classification method.
Expert Syst. Appl., 2022

HSGAN: Reducing mode collapse in GANs by the latent code distance of homogeneous samples.
Comput. Vis. Image Underst., 2022

Auto-Encoding Independent Attribute Transformation for Naive Bayesian Classifier.
Proceedings of the International Joint Conference on Neural Networks, 2022

DenMG: Density-Based Member Generation for Ensemble Clustering.
Proceedings of the Workshop Proceedings of the 51st International Conference on Parallel Processing, 2022

A Dynamic Variational Framework for Open-World Node Classification in Structured Sequences.
Proceedings of the IEEE International Conference on Data Mining, 2022

Mobility-aware Seamless Virtual Function Migration in Deviceless Edge Computing Environments.
Proceedings of the 42nd IEEE International Conference on Distributed Computing Systems, 2022

A Novel Method to Create Synthetic Samples with Autoencoder Multi-layer Extreme Learning Machine.
Proceedings of the Database Systems for Advanced Applications. DASFAA 2022 International Workshops, 2022

2021
Selection of diverse features with a diverse regularization.
Pattern Recognit., 2021

An effective content-based event recommendation model.
Multim. Tools Appl., 2021

Adaptive discriminant analysis for semi-supervised feature selection.
Inf. Sci., 2021

Novel kernel density estimator based on ensemble unbiased cross-validation.
Inf. Sci., 2021

Improved I-nice clustering algorithm based on density peaks mechanism.
Inf. Sci., 2021

A new approximate method for mining frequent itemsets from big data.
Comput. Sci. Inf. Syst., 2021

Unsupervised Adaptation for High-Dimensional with Limited-Sample Data Classification Using Variational Autoencoder.
Comput. Informatics, 2021

BTGAN: Training GAN with Balanced Triplet Loss and Two-Branch Architecture.
Proceedings of the International Joint Conference on Neural Networks, 2021

A Compressed Hidden Naive Bayesian Classifier.
Proceedings of the International Joint Conference on Neural Networks, 2021

RSP-Hist: Approximate Histograms for Big Data Exploration on Hadoop Clusters.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

A Two-Stage Missing Value Imputation Method Based on Autoencoder Neural Network.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Random Sample Partition-Based Clustering Ensemble Algorithm for Big Data.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
LABIN: Balanced Min Cut for Large-Scale Data.
IEEE Trans. Neural Networks Learn. Syst., 2020

Variable Weighting in Fuzzy k-Means Clustering to Determine the Number of Clusters.
IEEE Trans. Knowl. Data Eng., 2020

Enhanced Balanced Min Cut.
Int. J. Comput. Vis., 2020

Variational Autoencoder-Based Dimensionality Reduction for High-Dimensional Small-Sample Data Classification.
Int. J. Comput. Intell. Appl., 2020

An Asymptotic Statistical Learning Algorithm for Prediction of Key Trading Events.
IEEE Intell. Syst., 2020

A new approach to solve opinion dynamics on complex networks.
Expert Syst. Appl., 2020

Activeness and Loyalty Analysis in Event-Based Social Networks.
Entropy, 2020

A Robust k-Means Clustering Algorithm Based on Observation Point Mechanism.
Complex., 2020

Novel electricity pattern identification system based on improved I-nice algorithm.
Comput. Ind. Eng., 2020

A survey of data partitioning and sampling methods to support big data analysis.
Big Data Min. Anal., 2020

On quantum methods for machine learning problems part II: Quantum classification algorithms.
Big Data Min. Anal., 2020

On quantum methods for machine learning problems part I: Quantum tools.
Big Data Min. Anal., 2020

A hierarchical Gamma Mixture Model-based method for estimating the number of clusters in complex data.
Appl. Soft Comput., 2020

Distributed Data Strategies to Support Large-Scale Data Analysis Across Geo-Distributed Data Centers.
IEEE Access, 2020

Long and Short Term Risk Control for Online Portfolio Selection.
Proceedings of the Knowledge Science, Engineering and Management, 2020

Attribute Bagging-Based Extreme Learning Machine.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2020

Observation Points-Based Particle Swarm Optimization Algorithm.
Proceedings of the 7th IEEE International Conference on Cyber Security and Cloud Computing, 2020

Clustering Ensembles Based on Probability Density Function Estimation.
Proceedings of the 7th IEEE International Conference on Cyber Security and Cloud Computing, 2020

2019
Random Sample Partition: A Distributed Data Model for Big Data Analysis.
IEEE Trans. Ind. Informatics, 2019

Subspace Weighting Co-Clustering of Gene Expression Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2019

RRPlib: A spark library for representing HDFS blocks as a set of random sample data blocks.
Sci. Comput. Program., 2019

Semi-supervised Aspect-level Sentiment Classification Model based on Variational Autoencoder.
Knowl. Based Syst., 2019

A distributed data management system to support large-scale data analysis.
J. Syst. Softw., 2019

Exploring and cleaning big data with random sample data blocks.
J. Big Data, 2019

A new kernel density estimator based on the minimum entropy of data set.
Inf. Sci., 2019

Joint Optimization of Energy Consumption and Latency in Mobile Edge Computing for Internet of Things.
IEEE Internet Things J., 2019

Latent Feature Group Learning for High-Dimensional Data Clustering.
Inf., 2019

Machine Learning-Based Multi-Layer Multi-Hop Transmission Scheme for Dense Networks.
IEEE Commun. Lett., 2019

A Hierarchical Gamma Mixture Model-Based Method for Classification of High-Dimensional Data.
Entropy, 2019

Generate pairwise constraints from unlabeled data for semi-supervised clustering.
Data Knowl. Eng., 2019

Generative Neural Network based Spectrum Sharing using Linear Sum Assignment Problems.
CoRR, 2019

C3C: A New Static Content-Based Three-Level Web Cache.
IEEE Access, 2019

An Asymptotic Ensemble Learning Framework for Big Data Analysis.
IEEE Access, 2019

EAN: Event Attention Network for Stock Price Trend Prediction based on Sentimental Embedding.
Proceedings of the 11th ACM Conference on Web Science, 2019

Machine Learning Based Dynamic Cooperative Transmission Framework for IoUT Networks.
Proceedings of the 16th Annual IEEE International Conference on Sensing, 2019

A New Location-Based Topic Model for Event Attendees Recommendation.
Proceedings of the 2019 IEEE-RIVF International Conference on Computing and Communication Technologies, 2019

A New Approach for Approximately Mining Frequent Itemsets.
Proceedings of the Selected Papers of the XXI International Conference on Data Analytics and Management in Data Intensive Domains (DAMDID/RCDL 2019), 2019

Neural Network-Based Deep Encoding for Mixed-Attribute Data Classification.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2019

Efficiently Mining Maximal Diverse Frequent Itemsets.
Proceedings of the Database Systems for Advanced Applications, 2019

A Sampling-Based System for Approximate Big Data Analysis on Computing Clusters.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Local Adaptive Projection Framework for Feature Selection of Labeled and Unlabeled Data.
IEEE Trans. Neural Networks Learn. Syst., 2018

An Algorithm for Clustering Categorical Data With Set-Valued Features.
IEEE Trans. Neural Networks Learn. Syst., 2018

PurTreeClust: A Clustering Algorithm for Customer Segmentation from Massive Customer Transaction Data.
IEEE Trans. Knowl. Data Eng., 2018

TWCC: Automated Two-way Subspace Weighting Partitional Co-Clustering.
Pattern Recognit., 2018

Weakly supervised topic sentiment joint model with word embeddings.
Knowl. Based Syst., 2018

I-nice: A new approach for identifying the number of clusters and initial cluster centres.
Inf. Sci., 2018

CPLP: An algorithm for tracking the changes of power consumption patterns in load profile data over time.
Inf. Sci., 2018

A smart artificial bee colony algorithm with distance-fitness-based neighbor search and its application.
Future Gener. Comput. Syst., 2018

Determining the optimal temperature parameter for Softmax function in reinforcement learning.
Appl. Soft Comput., 2018

Random weight network-based fuzzy nonlinear regression for trapezoidal fuzzy number data.
Appl. Soft Comput., 2018

An efficient random forests algorithm for high dimensional data classification.
Adv. Data Anal. Classif., 2018

Cluster Survival Model of Concept Drift in Load Profile Data.
IEEE Access, 2018

Investigating Deep Reinforcement Learning Techniques in Personalized Dialogue Generation.
Proceedings of the 2018 SIAM International Conference on Data Mining, 2018

Particle Swarm Optimization-Based Weighted-Nadaraya-Watson Estimator.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2018

Spectral Clustering of Large-scale Data by Directly Solving Normalized Cut.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

A Two-Stage Data Processing Algorithm to Generate Random Sample Partitions for Big Data Analysis.
Proceedings of the Cloud Computing - CLOUD 2018, 2018

High-Dimensional Limited-Sample Biomedical Data Classification Using Variational Autoencoder.
Proceedings of the Data Mining - 16th Australasian Conference, AusDM 2018, Bahrurst, NSW, 2018

Slice_OP: Selecting Initial Cluster Centers Using Observation Points.
Proceedings of the Advanced Data Mining and Applications - 14th International Conference, 2018

2017
Ensemble subspace clustering of text data using two-level features.
Int. J. Mach. Learn. Cybern., 2017

Query ranking model for search engine query recommendation.
Int. J. Mach. Learn. Cybern., 2017

Fuzziness based semi-supervised learning approach for intrusion detection system.
Inf. Sci., 2017

Local PurTree Spectral Clustering for Massive Customer Transaction Data.
IEEE Intell. Syst., 2017

A Random Sample Partition Data Model for Big Data Analysis.
CoRR, 2017

k-mw-modes: An algorithm for clustering categorical matrix-object data.
Appl. Soft Comput., 2017

A fuzzy SV-k-modes algorithm for clustering categorical data with set-valued attributes.
Appl. Math. Comput., 2017

A New Static Web Caching Mechanism Based on Mutual Dependency Between Result Cache and Posting List Cache.
Proceedings of the Web Information Systems Engineering - WISE 2017, 2017

Self-adaptive Weighted Extreme Learning Machine for Imbalanced Classification Problems.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2017

Semi-supervised Feature Selection via Rescaled Linear Regression.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Scalable Normalized Cut with Improved Spectral Rotation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Improving Generalization Capability of Extreme Learning Machine with Synthetic Instances Generation.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

A Self-Balanced Min-Cut Algorithm for Image Clustering.
Proceedings of the IEEE International Conference on Computer Vision, 2017

I-Sampling: A New Block-Based Sampling Method for Large-Scale Dataset.
Proceedings of the 2017 IEEE International Congress on Big Data, 2017

2016
Learning distributed word representation with multi-contextual mixed embedding.
Knowl. Based Syst., 2016

Fuzzy nonlinear regression analysis using a random weight network.
Inf. Sci., 2016

An incremental model on search engine query recommendation.
Neurocomputing, 2016

Incremental density-based ensemble clustering over evolving data streams.
Neurocomputing, 2016

Big data analytics on Apache Spark.
Int. J. Data Sci. Anal., 2016

Segmentation of Factories on Electricity Consumption Behaviors Using Load Profile Data.
IEEE Access, 2016

A frequency-based gene selection method with random forests for gene data analysis.
Proceedings of the 2016 IEEE RIVF International Conference on Computing & Communication Technologies, 2016

Imbalanced ELM Based on Normal Density Estimation for Binary-Class Classification.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2016

Stratified Over-Sampling Bagging Method for Random Forests on Imbalanced Data.
Proceedings of the Intelligence and Security Informatics - 11th Pacific Asia Workshop, 2016

PurTreeClust: A purchase tree clustering algorithm for large-scale customer transaction data.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Empirical analysis of asymptotic ensemble learning for big data.
Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, 2016

2015
Stratified feature sampling method for ensemble clustering of high dimensional data.
Pattern Recognit., 2015

Two-level quantile regression forests for bias correction in range prediction.
Mach. Learn., 2015

Dynamic non-parametric joint sentiment topic mixture model.
Knowl. Based Syst., 2015

Identifying and Analyzing Popular Phrases Multi-Dimensionally in Social Media Data.
Int. J. Data Warehous. Min., 2015

Recommending high-utility search engine queries via a query-recommending model.
Neurocomputing, 2015

Editorial: Uncertainty in learning from big data.
Fuzzy Sets Syst., 2015

Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.
BMC Genom., 2015

A New Feature Sampling Method in Random Forests for Predicting High-Dimensional Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

Use Correlation Coefficients in Gaussian Process to Train Stable ELM Models.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

2014
A Novel Variable-order Markov Model for Clustering Categorical Sequences.
IEEE Trans. Knowl. Data Eng., 2014

Trend analysis of categorical data streams with a concept change method.
Inf. Sci., 2014

QRM: A Probabilistic Model for Search Engine Query Recommendation.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2014

Extensions to Quantile Regression Forests for Very High-Dimensional Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Ensemble Clustering of High Dimensional Data with FastMap Projection.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2014

A LDA Feature Grouping Method for Subspace Clustering of Text Data.
Proceedings of the Intelligence and Security Informatics - Pacific Asia Workshop, 2014

Bias-corrected Quantile Regression Forests for high-dimensional data.
Proceedings of the 2014 International Conference on Machine Learning and Cybernetics, 2014

2013
TW-k-Means: Automated Two-Level Variable Weighting Clustering Algorithm for Multiview Data.
IEEE Trans. Knowl. Data Eng., 2013

Privacy Preserving Distributed DBSCAN Clustering.
Trans. Data Priv., 2013

Stratified sampling for feature subspace selection in random forests for high dimensional data.
Pattern Recognit., 2013

An ensemble of decision cluster crotches for classification of high dimensional data.
Knowl. Based Syst., 2013

Post-processing strategies for improving local gene expression pattern analysis.
Int. J. Data Min. Bioinform., 2013

A Concept-Drifting Detection Algorithm for Categorical Evolving Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2013

A hybrid optimization method for acceleration of building linear classification models.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

A cross cluster-based collaborative filtering method for recommendation.
Proceedings of the IEEE International Conference on Information and Automation, 2013

2012
A feature group weighting method for subspace clustering of high-dimensional data.
Pattern Recognit., 2012

Topic oriented community detection through social objects and link analysis in social networks.
Knowl. Based Syst., 2012

Classifying Very High-Dimensional Data with Random Forests Built from Small Subspaces.
Int. J. Data Warehous. Min., 2012

Using a Variable Weighting k-Means Method to Build a Decision Cluster Classification Model.
Int. J. Pattern Recognit. Artif. Intell., 2012

Batch-Mode Active Learning with Semi-supervised Cluster Tree for Text Classification.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

Hybrid Random Forests: Advantages of Mixed Trees in Classifying Text Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2012

Scalable Random Forests for Massive Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2012

Multi-Layer Network for Influence Propagation over Microblog.
Proceedings of the Intelligence and Security Informatics - Pacific Asia Workshop, 2012

Scalable Subspace Logistic Regression Models for High Dimensional Data.
Proceedings of the Web Technologies and Applications - 14th Asia-Pacific Web Conference, 2012

2011
Clustering categorical data streams.
J. Comput. Methods Sci. Eng., 2011

Integrating constraints to support legally flexible business processes.
Inf. Syst. Frontiers, 2011

A Heuristic Algorithm for the Inner-City Multi-Drop: Container Loading Problem.
Int. J. Oper. Res. Inf. Syst., 2011

Margin-based ensemble classifier for protein fold recognition.
Expert Syst. Appl., 2011

High-Order Co-clustering Text Data on Semantics-Based Representation Model.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2011

Order batching and picking in a synchronized zone order picking system.
Proceedings of the 2011 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), 2011

A New Markov Model for Clustering Categorical Sequences.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Rating: Privacy Preservation for Multiple Attributes with Different Sensitivity Requirements.
Proceedings of the Data Mining Workshops (ICDMW), 2011

2010
Knowledge-based vector space model for text clustering.
Knowl. Inf. Syst., 2010

A self-learning framework for services selection.
Int. J. Inf. Technol. Manag., 2010

Exploiting Word Cluster Information for Unsupervised Feature Selection.
Proceedings of the PRICAI 2010: Trends in Artificial Intelligence, 2010

Mining Trajectory Corridors Using Fréchet Distance and Meshing Grids.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

Minimum Spanning Tree Based Classification Model for Massive Data with MapReduce Implementation.
Proceedings of the ICDMW 2010, 2010

Fuzzy soft subspace clustering method for gene co-expression network analysis.
Proceedings of the 2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops, 2010

CPLDP: An Efficient Large Dataset Processing System Built on Cloud Platform.
Proceedings of the Advanced Data Mining and Applications - 6th International Conference, 2010

2009
Soft Subspace Clustering for High-Dimensional Data.
Proceedings of the Encyclopedia of Data Warehousing and Mining, Second Edition (4 Volumes), 2009

Clustering Categorical Data with k-Modes.
Proceedings of the Encyclopedia of Data Warehousing and Mining, Second Edition (4 Volumes), 2009

SMART: a subspace clustering algorithm that automatically identifies the appropriate number of clusters.
Int. J. Data Min. Model. Manag., 2009

2008
CNP-based Implementation of Service-oriented Workflow Mapping in SHGWMS.
World Wide Web, 2008

Agglomerative Fuzzy K-Means Clustering Algorithm with Selection of Number of Clusters.
IEEE Trans. Knowl. Data Eng., 2008

Feature Weighting Random Forest for Detection of Hidden Web Search Interfaces.
Int. J. Comput. Linguistics Chin. Lang. Process., 2008

Fuzzy K-Means with Variable Weighting in High Dimensional Data Analysis.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

A Changing Window Approach to Exploring Gene Expression Patterns.
Proceedings of the 2008 IEEE International Conference on Bioinformatics and Biomedicine, 2008

Building a Decision Cluster Classification Model for High Dimensional Data by a Variable Weighting k-Means Method.
Proceedings of the AI 2008: Advances in Artificial Intelligence, 2008

2007
An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data.
IEEE Trans. Knowl. Data Eng., 2007

On the Impact of Dissimilarity Measure in k-Modes Clustering Algorithm.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Adaptive scheduling for shared window joins over data streams.
Frontiers Comput. Sci. China, 2007

Learning classifier system ensemble and compact rule set.
Connect. Sci., 2007

A New Initialization Method for Clustering Categorical Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

PAKDD 2007 Industrial Track Workshop.
Proceedings of the Emerging Technologies in Knowledge Discovery and Data Mining, 2007

2006
Automatic Transaction Compensation for Reliable Grid Applications.
J. Comput. Sci. Technol., 2006

Ensemble Learning Classifier System and Compact Ruleset.
Proceedings of the Simulated Evolution and Learning, 6th International Conference, 2006

Neighborhood Density Method for Selecting Initial Cluster Centers in K-Means Clustering.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2006

A Fast Greedy Algorithm for Outlier Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2006

Clustering of SNP Data with Application to Genomics.
Proceedings of the Workshops Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

SLF4SS: Facilitating Flexible Services Selection.
Proceedings of the 2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2006

<i>MFCRank</i>: A Web Ranking Algorithm Based on Correlation of Multiple Features.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

A Case-Based Data Mining Platform.
Proceedings of the Data Mining - Theory, Methodology, Techniques, and Applications, 2006

Supplier Categorization with <i>K</i>-Means Type Subspace Clustering.
Proceedings of the Frontiers of WWW Research and Development, 2006

2005
Automated Variable Weighting in k-Means Type Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Adaptive grid job scheduling with genetic algorithms.
Future Gener. Comput. Syst., 2005

FP-outlier: Frequent pattern based outlier detection.
Comput. Sci. Inf. Syst., 2005

On the Performance of Feature Weighting <i>K</i>-Means for Text Subspace Clustering.
Proceedings of the Advances in Web-Age Information Management, 2005

A Neighborhood-Based Clustering Algorithm.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2005

Subspace Clustering of Text Documents with Feature Weighting <i>K</i>-Means Algorithm.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2005

LCSE: Learning Classifier System Ensemble for Incremental Medical Instances.
Proceedings of the Learning Classifier Systems, International Workshops, 2005

Ontology-Based E-Catalog Matching for Integration of GDSN and EPCglobal Network.
Proceedings of the 2005 IEEE International Conference on e-Business Engineering (ICEBE 2005), 2005

Learning classifier system ensemble for data mining.
Proceedings of the Genetic and Evolutionary Computation Conference, 2005

ADDI: an agent-based extension to UDDI for supply chain management.
Proceedings of the Ninth International Conference on Computer Supported Cooperative Work in Design, 2005

2004
Web services: problems and future directions.
J. Web Semant., 2004

An optimization algorithm for clustering using weighted dissimilarity measures.
Pattern Recognit., 2004

Mining class outliers: concepts, algorithms and applications in CRM.
Expert Syst. Appl., 2004

Real-time transaction processing for autonomic Grid applications.
Eng. Appl. Artif. Intell., 2004

A data warehousing and data mining framework for web usage management.
Commun. Inf. Syst., 2004

Improved Email Classification through Enriched Feature Space.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

Mining Frequent Items in Spatio-temporal Databases.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

A Frequent Pattern Discovery Method for Outlier Detection.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

Mining Class Outliers: Concepts, Algorithms and Applications.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

A Categorized-Registry Model for Grid Resource Publication and Discovery Using Software Agents.
Proceedings of the Parallel and Distributed Computing: Applications and Technologies, 2004

Mining of Web-Page Visiting Patterns with Continuous-Time Markov Models.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2004

A Real-Time Transaction Approach for Grid Services: A Model and Algorithms.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

Enhanced Email Classification Based on Feature Space Enriching.
Proceedings of the Natural Language Processing and Information Systems, 2004

A Feature Weighting Approach to Building Classification Models by Interactive Clustering.
Proceedings of the Modeling Decisions for Artificial Intelligence, 2004

Petri-Net-Based Coordination Algorithms for Grid Transactions.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

Service Selection in Dynamic Demand-Driven Web Services.
Proceedings of the IEEE International Conference on Web Services (ICWS'04), 2004

An Ontology-Based Model for Grid Resource Publication and Discovery.
Proceedings of the Grid and Cooperative Computing, 2004

On Improving Website Connectivity by Using Web-Log Data Streams.
Proceedings of the Database Systems for Advances Applications, 2004

Meta-game Equilibrium for Multi-agent Reinforcement Learning.
Proceedings of the AI 2004: Advances in Artificial Intelligence, 2004

iSurfer: A Focused Web Crawler Based on Incremental Learning from Positive Samples.
Proceedings of the Advanced Web Technologies and Applications, 2004

An Efficient Multidimensional Data Model for Web Usage Mining.
Proceedings of the Advanced Web Technologies and Applications, 2004

2003
A Data Cube Model for Prediction-Based Web Prefetching.
J. Intell. Inf. Syst., 2003

Data Mining and Case-Based Reasoning for Distance Learning.
Int. J. Distance Educ. Technol., 2003

A Note on K-modes Clustering.
J. Classif., 2003

Uni-Grid P&T: A Toolkit for Building Customizable Grid Portals.
Proceedings of the Web Services, 2003

Adaptive Job Scheduling for a Service Grid Using a Genetic Algorithm.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003

Statistical models for time sequences data mining.
Proceedings of the 2003 IEEE International Conference on Computational Intelligence for Financial Engineering, 2003

C3: A New Learning Scheme to Improve Classification of Rare Category Emails.
Proceedings of the AI 2003: Advances in Artificial Intelligence, 2003

2002
M-FastMap: A Modified FastMap Algorithm for Visual Cluster Validation in Data Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2002

An Open Framework for Smart and Personalized Distance Learning.
Proceedings of the Advances in Web-Based Learning, First International Conference, 2002

2001
Patterns Discovery Based on Time-Series Decomposition.
Proceedings of the Knowledge Discovery and Data Mining, 2001

A Cube Model and Cluster Analysis for Web Access Sessions.
Proceedings of the WEBKDD 2001, 2001

An Empirical Study on the Visual Cluster Validation Method with Fastmap.
Proceedings of the Database Systems for Advanced Applications, Proceedings of the 7th International Conference on Database Systems for Advanced Applications (DASFAA 2001), 18-20 April 2001, 2001

2000
Data Mining in Disease Management - A Diabetes Case Study.
Proceedings of the PRICAI 2000, Topics in Artificial Intelligence, 6th Pacific Rim International Conference on Artificial Intelligence, Melbourne, Australia, August 28, 2000

A Visual Method of Cluster Validation with Fastmap.
Proceedings of the Knowledge Discovery and Data Mining, 2000

An Interactive Approach to Building Classification Models by Clustering and Cluster Validation.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2000

1999
A fuzzy k-modes algorithm for clustering categorical data.
IEEE Trans. Fuzzy Syst., 1999

Data-mining massive time series astronomical data: challenges, problems and solutions.
Inf. Softw. Technol., 1999

1998
Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values.
Data Min. Knowl. Discov., 1998

Data-Mining Massive Time Series Astronomical Data Sets - A Case Study.
Proceedings of the Research and Development in Knowledge Discovery and Data Mining, 1998

1997
A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining.
Proceedings of the Workshop on Research Issues on Data Mining and Knowledge Discovery, 1997

Mining the Knowledge Mine: The Hot Spots Methodology for Mining Large Real World Databases.
Proceedings of the Advanced Topics in Artificial Intelligence, 1997

Boosting Neural Networks in Real Worls Applications: An Empirical Study.
Proceedings of the Advanced Topics in Artificial Intelligence, 1997

1993
Neighborhood Query and Analysis with GeoSAL, a Spatial Database Language.
Proceedings of the Advances in Spatial Databases, 1993

1992
Solving Spatial Analysis Problems with GeoSAL, A Spatial Query Language.
Proceedings of the 6th Int. Working Conf. on Scientific and Statistical Database Management, 1992

1991
Geo-SAL: A Query Language for Spatial Data Analysis.
Proceedings of the Advances in Spatial Databases, 1991


  Loading...