Taghi M. Khoshgoftaar

Affiliations:
  • Florida Atlantic University, Boca Raton, Florida, USA


According to our database1, Taghi M. Khoshgoftaar authored at least 608 papers between 1989 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods.
J. Big Data, December, 2024

Low-shot learning and class imbalance: a survey.
J. Big Data, December, 2024

Synthesizing class labels for highly imbalanced credit card fraud detection data.
J. Big Data, December, 2024

Blockchain meets machine learning: a survey.
J. Big Data, December, 2024

Data reduction techniques for highly imbalanced medicare Big Data.
J. Big Data, December, 2024

2023
Threshold optimization and random undersampling for imbalanced credit card data.
J. Big Data, December, 2023

Investigating the effectiveness of one-class and binary classification for fraud detection.
J. Big Data, December, 2023

Comparative analysis of binary and one-class classification techniques for credit card fraud data.
J. Big Data, December, 2023

Iterative cleaning and learning of big highly-imbalanced fraud data using unsupervised learning.
J. Big Data, December, 2023

An approach to application-layer DoS detection.
J. Big Data, December, 2023

Breast cancer prediction using gated attentive multimodal deep learning.
J. Big Data, December, 2023

Evaluating classifier performance with highly imbalanced Big Data.
J. Big Data, December, 2023

Explainable machine learning models for Medicare fraud detection.
J. Big Data, December, 2023

Exploring Maximum Tree Depth and Random Undersampling in Ensemble Trees to Optimize the Classification of Imbalanced Big Data.
SN Comput. Sci., September, 2023

Learning from Highly Imbalanced Big Data with Label Noise.
Int. J. Artif. Intell. Tools, August, 2023

Data-Centric AI for Healthcare Fraud Detection.
SN Comput. Sci., July, 2023

Using machine learning to identify patient characteristics to predict mortality of in-patients with COVID-19 in South Florida.
Frontiers Digit. Health, March, 2023

The effect of feature extraction and data sampling on credit card fraud detection.
J. Big Data, 2023

Improving Medicare Fraud Detection through Big Data Size Reduction Techniques.
Proceedings of the IEEE International Conference on Service-Oriented System Engineering, 2023

Enhancing Credit Card Fraud Detection Through a Novel Ensemble Feature Selection Technique.
Proceedings of the 24th IEEE International Conference on Information Reuse and Integration for Data Science, 2023

Assessing One-Class and Binary Classification Approaches for Identifying Medicare Fraud.
Proceedings of the 24th IEEE International Conference on Information Reuse and Integration for Data Science, 2023

Unsupervised Anomaly Detection of Class Imbalanced Cognition Data Using an Iterative Cleaning Method.
Proceedings of the 24th IEEE International Conference on Information Reuse and Integration for Data Science, 2023

One-Class Classifier Performance: Comparing Majority versus Minority Class Training.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

A Model-Agnostic Feature Selection Technique to Improve the Performance of One-Class Classifiers.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

Data Reduction to Improve the Performance of One-Class Classifiers on Highly Imbalanced Big Data.
Proceedings of the International Conference on Machine Learning and Applications, 2023

A Comparative Study of Model-Agnostic and Importance-Based Feature Selection Approaches.
Proceedings of the 5th IEEE International Conference on Cognitive Machine Intelligence, 2023

A Novel Approach to Synthesize Class Labels in Highly Imbalanced Large Data.
Proceedings of the 5th IEEE International Conference on Cognitive Machine Intelligence, 2023

2022
Encoding High-Dimensional Procedure Codes for Healthcare Fraud Detection.
SN Comput. Sci., 2022

Hyperparameter Tuning for Medicare Fraud Detection in Big Data.
SN Comput. Sci., 2022

A Survey on Classifying Big Data with Label Noise.
ACM J. Data Inf. Qual., 2022

A new feature popularity framework for detecting cyberattacks using popular features.
J. Big Data, 2022

The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey.
J. Big Data, 2022

IoT information theft prediction using ensemble feature selection.
J. Big Data, 2022

The Effects of Random Undersampling for Big Data Medicare Fraud Detection.
Proceedings of the IEEE International Conference on Service-Oriented System Engineering, 2022

A Class-Imbalanced Study with Feature Extraction via PCA and Convolutional Autoencoder.
Proceedings of the 23rd IEEE International Conference on Information Reuse and Integration for Data Science, 2022

Healthcare Provider Summary Data for Fraud Classification.
Proceedings of the 23rd IEEE International Conference on Information Reuse and Integration for Data Science, 2022

Optimizing Ensemble Trees for Big Data Healthcare Fraud Detection.
Proceedings of the 23rd IEEE International Conference on Information Reuse and Integration for Data Science, 2022

Exploring Language-Interfaced Fine-Tuning for COVID-19 Patient Survival Classification.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

GANs for Class-Imbalanced Data: A Meta-Analysis of GitHub Projects.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

Evaluating Performance Metrics for Credit Card Fraud Classification.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

Cost-Sensitive Ensemble Learning for Highly Imbalanced Classification.
Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022

Informative Evaluation Metrics for Highly Imbalanced Big Data Classification.
Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022

Predicting the Severity of COVID-19 Respiratory Illness with Deep Learning.
Proceedings of the Thirty-Fifth International Florida Artificial Intelligence Research Society Conference, 2022

An Exploration of Consistency Learning with Data Augmentation.
Proceedings of the Thirty-Fifth International Florida Artificial Intelligence Research Society Conference, 2022

A Comparison of House Price Classification with Structured and Unstructured Text Data.
Proceedings of the Thirty-Fifth International Florida Artificial Intelligence Research Society Conference, 2022

A Comparative Approach to Threshold Optimization for Classifying Imbalanced Data.
Proceedings of the 8th IEEE International Conference on Collaboration and Internet Computing, 2022

A Novel Approach for Unsupervised Learning of Highly-Imbalanced Data.
Proceedings of the 4th IEEE International Conference on Cognitive Machine Intelligence, 2022

2021
Medical Provider Embeddings for Healthcare Fraud Detection.
SN Comput. Sci., 2021

Gradient Boosted Decision Tree Algorithms for Medicare Fraud Detection.
SN Comput. Sci., 2021

Detecting web attacks using random undersampling and ensemble learners.
J. Big Data, 2021

Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platformm.
J. Big Data, 2021

Text Data Augmentation for Deep Learning.
J. Big Data, 2021

Deep Learning applications for COVID-19.
J. Big Data, 2021

A literature review on one-class classification and its potential applications in big data.
J. Big Data, 2021

A reconstruction error-based framework for label noise detection.
J. Big Data, 2021

Detecting cybersecurity attacks across different network features and learners.
J. Big Data, 2021

A Review and Analysis of the Bot-IoT Dataset.
Proceedings of the 15th IEEE International Conference on Service-Oriented System Engineering, 2021

Predicting Traffic Incidents in Road Networks Using Vehicle Detector Data.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

Detecting Web Attacks in Severely Imbalanced Network Traffic Data.
Proceedings of the 22nd IEEE International Conference on Information Reuse and Integration for Data Science, 2021

Detecting Slow Application-Layer DoS Attacks With PCA.
Proceedings of the 22nd IEEE International Conference on Information Reuse and Integration for Data Science, 2021

Encoding Techniques for High-Cardinality Features and Ensemble Learners.
Proceedings of the 22nd IEEE International Conference on Information Reuse and Integration for Data Science, 2021

Impact of Hyperparameter Tuning in Classifying Highly Imbalanced Big Data.
Proceedings of the 22nd IEEE International Conference on Information Reuse and Integration for Data Science, 2021

Using Inductive Transfer Learning to Improve Hotel Review Spam Detection.
Proceedings of the 22nd IEEE International Conference on Information Reuse and Integration for Data Science, 2021

Investigating the Generalization of Image Classifiers with Augmented Test Sets.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

Feature Extraction for Class Imbalance Using a Convolutional Autoencoder and Data Sampling.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

The Effects of Class Label Noise on Highly-Imbalanced Big Data.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

Output Thresholding for Ensemble Learners and Imbalanced Big Data.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

Feature Popularity Between Different Web Attacks with Supervised Feature Selection Rankers.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

KerasBERT: Modeling the Keras Language.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

Detecting Information Theft Attacks in the Bot-IoT Dataset.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

Robust Thresholding Strategies for Highly Imbalanced and Noisy Data.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

Detecting SSH and FTP Brute Force Attacks in Big Data.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

Detecting SQL Injection Web Attacks Using Ensemble Learners and Data Sampling.
Proceedings of the IEEE International Conference on Cyber Security and Resilience, 2021

IoT Reconnaissance Attack Classification with Random Undersampling and Ensemble Feature Selection.
Proceedings of the 7th IEEE International Conference on Collaboration and Internet Computing, 2021

An Easy-to-Classify Approach for the Bot-IoT Dataset.
Proceedings of the Third IEEE International Conference on Cognitive Machine Intelligence, 2021

Mitigating Class Imbalance for IoT Network Intrusion Detection: A Survey.
Proceedings of the Seventh IEEE International Conference on Big Data Computing Service and Applications, 2021

An Examination of Neural Networks on Cluster Computers.
Proceedings of the Seventh IEEE International Conference on Big Data Computing Service and Applications, 2021

Leveraging LightGBM for Categorical Big Data.
Proceedings of the Seventh IEEE International Conference on Big Data Computing Service and Applications, 2021

2020
Sample size determination for biomedical big data with limited labels.
Netw. Model. Anal. Health Informatics Bioinform., 2020

Survey on RNN and CRF models for de-identification of medical free text.
J. Big Data, 2020

Investigating the relationship between time and predictive model maintenance.
J. Big Data, 2020

A survey and analysis of intrusion detection models based on CSE-CIC-IDS2018 Big Data.
J. Big Data, 2020

Investigating class rarity in big data.
J. Big Data, 2020

CatBoost for big data: an interdisciplinary review.
J. Big Data, 2020

Survey on categorical data for neural networks.
J. Big Data, 2020

The Effects of Data Sampling with Deep Learning and Highly Imbalanced Big Data.
Inf. Syst. Frontiers, 2020

A study on rare fraud predictions with big Medicare claims fraud data.
Intell. Data Anal., 2020

Detection Methods of Slow Read DoS Using Full Packet Capture Data.
Proceedings of the 21st International Conference on Information Reuse and Integration for Data Science, 2020

Semantic Embeddings for Medical Providers and Fraud Detection.
Proceedings of the 21st International Conference on Information Reuse and Integration for Data Science, 2020

Medicare Fraud Detection using CatBoost.
Proceedings of the 21st International Conference on Information Reuse and Integration for Data Science, 2020

Accelerated Deep Learning on HPCC Systems.
Proceedings of the 19th IEEE International Conference on Machine Learning and Applications, 2020

Performance of CatBoost and XGBoost in Medicare Fraud Detection.
Proceedings of the 19th IEEE International Conference on Machine Learning and Applications, 2020

Evaluating The Number of Trainable Parameters on Deep Maxout and LReLU Networks for Visual Recognition.
Proceedings of the 19th IEEE International Conference on Machine Learning and Applications, 2020

A Short Survey of LSTM Models for De-identification of Medical Free Text.
Proceedings of the 6th IEEE International Conference on Collaboration and Internet Computing, 2020

Hcpcs2Vec: Healthcare Procedure Embeddings for Medicare Fraud Prediction.
Proceedings of the 6th IEEE International Conference on Collaboration and Internet Computing, 2020

Detecting Cybersecurity Attacks Using Different Network Features with LightGBM and XGBoost Learners.
Proceedings of the 2nd IEEE International Conference on Cognitive Machine Intelligence, 2020

2019
Melanoma risk modeling from limited positive samples.
Netw. Model. Anal. Health Informatics Bioinform., 2019

A survey on Image Data Augmentation for Deep Learning.
J. Big Data, 2019

A parallel and distributed stochastic gradient descent implementation using commodity clusters.
J. Big Data, 2019

Medicare fraud detection using neural networks.
J. Big Data, 2019

Survey on deep learning with class imbalance.
J. Big Data, 2019

Random forest implementation and optimization for Big Data analytics on LexisNexis's high performance computing cluster platform.
J. Big Data, 2019

The effects of class rarity on the evaluation of supervised healthcare fraud detection models.
J. Big Data, 2019

Examining characteristics of predictive models with imbalanced big data.
J. Big Data, 2019

Severely imbalanced Big Data challenges: investigating data sampling approaches.
J. Big Data, 2019

Evaluation of maxout activations in deep learning across several big data domains.
J. Big Data, 2019

Impact of class distribution on the detection of slow HTTP DoS attacks using Big Data.
J. Big Data, 2019

Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures.
Inf. Syst. Frontiers, 2019

Maxout Networks for Visual Recognition.
Int. J. Multim. Data Eng. Manag., 2019

Efficient learning from big data for cancer risk modeling: A case study with melanoma.
Comput. Biol. Medicine, 2019

Deep Learning and Data Sampling with Imbalanced Big Data.
Proceedings of the 20th IEEE International Conference on Information Reuse and Integration for Data Science, 2019

A Comparison of Performance Metrics with Severely Imbalanced Network Security Big Data.
Proceedings of the 20th IEEE International Conference on Information Reuse and Integration for Data Science, 2019

Deep Learning with Maxout Activations for Visual Recognition and Verification.
Proceedings of the 20th IEEE International Conference on Information Reuse and Integration for Data Science, 2019

Evaluating Model Predictive Performance: A Medicare Fraud Detection Case Study.
Proceedings of the 20th IEEE International Conference on Information Reuse and Integration for Data Science, 2019

Approximating Learning Curves for Imbalanced Big Data with Limited Labels.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

Threshold Based Optimization of Performance Metrics with Severely Imbalanced Big Security Data.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

A Study on Software Metric Selection for Software Fault Prediction.
Proceedings of the 18th IEEE International Conference On Machine Learning And Applications, 2019

Learning Curve Estimation with Large Imbalanced Datasets.
Proceedings of the 18th IEEE International Conference On Machine Learning And Applications, 2019

The Effect of Time on the Maintenance of a Predictive Model.
Proceedings of the 18th IEEE International Conference On Machine Learning And Applications, 2019

Deep Learning and Thresholding with Class-Imbalanced Big Data.
Proceedings of the 18th IEEE International Conference On Machine Learning And Applications, 2019

Investigation of Maxout Activations on Convolutional Neural Networks for Big Data Text Sentiment Analysis.
Proceedings of the Thirty-Second International Florida Artificial Intelligence Research Society Conference, 2019

Detecting Slow HTTP POST DoS Attacks Using Netflow Features.
Proceedings of the Thirty-Second International Florida Artificial Intelligence Research Society Conference, 2019

Differentiating between Educational Data Mining and Learning Analytics: A Bibliometric Approach.
Proceedings of the Joint Proceedings of the Workshops of the 12th International Conference on Educational Data Mining co-located with the 12th International Conference on Educational Data Mining, 2019

Investigating Random Undersampling and Feature Selection on Bioinformatics Big Data.
Proceedings of the IEEE Fifth International Conference on Big Data Computing Service and Applications, 2019

Maxout Neural Network for Big Data Medical Fraud Detection.
Proceedings of the IEEE Fifth International Conference on Big Data Computing Service and Applications, 2019

2018
Social media for polling and predicting United States election outcome.
Soc. Netw. Anal. Min., 2018

Big Data: Deep Learning for financial sentiment analysis.
J. Big Data, 2018

A survey on addressing high-class imbalance in big data.
J. Big Data, 2018

Big Data fraud detection using multiple medicare data sources.
J. Big Data, 2018

A Study of the Impact of Base Traditional Learners on Transfer Learning Algorithms.
Int. J. Artif. Intell. Tools, 2018

The effects of varying class distribution on learner behavior for medicare fraud detection with imbalanced big data.
Health Inf. Sci. Syst., 2018

A review of statistical and machine learning methods for modeling cancer risk using structured clinical data.
Artif. Intell. Medicine, 2018

Filter-Based Subset Selection for Easy, Moderate, and Hard Bioinformatics Data.
Proceedings of the 2018 IEEE International Conference on Information Reuse and Integration, 2018

Is Gene Selection Enough for Imbalanced Bioinformatics Data?
Proceedings of the 2018 IEEE International Conference on Information Reuse and Integration, 2018

Utilizing Netflow Data to Detect Slow Read Attacks.
Proceedings of the 2018 IEEE International Conference on Information Reuse and Integration, 2018

The Effects of Random Undersampling with Simulated Class Imbalance for Big Data.
Proceedings of the 2018 IEEE International Conference on Information Reuse and Integration, 2018

Identifying Medicare Provider Fraud with Unsupervised Machine Learning.
Proceedings of the 2018 IEEE International Conference on Information Reuse and Integration, 2018

Medicare Fraud Detection Using Random Forest with Class Imbalanced Big Data.
Proceedings of the 2018 IEEE International Conference on Information Reuse and Integration, 2018

A Survey of Medicare Data Processing and Integration for Fraud Detection.
Proceedings of the 2018 IEEE International Conference on Information Reuse and Integration, 2018

Building and Interpreting Risk Models from Imbalanced Clinical Data.
Proceedings of the IEEE 30th International Conference on Tools with Artificial Intelligence, 2018

Data Sampling Approaches with Severely Imbalanced Big Data for Medicare Fraud Detection.
Proceedings of the IEEE 30th International Conference on Tools with Artificial Intelligence, 2018

An Empirical Study on Class Rarity in Big Data.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

Location-Based Twitter Sentiment Analysis for Predicting the U.S. 2016 Presidential Election.
Proceedings of the Thirty-First International Florida Artificial Intelligence Research Society Conference, 2018

Fraud Detection with a Limited Number of Known Fraudulent Medicare Providers.
Proceedings of the Thirty-First International Florida Artificial Intelligence Research Society Conference, 2018

The Detection of Medicare Fraud Using Machine Learning Methods with Excluded Provider Labels.
Proceedings of the Thirty-First International Florida Artificial Intelligence Research Society Conference, 2018

The Impact of Malicious Accounts on Political Tweet Sentiment.
Proceedings of the 4th IEEE International Conference on Collaboration and Internet Computing, 2018

Melanoma Risk Prediction with Structured Electronic Health Records.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

2017
Improving detection of untrustworthy online reviews using ensemble learners combined with feature selection.
Soc. Netw. Anal. Min., 2017

Reliability Evaluation Model of Component-Based Software Based on Complex Network Theory.
Qual. Reliab. Eng. Int., 2017

Improving deep neural network design with new text data representations.
J. Big Data, 2017

Large-scale distributed L-BFGS.
J. Big Data, 2017

A survey on heterogeneous transfer learning.
J. Big Data, 2017

Analysis of Transfer Learning Performance Measures.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Modernizing Analytics for Melanoma with a Large-Scale Research Dataset.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

User Behavior Anomaly Detection for Application Layer DDoS Attacks.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Using Weather and Playing Surface to Predict the Occurrence of Injury in Major League Soccer Games: A Case Study.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Medical Provider Specialty Predictions for the Detection of Anomalous Medicare Insurance Claims.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Estimating Outlier Score Probabilities.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Evaluation of Transfer Learning Algorithms Using Different Base Learners.
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017

Training Convolutional Networks on Truncated Text.
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017

Comparing Transfer Learning and Traditional Learning Under Domain Class Imbalance.
Proceedings of the 16th IEEE International Conference on Machine Learning and Applications, 2017

Medicare Fraud Detection Using Machine Learning Methods.
Proceedings of the 16th IEEE International Conference on Machine Learning and Applications, 2017

Deep Neural Network Architecture for Character-Level Learning on Short Text.
Proceedings of the Thirtieth International Florida Artificial Intelligence Research Society Conference, 2017

A Text Mining Approach for Anomaly Detection in Application Layer DDoS Attacks.
Proceedings of the Thirtieth International Florida Artificial Intelligence Research Society Conference, 2017

Multivariate Anomaly Detection in Medicare using Model Residuals and Probabilistic Programming.
Proceedings of the Thirtieth International Florida Artificial Intelligence Research Society Conference, 2017

Detection of Phishing Webpages Using Heterogeneous Transfer Learning.
Proceedings of the 3rd IEEE International Conference on Collaboration and Internet Computing, 2017

Exploring the Effectiveness of Twitter at Polling the United States 2016 Presidential Election.
Proceedings of the 3rd IEEE International Conference on Collaboration and Internet Computing, 2017

A Review of Performance Evaluation on 2D Face Databases.
Proceedings of the Third IEEE International Conference on Big Data Computing Service and Applications, 2017

Predicting sentinel node status in melanoma from a real-world EHR dataset.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

2016
Is Data Sampling Required When Using Random Forest for Classification on Imbalanced Bioinformatics Data?
Proceedings of the Theoretical Information Reuse and Integration, 2016

A survey of transfer learning.
J. Big Data, 2016

The improved grey model based on particle swarm optimization algorithm for time series prediction.
Eng. Appl. Artif. Intell., 2016

Designing a Testing Framework for Transfer Learning Algorithms (Application Paper).
Proceedings of the 17th IEEE International Conference on Information Reuse and Integration, 2016

Predicting Cancer Relapse with Clinical Data: A Survey of Current Techniques.
Proceedings of the 17th IEEE International Conference on Information Reuse and Integration, 2016

Designing a Better Data Representation for Deep Neural Networks and Text Classification.
Proceedings of the 17th IEEE International Conference on Information Reuse and Integration, 2016

Cross-Domain Sentiment Analysis: An Empirical Investigation.
Proceedings of the 17th IEEE International Conference on Information Reuse and Integration, 2016

Investigating the Variation of Ensemble Size on Bagging-Based Classifier Performance in Imbalanced Bioinformatics Datasets.
Proceedings of the 17th IEEE International Conference on Information Reuse and Integration, 2016

A Novel Method for Fraudulent Medicare Claims Detection from Expected Payment Deviations (Application Paper).
Proceedings of the 17th IEEE International Conference on Information Reuse and Integration, 2016

An Investigation of Transfer Learning and Traditional Machine Learning Algorithms.
Proceedings of the 28th IEEE International Conference on Tools with Artificial Intelligence, 2016

Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims.
Proceedings of the 28th IEEE International Conference on Tools with Artificial Intelligence, 2016

Investigating Transfer Learners for Robustness to Domain Class Imbalance.
Proceedings of the 15th IEEE International Conference on Machine Learning and Applications, 2016

An Investigation of Ensemble Techniques for Detection of Spam Reviews.
Proceedings of the 15th IEEE International Conference on Machine Learning and Applications, 2016

A Probabilistic Programming Approach for Outlier Detection in Healthcare Claims.
Proceedings of the 15th IEEE International Conference on Machine Learning and Applications, 2016

Enhancing Ensemble Learners with Data Sampling on High-Dimensional Imbalanced Tweet Sentiment Data.
Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, 2016

Necessity of Feature Selection when Augmenting Tweet Sentiment Feature Spaces with Emoticons.
Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, 2016

Comparing Approaches for Combining Data Sampling and Feature Selection to Address Key Data Quality Issues in Tweet Sentiment Analysis.
Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, 2016

RUDY Attack: Detection at the Network Level and Its Important Features.
Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, 2016

Reducing Feature Set Explosion to Facilitate Real-World Review Spam Detection.
Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, 2016

Integrating Multiple Data Sources to Enhance Sentiment Prediction.
Proceedings of the 2nd IEEE International Conference on Collaboration and Internet Computing, 2016

Transfer Learning Techniques.
Proceedings of the Big Data Technologies and Applications, 2016

Deep Learning Techniques in Big Data Analytics.
Proceedings of the Big Data Technologies and Applications, 2016

2015
Intrusion detection and Big Heterogeneous Data: a Survey.
J. Big Data, 2015

Deep learning applications and challenges in big data analytics.
J. Big Data, 2015

A survey of open source tools for machine learning with big data in the Hadoop ecosystem.
J. Big Data, 2015

Survey of review spam detection using machine learning techniques.
J. Big Data, 2015

On the Stability of Feature Selection Methods in Software Quality Prediction: An Empirical Investigation.
Int. J. Softw. Eng. Knowl. Eng., 2015

An Empirical Investigation on Wrapper-Based Feature Selection for Predicting Software Quality.
Int. J. Softw. Eng. Knowl. Eng., 2015

Aggregating Data Sampling with Feature Subset Selection to Address Skewed Software Defect Data.
Int. J. Softw. Eng. Knowl. Eng., 2015

Investigating Two Approaches for Adding Feature Ranking to Sampled Ensemble Learning for Software Quality Estimation.
Int. J. Softw. Eng. Knowl. Eng., 2015

Stability of Three Forms of Feature Selection Methods on Software Engineering Data.
Proceedings of the 27th International Conference on Software Engineering and Knowledge Engineering, 2015

Combining Feature Subset Selection and Data Sampling for Coping with Highly Imbalanced Software Data.
Proceedings of the 27th International Conference on Software Engineering and Knowledge Engineering, 2015

A Multi-dimensional Comparison of Toolkits for Machine Learning with Big Data.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Using Random Undersampling to Alleviate Class Imbalance on Tweet Sentiment Data.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Using Ensemble Learners to Improve Classifier Performance on Tweet Sentiment Data.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Alterations to the Bootstrapping Process within Random Forest: A Case Study on Imbalanced Bioinformatics Data.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Building an Effective Classification Model for Breast Cancer Patient Response Data.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Observing the Effect of the Choice of Classifier on Bioinformatics Data with Varying Levels of Data Quality and Class Balance.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Choosing an Appropriate Ensemble Classifier for Balanced Bioinformatics Data.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

The Effect of Data Sampling When Using Random Forest on Imbalanced Bioinformatics Data.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

A Survey of 2D Face Databases.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Efficient Modeling of User-Entity Preference in Big Social Networks.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

Using Feature Selection in Combination with Ensemble Learning Techniques to Improve Tweet Sentiment Classification Performance.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

Ensemble vs. Data Sampling: Which Option Is Best Suited to Improve Classification Performance of Imbalanced Bioinformatics Data?
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

The Effect of Dataset Size on Training Tweet Sentiment Classifiers.
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

Utilizing Ensemble, Data Sampling and Feature Selection Techniques for Improving Classification Performance on Tweet Sentiment Data.
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

Detection of SSH Brute Force Attacks Using Aggregated Netflow Data.
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

Investigating New Bootstrapping Approaches of Bagging Classifiers to Account for Class Imbalance in Bioinformatics Datasets.
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

Does the Inclusion of Data Sampling Improve the Performance of Boosting Algorithms on Imbalanced Bioinformatics Data?
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

A New Intrusion Detection Benchmarking System.
Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference, 2015

Impact of Feature Selection Techniques for Tweet Sentiment Classification.
Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference, 2015

Selecting the Appropriate Ensemble Learning Approach for Balanced Bioinformatics Data.
Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference, 2015

2014
A review of data mining using big data in health informatics.
J. Big Data, 2014

A comparative study of iterative and non-iterative feature selection techniques for software defect prediction.
Inf. Syst. Frontiers, 2014

System regression test planning with a fuzzy expert system.
Inf. Sci., 2014

An empirical study of the classification performance of learners on imbalanced and noisy software quality data.
Inf. Sci., 2014

Software quality assessment using a multi-strategy classifier.
Inf. Sci., 2014

Incomplete-case nearest neighbor imputation in software measurement data.
Inf. Sci., 2014

The Use of Ensemble-Based Data Preprocessing Techniques for Software Defect Prediction.
Int. J. Softw. Eng. Knowl. Eng., 2014

Choosing the Best Classification Performance Metric for Wrapper-based Software Metric Selection for Defect Prediction.
Proceedings of the 26th International Conference on Software Engineering and Knowledge Engineering, 2014

Comparing Two Approaches for Adding Feature Ranking to Sampled Ensemble Learning for Software Quality Estimation.
Proceedings of the 26th International Conference on Software Engineering and Knowledge Engineering, 2014

Stability of filter- and wrapper-based software metric selection techniques.
Proceedings of the 15th IEEE International Conference on Information Reuse and Integration, 2014

Using feature selection and classification to build effective and efficient firewalls.
Proceedings of the 15th IEEE International Conference on Information Reuse and Integration, 2014

The effect of noise level and distribution on classification of easy gene microarray data.
Proceedings of the 15th IEEE International Conference on Information Reuse and Integration, 2014

How ranker and learner choice affects classification performance on noisy bioinformatics data.
Proceedings of the 15th IEEE International Conference on Information Reuse and Integration, 2014

Rotation invariant face recognition survey.
Proceedings of the 15th IEEE International Conference on Information Reuse and Integration, 2014

Improving software quality estimation by combining feature selection strategies with sampled ensemble learning.
Proceedings of the 15th IEEE International Conference on Information Reuse and Integration, 2014

Classification performance of three approaches for combining data sampling and gene selection on bioinformatics data.
Proceedings of the 15th IEEE International Conference on Information Reuse and Integration, 2014

Optimizing Wrapper-Based Feature Selection for Use on Bioinformatics Data.
Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, 2014

Combining Feature Selection and Ensemble Learning for Software Quality Estimation.
Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, 2014

Comparison of Data Sampling Approaches for Imbalanced Bioinformatics Data.
Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, 2014

A Session Based Approach for Aggregating Network Traffic Data - The SANTA Dataset.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

Network Traffic Prediction Models for Near- and Long-Term Predictions.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

Using Correlation-Based Feature Selection for a Diverse Collection of Bioinformatics Datasets.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

Evaluation of Wrapper-Based Feature Selection Using Hard, Moderate, and Easy Bioinformatics Data.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

Machine Learning for Detecting Brute Force Attacks at the Network Level.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

Effects of the Use of Boosting on Classification Performance of Imbalanced Bioinformatics Datasets.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

Select-Bagging: Effectively Combining Gene Selection and Bagging for Balanced Bioinformatics Data.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

Selecting the Appropriate Data Sampling Approach for Imbalanced and High-Dimensional Bioinformatics Datasets.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Bioengineering, 2014

2013
High Consequence Systems and Semantic Computing.
Int. J. Semantic Comput., 2013

Editorial.
Int. J. Artif. Intell. Tools, 2013

A Study of Software Metric Selection Techniques: stability Analysis and Defect Prediction Model Performance.
Int. J. Artif. Intell. Tools, 2013

Feature Selection for Optimization of Wavelet Packet Decomposition in Reliability Analysis of Systems.
Int. J. Artif. Intell. Tools, 2013

A Study on First Order Statistics-Based Feature Selection Techniques on Software Metric Data.
Proceedings of the 25th International Conference on Software Engineering and Knowledge Engineering, 2013

Overcoming Big Data Challenges.
Proceedings of the 25th International Conference on Software Engineering and Knowledge Engineering, 2013

Exploring Ensemble-Based Data Preprocessing Techniques for Software Quality Estimation.
Proceedings of the 25th International Conference on Software Engineering and Knowledge Engineering, 2013

Predicting susceptibility to social bots on Twitter.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

Filter- and wrapper-based feature selection for predicting user interaction with Twitter bots.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

The importance of performance metrics within wrapper feature selection.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

Hidden dependencies between class imbalance and difficulty of learning for bioinformatics datasets.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

The use of balance-aware subsampling for bioinformatics datasets.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

Patient response datasets: Challenges and opportunities.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

Feature list aggregation approaches for ensemble gene selection on patient response datasets.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

A survey of stability analysis of feature subset selection techniques.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

Gene selection stability's dependence on dataset difficulty.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

Comparison of rank-based vs. score-based aggregation for ensemble gene selection.
Proceedings of the IEEE 14th International Conference on Information Reuse & Integration, 2013

Comparison of Two Frameworks for Measuring the Stability of Gene-Selection Techniques on Noisy Class-Imbalanced Data.
Proceedings of the 25th IEEE International Conference on Tools with Artificial Intelligence, 2013

Which Users Reply to and Interact with Twitter Social Bots?
Proceedings of the 25th IEEE International Conference on Tools with Artificial Intelligence, 2013

Should the Same Learners Be Used Both within Wrapper Feature Selection and for Building Classification Models?
Proceedings of the 25th IEEE International Conference on Tools with Artificial Intelligence, 2013

How the Choice of Wrapper Learner and Performance Metric Affects Subset Evaluation.
Proceedings of the 25th IEEE International Conference on Tools with Artificial Intelligence, 2013

Stability of Filter- and Wrapper-Based Feature Subset Selection.
Proceedings of the 25th IEEE International Conference on Tools with Artificial Intelligence, 2013

A Review of Ensemble Classification for DNA Microarrays Data.
Proceedings of the 25th IEEE International Conference on Tools with Artificial Intelligence, 2013

Maximizing Classification Performance for Patient Response Datasets.
Proceedings of the 25th IEEE International Conference on Tools with Artificial Intelligence, 2013

An Empirical Study on Wrapper-Based Feature Selection for Software Engineering Data.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Comparative Analysis on the Stability of Feature Selection Techniques Using Three Frameworks on Biological Datasets.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Comparison of Stability for Different Families of Filter-Based and Wrapper-Based Feature Selection.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Random Forest with 200 Selected Features: An Optimal Model for Bioinformatics Research.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Survey of Data Cleansing and Monitoring for Large-Scale Battery Backup Installations.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Contrasting Undersampled Boosting with Internal and External Feature Selection for Patient Response Datasets.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Survey of Clinical Data Mining Applications on Big Data in Health Informatics.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Improving Software Quality Estimation by Combining Boosting and Feature Selection.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Simplifying the Utilization of Machine Learning Techniques for Bioinformatics.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Ensemble Gene Selection Versus Single Gene Selection: Which Is Better?
Proceedings of the Twenty-Sixth International Florida Artificial Intelligence Research Society Conference, 2013

Classification Performance of Rank Aggregation Techniques for Ensemble Gene Selection.
Proceedings of the Twenty-Sixth International Florida Artificial Intelligence Research Society Conference, 2013

2012
Predicting high-risk program modules by selecting the right software measurements.
Softw. Qual. J., 2012

A System-Level Modeling Methodology for Performance-Driven Component Selection in Multicore Architectures.
IEEE Syst. J., 2012

Threshold-based feature selection techniques for high-dimensional bioinformatics data.
Netw. Model. Anal. Health Informatics Bioinform., 2012

An Empirical Study of Feature Ranking Techniques for Software Quality Prediction.
Int. J. Softw. Eng. Knowl. Eng., 2012

Software measurement data reduction using ensemble techniques.
Neurocomputing, 2012

Exploring filter-based feature selection techniques for software quality classification.
Int. J. Inf. Decis. Sci., 2012

Evaluation of the importance of data pre-processing order when combining feature selection and data sampling.
Int. J. Bus. Intell. Data Min., 2012

Measuring stability of feature ranking techniques: a noise-based approach.
Int. J. Bus. Intell. Data Min., 2012

An Empirical Study of Software Metric Selection Techniques for Defect Prediction.
Proceedings of the 24th International Conference on Software Engineering & Knowledge Engineering (SEKE'2012), 2012

Stability of Filter-Based Feature Selection Methods for Imbalanced Software Measurement Data.
Proceedings of the 24th International Conference on Software Engineering & Knowledge Engineering (SEKE'2012), 2012

A novel dataset-similarity-aware approach for evaluating stability of software metric selection techniques.
Proceedings of the IEEE 13th International Conference on Information Reuse & Integration, 2012

Machine prediction of personality from Facebook profiles.
Proceedings of the IEEE 13th International Conference on Information Reuse & Integration, 2012

An extensive comparison of feature ranking aggregation techniques in bioinformatics.
Proceedings of the IEEE 13th International Conference on Information Reuse & Integration, 2012

Impact of noise and data sampling on stability of feature ranking techniques for biological datasets.
Proceedings of the IEEE 13th International Conference on Information Reuse & Integration, 2012

Panel: Using information re-use and integration principles in big data.
Proceedings of the IEEE 13th International Conference on Information Reuse & Integration, 2012

Exploring an iterative feature selection technique for highly imbalanced data sets.
Proceedings of the IEEE 13th International Conference on Information Reuse & Integration, 2012

A review of the stability of feature selection techniques for bioinformatics data.
Proceedings of the IEEE 13th International Conference on Information Reuse & Integration, 2012

A Comparative Study on the Stability of Software Metric Selection Techniques.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

An Empirical Study on the Stability of Feature Selection for Imbalanced Software Engineering Data.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Using Twitter Content to Predict Psychopathy.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

A New Fixed-Overlap Partitioning Algorithm for Determining Stability of Bioinformatics Gene Rankers.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Mean Aggregation versus Robust Rank Aggregation for Ensemble Gene Selection.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

First Order Statistics Based Feature Selection: A Diverse and Powerful Family of Feature Seleciton Techniques.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

A Novel Noise-Resistant Boosting Algorithm for Class-Skewed Data.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

A Hybrid Approach to Coping with High Dimensionality and Class Imbalance for Software Defect Prediction.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Applying Feature Selection to Short Time Wavelet Transformed Vibration Data for Reliability Analysis of an Ocean Turbine.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Studying the Effect of Class Imbalance in Ocean Turbine Fault Data on Reliable State Detection.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Decision Level Fusion of Wavelet Features for Ocean Turbine State Detection.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Comparing Two New Gene Selection Ensemble Approaches with the Commonly-Used Approach.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Determining the Number of Iterations Appropriate for Ensemble Gene Selection on Microarray Data.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

The Effect of Number of Iterations on Ensemble Gene Selection.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Robustness of Threshold-Based Feature Rankers with Data Sampling on Noisy and Imbalanced Data.
Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference, 2012

The effect of measurement approach and noise level on gene selection stability.
Proceedings of the 2012 IEEE International Conference on Bioinformatics and Biomedicine, 2012

Similarity analysis of feature ranking techniques on imbalanced DNA microarray datasets.
Proceedings of the 2012 IEEE International Conference on Bioinformatics and Biomedicine, 2012

2011
The use of decision trees for cost-sensitive classification: an empirical study in software quality prediction.
WIREs Data Mining Knowl. Discov., 2011

Ontology-Based Business Process Customization for Composite Web Services.
IEEE Trans. Syst. Man Cybern. Part A, 2011

Comparing Boosting and Bagging Techniques With Noisy and Imbalanced Data.
IEEE Trans. Syst. Man Cybern. Part A, 2011

Choosing software metrics for defect prediction: an investigation on feature selection techniques.
Softw. Pract. Exp., 2011

Evaluating the Impact of Data Quality on Sampling.
J. Inf. Knowl. Manag., 2011

Identification of microRNA biomarkers for cancer by combining multiple feature selection techniques.
J. Comput. Methods Sci. Eng., 2011

Metric Selection for Software Defect Prediction.
Int. J. Softw. Eng. Knowl. Eng., 2011

An exploration of learning when data is noisy and imbalanced.
Intell. Data Anal., 2011

An Empirical Study of Software Metrics Selection Using Support Vector Machine.
Proceedings of the 23rd International Conference on Software Engineering & Knowledge Engineering (SEKE'2011), 2011

A Comparative Study of Different Strategies for Predicting Software Quality.
Proceedings of the 23rd International Conference on Software Engineering & Knowledge Engineering (SEKE'2011), 2011

Software Defect Prediction for High-Dimensional and Class-Imbalanced Data.
Proceedings of the 23rd International Conference on Software Engineering & Knowledge Engineering (SEKE'2011), 2011

Using Classifier-Based Nominal Imputation to Improve Machine Learning.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2011

Measuring robustness of Feature Selection techniques on software engineering datasets.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2011

Fourier transforms for vibration analysis: A review and case study.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2011

Comparison of approaches to alleviate problems with high-dimensional and class-imbalanced data.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2011

A comparative evaluation of feature ranking methods for high dimensional bioinformatics data.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2011

A noise-based stability evaluation of threshold-based feature selection techniques.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2011

Measuring Stability of Threshold-Based Feature Selection Techniques.
Proceedings of the IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011

Feature Selection for Vibration Sensor Data Transformed by a Streaming Wavelet Packet Decomposition.
Proceedings of the IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011

Impact of Data Sampling on Stability of Feature Selection for Software Measurement Data.
Proceedings of the IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011

Feature Selection on Dynamometer Data for Reliability Analysis.
Proceedings of the IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011

Stability and Classification Performance of Feature Selection Techniques.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

Impact of Noise and Data Sampling on Stability of Feature Selection.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

Using Feature Selection to Determine Optimal Depth for Wavelet Packet Decomposition of Vibration Signals for Ocean System Reliability.
Proceedings of the 13th IEEE International Symposium on High-Assurance Systems Engineering, 2011

Ensemble Coordination for Discrete Event Control.
Proceedings of the 13th IEEE International Symposium on High-Assurance Systems Engineering, 2011

A Dynamometer for an Ocean Turbine Prototype: Reliability through Automated Monitoring.
Proceedings of the 13th IEEE International Symposium on High-Assurance Systems Engineering, 2011

How Many Software Metrics Should be Selected for Defect Prediction?
Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, 2011

Feature Level Sensor Fusion for Improved Fault Detection in MCM Systems for Ocean Turbines.
Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, 2011

Robustness of Filter-Based Feature Ranking: A Case Study.
Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, 2011

Stability Analysis of Feature Ranking Techniques on Biological Datasets.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2011

Random forest: A reliable tool for patient response prediction.
Proceedings of the 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops, 2011

2010
RUSBoost: A Hybrid Approach to Alleviating Class Imbalance.
IEEE Trans. Syst. Man Cybern. Part A, 2010

Evolutionary Optimization of Software Quality Modeling with Multiple Repositories.
IEEE Trans. Software Eng., 2010

Supervised neural network modeling: an empirical investigation into learning from imbalanced data with labeling errors.
IEEE Trans. Neural Networks, 2010

Dynamic Two-phase Truncated Rayleigh Model for Release Date Prediction of Software.
J. Softw. Eng. Appl., 2010

An Empirical Evaluation of Repetitive Undersampling Techniques.
Int. J. Softw. Eng. Knowl. Eng., 2010

Evolutionary data analysis for the class imbalance problem.
Intell. Data Anal., 2010

Ensemble Feature Selection Technique for Software Quality Classification.
Proceedings of the 22nd International Conference on Software Engineering & Knowledge Engineering (SEKE'2010), Redwood City, San Francisco Bay, CA, USA, July 1, 2010

Software Engineering with Computational Intelligence and Machine Learning A Novel Software Metric Selection Technique Using the Area Under ROC Curves.
Proceedings of the 22nd International Conference on Software Engineering & Knowledge Engineering (SEKE'2010), Redwood City, San Francisco Bay, CA, USA, July 1, 2010

A comparative study of filter-based feature ranking techniques.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2010

Active learning with neural networks for intrusion detection.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2010

A novel feature selection technique for highly imbalanced data.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2010

Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction.
Proceedings of the 22nd IEEE International Conference on Tools with Artificial Intelligence, 2010

A Comparative Study of Ensemble Feature Selection Techniques for Software Defect Prediction.
Proceedings of the Ninth International Conference on Machine Learning and Applications, 2010

A Novel Noise Filtering Algorithm for Imbalanced Data.
Proceedings of the Ninth International Conference on Machine Learning and Applications, 2010

Comparative Analysis of DNA Microarray Data through the Use of Feature Selection Techniques.
Proceedings of the Ninth International Conference on Machine Learning and Applications, 2010

Predicting Faults in High Assurance Software.
Proceedings of the 12th IEEE High Assurance Systems Engineering Symposium, 2010

A Comparative Study of Threshold-Based Feature Selection Techniques.
Proceedings of the 2010 IEEE International Conference on Granular Computing, 2010

An Evaluation of Sampling on Filter-Based Feature Selection Methods.
Proceedings of the Twenty-Third International Florida Artificial Intelligence Research Society Conference, 2010

An Empirical Study of Predictive Modeling Techniques of Software Quality.
Proceedings of the Bio-Inspired Models of Network, Information, and Computing Systems, 2010

2009
Count Models for Software Quality Estimation.
Proceedings of the Encyclopedia of Data Warehousing and Mining, Second Edition (4 Volumes), 2009

Improving Software-Quality Predictions With Data Sampling and Boosting.
IEEE Trans. Syst. Man Cybern. Part A, 2009

Empirical Case Studies in Attribute Noise Detection.
IEEE Trans. Syst. Man Cybern. Part C, 2009

Evolutionary Sampling and Software Quality Modeling of High-Assurance Systems.
IEEE Trans. Syst. Man Cybern. Part A, 2009

From Web Service Artifact to a Readable and Verifiable Model.
IEEE Trans. Serv. Comput., 2009

Software quality analysis by combining multiple projects and learners.
Softw. Qual. J., 2009

Identifying Learners Robust to Low Quality Data.
Informatica (Slovenia), 2009

Making an accurate classifier ensemble by voting on classifications from imputed learning sets.
Int. J. Inf. Decis. Sci., 2009

Hybrid sampling for imbalanced data.
Integr. Comput. Aided Eng., 2009

Knowledge discovery from imbalanced and noisy data.
Data Knowl. Eng., 2009

A Survey of Collaborative Filtering Techniques.
Adv. Artif. Intell., 2009

An Extendible Translation of BPEL to a Machine-verifiable Model.
Proceedings of the 21st International Conference on Software Engineering & Knowledge Engineering (SEKE'2009), 2009

Value-Based Software Quality Modeling.
Proceedings of the 21st International Conference on Software Engineering & Knowledge Engineering (SEKE'2009), 2009

A Novel Hybrid Search Algorithm for Feature Selection.
Proceedings of the 21st International Conference on Software Engineering & Knowledge Engineering (SEKE'2009), 2009

Aggregating Performance Metrics for Classifier Evaluation.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2009

An Empirical Comparison of Repetitive Undersampling Techniques.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2009

An Empirical Investigation of Filter Attribute Selection Techniques for Software Quality Classification.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2009

High-Dimensional Software Engineering Data and Feature Selection.
Proceedings of the ICTAI 2009, 2009

A Study on the Relationships of Classifier Performance Metrics.
Proceedings of the ICTAI 2009, 2009

Exploring Software Quality Classification with a Wrapper-Based Feature Ranking Technique.
Proceedings of the ICTAI 2009, 2009

An Empirical Study on Wrapper-Based Feature Ranking.
Proceedings of the ICTAI 2009, 2009

Feature Selection with Imbalanced Data for Software Defect Prediction.
Proceedings of the International Conference on Machine Learning and Applications, 2009

Wrapper-Based Feature Ranking for Software Engineering Metrics.
Proceedings of the International Conference on Machine Learning and Applications, 2009

Mining Data from Multiple Software Development Projects.
Proceedings of the ICDM Workshops 2009, 2009

Feature Selection with High-Dimensional Imbalanced Data.
Proceedings of the ICDM Workshops 2009, 2009

VipBoost: A More Accurate Boosting Algorithm.
Proceedings of the Twenty-Second International Florida Artificial Intelligence Research Society Conference, 2009

2008
Software Quality Modeling as a Reliability Tool.
Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

Software Module Risk Analysis.
Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

Imputation techniques for multivariate missingness in software measurement data.
Softw. Qual. J., 2008

A comprehensive empirical evaluation of missing value imputation in noisy software measurement data.
J. Syst. Softw., 2008

Collaborative Filtering for Multi-Class Data Using Bayesian Networks.
Int. J. Artif. Intell. Tools, 2008

Low-Effort Labeling of Network Events for Intrusion Detection in WLANs.
Int. J. Artif. Intell. Tools, 2008

Assuring Timeliness in an e-Science Service-Oriented Architecture.
Computer, 2008

Imputed Neighborhood Based Collaborative Filtering.
Proceedings of the 2008 IEEE / WIC / ACM International Conference on Web Intelligence, 2008

Toward Model Checking Web Services Over the Web.
Proceedings of the Twentieth International Conference on Software Engineering & Knowledge Engineering (SEKE'2008), 2008

On the Rarity of Fault-prone Modules in Knowledge-based Software Quality Modeling.
Proceedings of the Twentieth International Conference on Software Engineering & Knowledge Engineering (SEKE'2008), 2008

Analyzing the Impact of Attribute Noise on Software Quality Classification.
Proceedings of the Twentieth International Conference on Software Engineering & Knowledge Engineering (SEKE'2008), 2008

Imputation-boosted collaborative filtering using machine learning classifiers.
Proceedings of the 2008 ACM Symposium on Applied Computing (SAC), 2008

VCI predictors: Voting on classifications from imputed learning sets.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2008

Identifying learners robust to low quality data.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2008

Using Imputation Techniques to Help Learn Accurate Classifiers.
Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2008), 2008

Addressing Class Imbalance in Non-binary Classification Problems.
Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2008), 2008

Improving Learner Performance with Data Sampling and Boosting.
Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2008), 2008

Resampling or Reweighting: A Comparison of Boosting Implementations.
Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2008), 2008

VoB predictors: Voting on bagging classifications.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

RUSBoost: Improving classification performance when training data is skewed.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Comparison of Four Performance Metrics for Evaluating Sampling Techniques for Low Quality Class-Imbalanced Data.
Proceedings of the Seventh International Conference on Machine Learning and Applications, 2008

A Comparative Study of Data Sampling and Cost Sensitive Learning.
Proceedings of the Workshops Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

A Mixture Imputation-Boosted Collaborative Filter.
Proceedings of the Twenty-First International Florida Artificial Intelligence Research Society Conference, 2008

Contrast Pattern Mining with Gap Constraints for Peptide Folding Prediction.
Proceedings of the Twenty-First International Florida Artificial Intelligence Research Society Conference, 2008

Building Useful Models from Imbalanced Data with Sampling and Boosting.
Proceedings of the Twenty-First International Florida Artificial Intelligence Research Society Conference, 2008

Software quality modeling: The impact of class noise on the random forest classifier.
Proceedings of the IEEE Congress on Evolutionary Computation, 2008

2007
Software Quality Analysis of Unlabeled Program Modules With Semisupervised Clustering.
IEEE Trans. Syst. Man Cybern. Part A, 2007

A Multi-Objective Software Quality Classification Model Using Genetic Programming.
IEEE Trans. Reliab., 2007

Count Models for Software Quality Estimation.
IEEE Trans. Reliab., 2007

A Comprehensive Empirical Study of Count Models for Software Fault Prediction.
IEEE Trans. Reliab., 2007

Software quality estimation with limited fault data: a semi-supervised learning perspective.
Softw. Qual. J., 2007

Editorial: Special issue on mining low-quality data.
Knowl. Inf. Syst., 2007

The pairwise attribute noise detection algorithm.
Knowl. Inf. Syst., 2007

Improving Software Quality Prediction by Noise Filtering Techniques.
J. Comput. Sci. Technol., 2007

The multiple imputation quantitative noise corrector.
Intell. Data Anal., 2007

Hybrid Collaborative Filtering Algorithms Using a Mixture of Experts.
Proceedings of the 2007 IEEE / WIC / ACM International Conference on Web Intelligence, 2007

Learning from Software Quality Data with Class Imbalance and Noise.
Proceedings of the Nineteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2007), 2007

Rule-Based Multiple Object Tracking for Traffic Surveillance Using Collaborative Background Extraction.
Proceedings of the Advances in Visual Computing, Third International Symposium, 2007

A Progressive Edge-Based Stereo Correspondence Method.
Proceedings of the Advances in Visual Computing, Third International Symposium, 2007

An Empirical Study on Estimating Motions in Video Stabilization.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2007

Building a Novel GP-Based Software Quality Classifier Using Multiple Validation Datasets.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2007

An Empirical Study of the Noise Impact on Cost-Sensitive Learning.
Proceedings of the IJCAI 2007, 2007

Mining Data with Rare Events: A Case Study.
Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007), 2007

An Empirical Study of Learning from Imbalanced Data Using Random Forest.
Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007), 2007

Learning with limited minority class data.
Proceedings of the Sixth International Conference on Machine Learning and Applications, 2007

Using evolutionary sampling to mine imbalanced data.
Proceedings of the Sixth International Conference on Machine Learning and Applications, 2007

An application of a rule-based model in software quality classification.
Proceedings of the Sixth International Conference on Machine Learning and Applications, 2007

Experimental perspectives on learning from imbalanced data.
Proceedings of the Machine Learning, 2007

Arbitrarily-Shaped Window Based Stereo Matching using the Go-Light Optimization Algorithm.
Proceedings of the International Conference on Image Processing, 2007

Skewed Class Distributions and Mislabeled Examples.
Proceedings of the Workshops Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

2006
Unsupervised multiscale color image segmentation based on MDL principle.
IEEE Trans. Image Process., 2006

An empirical study of predicting software faults with case-based reasoning.
Softw. Qual. J., 2006

Resource oriented selection of rule-based classification models: An empirical case study.
Softw. Qual. J., 2006

Detecting Noisy Instances with the Ensemble Filter: a Study in Software Quality Estimation.
Int. J. Softw. Eng. Knowl. Eng., 2006

Noise elimination with partitioning filter for software quality estimation.
Int. J. Comput. Appl. Technol., 2006

Indirect classification approaches: a comparative study in network intrusion detection.
Int. J. Comput. Appl. Technol., 2006

Determining noisy instances relative to attributes of interest.
Intell. Data Anal., 2006

Class noise detection using frequent itemsets.
Intell. Data Anal., 2006

Quality Problem in Software Measurement Data.
Adv. Comput., 2006

Polishing Noise in Continuous Software Measurement Data.
Proceedings of the Eighteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2006), 2006

Multiple Imputation of Software Measurement Data: A Case Study.
Proceedings of the Eighteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2006), 2006

Classification of ships in surveillance video.
Proceedings of the 2006 IEEE International Conference on Information Reuse and Integration, 2006

Developing an effective validation strategy for genetic programming models based on multiple datasets.
Proceedings of the 2006 IEEE International Conference on Information Reuse and Integration, 2006

Labeling network event records for intrusion detection in a Wireless LAN.
Proceedings of the 2006 IEEE International Conference on Information Reuse and Integration, 2006

Software quality imputation in the presence of noisy data.
Proceedings of the 2006 IEEE International Conference on Information Reuse and Integration, 2006

Noise correction using bayesian multiple imputation.
Proceedings of the 2006 IEEE International Conference on Information Reuse and Integration, 2006

Collaborative Filtering for Multi-class Data Using Belief Nets Algorithms.
Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2006), 2006

A Hybrid Approach to Cleansing Software Measurement Data.
Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2006), 2006

Assessment of a Multi-Strategy Classifier for an Embedded Software System.
Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2006), 2006

A Comparison of Software Fault Imputation Procedures.
Proceedings of the Fifth International Conference on Machine Learning and Applications, 2006

2005
Resource-oriented software quality classification models.
J. Syst. Softw., 2005

Comparing software fault predictions of pure and zero-inflated Poisson regression models.
Int. J. Syst. Sci., 2005

Enhancing software quality estimation using ensemble-classifier based noise filtering.
Intell. Data Anal., 2005

Detecting noisy instances with the rule-based classification model.
Intell. Data Anal., 2005

Evaluating noise elimination techniques for software quality estimation.
Intell. Data Anal., 2005

Identifying noisy features with the Pairwise Attribute Noise Detection Algorithm.
Intell. Data Anal., 2005

Evaluating indirect and direct classification techniques for network intrusion detection.
Intell. Data Anal., 2005

Assessment of a New Three-Group Software Quality Classification Technique: An Empirical Case Study.
Empir. Softw. Eng., 2005

Application of fuzzy expert system in test case selection for system regression test.
Proceedings of the 2005 IEEE International Conference on Information Reuse and Integration, 2005

The partitioning- and rule-based filter for noise detection.
Proceedings of the 2005 IEEE International Conference on Information Reuse and Integration, 2005

Hierarchical indexing of ocean survey video by mean shift clustering and MDL principle.
Proceedings of the 2005 IEEE International Conference on Information Reuse and Integration, 2005

A Clustering Approach to Wireless Network Intrusion Detection.
Proceedings of the 17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2005), 2005

Intrusion detection in wireless networks using clustering techniques with expert analysis.
Proceedings of the Fourth International Conference on Machine Learning and Applications, 2005

Identifying noise in an attribute of interest.
Proceedings of the Fourth International Conference on Machine Learning and Applications, 2005

Analyzing Software Quality with Limited Fault-Proneness Defect Data.
Proceedings of the Ninth IEEE International Symposium on High Assurance Systems Engineering (HASE 2005), 2005

2004
A multiobjective module-order model for software quality enhancement.
IEEE Trans. Evol. Comput., 2004

Identification of fuzzy models of software cost estimation.
Fuzzy Sets Syst., 2004

Analyzing Software Measurement Data with Clustering Techniques.
IEEE Intell. Syst., 2004

Comparative Assessment of Software Quality Classification Techniques: An Empirical Case Study.
Empir. Softw. Eng., 2004

Software quality estimation with case-based reasoning.
Adv. Comput., 2004

Multi-Objective Optimization by CBR GA-Optimizer for Module-Order Modeling.
Proceedings of the Sixteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2004), 2004

Noise Elimination with Ensemble-Classifier Filtering: A Case-Study in Software Quality Engineerin.
Proceedings of the Sixteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2004), 2004

The Necessity of Assuring Quality in Software Measurement Data.
Proceedings of the 10th IEEE International Software Metrics Symposium (METRICS 2004), 2004

Module-Order Modeling using an Evolutionary Multi-Objective Optimization Approach.
Proceedings of the 10th IEEE International Software Metrics Symposium (METRICS 2004), 2004

Rule-Based Noise Detection for Software Measurement Data.
Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004

Generating Multiple Noise Elimination Filters with the Ensemble-Partitioning Filter.
Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004

Noise Identification with the k-Means Algorithm.
Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2004), 2004

Semi-Supervised Learning for Software Quality Estimation.
Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2004), 2004

Efficient Image Segmentation by Mean Shift Clustering and MDL-Guided Region Merging.
Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2004), 2004

Unsupervised Learning for Expert-Based Software Quality Estimation.
Proceedings of the 8th IEEE International Symposium on High-Assurance Systems Engineering (HASE 2004), 2004

Reducing Overfitting in Genetic Programming Models for Software Quality Classification.
Proceedings of the 8th IEEE International Symposium on High-Assurance Systems Engineering (HASE 2004), 2004

Resource-Sensitive Intrusion Detection Models for Network Traffic.
Proceedings of the 8th IEEE International Symposium on High-Assurance Systems Engineering (HASE 2004), 2004

2003
Ordering Fault-Prone Software Modules.
Softw. Qual. J., 2003

Introduction to the Special Issue on Quality Engineering with Computational Intelligence.
Softw. Qual. J., 2003

Application of fuzzy expert systems in assessing operational risk of software.
Inf. Softw. Technol., 2003

Software Quality Classification Modeling Using the SPRINT Decision Tree Algorithm.
Int. J. Artif. Intell. Tools, 2003

Analogy-Based Practical Classification Rules for Software Quality Estimation.
Empir. Softw. Eng., 2003

Fault Prediction Modeling for Software Quality Estimation: Comparing Commonly Used Techniques.
Empir. Softw. Eng., 2003

Empirical Case Studies of Combining Software Quality Classification Models.
Proceedings of the 3rd International Conference on Quality Software (QSIC 2003), 2003

Application of an Attribute Selection Method to CBR-Based Software Quality Classification.
Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2003), 2003

Genetic Programming-Based Decision Trees for Software Quality Classification.
Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2003), 2003

Detecting Outliers Using Rule-Based Modeling for Improving CBR-Based Software Quality Classification Models.
Proceedings of the Case-Based Reasoning Research and Development, 2003

Building Decision Tree Software Quality Classification Models Using Genetic Programming.
Proceedings of the Genetic and Evolutionary Computation, 2003

2002
Using regression trees to classify fault-prone software modules.
IEEE Trans. Reliab., 2002

Predicting Fault-Prone Modules in Embedded Systems Using Analogy-Based Classification Models.
Int. J. Softw. Eng. Knowl. Eng., 2002

Uncertain Classification of Fault-Prone Software Modules.
Empir. Softw. Eng., 2002

Tree-Based Software Quality Estimation Models For Fault Prediction.
Proceedings of the 8th IEEE International Software Metrics Symposium (METRICS 2002), 2002

An Empirical Study of the Impact of Count Models Predictions on Module-Order Models.
Proceedings of the 8th IEEE International Software Metrics Symposium (METRICS 2002), 2002

Estimating Software Project Effort by Analogy Based on Linguistic Values.
Proceedings of the 8th IEEE International Software Metrics Symposium (METRICS 2002), 2002

Improving Usefulness of Software Quality Classification Models Based on Boolean Discriminant Functions.
Proceedings of the 13th International Symposium on Software Reliability Engineering (ISSRE 2002), 2002

Cost-Sensitive Boosting In Software Quality Modeling.
Proceedings of the 7th IEEE International Symposium on High-Assurance Systems Engineering (HASE 2002), 2002

Can neural networks be easily interpreted in software cost estimation?
Proceedings of the 2002 IEEE International Conference on Fuzzy Systems, 2002

2001
Data Mining of Software Development Databases.
Softw. Qual. J., 2001

Cost-Benefit Analysis of Software Quality Models.
Softw. Qual. J., 2001

Empirical Assessment of a Software Metric: The Information Content of Operators.
Softw. Qual. J., 2001

Controlling Overfitting in Classification-Tree Models of Software Quality.
Empir. Softw. Eng., 2001

Software Quality Prediction for High-Assurance Network Telecommunications Systems.
Comput. J., 2001

Controlling Overfitting in Software Quality Models: Experiments with Regression Trees and Classification.
Proceedings of the 7th IEEE International Software Metrics Symposium (METRICS 2001), 2001

Measuring Coupling and Cohesion of Software Modules: An Information-Theory Approach.
Proceedings of the 7th IEEE International Software Metrics Symposium (METRICS 2001), 2001

An Application of Zero-Inflated Poisson Regression for Software Fault Prediction .
Proceedings of the 12th International Symposium on Software Reliability Engineering (ISSRE 2001), 2001

Genetic Programming Model for Software Quality Classification.
Proceedings of the 6th IEEE International Symposium on High-Assurance Systems Engineering (HASE 2001), 2001

2000
Classification-tree models of software-quality over multiple releases.
IEEE Trans. Reliab., 2000

A practical classification-rule for software-quality models.
IEEE Trans. Reliab., 2000

Case-Based Software Quality Prediction.
Int. J. Softw. Eng. Knowl. Eng., 2000

Balancing Misclassification Rates in Classification-Tree Models of Software Quality.
Empir. Softw. Eng., 2000

Accuracy of software quality models over multiple releases.
Ann. Softw. Eng., 2000

Modeling Fault-Prone Modules of Subsystems.
Proceedings of the 11th International Symposium on Software Reliability Engineering (ISSRE 2000), 2000

Improving Tree-Based Models of Software Quality with Principal Components Analysis.
Proceedings of the 11th International Symposium on Software Reliability Engineering (ISSRE 2000), 2000

Modeling software quality: the Software Measurement Analysis and Reliability Toolkit.
Proceedings of the 12th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2000), 2000

Prediction of software faults using fuzzy nonlinear regression modeling.
Proceedings of the 5th IEEE International Symposium on High-Assurance Systems Engineering (HASE 2000), 2000

Using product, process, and execution metrics to predict fault-prone software modules with classification trees.
Proceedings of the 5th IEEE International Symposium on High-Assurance Systems Engineering (HASE 2000), 2000

1999
Which Software Modules have Faults which will be Discovered by Customers?
J. Softw. Maintenance Res. Pract., 1999

Using Classification Trees for Software Quality Models: Lessons Learned.
Int. J. Softw. Eng. Knowl. Eng., 1999

Data Mining for Predictors of Software Quality.
Int. J. Softw. Eng. Knowl. Eng., 1999

A Comparative Study of Ordering and Classification of Fault-Prone Software Modules.
Empir. Softw. Eng., 1999

Can Metrics and Models be Applied Across Multiple Releases or Projects?
Proceedings of the 6th IEEE International Software Metrics Symposium (METRICS 1999), 1999

Assessing Uncertain Predictions of Software Quality.
Proceedings of the 6th IEEE International Software Metrics Symposium (METRICS 1999), 1999

Measuring Coupling and Cohesion: An Information-Theory Approach.
Proceedings of the 6th IEEE International Software Metrics Symposium (METRICS 1999), 1999

Experience Paper: Preparing Measurements of Legacy Software for Predicting Operational Faults.
Proceedings of the 1999 International Conference on Software Maintenance, 1999

Predicting Fault-Prone Software Modules in Embedded Systems with Classification Trees.
Proceedings of the 4th IEEE International Symposium on High-Assurance Systems Engineering (HASE '99), 1999

Modelling software quality with GP.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 1999), 1999

Using Genetic Programming to Determine Software Quality.
Proceedings of the Twelfth International Florida Artificial Intelligence Research Society Conference, 1999

Application of a Usage Profile in Software Quality Models.
Proceedings of the 3rd European Conference on Software Maintenance and Reengineering (CSMR '99), 1999

1998
Classification of Fault-Prone Software Modules: Prior Probabilities, Costs, and Model Evaluation.
Empir. Softw. Eng., 1998

Using Process History to Predict Software Quality.
Computer, 1998

Predicting the order of fault-prone modules in legacy software.
Proceedings of the Ninth International Symposium on Software Reliability Engineering, 1998

Can a Software Quality Model Hit a Moving Target?
Proceedings of the 1998 International Conference on Software Maintenance, 1998

Hitting the Moving Target: Trials and Tribulations of Modeling Quality in Evolving Software Systems.
Proceedings of the 1998 International Conference on Software Maintenance, 1998

The Application of Fuzzy Enhanced Case-Based Reasoning for Identifying Fault-Prone Modules.
Proceedings of the 3rd IEEE International Symposium on High-Assurance Systems Engineering (HASE '98), 1998

1997
Application of neural networks to software quality modeling of a very large telecommunications system.
IEEE Trans. Neural Networks, 1997

An Information Theory-Based Approach to Quantifying the Contribution of a Software Metric.
J. Syst. Softw., 1997

The Impact of Costs of Misclassification on Software Quality Modeling.
Proceedings of the 4th IEEE International Software Metrics Symposium (METRICS 1997), 1997

Predicting fault-prone modules with case-based reasoning.
Proceedings of the Eighth International Symposium on Software Reliability Engineering, 1997

Evolutionary neural networks: a robust approach to software reliability problems.
Proceedings of the Eighth International Symposium on Software Reliability Engineering, 1997

Process Measures for Predicting Software Quality.
Proceedings of the 2nd High-Assurance Systems Engineering Workshop (HASE '97), 1997

1996
Using neural networks to predict software faults during testing.
IEEE Trans. Reliab., 1996

Analysis and differentiation of software system environments.
Softw. Qual. J., 1996

Early Quality Prediction: A Case Study in Telecommunications.
IEEE Softw., 1996

Emerald: Software Metrics and Models on the Desktop.
IEEE Softw., 1996

The impact of software evolution and reuse on software quality.
Empir. Softw. Eng., 1996

Detection of software modules with high debug code churn in a very large legacy system.
Proceedings of the Seventh International Symposium on Software Reliability Engineering, 1996

Integrating metrics and models for software risk assessment.
Proceedings of the Seventh International Symposium on Software Reliability Engineering, 1996

Using the genetic algorithm to build optimal neural networks for fault-prone module detection.
Proceedings of the Seventh International Symposium on Software Reliability Engineering, 1996

Detection of Fault-Prone Software Modules During a Spiral Life Cycle.
Proceedings of the 1996 International Conference on Software Maintenance (ICSM '96), 1996

A tree-based classification model for analysis of a military software system.
Proceedings of the 1st High-Assurance Systems Engineering Workshop (HASE '96), 1996

1995
Investigating ARIMA models of software system quality.
Softw. Qual. J., 1995

Performance Analysis of a Peer-to-Peer I/O Architecture in Video Server Environments.
Multim. Tools Appl., 1995

A Performance Analysis of an Object-Based I/O Architecture in a Video Server Environment.
Multim. Syst., 1995

A neural network approach for early detection of program modules having high risk in the maintenance phase.
J. Syst. Softw., 1995

Exploring the behaviour of neural network software quality models.
Softw. Eng. J., 1995

Application of Neural Networks for Predicting Program Faults.
Ann. Softw. Eng., 1995

An assessment of software quality in a C++ environment.
Proceedings of the Sixth International Symposium on Software Reliability Engineering, 1995

Detection of fault-prone program modules in a very large telecommunications system.
Proceedings of the Sixth International Symposium on Software Reliability Engineering, 1995

Detecting program modules with low testability.
Proceedings of the International Conference on Software Maintenance, 1995

Multivariate assessment of complex software systems: a comparative study.
Proceedings of the 1st IEEE International Conference on Engineering of Complex Computer Systems (ICECCS '95), 1995

1994
A Performance Analysis of Personal Computers in a Video Conferencing Environment.
Multim. Syst., 1994

Performance Analysis of Advanced I/O Architectures for PC-Based Video Servers.
Multim. Syst., 1994

Alternative approaches for the use of metrics to order programs by complexity.
J. Syst. Softw., 1994

A comparative study of pattern recognition techniques for quality evaluation of telecommunications software.
IEEE J. Sel. Areas Commun., 1994

A performance analysis of advanced I/O architectures for PC-based network file servers.
Distributed Syst. Eng., 1994

Modeling the Relationship Between Source Code Complexity and Maintenance Difficulty.
Computer, 1994

Software Metrics: Charting the Course - Guest Editors' Introduction.
Computer, 1994

Are the principal components of software complexity data stable across software products?
Proceedings of the 1994 IEEE 2nd International Software Metrics Symposium, 1994

On the impact of software product dissimilarity on software quality models.
Proceedings of the 5th International Symposium on Software Reliability Engineering, 1994

Canonical Modeling of Software Complexity and Fault Correction Activity.
Proceedings of the International Conference on Software Maintenance, 1994

Improving Code Churn Predictions During the System Test and Maintenance Phases.
Proceedings of the International Conference on Software Maintenance, 1994

1993
Measurement of data structure complexity.
J. Syst. Softw., 1993

A high-level performance analysis of the IBM subsystem control block (SCB) architecture.
Microprocess. Microprogramming, 1993

A Performance Analysis of the IBM Subsystem Control Block Architecture in a Video Conferencing Environment.
Proceedings of the First ACM International Conference on Multimedia '93, 1993

Dynamic system complexity.
Proceedings of the First International Software Metrics Symposium, 1993

A neural network modeling methodology for the detection of high-risk programs.
Proceedings of the Fourth International Symposium on Software Reliability Engineering, 1993

A Comparative Study of Predictive Models for Program Changes During System Testing and Maintenance.
Proceedings of the Conference on Software Maintenance, 1993

1992
The Detection of Fault-Prone Programs.
IEEE Trans. Software Eng., 1992

Predictive Modeling Techniques of Software Quality from Software Measures.
IEEE Trans. Software Eng., 1992

Measuring Dynamic Program Complexity.
IEEE Softw., 1992

A workload model for frame-based real-time applications on distributed systems.
J. Syst. Softw., 1992

A neural network approach for predicting software development faults.
Proceedings of the Third International Symposium on Software Reliability Engineering, 1992

Software measurement for the space shuttle HAL/S maintenance environment.
Proceedings of the Conference on Software Maintenance, 1992

1991
The use of software complexity metrics in software reliability modeling.
Proceedings of the Second International Symposium on Software Reliability Engineering, 1991

Software reliability model selection: a cast study.
Proceedings of the Second International Symposium on Software Reliability Engineering, 1991

1990
Applications of a relative complexity metric for software project management.
J. Syst. Softw., 1990

Predicting Software Development Errors Using Software Complexity Metrics.
IEEE J. Sel. Areas Commun., 1990

The lines of code metric as a predictor of program faults: a critical analysis.
Proceedings of the Fourteenth Annual International Computer Software and Applications Conference, 1990

1989
The Dimensionality of Program Complexity.
Proceedings of the 11th International Conference on Software Engineering, 1989


  Loading...