Reliability of predictions using hybrid Algorithms and its application to dimensionality reduction

BBOSA, FRANCIS FULLER

dc.contributor.author	BBOSA, FRANCIS FULLER
dc.date.accessioned	2022-09-22T08:07:53Z
dc.date.available	2022-09-22T08:07:53Z
dc.date.issued	2022-07-25
dc.identifier.uri	http://hdl.handle.net/10570/10810
dc.description	PhD thesis	en_US
dc.description.abstract	The reliability of predictions emanating from independent data mining techniques is a complex problem. This could be attributed to cross-cutting weaknesses of individual techniques such as collinearity due to high dimensionality of attributes in a dataset, biasedness due to underfitting and overfitting of data, noise accumulation due to outliers as well as failure to take into consideration class imbalance in imbalanced data and thus affecting the reliability of predictions emanating from these models. This study aimed at addressing this drawback by developing a hybrid data mining algorithm for predicting reliable classes. The decision tree and naïve Bayes classifiers were used to build a hybrid prediction algorithm. The decision tree was employed for important attribute extraction based on the C4.5 algorithm and its gain ratio values were used as input weights to construct a weighted naïve Bayesian classifier. The goodness of fit for all the data mining models was done using k-fold cross-validation based on a confusion matrix on previously untrained imbalanced data. Accuracy, F-measure and the Area under the Receiver Operating Characteristics curve (AUC) were the key performance metrics used to evaluate the generalizability of the hybrid model in comparison to the independent models. The results revealed that the proposed hybrid model outperformed the independent decision tree and naïve Bayes classifiers on all demonstration datasets respectively. Hence merging several independent homogeneous predictive data mining techniques may enhance the accuracy of the estimates leading to reliable predictions.	en_US
dc.language.iso	en	en_US
dc.subject	Data mining	en_US
dc.subject	Dimensionality	en_US
dc.subject	Algorithm	en_US
dc.title	Reliability of predictions using hybrid Algorithms and its application to dimensionality reduction	en_US
dc.type	Thesis	en_US

Files in this item

Name:: PhD Dissertation_Francis Bbosa.pdf
Size:: 2.573Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

School of Computing and Informatics Technology (CIT) Collection

Show simple item record