Enhanced k-nearest neighbours classification performance based on segmentation and imputation of missing data

dc.contributor.authorSaeed, Soobia
dc.date.accessioned2024-02-25T00:10:07Z
dc.date.available2024-02-25T00:10:07Z
dc.date.issued2022
dc.descriptionThesis (PhD. (Computer Science))
dc.description.abstractDiagnosing data or object classification for magnetic resonance images is important in image segmentation especially data which is less effective to be identified namely low-grade tumors or cerebrospinal fluid (CSF).The aim of this thesis is to address the aforementioned problems associated with missing data in MRI images and noisy of MRI images that required more processing times. This thesis focus on segmentation of brain tumor and CSF classification of fourdimensional MRI images. Three datasets called Light Field Database (LFD) with improved accuracy of images and increased resolution have been created. A hybrid k-nearest neighbours (k-NN) framework with time complexity that consists of three techniques namely GrabCut support vector machine (GCSVM) and scale invariant feature transform (SIFT), hidden Markov model of k-mean clustering (HMkC) and k-NN, and correlation matrices of discrete Fourier transform (CM-DFT) have been proposed. Firstly, GCSVM and SIFT technique is a combination of three methods namely the GrabCut, Support Vector Machine and Scale Invariant Feature Transform. This result of the technique is 99.9% for SVM accuracy, 4606 for GrabCut segmentation of Maximum Flow, 50625 and 50168 for Nodes of Image Pixel and edges respectively, and 2.29 seconds for computational time. For SIFT by using LFD dataset, the performance of distance value in the segmentation is 1.464, 1.215 and 1.23 for dataset-I, dataset-II, dataset-III respectively. Meanwhile, computational time for dataset-I, dataset-II and dataset-III is 1.47 seconds, 1.88 seconds, and 1.35 seconds respectively. Secondly, HMkC and k-NN resolves the classification problem using the Iterated Condition Mode (ICMM) with k-mean clustering algorithm and k-NN algorithm. The classification result of the technique for the accuracy, sensitivity, specificity and computational time is 99.83%, 99.99%, 99.8%, and 14.9 seconds respectively. Thirdly, CM-DFT technique resolves the missing data imputation problem by using cross correlation of lagged hybrid k-NN with DFT (Hk-NN-DFT) to enhance the MRI images. The technique generates the not a non-missing values in terms of multiplication of 1100-3000 and 99.84% for the accuracy of missing data in the image. The missing ratio result of imputed missing data in the images after retrieving the missing ratio of dataset-I, II, and III is 0.9815 with the 1.533 second of computational time. These three techniques are useful to improve the proposed hybrid k-NN framework to ensure that the classification of brain tumor (low grade tumors) and CSF in images is conducted easily.
dc.description.sponsorshipFaculty of Engineering - School of Computing
dc.identifier.urihttp://openscience.utm.my/handle/123456789/1018
dc.language.isoen
dc.publisherUniversiti Teknologi Malaysia
dc.subjectMagnetic resonance imaging—Diagnostic use
dc.subjectDiagnosis—Technological innovations
dc.subjectDiagnostic imaging—Research
dc.titleEnhanced k-nearest neighbours classification performance based on segmentation and imputation of missing data
dc.typeThesis
dc.typeDataset
Files
Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
SoobiaSaeedPSC2022_A.pdf
Size:
227.56 KB
Format:
Adobe Portable Document Format
Description:
COMPARATIVE ANALYSIS OF METHODS DISCUSSED
Loading...
Thumbnail Image
Name:
SoobiaSaeedPSC2022_B.pdf
Size:
119.61 KB
Format:
Adobe Portable Document Format
Description:
COMPARISON OF HYBRID MODEL AND MISSING DATA IMPUTATION
Loading...
Thumbnail Image
Name:
SoobiaSaeedPSC2022_C.pdf
Size:
129.18 KB
Format:
Adobe Portable Document Format
Description:
COMPARISON OF Hybrid SVM and Graph Cut Algorithm
Loading...
Thumbnail Image
Name:
SoobiaSaeedPSC2022_E.pdf
Size:
104.54 KB
Format:
Adobe Portable Document Format
Description:
STATISTICAL RESULTS OF OLD AND NEW PROPOSED LFD DATASETS
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: