Hybrid dragonfly algorithm with neighbourhood component analysis and gradient tree boosting for crime rates modelling

dc.contributor.authorKhairuddin, Alif Ridzuan
dc.date.accessioned2023-08-21T02:21:59Z
dc.date.available2023-08-21T02:21:59Z
dc.date.issued2021
dc.descriptionThesis (PhD. (Computer Science))
dc.description.abstractIn crime studies, crime rates time series prediction helps in strategic crime prevention formulation and decision making. Statistical models are commonly applied in predicting time series crime rates. However, the time series crime rates data are limited and mostly nonlinear. One limitation in the statistical models is that they are mainly linear and are only able to model linear relationships. Thus, this study proposed a time series crime prediction model that can handle nonlinear components as well as limited historical crime rates data. Recently, Artificial Intelligence (AI) models have been favoured as they are able to handle nonlinear and robust to small sample data components in crime rates. Hence, the proposed crime model implemented an artificial intelligence model namely Gradient Tree Boosting (GTB) in modelling the crime rates. The crime rates are modelled using the United States (US) annual crime rates of eight crime types with nine factors that influence the crime rates. Since GTB has no feature selection, this study proposed hybridisation of Neighbourhood Component Analysis (NCA) and GTB (NCA-GTB) in identifying significant factors that influence the crime rates. Also, it was found that both NCA and GTB are sensitive to input parameter. Thus, DA2-NCA-eGTB model was proposed to improve the NCA-GTB model. The DA2-NCA-eGTB model hybridised a metaheuristic optimisation algorithm namely Dragonfly Algorithm (DA) with NCA-GTB model to optimise NCA and GTB parameters. In addition, DA2-NCA-eGTB model also improved the accuracy of the NCA-GTB model by using Least Absolute Deviation (LAD) as the GTB loss function. The experimental result showed that DA2-NCA-eGTB model outperformed existing AI models in all eight modelled crime types. This was proven by the smaller values of Mean Absolute Percentage Error (MAPE), which was between 2.9195 and 18.7471. As a conclusion, the study showed that DA2-NCA-eGTB model is statistically significant in representing all crime types and it is able to handle the nonlinear component in limited crime rate data well.
dc.description.sponsorshipFaculty of Engineering - School of Computing
dc.identifier.citationNA
dc.identifier.issnNA
dc.identifier.urihttp://openscience.utm.my/handle/123456789/617
dc.language.isoen
dc.publisherUniversiti Teknologi Malaysia
dc.relation.ispartofseriesNA; NA
dc.subjectartificial intelligence
dc.titleHybrid dragonfly algorithm with neighbourhood component analysis and gradient tree boosting for crime rates modelling
dc.typeThesis
dc.typeDataset
Files
Original bundle
Now showing 1 - 5 of 5
Loading...
Thumbnail Image
Name:
AlifRidzuanKhairuddinPSC2021_A.pdf
Size:
47.99 KB
Format:
Adobe Portable Document Format
Description:
Annual Crime Rate Data for Each Crime Type
Loading...
Thumbnail Image
Name:
AlifRidzuanKhairuddinPSC2021_B.pdf
Size:
597.18 KB
Format:
Adobe Portable Document Format
Description:
Annual Data for Each Factor
Loading...
Thumbnail Image
Name:
AlifRidzuanKhairuddinPSC2021_D.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format
Description:
Scatter Diagram for Each Factor by Crime Rates
Loading...
Thumbnail Image
Name:
AlifRidzuanKhairuddinPSC2021_E.pdf
Size:
40.4 KB
Format:
Adobe Portable Document Format
Description:
Comparison of NCA-GTB and DA-NCA-GTB Model Rank of Factors by Crime Type
Loading...
Thumbnail Image
Name:
AliAhaniPSC2019_C.pdf
Size:
318.94 KB
Format:
Adobe Portable Document Format
Description:
Indicator reliability assessment after item deletion
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: