dc.contributor.author | Montshiwa, Tlhalitshi Volition | |
dc.contributor.author | Pulenyane, Malebogo | |
dc.date.accessioned | 2022-06-15T08:10:18Z | |
dc.date.available | 2022-06-15T08:10:18Z | |
dc.date.issued | 2020 | |
dc.identifier.citation | Montshiwa, T.V. and Pulenyane, M. 2020. A regression model for predicting the likelihood of reporting a crime based on the victim’s demographic variables and their perceptions towards the police. Statistics, Politics and Policy. 11(2) [https://doi.org/10.1515/spp-2020-0003] | en_US |
dc.identifier.issn | 2194-6299 | |
dc.identifier.issn | 2151-7509 (Online) | |
dc.identifier.uri | http://hdl.handle.net/10394/39234 | |
dc.identifier.uri | https://doi.org/10.1515/spp-2020-0003 | |
dc.description.abstract | Despite the growing criminal activities in South Africa,many victims still
do not report the crimes, therefore there was a need to understand the determinants
of the likelihood of reporting a crime in the country. Binary logistic regression is a
supervisedmachine learning algorithmthat can assist in predicting the likelihood of
reporting a crime but the selection of relevant variables to add in the model varies
from one author to the other. Selection of theoretically sound and statistically
relevant independent variables is key to achieving parsimonious multivariate
models. This study sought to test the efficiency of some commonly used variable
selection methods for logistic regression models in order to identify the most relevant
determinants of the likelihood of reporting a crime of housebreaking. The study
used 17 candidate variables such as the victims’ demographic variables and their
perceptions on the police. The multivariate model fitted using stepwise selection
was found to be a best fit for the data based on the lowest AIC, the highest classification
accuracy rate and the highest Area under the Receiver Operating Characteristic
curve. Themodel fitted using theHosmer-Lemeshow(H-L) algorithmwas the
worst fit for the data. The study revealed a limitation of the stepwise selection
method which is that this method may select different independent variables for
each unique set of randomly selected observations of the same dataset. The study
established a multivariate logistic regression model to predict the likelihood of a
victim reporting a crime of housebreaking and the determinants thereof. | en_US |
dc.language.iso | en | en_US |
dc.publisher | De Gruyter | en_US |
dc.title | A regression model for predicting the likelihood of reporting a crime based on the victim’s demographic variables and their perceptions towards the police | en_US |
dc.type | Article | en_US |
dc.contributor.researchID | 22297812 - Montshiwa, Volition Tlhalitshi | |
dc.contributor.researchID | 22388729 - Pulenyane, Malebogo | |