Applying machine learning to criminology: semi-parametric spatial-demographic Bayesian regression
Abstract Objectives This paper describes the use of machine learning techniques to implement a Bayesian approach to modelling the dependency between offence data and environmental factors such as demographic characteristics and spatial location. The main goal of this paper is to provide a fully prob...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Springer
2018-06-01
|
Series: | Security Informatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13388-018-0030-x |
id |
doaj-art-3de768d8e53949e6a19c9d94b4b51250 |
---|---|
recordtype |
oai_dc |
spelling |
doaj-art-3de768d8e53949e6a19c9d94b4b512502018-08-15T22:56:41ZengSpringerSecurity Informatics2190-85322018-06-017111910.1186/s13388-018-0030-xApplying machine learning to criminology: semi-parametric spatial-demographic Bayesian regressionRoman Marchant0Sebastian Haan1Garner Clancey2Sally Cripps3Centre for Translational Data Science, The University of SydneySydney Informatics Hub, The University of SydneySydney Institute of Criminology, The University of SydneyCentre for Translational Data Science, The University of SydneyAbstract Objectives This paper describes the use of machine learning techniques to implement a Bayesian approach to modelling the dependency between offence data and environmental factors such as demographic characteristics and spatial location. The main goal of this paper is to provide a fully probabilistic approach to modelling crime which reflects all uncertainties in the prediction of offences as well as the uncertainties surrounding model parameters. Methods The proposed method is based on a Bayesian framework, with a Gaussian Process prior and MCMC, allowing uncertainties in prediction and inference to be quantified via the posterior distributions of interest. By using Bayesian updating, these predictions and inferences are dynamic in the sense that they change as new information becomes available. Results We applied the proposed methodology to particular offence data, such as domestic violence-related assaults, burglary and motor vehicle theft, in the state of New South Wales (NSW), Australia. Our results demonstrate the strength of the technique by validating the factors that are associated with high and low criminal activity, including bounds on the degree of the relation. Conclusions We argue that this fully probabilistic approach will improve prediction, in the sense that the uncertainties are more accurately quantified, with attendant benefits to policymakers and policing organisations seeking to deploy limited criminal justice resources to prevent and control crime. While limitations and areas for potential improvement are identified, the success of the Bayesian approach, implemented using machine learning techniques, in a criminological context represents an exciting development.http://link.springer.com/article/10.1186/s13388-018-0030-xSemi-parametric regressionCrime ratesMachine learningBayesian methodsGaussian process |
institution |
Open Data Bank |
collection |
Open Access Journals |
building |
Directory of Open Access Journals |
language |
English |
format |
Article |
author |
Roman Marchant Sebastian Haan Garner Clancey Sally Cripps |
spellingShingle |
Roman Marchant Sebastian Haan Garner Clancey Sally Cripps Applying machine learning to criminology: semi-parametric spatial-demographic Bayesian regression Security Informatics Semi-parametric regression Crime rates Machine learning Bayesian methods Gaussian process |
author_facet |
Roman Marchant Sebastian Haan Garner Clancey Sally Cripps |
author_sort |
Roman Marchant |
title |
Applying machine learning to criminology: semi-parametric spatial-demographic Bayesian regression |
title_short |
Applying machine learning to criminology: semi-parametric spatial-demographic Bayesian regression |
title_full |
Applying machine learning to criminology: semi-parametric spatial-demographic Bayesian regression |
title_fullStr |
Applying machine learning to criminology: semi-parametric spatial-demographic Bayesian regression |
title_full_unstemmed |
Applying machine learning to criminology: semi-parametric spatial-demographic Bayesian regression |
title_sort |
applying machine learning to criminology: semi-parametric spatial-demographic bayesian regression |
publisher |
Springer |
series |
Security Informatics |
issn |
2190-8532 |
publishDate |
2018-06-01 |
description |
Abstract Objectives This paper describes the use of machine learning techniques to implement a Bayesian approach to modelling the dependency between offence data and environmental factors such as demographic characteristics and spatial location. The main goal of this paper is to provide a fully probabilistic approach to modelling crime which reflects all uncertainties in the prediction of offences as well as the uncertainties surrounding model parameters. Methods The proposed method is based on a Bayesian framework, with a Gaussian Process prior and MCMC, allowing uncertainties in prediction and inference to be quantified via the posterior distributions of interest. By using Bayesian updating, these predictions and inferences are dynamic in the sense that they change as new information becomes available. Results We applied the proposed methodology to particular offence data, such as domestic violence-related assaults, burglary and motor vehicle theft, in the state of New South Wales (NSW), Australia. Our results demonstrate the strength of the technique by validating the factors that are associated with high and low criminal activity, including bounds on the degree of the relation. Conclusions We argue that this fully probabilistic approach will improve prediction, in the sense that the uncertainties are more accurately quantified, with attendant benefits to policymakers and policing organisations seeking to deploy limited criminal justice resources to prevent and control crime. While limitations and areas for potential improvement are identified, the success of the Bayesian approach, implemented using machine learning techniques, in a criminological context represents an exciting development. |
topic |
Semi-parametric regression Crime rates Machine learning Bayesian methods Gaussian process |
url |
http://link.springer.com/article/10.1186/s13388-018-0030-x |
_version_ |
1612702222875885568 |