Interpretation of nonlinear relationships between process variables by use of random forests

Better understanding of process phenomena is dependent on the interpretation of models capturing the relationships between the process variables. Although linear regression is used routinely in the mineral process industries for this purpose, it may not be useful where the relationships between vari...

Full description

Bibliographic Details
Main Authors: Auret, L., Aldrich, Chris
Format: Journal Article
Published: Elsevier 2012
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/14782
_version_ 1848748715017240576
author Auret, L.
Aldrich, Chris
author_facet Auret, L.
Aldrich, Chris
author_sort Auret, L.
building Curtin Institutional Repository
collection Online Access
description Better understanding of process phenomena is dependent on the interpretation of models capturing the relationships between the process variables. Although linear regression is used routinely in the mineral process industries for this purpose, it may not be useful where the relationships between variables are nonlinear or complex. Under these circumstances, nonlinear methods, such as neural networks or decision trees can be used to develop reliable models, without necessarily giving any particular or explicit insight into the relationships between the process and the target variables. This is a major drawback in situations where such information would be very important, such as in fault identification or gaining a better understanding of the fundamentals of a process. In this paper, the use of variable importance measures and partial dependency plots generated by random forest models are proposed as a practical tool that can be used to surmount this problem. In particular, it is shown that important variables can be flagged by appropriate threshold generated by inclusion of dummy variables in the system. Moreover, the results of the study indicate that random forest models can reliably identify the influence of individual variables, even in the presence of high levels of additive noise. This would make it a useful tool in continuous process improvement and root cause analysis of abnormal process behaviour.
first_indexed 2025-11-14T07:09:26Z
format Journal Article
id curtin-20.500.11937-14782
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T07:09:26Z
publishDate 2012
publisher Elsevier
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-147822017-09-13T16:08:12Z Interpretation of nonlinear relationships between process variables by use of random forests Auret, L. Aldrich, Chris Comminution Pyrometallurgy Modelling Better understanding of process phenomena is dependent on the interpretation of models capturing the relationships between the process variables. Although linear regression is used routinely in the mineral process industries for this purpose, it may not be useful where the relationships between variables are nonlinear or complex. Under these circumstances, nonlinear methods, such as neural networks or decision trees can be used to develop reliable models, without necessarily giving any particular or explicit insight into the relationships between the process and the target variables. This is a major drawback in situations where such information would be very important, such as in fault identification or gaining a better understanding of the fundamentals of a process. In this paper, the use of variable importance measures and partial dependency plots generated by random forest models are proposed as a practical tool that can be used to surmount this problem. In particular, it is shown that important variables can be flagged by appropriate threshold generated by inclusion of dummy variables in the system. Moreover, the results of the study indicate that random forest models can reliably identify the influence of individual variables, even in the presence of high levels of additive noise. This would make it a useful tool in continuous process improvement and root cause analysis of abnormal process behaviour. 2012 Journal Article http://hdl.handle.net/20.500.11937/14782 10.1016/j.mineng.2012.05.008 Elsevier restricted
spellingShingle Comminution
Pyrometallurgy
Modelling
Auret, L.
Aldrich, Chris
Interpretation of nonlinear relationships between process variables by use of random forests
title Interpretation of nonlinear relationships between process variables by use of random forests
title_full Interpretation of nonlinear relationships between process variables by use of random forests
title_fullStr Interpretation of nonlinear relationships between process variables by use of random forests
title_full_unstemmed Interpretation of nonlinear relationships between process variables by use of random forests
title_short Interpretation of nonlinear relationships between process variables by use of random forests
title_sort interpretation of nonlinear relationships between process variables by use of random forests
topic Comminution
Pyrometallurgy
Modelling
url http://hdl.handle.net/20.500.11937/14782