Comparing generalized additive neural networks with multilayer perceptrons
Goosen, Johannes Christiaan
MetadataShow full item record
In this dissertation, generalized additive neural networks (GANNs) and multilayer perceptrons (MLPs) are studied and compared as prediction techniques. MLPs are the most widely used type of artificial neural network (ANN), but are considered black boxes with regard to interpretability. There is currently no simple a priori method to determine the number of hidden neurons in each of the hidden layers of ANNs. Guidelines exist that are either heuristic or based on simulations that are derived from limited experiments. A modified version of the neural network construction with cross–validation samples (N2C2S) algorithm is therefore implemented and utilized to construct good MLP models. This algorithm enables the comparison with GANN models. GANNs are a relatively new type of ANN, based on the generalized additive model. The architecture of a GANN is less complex compared to MLPs and results can be interpreted with a graphical method, called the partial residual plot. A GANN consists of an input layer where each of the input nodes has its own MLP with one hidden layer. Originally, GANNs were constructed by interpreting partial residual plots. This method is time consuming and subjective, which may lead to the creation of suboptimal models. Consequently, an automated construction algorithm for GANNs was created and implemented in the SAS R statistical language. This system was called AutoGANN and is used to create good GANN models. A number of experiments are conducted on five publicly available data sets to gain insight into the similarities and differences between GANN and MLP models. The data sets include regression and classification tasks. In–sample model selection with the SBC model selection criterion and out–of–sample model selection with the average validation error as model selection criterion are performed. The models created are compared in terms of predictive accuracy, model complexity, comprehensibility, ease of construction and utility. The results show that the choice of model is highly dependent on the problem, as no single model always outperforms the other in terms of predictive accuracy. GANNs may be suggested for problems where interpretability of the results is important. The time taken to construct good MLP models by the modified N2C2S algorithm may be shorter than the time to build good GANN models by the automated construction algorithm
- ETD@PUK 
Showing items related by title, author, creator and subject.
Structural vascular disease in Africans: performance of ethnic-specific waist circumference cut points using logistic regression and neural network analyses: the SABPA study Botha, J.; De Ridder, J.H.; Potgieter, J.C.; Steyn, H.S.; Malan, L. (Thieme, 2013)A recently proposed model for waist circumference cut points (RPWC), driven by increased blood pressure, was demonstrated in an African population. We therefore aimed to validate the RPWC by comparing the RPWC and the Joint ...
Du Toit, Tiny; Kruger, Hennie (Tadbir Operational Research Group, 2014)Traditionally spam has been considered as an inconvenience requiring workers to sift through and delete large numbers of e-mail messages per day. However, new developments and the Internet have dramatically transformed the ...
Schabort, Cornelis Johannes; Neomagus, Hendrik Willem Johannes P. (Gulf Publishing Company, 2013)The Natref Oil Refinery in Sasolburg, South Africa, experienced refractory problems in the combustion chamber of the Sulphur Recovery Unit (SRU) since start-up (Figure 1). Even though the SRU was at all times operated ...