Exploring neural network training dynamics through binary node activations

Haasbroek, Daniël G.; Davel, Marelie H.

dc.contributor.author	Haasbroek, Daniël G.
dc.contributor.author	Davel, Marelie H.
dc.date.accessioned	2021-02-26T14:01:08Z
dc.date.available	2021-02-26T14:01:08Z
dc.date.issued	2020
dc.identifier.isbn	978-0-620-89373-2
dc.identifier.uri	http://hdl.handle.net/10394/36796
dc.description.abstract	Each node in a neural network is trained to activate for a specific region in the input domain. Any training samples that fall within this domain are therefore implicitly clustered together. Recent work has highlighted the importance of these clusters during the training process but has not yet investigated their evolution during training. Towards this goal, we train several ReLU-activated MLPs on a simple classification task (MNIST) and show that a consistent training process emerges: (1) sample clusters initially increase in size and then decrease as training progresses, (2) the size of sample clusters in the first layer decreases more rapidly than in deeper layers, (3) binary node activations, especially of nodes in deeper layers, become more sensitive to class membership as training progresses, (4) individual nodes remain poor predictors of class membership, even if accurate when applied as a group. We report on the detail of these findings and interpret them from the perspective of a high-dimensional clustering process.	en_US
dc.language.iso	en	en_US
dc.publisher	Southern African Conference for Artificial Intelligence Research	en_US
dc.subject	Neural networks	en_US
dc.subject	Generalization	en_US
dc.subject	Clustering	en_US
dc.title	Exploring neural network training dynamics through binary node activations	en_US
dc.type	Article	en_US

Files in this item

Name:: Haasbroek-2020-exploring-nn-tr ...
Size:: 1.525Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Faculty of Engineering [1123]

Show simple item record