Analysis of Hidden Representations by Greedy Clustering

Rudy Setiono; Huan Liu

doi:10.1080/095400998116567

Analysis of Hidden Representations by Greedy Clustering

Rudy Setiono, Huan Liu

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

The hidden layer of backpropagation neural networks (NNs) holds the key to the networks' success in solving pattern classification problems. The units in the hidden layer encapsulate the network's internal representations of the outside world described by the input data. In this paper, the hidden representations of trained networks are investigated by means of a simple greedy clustering algorithm. This clustering algorithm is applied to networks that have been trained to solve well-known problems: the monks problems, the 5-bit parity problem and the contiguity problem. The results from applying the algorithm to problems with known concepts provide us with a better understanding of NN learning. These results also explain why NNs achieve higher predictive accuracy than that of decision-tree methods. The results of this study can be readily applied to rule extraction from NNs. Production rules are extracted for the parity and the monks problems, as well as for a benchmark data set: Pima Indian diabetes diagnosis. The extracted rules from the Pima Indian diabetes data set compare favorably with rules extracted from ARTMAP NNs in terms of predictive accuracy and simplicity.

Original language	English (US)
Pages (from-to)	21-42
Number of pages	22
Journal	Connection Science
Volume	10
Issue number	1
DOIs	https://doi.org/10.1080/095400998116567
State	Published - Mar 1998
Externally published	Yes

Keywords

Backpropagation neural network
Clustering
Hidden representation
Pruning
Rule extraction

ASJC Scopus subject areas

Software
Human-Computer Interaction
Artificial Intelligence

Access to Document

10.1080/095400998116567

Cite this

@article{1e16ec6ca5c947b88ba243b0c69256e6,

title = "Analysis of Hidden Representations by Greedy Clustering",

abstract = "The hidden layer of backpropagation neural networks (NNs) holds the key to the networks' success in solving pattern classification problems. The units in the hidden layer encapsulate the network's internal representations of the outside world described by the input data. In this paper, the hidden representations of trained networks are investigated by means of a simple greedy clustering algorithm. This clustering algorithm is applied to networks that have been trained to solve well-known problems: the monks problems, the 5-bit parity problem and the contiguity problem. The results from applying the algorithm to problems with known concepts provide us with a better understanding of NN learning. These results also explain why NNs achieve higher predictive accuracy than that of decision-tree methods. The results of this study can be readily applied to rule extraction from NNs. Production rules are extracted for the parity and the monks problems, as well as for a benchmark data set: Pima Indian diabetes diagnosis. The extracted rules from the Pima Indian diabetes data set compare favorably with rules extracted from ARTMAP NNs in terms of predictive accuracy and simplicity.",

keywords = "Backpropagation neural network, Clustering, Hidden representation, Pruning, Rule extraction",

author = "Rudy Setiono and Huan Liu",

year = "1998",

month = mar,

doi = "10.1080/095400998116567",

language = "English (US)",

volume = "10",

pages = "21--42",

journal = "Connection Science",

issn = "0954-0091",

publisher = "Taylor and Francis AS",

number = "1",

}

TY - JOUR

T1 - Analysis of Hidden Representations by Greedy Clustering

AU - Setiono, Rudy

AU - Liu, Huan

PY - 1998/3

Y1 - 1998/3

N2 - The hidden layer of backpropagation neural networks (NNs) holds the key to the networks' success in solving pattern classification problems. The units in the hidden layer encapsulate the network's internal representations of the outside world described by the input data. In this paper, the hidden representations of trained networks are investigated by means of a simple greedy clustering algorithm. This clustering algorithm is applied to networks that have been trained to solve well-known problems: the monks problems, the 5-bit parity problem and the contiguity problem. The results from applying the algorithm to problems with known concepts provide us with a better understanding of NN learning. These results also explain why NNs achieve higher predictive accuracy than that of decision-tree methods. The results of this study can be readily applied to rule extraction from NNs. Production rules are extracted for the parity and the monks problems, as well as for a benchmark data set: Pima Indian diabetes diagnosis. The extracted rules from the Pima Indian diabetes data set compare favorably with rules extracted from ARTMAP NNs in terms of predictive accuracy and simplicity.

AB - The hidden layer of backpropagation neural networks (NNs) holds the key to the networks' success in solving pattern classification problems. The units in the hidden layer encapsulate the network's internal representations of the outside world described by the input data. In this paper, the hidden representations of trained networks are investigated by means of a simple greedy clustering algorithm. This clustering algorithm is applied to networks that have been trained to solve well-known problems: the monks problems, the 5-bit parity problem and the contiguity problem. The results from applying the algorithm to problems with known concepts provide us with a better understanding of NN learning. These results also explain why NNs achieve higher predictive accuracy than that of decision-tree methods. The results of this study can be readily applied to rule extraction from NNs. Production rules are extracted for the parity and the monks problems, as well as for a benchmark data set: Pima Indian diabetes diagnosis. The extracted rules from the Pima Indian diabetes data set compare favorably with rules extracted from ARTMAP NNs in terms of predictive accuracy and simplicity.

KW - Backpropagation neural network

KW - Clustering

KW - Hidden representation

KW - Pruning

KW - Rule extraction

UR - http://www.scopus.com/inward/record.url?scp=0032399390&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0032399390&partnerID=8YFLogxK

U2 - 10.1080/095400998116567

DO - 10.1080/095400998116567

M3 - Article

AN - SCOPUS:0032399390

SN - 0954-0091

VL - 10

SP - 21

EP - 42

JO - Connection Science

JF - Connection Science

IS - 1

ER -

Analysis of Hidden Representations by Greedy Clustering

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this