Application of a Genetic Algorithm to Variable Selection in Fuzzy Clustering

In order to group the observations of a data set into a given number of clusters, an ?optimal? subset out of a greater number of explanatory variables is to be selected. The problem is approached by maximizing a quality measure under certain restrictions that are supposed to keep the subset most representative of the whole data. The restrictions may either be set manually, or generated from the data. A genetic optimization algorithm is developed to solve this problem. The procedure is then applied to a data set describing features of sub-districts of the city of Dortmund, Germany, to detect different social milieus and investigate the variables making up the differences between these.

MoreLess

Year of publication:	2004
Authors:	Röver, Christian ; Szepannek, Gero
Institutions:	Institut für Wirtschafts- und Sozialstatistik, Universität Dortmund

More details

Extent:	application/pdf
Series:	Technical Reports.
Type of publication:	Book / Working Paper
Language:	English
Notes:	Number 2004,76
Source:	RePEc - Research Papers in Economics

Persistent link: https://www.econbiz.de/10009219830