XMeans
Package
weka.clusterers
Synopsis
Cluster data using the X-means algorithm.
X-Means is K-Means extended by an Improve-Structure part In this part of the algorithm the centers are attempted to be split in its region. The decision between the children of each center and itself is done comparing the BIC-values of the two structures.
For more information see:
Dan Pelleg, Andrew W. Moore: X-means: Extending K-means with Efficient Estimation of the Number of Clusters. In: Seventeenth International Conference on Machine Learning, 727-734, 2000.
Available in Weka 3.6.x - 3.7.1. Available via the package management system for Weka >= 3.7.2 (XMeans)
Options
The table below describes the options available for XMeans.
Option |
Description |
---|---|
KDTree |
The KDTree to use. |
binValue |
Set the value that represents true in the new attributes. |
cutOffFactor |
the cut-off factor to use |
debugLevel |
The debug level to use. |
debugVectorsFile |
The file containing the debug vectors (only for debugging!). |
distanceF |
The distance function to use. |
inputCenterFile |
The file to read the list of centers from. |
maxIterations |
the maximum number of iterations to perform |
maxKMeans |
the maximum number of iterations to perform in KMeans |
maxKMeansForChildren |
the maximum number of iterations KMeans that is performed on the child centers |
maxNumClusters |
set maximum number of clusters |
minNumClusters |
set minimum number of clusters |
outputCenterFile |
The file to write the list of centers to. |
seed |
The random number seed to be used. |
useKDTree |
Whether to use the KDTree. |
Capabilities
The table below describes the capabilites of XMeans.
Capability |
Supported |
---|---|
Class |
No class |
Attributes |
Missing values, Date attributes, Numeric attributes |
Min # of instances |
1 |