MiniBatchKMeans Class |
Namespace: Accord.MachineLearning
The MiniBatchKMeans type exposes the following members.
Name | Description | |
---|---|---|
MiniBatchKMeans(Int32, Int32) |
Initializes a new instance of KMeans algorithm
| |
MiniBatchKMeans(Int32, Int32, IDistanceDouble) |
Initializes a new instance of Mini-Batch K-Means algorithm
|
Name | Description | |
---|---|---|
BatchSize |
Gets or sets the size of batches.
| |
Centroids |
Gets or sets the cluster centroids.
(Inherited from KMeans.) | |
Clusters |
Gets the clusters found by K-means.
(Inherited from KMeans.) | |
ComputeCovariances |
Gets or sets whether covariance matrices for the clusters should
be computed at the end of an iteration. Default is true.
(Inherited from KMeans.) | |
ComputeError |
Gets or sets whether the clustering distortion error (the
average distance between all data points and the cluster
centroids) should be computed at the end of the algorithm.
The result will be stored in Error. Default is true.
(Inherited from KMeans.) | |
Dimension |
Gets the dimensionality of the data space.
(Inherited from KMeans.) | |
Distance |
Gets or sets the distance function used
as a distance metric between data points.
(Inherited from KMeans.) | |
Error |
Gets the cluster distortion error after the
last call to this class' Compute methods.
(Inherited from KMeans.) | |
InitializationBatchSize |
Gets or sets the size of the batch used during initialization.
| |
Iterations |
Gets the number of iterations performed in the
last call to this class' Compute methods.
(Inherited from KMeans.) | |
K |
Gets the number of clusters.
(Inherited from KMeans.) | |
Labels |
Gets the labels assigned for each data point in the last
call to Learn(Double, Double).
| |
MaxIterations |
Gets or sets the maximum number of iterations to
be performed by the method. If set to zero, no
iteration limit will be imposed. Default is 0.
(Inherited from KMeans.) | |
NumberOfInitializations |
Gets or sets the number of different initializations of the centroids.
| |
ParallelOptions |
Gets or sets the parallelization options for this algorithm.
(Inherited from ParallelLearningBase.) | |
Token |
Gets or sets a cancellation token that can be used
to cancel the algorithm while it is running.
(Inherited from ParallelLearningBase.) | |
Tolerance |
Gets or sets the relative convergence threshold
for stopping the algorithm. Default is 1e-5.
(Inherited from KMeans.) | |
UseSeeding |
Gets or sets the strategy used to initialize the
centroids of the clustering algorithm. Default is
KMeansPlusPlus.
(Inherited from KMeans.) |
Name | Description | |
---|---|---|
Compute(Double) | Obsolete.
Divides the input data into K clusters.
(Inherited from KMeans.) | |
Compute(Double, Double) | Obsolete.
Divides the input data into K clusters.
(Inherited from KMeans.) | |
Compute(Double, Double) | Obsolete.
Divides the input data into K clusters.
(Inherited from KMeans.) | |
ComputeInformation(Double) |
Computes the information about each cluster (covariance, proportions and error).
(Inherited from KMeans.) | |
ComputeInformation(Double, Int32) |
Computes the information about each cluster (covariance, proportions and error).
(Inherited from KMeans.) | |
converged |
Determines if the algorithm has converged by comparing the
centroids between two consecutive iterations.
(Inherited from KMeans.) | |
Equals | Determines whether the specified object is equal to the current object. (Inherited from Object.) | |
Finalize | Allows an object to try to free resources and perform other cleanup operations before it is reclaimed by garbage collection. (Inherited from Object.) | |
GetHashCode | Serves as the default hash function. (Inherited from Object.) | |
GetType | Gets the Type of the current instance. (Inherited from Object.) | |
Learn |
Learns a model that can map the given inputs to the desired outputs.
(Overrides KMeansLearn(Double, Double).) | |
MemberwiseClone | Creates a shallow copy of the current Object. (Inherited from Object.) | |
Randomize |
Randomizes the clusters inside a dataset.
(Inherited from KMeans.) | |
ToString | Returns a string that represents the current object. (Inherited from Object.) |
Name | Description | |
---|---|---|
HasMethod |
Checks whether an object implements a method with the given name.
(Defined by ExtensionMethods.) | |
IsEqual |
Compares two objects for equality, performing an elementwise
comparison if the elements are vectors or matrices.
(Defined by Matrix.) | |
To(Type) | Overloaded.
Converts an object into another type, irrespective of whether
the conversion can be done at compile time or not. This can be
used to convert generic types to numeric types during runtime.
(Defined by ExtensionMethods.) | |
ToT | Overloaded.
Converts an object into another type, irrespective of whether
the conversion can be done at compile time or not. This can be
used to convert generic types to numeric types during runtime.
(Defined by ExtensionMethods.) |
The Mini-Batch K-Means clustering algorithm is a modification of the K-Means algorithm.
In each iteration, it uses only a portion of data to update the cluster centroids with the gradient step. The subsets of data are called mini-batches and are randomly sampled from the whole dataset in each iteration.
Mini-Batch K-Means is faster than k-means for large datasets since batching reduces computational time of the algorithm.
The algorithm is composed of the following steps:
References:
Accord.Math.Random.Generator.Seed = 0; // Declare some observations double[][] observations = { new double[] { -5, -2, -1 }, new double[] { -5, -5, -6 }, new double[] { 2, 1, 1 }, new double[] { 1, 1, 2 }, new double[] { 1, 2, 2 }, new double[] { 3, 1, 2 }, new double[] { 11, 5, 4 }, new double[] { 15, 5, 6 }, new double[] { 10, 5, 6 }, }; // Create a new Mini-Batch K-Means algorithm MiniBatchKMeans mbkmeans = new MiniBatchKMeans(k: 3, batchSize: 2); // Compute and retrieve the data centroids var clusters = mbkmeans.Learn(observations); // Use the centroids to parition all the data int[] labels = clusters.Decide(observations);