Clustering
NeuronDB provides multiple clustering algorithms for unsupervised learning.
K-Means
Partition data into k clusters:
CREATE TEMP TABLE kmeans_model AS SELECT train_kmeans_model_id('data_table', 'features', 3, 100) AS model_id;
Mini-batch K-Means
Faster version for large datasets:
SELECT train_minibatch_kmeans('data_table', 'features', 3, 100) AS model_id;
DBSCAN
Density-based clustering:
SELECT train_dbscan('data_table', 'features', 0.5, 5) AS model_id;
GMM (Gaussian Mixture Model)
Probabilistic clustering:
CREATE TEMP TABLE gmm_model AS SELECT train_gmm_model_id('data_table', 'features', 3) AS model_id;
Hierarchical Clustering
SELECT train_hierarchical_clustering('data_table', 'features', 3) AS model_id;
Learn More
For detailed documentation on clustering algorithms, choosing parameters, evaluating clusters, and visualization, visit:
Related Topics
- Dimensionality Reduction - Reduce dimensions before clustering
- Quality Metrics - Evaluate clustering quality