clustering

Here are 1,890 public repositories matching this topic...

biolab / orange3

🍊 📊 💡 Orange: Interactive data analysis

visualization python data-science machine-learning data-mining random-forest clustering numpy scikit-learn regression pandas data-visualization classification scipy orange plotting decision-trees visual-programming orange3

Updated Jun 16, 2025
Python

dedupeio / dedupe

Star

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

python clustering dedupe record-linkage python-library entity-resolution datamade dedupe-library de-duplicating

Updated Nov 25, 2024
Python

embeddings-benchmark / mteb

Star

MTEB: Massive Text Embedding Benchmark

benchmark information-retrieval retrieval text-classification clustering sts semantic-search reranking text-embedding sgpt neural-search sentence-transformers sbert multilingual-nlp bitext-mining mteb

Updated Jun 23, 2025
Python

benedekrozemberczki / awesome-community-detection

Sponsor

Star

A curated list of community detection research papers with implementations.

Updated Mar 16, 2024
Python

nomic-ai / nomic

Star

Interact, analyze and structure massive text, image, embedding, audio and video datasets

python clustering text embeddings topic-modeling duplicate-detection unstructured-data

Updated Jun 13, 2025
Python

google / uis-rnn

Star

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

machine-learning clustering supervised-learning speaker-recognition speaker-diarization supervised-clustering uis-rnn

Updated Sep 25, 2024
Python

JustGlowing / minisom

Star

🔴 MiniSom is a minimalistic implementation of the Self Organizing Maps

machine-learning clustering som neural-networks dimensionality-reduction outlier-detection kohonen unsupervised-learning manifold-learning self-organizing-map vector-quantization

Updated Apr 7, 2025
Python

A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/classification/clustering/forecasting/anomaly detection/cleaning on incomplete industrial (irregularly-sampled) multivariate TS with NaN missing values

machine-learning data-mining deep-learning time-series clustering pytorch forecasting classification imputation generation anomaly-detection missing-values

Updated May 29, 2025
Python

wvangansbeke / Unsupervised-Classification

Star

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

clustering image-classification representation-learning unsupervised-learning moco self-supervised-learning simclr eccv2020 eccv-2020 contrastive-learning

Updated Jul 27, 2023
Python

parthsarthi03 / raptor

Star

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

machine-learning framework retrieval clustering language-model agents rag vector-database llm retrieval-augmented-generation

Updated Sep 3, 2024
Python

annoviko / pyclustering

Star

pyclustering is a Python, C++ data mining library.

python c-plus-plus data-science machine-learning data-mining algorithms clustering python3 neural-networks oscillatory-networks

Updated Feb 25, 2024
Python

wannesm / dtaidistance

Sponsor

Star

Time series distances: Dynamic Time Warping (fast DTW implementation in C)

python c timeseries clustering dtw dynamic-time-warping distance-measure

Updated Jun 18, 2025
Python

unum-cloud / uform

Star

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Updated Jun 21, 2025
Python

tensorflow / similarity

Star

TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

python machine-learning deep-learning clustering tensorflow nearest-neighbor-search metric-learning cosine-similarity nearest-neighbors unsupervised-learning knn similarity-search similarity-learning simclr contrastive-learning simsiam barlow-twins simclr2

Updated May 6, 2024
Python

scikit-multilearn / scikit-multilearn

Star

A scikit-learn based module for multi-label et. al. classification

machine-learning clustering scikit-learn classification partitioning label-prediction scikit multi-label scikit-multilearn

Updated Feb 1, 2024
Python

yueliu1999 / Awesome-Deep-Graph-Clustering

Star

Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).

machine-learning data-mining deep-learning clustering surveys representation-learning data-mining-algorithms network-embedding graph-convolutional-networks gcn graph-embedding graph-neural-networks self-supervised-learning deep-clustering graphclustering

Updated May 6, 2025
Python

benedekrozemberczki / ClusterGCN

Sponsor

Star

A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).

Updated Nov 6, 2022
Python

loicland / superpoint_graph

Star

Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs

semantic clustering point-cloud pytorch lidar segmentation partition semantic-segmentation large-scale ply-files superpoint-graphs

Updated Jul 19, 2023
Python

logpai / Drain3

Star

A robust streaming log template miner based on the Drain algorithm

machine-learning log clustering observability log-clustering drain anomaly-detection aiops template-mining

Updated Feb 4, 2025
Python

yahoo / lopq

Star

Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.

spark clustering nearest-neighbor-search product-quantization lopq

Updated Apr 14, 2019
Python

Improve this page

Add a description, image, and links to the clustering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the clustering topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clustering

Here are 1,890 public repositories matching this topic...

biolab / orange3

dedupeio / dedupe

embeddings-benchmark / mteb

benedekrozemberczki / awesome-community-detection

nomic-ai / nomic

google / uis-rnn

JustGlowing / minisom

WenjieDu / PyPOTS

wvangansbeke / Unsupervised-Classification

parthsarthi03 / raptor

annoviko / pyclustering

wannesm / dtaidistance

unum-cloud / uform

tensorflow / similarity

scikit-multilearn / scikit-multilearn

yueliu1999 / Awesome-Deep-Graph-Clustering

benedekrozemberczki / ClusterGCN

loicland / superpoint_graph

logpai / Drain3

yahoo / lopq

Improve this page

Add this topic to your repo