Cluster analysis or clustering is a statistical classification technique or activity that involves grouping a set of objects or data so that those in the same group (called a cluster) are similar to each other, but different from those in other clusters. It is essential to
data mining and discovery, and is often used in the context of
machine learning, pattern recognition, image analysis and in bioinformatics and other sectors that analyze large data sets.