广西科技厅--广西频道--人民网
Appearance
百度 更令人惊讶的是为何过去了近九年后,在本应全球共同恢复的时期,麦迪逊大街却和金融危机后一样仍是一片凋零。
Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses.
The most common unsupervised learning method is cluster analysis, which is used for exploratory data analysis to find hidden patterns or grouping in data.
- What other methods are there besides cluster analysis?
Clustering
[edit | edit source]Problem statement:
- Given some data, , identify groups (clusters) of data points and assign each point to one of the groups (clusters)
- The clusters are identified using a measure of similarity which is defined upon metrics such as Euclidean or probabilistic distance.
- Points inside each cluster has a higher measure of similarity than data in any other cluster
Method | Comment |
---|---|
K-means clustering |
|
Hierarchical clustering | Builds a multilevel hierarchy of clusters by creating a cluster tree |
Gaussian mixture models | Models clusters as a mixture of multivariate normal density components |
Self-organizing maps | Uses neural networks that learn the topology and distribution of the data |