谱聚类python代码_python中的谱聚类图

没有太多光谱聚类的经验，只是按照文档进行(结果请跳到最后！)以下内容：

代码：import numpy as np

import networkx as nx

from sklearn.cluster import SpectralClustering

from sklearn import metrics

np.random.seed(1)

# Get your mentioned graph

G = nx.karate_club_graph()

# Get ground-truth: club-labels -> transform to 0/1 np-array

# (possible overcomplicated networkx usage here)

gt_dict = nx.get_node_attributes(G, 'club')

gt = [gt_dict[i] for i in G.nodes()]

gt = np.array([0 if i == 'Mr. Hi' else 1 for i in gt])

# Get adjacency-matrix as numpy-array

adj_mat = nx.to_numpy_matrix(G)

print('ground truth')

print(gt)

# Cluster

sc = SpectralClustering(2, affinity='precomputed', n_init=100)

sc.fit(adj_mat)

# Compare ground-truth and clustering-results

print('spectral clustering')

print(sc.labels_)

print('just for better-visualization: invert clusters (permutation)')

print(np.abs(sc.labels_ - 1))

# Calculate some clustering metrics

print(metrics.adjusted_rand_score(gt, sc.labels_))

print(metrics.adjusted_mutual_info_score(gt, sc.labels_))

输出：ground truth

[0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 0 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1]

spectral clustering

[1 1 0 1 1 1 1 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0]

just for better-visualization: invert clusters (permutation)

[0 0 1 0 0 0 0 1 1 1 0 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1]

0.204094758281

0.271689477828

总体思路：

介绍here中的数据和任务：The nodes in the graph represent the 34 members in a college Karate club. (Zachary is a sociologist, and he was one of the members.) An edge between two nodes indicates that the two members spent significant time together outside normal club meetings. The dataset is interesting because while Zachary was collecting his data, there was a dispute in the Karate club, and it split into two factions: one led by “Mr. Hi”, and one led by “John A”. It turns out that using only the connectivity information (the edges), it is possible to recover the two factions.

使用sklearn&spectral集群解决此问题：If affinity is the adjacency matrix of a graph, this method can be used to find normalized graph cuts.

This将规范化图切割描述为：Find two disjoint partitions A and B of the vertices V of a graph, so

that A ∪ B = V and A ∩ B = ∅

Given a similarity measure w(i,j) between two vertices (e.g. identity

when they are connected) a cut value (and its normalized version) is defined as:

cut(A, B) = SUM u in A, v in B: w(u, v)

...

we seek the minimization of disassociation

between the groups A and B and the maximization of the association

within each group

听起来不错。因此，我们创建邻接矩阵(nx.to_numpy_matrix(G))，并将参数affinity设置为预计算的(因为邻接矩阵是我们预计算的相似性度量)。Alternatively, using precomputed, a user-provided affinity matrix can be used.

编辑：虽然对此不熟悉，但我查找了要调整的The strategy to use to assign labels in the embedding space. There are two ways to assign labels after the laplacian embedding. k-means can be applied and is a popular choice. But it can also be sensitive to initialization. Discretization is another approach which is less sensitive to random initialization.

所以尝试不那么敏感的方法：sc = SpectralClustering(2, affinity='precomputed', n_init=100, assign_labels='discretize')

输出：ground truth

[0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 0 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1]

spectral clustering

[0 0 1 0 0 0 0 0 1 1 0 0 0 0 1 1 0 0 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1]

just for better-visualization: invert clusters (permutation)

[1 1 0 1 1 1 1 1 0 0 1 1 1 1 0 0 1 1 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0]

0.771725032425

0.722546051351

这是一个非常符合实际的事实！

谱聚类python代码_python中的谱聚类图相关推荐

层次聚类python实现_Python机器学习——Agglomerative层次聚类
层次聚类(hierarchical clustering)可在不同层次上对数据集进行划分,形成树状的聚类结构.AggregativeClustering是一种常用的层次聚类算法. 其原理是:最初将每个 ...
谱聚类python代码_Python 谱聚类算法从零开始
谱聚类算法是一种常用的无监督机器学习算法,其性能优于其他聚类方法. 此外,谱聚类实现起来非常简单,并且可以通过标准线性代数方法有效地求解. 在谱聚类算法中,根据数据点之间的相似性而不是k-均值中的绝对 ...
js如何运行python代码_python中执行javascript代码
python中执行javascript代码: 1.安装相应的库,我使用的是PyV8 2.import PyV8 ctxt = PyV8.JSContext() ctxt.enter() func = ...
层次聚类python代码_python实现层次聚类
BAFIMINARMTO BA0662877255412996 FI6620295468268400 MI8772950754564138 NA2554687540219869 RM412268564 ...
支持向量机python代码_Python中的支持向量机SVM的使用（有实例）
除了在Matlab中使用PRTools工具箱中的svm算法,Python中一样可以使用支持向量机做分类.因为Python中的sklearn库也集成了SVM算法,本文的运行环境是Pycharm. 一.导 ...
谱聚类Python代码详解
谱聚类算法步骤整体来说,谱聚类算法要做的就是先求出相似性矩阵,然后对该矩阵归一化运算,之后求前个特征向量,最后运用K-means算法分类. 实际上,谱聚类要做的事情其实就是将高维度的数据,以特征向量 ...
python字符集_PYTHON 中的字符集
Python中的字符编码是个老生常谈的话题,今天来梳理一下相关知识,希望给其他人些许帮助. Python2的默认编码是ASCII,不能识别中文字符,需要显式指定字符编码:Python3的默认编码 ...
用Python代码实现视频转gif动图
下面是一个使用 Python 代码实现视频转 gif 动图的简单示例: import imageio# 读取视频文件 video = imageio.get_reader('input.mp4')# ...
python层次聚类_python中做层次聚类，使用scipy.cluster.hierarchy.fclusterdata方法 | 学步园...
python机器学习包里面的cluster提供了很多聚类但是没有看明白ward_tree的返回值代表了什么含义,遂决定寻找别的实现方式. 经过查找,发现scipy.cluster.hierarchy ...

谱聚类python代码_python中的谱聚类图

谱聚类python代码_python中的谱聚类图相关推荐

最新文章

热门文章