Matplotlib 离散颜色条
- 2025-01-20 09:07:00
- admin 原创
- 123
问题描述:
我正在尝试为 matplotlib 中的散点图制作离散颜色条
我有 x、y 数据,并且每个点都有一个整数标签值,我想用唯一的颜色来表示,例如
plt.scatter(x, y, c=tag)
通常标签是 0-20 之间的整数,但具体范围可能会改变
到目前为止我只是使用默认设置,例如
plt.colorbar()
这给出了连续的颜色范围。理想情况下,我想要一组 n 种离散颜色(本例中 n=20)。更好的方法是让标签值为 0 以产生灰色,1-20 产生彩色。
我找到了一些“食谱”脚本,但它们非常复杂,我不认为它们是解决看似简单问题的正确方法
解决方案 1:
您可以使用 BoundaryNorm 作为散点图的规范化器,轻松创建自定义离散颜色条。古怪之处(在我的方法中)是将 0 显示为灰色。
对于图像,我经常使用 cmap.set_bad() 并将我的数据转换为 numpy 掩码数组。这样更容易使 0 变成灰色,但我无法让它与散点图或自定义 cmap 一起使用。
另外,您可以从头开始制作自己的 cmap,或者读出现有的 cmap 并覆盖一些特定的条目。
import numpy as np
import matplotlib as mpl
import matplotlib.pylab as plt
fig, ax = plt.subplots(1, 1, figsize=(6, 6)) # setup the plot
x = np.random.rand(20) # define the data
y = np.random.rand(20) # define the data
tag = np.random.randint(0, 20, 20)
tag[10:12] = 0 # make sure there are some 0 values to show up as grey
cmap = plt.cm.jet # define the colormap
# extract all colors from the .jet map
cmaplist = [cmap(i) for i in range(cmap.N)]
# force the first color entry to be grey
cmaplist[0] = (.5, .5, .5, 1.0)
# create the new map
cmap = mpl.colors.LinearSegmentedColormap.from_list(
'Custom cmap', cmaplist, cmap.N)
# define the bins and normalize
bounds = np.linspace(0, 20, 21)
norm = mpl.colors.BoundaryNorm(bounds, cmap.N)
# make the scatter
scat = ax.scatter(x, y, c=tag, s=np.random.randint(100, 500, 20),
cmap=cmap, norm=norm)
# create a second axes for the colorbar
ax2 = fig.add_axes([0.95, 0.1, 0.03, 0.8])
cb = plt.colorbar.ColorbarBase(ax2, cmap=cmap, norm=norm,
spacing='proportional', ticks=bounds, boundaries=bounds, format='%1i')
ax.set_title('Well defined discrete colors')
ax2.set_ylabel('Very custom cbar [-]', size=12)
我个人认为,有 20 种不同的颜色,读取具体值有点困难,但这当然取决于你。
解决方案 2:
您可以按照下面的示例或文档中新添加的示例进行操作
#!/usr/bin/env python
"""
Use a pcolor or imshow with a custom colormap to make a contour plot.
Since this example was initially written, a proper contour routine was
added to matplotlib - see contour_demo.py and
http://matplotlib.sf.net/matplotlib.pylab.html#-contour.
"""
from pylab import *
delta = 0.01
x = arange(-3.0, 3.0, delta)
y = arange(-3.0, 3.0, delta)
X,Y = meshgrid(x, y)
Z1 = bivariate_normal(X, Y, 1.0, 1.0, 0.0, 0.0)
Z2 = bivariate_normal(X, Y, 1.5, 0.5, 1, 1)
Z = Z2 - Z1 # difference of Gaussians
cmap = cm.get_cmap('PiYG', 11) # 11 discrete colors
im = imshow(Z, cmap=cmap, interpolation='bilinear',
vmax=abs(Z).max(), vmin=-abs(Z).max())
axis('off')
colorbar()
show()
生成以下图像:
解决方案 3:
上述答案都很好,只是它们在颜色条上没有正确的标记位置。我喜欢将标记放在颜色的中间,这样数字 -> 颜色的映射会更清晰。您可以通过更改 matshow 调用的限制来解决此问题:
import matplotlib.pyplot as plt
import numpy as np
def discrete_matshow(data):
# get discrete colormap
cmap = plt.get_cmap('RdBu', np.max(data) - np.min(data) + 1)
# set limits .5 outside true range
mat = plt.matshow(data, cmap=cmap, vmin=np.min(data) - 0.5,
vmax=np.max(data) + 0.5)
# tell the colorbar to tick at integers
cax = plt.colorbar(mat, ticks=np.arange(np.min(data), np.max(data) + 1))
# generate data
a = np.random.randint(1, 9, size=(10, 10))
discrete_matshow(a)
解决方案 4:
要设置高于或低于颜色图范围的值,您需要使用颜色图的set_over
和方法。如果您想标记特定值,请屏蔽它(即创建一个屏蔽数组),然后使用方法。(查看基本颜色图类的文档:http ://matplotlib.org/api/colors_api.html#matplotlib.colors.Colormap )set_under
`set_bad`
听起来你想要这样的东西:
import matplotlib.pyplot as plt
import numpy as np
# Generate some data
x, y, z = np.random.random((3, 30))
z = z * 20 + 0.1
# Set some values in z to 0...
z[:5] = 0
cmap = plt.get_cmap('jet', 20)
cmap.set_under('gray')
fig, ax = plt.subplots()
cax = ax.scatter(x, y, c=z, s=100, cmap=cmap, vmin=0.1, vmax=z.max())
fig.colorbar(cax, extend='min')
plt.show()
解决方案 5:
这个主题已经被很好地涵盖了,但我想添加一些更具体的内容:我想确保某个值将映射到该颜色(而不是任何颜色)。
这并不复杂,但因为我花了一些时间,它可能会帮助其他人避免像我一样浪费太多时间:)
import numpy as np
import matplotlib
from matplotlib import pyplot as plt
from matplotlib.colors import ListedColormap
# Let's design a dummy land use field
A = np.reshape([7, 2, 13, 7, 2, 2], (2, 3))
vals = np.unique(A)
# Let's also design our color mapping: 1s should be plotted in blue, 2s in red, etc...
col_dict = {1: "blue",
2: "red",
13: "orange",
7: "green"}
# We create a colormar from our list of colors
cm = ListedColormap([col_dict[x] for x in col_dict.keys()])
# Let's also define the description of each category : 1 (blue) is Sea; 2 (red) is burnt, etc... Order should be respected here ! Or using another dict maybe could help.
labels = np.array(["Sea", "City", "Sand", "Forest"])
len_lab = len(labels)
# prepare normalizer
# Prepare bins for the normalizer
norm_bins = np.sort([*col_dict.keys()]) + 0.5
norm_bins = np.insert(norm_bins, 0, np.min(norm_bins) - 1.0)
print(norm_bins)
# Make normalizer and formatter
norm = matplotlib.colors.BoundaryNorm(norm_bins, len_lab, clip=True)
fmt = matplotlib.ticker.FuncFormatter(lambda x, pos: labels[norm(x)])
# Plot our figure
fig, ax = plt.subplots()
im = ax.imshow(A, cmap=cm, norm=norm)
diff = norm_bins[1:] - norm_bins[:-1]
tickz = norm_bins[:-1] + diff / 2
cb = fig.colorbar(im, format=fmt, ticks=tickz)
plt.show()
解决方案 6:
我一直在研究这些想法,以下是我的一点看法。它避免了调用BoundaryNorm
以及将 指定为和 的norm
参数。但是,我发现没有办法消除对 的冗长调用。scatter
`colorbar`matplotlib.colors.LinearSegmentedColormap.from_list
背景是,matplotlib 提供了所谓的定性颜色图,旨在用于离散数据。Set1
例如,有 9 种易于区分的颜色,tab20
可用于 20 种颜色。有了这些地图,就可以自然地使用它们的前 n 种颜色来为具有 n 个类别的散点图着色,如下例所示。该示例还生成了一个带有适当标记的 n 种离散颜色的颜色条。
import matplotlib, numpy as np, matplotlib.pyplot as plt
n = 5
from_list = matplotlib.colors.LinearSegmentedColormap.from_list
cm = from_list(None, plt.cm.Set1(range(0,n)), n)
x = np.arange(99)
y = x % 11
z = x % n
plt.scatter(x, y, c=z, cmap=cm)
plt.clim(-0.5, n-0.5)
cb = plt.colorbar(ticks=range(0,n), label='Group')
cb.ax.tick_params(length=0)
生成下图。n
调用中的Set1
指定该颜色图的第一个颜色,调用中的n
最后一个
指定构造一个带有颜色的图(默认值为 256)。为了将设置为默认颜色图,我发现有必要给它命名并注册它,即:n
`from_listn
cm`plt.set_cmap
cm = from_list('Set15', plt.cm.Set1(range(0,n)), n)
plt.cm.register_cmap(None, cm)
plt.set_cmap(cm)
...
plt.scatter(x, y, c=z)
解决方案 7:
@Enzoupi 的回答有很多好东西。我把它分解了一下,看看是什么。这是我的注释版本。所有功劳都归功于他们。
import numpy as np
import matplotlib
from matplotlib import pyplot as plt
from matplotlib.colors import ListedColormap
# Let's design a dummy land use field
A = np.reshape([7, 2, 13, 7, 2, 2], (2, 3))
# Let's also design our color mapping: 1s should be plotted in blue, 2s in red, etc...
col_dict = {1: "blue",
2: "red",
13: "orange",
7: "green"}
# We create a colormar from our list of colors
cm = ListedColormap(colors=list(col_dict.values()))
# Note the colormap `cm` has no information as to the values we want each color
# to represent... to do that, we have to normalize the image.
# We create bins by adding 0.5 to each value (this assumes no two values are
# less than 0.5 apart)
norm_bins = np.sort([*col_dict.keys()]) + 0.5
# We must also add a bin at the bottom; doesn't matter really where as long as
# it's below the minimum value. This is because `BoundaryNorm` needs bins on
# either side of a value to map that value to a particular color.
norm_bins = np.insert(norm_bins, 0, np.min(norm_bins) - 1.0) # add one below the minimum
norm = matplotlib.colors.BoundaryNorm(norm_bins, len(col_dict), clip=True)
# Let's also define the description of each category : 1 (blue) is Sea; 2 (red) is burnt, etc...
# Order should be respected here ! Or using another dict maybe could help.
labels = ["Sea", "City", "Sand", "Forest"]
# We need a tick formatter that takes the value `x` and maps it to a label.
# We use the normalizer we created to take land use values and convert to the
# color index, and then use that to pick from our label list.
fmt = matplotlib.ticker.FuncFormatter(lambda x, pos: labels[norm(x)])
# But, as-is, the ticks will be at the boundary limits, which will appear to
# show incorrect labels. So we must also put the ticks at the center of each bin
# in the normalizer.
diff = norm_bins[1:] - norm_bins[:-1]
ticks = norm_bins[:-1] + diff / 2
# So in summary:
# Plot `A` using a list of colors `cm`. Normalize the values in `A` using `norm`
# (so that discontinous values map to the correct colors)
im = plt.imshow(A, cmap=cm, norm=norm)
# Add a colorbar which positions ticks at the center of each color band and
# formats them so they're labeled according to the meaning of the value.
plt.colorbar(im, format=fmt, ticks=ticks)
plt.show()
解决方案 8:
我认为您可能想看看colors.ListedColormap来生成您的颜色图,或者如果您只需要一个静态颜色图,我一直在开发一个可能有帮助的应用程序。
解决方案 9:
我的用例类似,但有一些额外的要求:
应该将任意整数列表映射到任意颜色。
不应假设整数是连续的、排序的或在数据中表示的。
感谢@Enzoupi 提供一个起点,我对其进行了重构,使其更具可重用性,并进行了调试,以正确地将颜色映射到整数。
import numpy as np
import matplotlib as mpl
import pylab as plt
class Label:
def __init__(self, integer, color, label):
self.integer = integer
self.color = color
self.label = label
class DiscreteColormapGenerator:
def __init__(self, labels):
self.labels = labels
self.labels.sort(key=lambda x: x.integer)
def get_colormap(self):
return mpl.colors.ListedColormap([l.color for l in self.labels])
def get_normalizer(self):
bins = [l.integer for l in self.labels]
bins = [self.labels[0].integer - 1] + bins + [self.labels[-1].integer + 1]
bins = np.array(bins[0:-1] + bins[1:]).reshape(2, -1).mean(0)
return mpl.colors.BoundaryNorm(bins, len(self.labels), clip=True)
def get_legend_patches(self):
return [mpl.patches.Patch(color=l.color, label=l.label) for l in labels]
if __name__ == "__main__":
labels = [
Label(1, "blue", "Sea"),
Label(2, "red", "City"),
Label(13, "orange", "Sand"),
Label(7, "green", "Forest"),
]
cmpr = DiscreteColormapGenerator(labels)
cmpr.get_normalizer()
A = np.reshape([7, 2, 13, 7, 2, 2], (2, 3))
fig, ax = plt.subplots()
im = ax.imshow(A, cmap=cmpr.get_colormap(), norm=cmpr.get_normalizer())
fig.legend(handles=cmpr.get_legend_patches())
plt.show()
扫码咨询,免费领取项目管理大礼包!