Scatter plot with different text at each data point

Question

I am trying to make a scatter plot and annotate data points with different numbers from a list. So, for example, I want to plot y vs x and annotate with corresponding numbers from n.

y = [2.56422, 3.77284, 3.52623, 3.51468, 3.02199]
x = [0.15, 0.3, 0.45, 0.6, 0.75]
n = [58, 651, 393, 203, 123]
ax = fig.add_subplot(111)
ax1.scatter(z, y, fmt='o')

Any ideas?

You can also get scatter plot with tooltip labels on hover using the mpld3 library. mpld3.github.io/examples/scatter_tooltip.html — Claude COULOMBE
– Claude COULOMBE, Commented May 20, 2019 at 2:17

borgr · Accepted Answer · 2024-02-14 20:52:43Z

821

I'm not aware of any plotting method which takes arrays or lists but you could use annotate() while iterating over the values in n.

import matplotlib.pyplot as plt
x = [0.15, 0.3, 0.45, 0.6, 0.75]
y = [2.56422, 3.77284, 3.52623, 3.51468, 3.02199]
n = [58, 651, 393, 203, 123]

fig, ax = plt.subplots()
ax.scatter(x, y)

for i, txt in enumerate(n):
    ax.annotate(txt, (x[i], y[i]))

There are a lot of formatting options for annotate(), see the matplotlib website:

enter image description here

edited Feb 14, 2024 at 20:52

borgr

26.4k6 gold badges29 silver badges41 bronze badges

answered Jan 21, 2013 at 7:47

Rutger Kassies

65k17 gold badges119 silver badges102 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

ijoseph Over a year ago

Works well on top of Seaborn regplots without too much disruption, too.

Rachel Over a year ago

@Rutger I use a pandas datframe and I somehow get a KeyError- so I guess a dict() object is expected? Is there any other way to label the data using enumerate, annotate and a pandas data frame?

aviator Over a year ago

For points that happen to be very close, is there any way to offset the annotations and draw lines pointing from the data points to the labels in order to nicely separate the otherwise overlapping labels?

Rutger Kassies Over a year ago

@aviator, not built-in unfortunately. But see for example this using networkx's layout engine: stackoverflow.com/a/34697108/1755432

Rutger Kassies Over a year ago

@Ben, yes the annotate function has a xytext=(x,y) keyword that allows specifying the location of the text label. The default is the same as the point xy=(x,y). For example: ax.annotate(txt, xy=(z[i], y[i]), xytext=(z[i]+0.1, y[i]+0.1)) That will also allow drawing lines arrows between the two locations. More info at: matplotlib.org/3.5.0/tutorials/text/annotations.html

|

borgr · Accepted Answer · 2024-02-14 20:53:45Z

71

In case anyone is trying to apply the above solutions to a .scatter() instead of a .subplot(),

I tried running the following code

import matplotlib.pyplot as plt
x = [0.15, 0.3, 0.45, 0.6, 0.75]
y = [2.56422, 3.77284, 3.52623, 3.51468, 3.02199]
n = [58, 651, 393, 203, 123]

fig, ax = plt.scatter(x, y)

for i, txt in enumerate(n):
    ax.annotate(txt, (x[i], y[i]))

But ran into errors stating "cannot unpack non-iterable PathCollection object", with the error specifically pointing at codeline fig, ax = plt.scatter(x, y)

I eventually solved the error using the following code

import matplotlib.pyplot as plt
plt.scatter(x, y)

for i, txt in enumerate(n):
    plt.annotate(txt, (x[i], y[i]))

I didn't expect there to be a difference between .scatter() and .subplot() I should have known better.

edited Feb 14, 2024 at 20:53

borgr

26.4k6 gold badges29 silver badges41 bronze badges

answered Mar 28, 2019 at 2:11

Heather Claxton

1,1199 silver badges11 bronze badges

2 Comments

Brandon Over a year ago

I'm using this exact same code in one of my scripts (the second block here), but I'm met with an error message saying "IndexError: index 1 is out of bounds for axis 0 with size 1", which is referring to "txt" in the annotate function. Any idea why this is happening?

Alperino Over a year ago

That's because plt.scatter is not meant to create a Figure and an Axes like plt.subplots() does, but a PathCollection containing the scatter points. You are supposed to create the figure and axes beforehand.

Joop · Accepted Answer · 2022-03-11 15:34:22Z

46

In versions earlier than matplotlib 2.0, ax.scatter is not necessary to plot text without markers. In version 2.0 you'll need ax.scatter to set the proper range and markers for text.

import matplotlib.pyplot as plt
y = [2.56422, 3.77284, 3.52623, 3.51468, 3.02199]
z = [0.15, 0.3, 0.45, 0.6, 0.75]
n = [58, 651, 393, 203, 123]

fig, ax = plt.subplots()

for i, txt in enumerate(n):
    ax.annotate(txt, (z[i], y[i]))

And in this link you can find an example in 3d.

edited Mar 11, 2022 at 15:34

Joop

3,8181 gold badge37 silver badges57 bronze badges

answered May 15, 2016 at 19:21

rafaelvalle

7,1233 gold badges36 silver badges39 bronze badges

2 Comments

Levine Over a year ago

This is awesome! Thanks for sharing this solution. Can you also share what the proper code is to set the size of the figure? Implementations such as plt.figure(figsize=(20,10)) aren't working as expected, in that that invoking this code doesn't actually change the size of the image. Looking forward to your assistance. Thanks!

rafaelvalle Over a year ago

fig, ax = plt.subplots(figsize=(20,10))

Kamal El-Saaid · Accepted Answer · 2020-08-15 13:04:13Z

35

You may also use pyplot.text (see here).

def plot_embeddings(M_reduced, word2Ind, words):
    """ 
        Plot in a scatterplot the embeddings of the words specified in the list "words".
        Include a label next to each point.
    """
    for word in words:
        x, y = M_reduced[word2Ind[word]]
        plt.scatter(x, y, marker='x', color='red')
        plt.text(x+.03, y+.03, word, fontsize=9)
    plt.show()

M_reduced_plot_test = np.array([[1, 1], [-1, -1], [1, -1], [-1, 1], [0, 0]])
word2Ind_plot_test = {'test1': 0, 'test2': 1, 'test3': 2, 'test4': 3, 'test5': 4}
words = ['test1', 'test2', 'test3', 'test4', 'test5']
plot_embeddings(M_reduced_plot_test, word2Ind_plot_test, words)

edited Aug 15, 2020 at 13:04

Kamal El-Saaid

1452 silver badges12 bronze badges

answered Jan 24, 2019 at 14:21

irudyak

2,36126 silver badges20 bronze badges

Comments

Anwarvic · Accepted Answer · 2020-05-19 21:26:20Z

I would love to add that you can even use arrows /text boxes to annotate the labels. Here is what I mean:

import random
import matplotlib.pyplot as plt


y = [2.56422, 3.77284, 3.52623, 3.51468, 3.02199]
z = [0.15, 0.3, 0.45, 0.6, 0.75]
n = [58, 651, 393, 203, 123]

fig, ax = plt.subplots()
ax.scatter(z, y)

ax.annotate(n[0], (z[0], y[0]), xytext=(z[0]+0.05, y[0]+0.3), 
    arrowprops=dict(facecolor='red', shrink=0.05))

ax.annotate(n[1], (z[1], y[1]), xytext=(z[1]-0.05, y[1]-0.3), 
    arrowprops = dict(  arrowstyle="->",
                        connectionstyle="angle3,angleA=0,angleB=-90"))

ax.annotate(n[2], (z[2], y[2]), xytext=(z[2]-0.05, y[2]-0.3), 
    arrowprops = dict(arrowstyle="wedge,tail_width=0.5", alpha=0.1))

ax.annotate(n[3], (z[3], y[3]), xytext=(z[3]+0.05, y[3]-0.2), 
    arrowprops = dict(arrowstyle="fancy"))

ax.annotate(n[4], (z[4], y[4]), xytext=(z[4]-0.1, y[4]-0.2),
    bbox=dict(boxstyle="round", alpha=0.1), 
    arrowprops = dict(arrowstyle="simple"))

plt.show()

Which will generate the following graph:

hamflow · Accepted Answer · 2022-10-12 07:59:24Z

17

For limited set of values matplotlib is fine. But when you have lots of values the tooltip starts to overlap over other data points. But with limited space you can't ignore the values. Hence it's better to zoom out or zoom in.

Using plotly

import plotly.express as px
import pandas as pd

df = px.data.tips()

df = px.data.gapminder().query("year==2007 and continent=='Americas'")


fig = px.scatter(df, x="gdpPercap", y="lifeExp", text="country", log_x=True, size_max=100, color="lifeExp")
fig.update_traces(textposition='top center')
fig.update_layout(title_text='Life Expectency', title_x=0.5)
fig.show()

edited Oct 12, 2022 at 7:59

hamflow

3591 silver badge13 bronze badges

answered Jul 15, 2020 at 23:52

bigbounty

17.5k7 gold badges45 silver badges76 bronze badges

2 Comments

Saraha Over a year ago

what are you using here for inline zooming? It's not mpld3, is it?

mins Over a year ago

imho, an animation at this speed adds nothing, a carefully designed fixed image would be less frustrating.

William Miller · Accepted Answer · 2020-04-07 05:36:55Z

14

Python 3.6+:

coordinates = [('a',1,2), ('b',3,4), ('c',5,6)]
for x in coordinates: plt.annotate(x[0], (x[1], x[2]))

edited Apr 7, 2020 at 5:36

William Miller

10.4k4 gold badges31 silver badges50 bronze badges

answered Jan 6, 2020 at 15:40

palash

5394 silver badges15 bronze badges

1 Comment

Mad Physicist Over a year ago

At that point, why not do coordinates = [('a',(1,2)), ('b',(3,4)), ('c',(5,6))] and plt.annotate(*x)?

Uzzal Podder · Accepted Answer · 2020-09-06 19:23:29Z

4

This might be useful when you need individually annotate in different time (I mean, not in a single for loop)

ax = plt.gca()
ax.annotate('your_lable', (x,y))

where x and y are the your target coordinate and type is float/int.

answered Sep 6, 2020 at 19:23

Uzzal Podder

3,23528 silver badges26 bronze badges

Comments

andor kesselman · Accepted Answer · 2019-12-03 08:13:38Z

3

As a one liner using list comprehension and numpy:

[ax.annotate(x[0], (x[1], x[2])) for x in np.array([n,z,y]).T]

setup is ditto to Rutger's answer.

answered Dec 3, 2019 at 8:13

andor kesselman

1,1893 gold badges17 silver badges27 bronze badges

2 Comments

Mad Physicist Over a year ago

Instead of a list comprehension, which creates a list of unwanted values, use something like deque(..., maxlen=0).

alparslan mimaroğlu Over a year ago

or use a regular for loop like a normal person. List comprehension is amazing and powerful but it should not be used in this situation

Collectives™ on Stack Overflow

Scatter plot with different text at each data point

9 Answers 9

8 Comments

2 Comments

2 Comments

Comments

Comments

2 Comments

1 Comment

Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

8 Comments

2 Comments

2 Comments

Comments

Comments

2 Comments

1 Comment

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related