

A key for bubble sizes serves the same purpose as those tick marks for the third variable. It is fairly easy to evaluate and compare values based on horizontal or vertical lengths and positions, thanks to the tick marks on the axes. Include a legendĪs another tip, it’s recommended to include a legend or other key on your plot to show how different bubble sizes correspond with values of your third variable. Reducing bubble size can help provide some physical separation between points, but doing so will also make it more difficult to read values from bubble sizes. If there appears to be too much overplotting, then it might be worth thinking about a way to summarize the data or choose a different chart type to represent your data. There aren’t any hard guidelines for whether a dataset is appropriate for a bubble chart or not, but it’s a point to be aware of when creating a bubble chart. Without transparency, the smaller data point would not have been visible against the larger ones. This overlapping also means that there are limitations to the number of data points that can be plotted while keeping a plot readable. Limit number of points to plotīubble charts are commonly drawn with transparency on points since overlaps are a much easier occurrence than when all points are a small size. Many visualization tools will automatically match value to area, but be careful of those cases where value is matched to diameter or radius instead. In the same scenario as above, a point with twice the value of another point should have sqrt(2) = 1.41 times the diameter or radius so that its area is twice the smaller point’s.ĭepending on how you are creating your bubble chart, you may need to scale your data to account for how data values are mapped to point sizes. Instead, make sure that the bubbles’ areas correspond with the third variable’s values. When this kind of scaling is performed, a point with twice the value of another point will end up with four times the area, making its value look much larger than is actually warranted. One easy mistake that can be made is to scale the points’ diameters or radii to the third variable’s values. Best practices for using a bubble chart Scale bubble area by value

One point will be plotted for each row in the table. Two columns will correspond with the horizontal and vertical positions of each point, while the third will indicate each point’s size. Example of data structure avg_points_againstĪ bubble chart is created from a data table with three columns.
#Bi linear scatter plot series#
While it is easier to get the specific win counts for each team from this series of plots, the relationship between all three variables is not as clearly stated as in the bubble chart. The three scatter plots above show the same data as the original example bubble chart. It would require multiple two-variable scatter plots in order to gain the same number of insights even then, inferring a three-way relationship between data points will not be as direct as in a bubble chart. Z), as well as an overall three-way comparison. In a single bubble chart, we can make three different pairwise comparisons (X vs. However, the addition of marker size as a dimension allows for the comparison between three variables rather than just two. Like the scatter plot, a bubble chart is primarily used to depict and show relationships between numeric variables. This is a completely different chart type that will be discussed briefly towards the end of the article. The name “bubble chart” is sometimes used to refer to a different chart type, the packed circle chart. Instead, the main takeaway from the plot comes from the third variable: as teams score more points and allow fewer points from their opponents (towards the upper left), they will earn more victories, as one might naturally expect. (Ties are worth half a win.)įrom the plot, we can see that there is a lot more variance in points scored by teams than by their opponents, but there’s no particularly strong correlation between the two. Each bubble’s size indicates the number of wins earned by each team, with larger bubbles corresponding to higher win rates. A bubble’s horizontal position notes the average points scored against that team each game, and the vertical position notes the average points scored by that team each game. Each bubble represents a single team’s performance. The example bubble chart above depicts the points scored per game by teams in the regular season of the National Football League in 2018. Each dot in a bubble chart corresponds with a single data point, and the variables’ values for each point are indicated by horizontal position, vertical position, and dot size. A bubble chart (aka bubble plot) is an extension of the scatter plot used to look at relationships between three numeric variables.
