Friday, 11 March 2022

6 Nations 2021 Data Visualisation Project - Summary

In total, there were 213 point-scoring moments. Wales had the most with 46, Italy had the least with 16. The ranking by point-scoring momements doesn't quite match the final ranking of the teams. Wales and Italy are top and bottom yes, but Ireland "underperformed" the points they scored and France overperformed theirs. 

178 players were present for at least one point scoring moment. Wales used the most players with 34, England used the least with 27 players (you can't say Eddie Jones doesn't know his team). I was expecting France to be the team that used the most players since France were the team with a COVID outbreak.

The point-scoring moments were reasonably well spread out in time, with interesting peaks at 18 and 65 minutes.

  uyOCxG.png 

No minute had a point scored by all team, but several had points scored by 4/6 teams - including the 18th but not the 56th minute.

  uyOUKc.png 
(Sorry about the slight colour clash) 

The percentage version of the previous: uyOXyW.png 

Comparing who scored the points for the 6 teams is interesting: uyOnc1.png 

All the teams are dependent on kickers, as expected. It's possible that Scotland's is less skewed towards Finn Russell than it might be otherwise because of the game time Stuart Hogg got, due to injury or Russell being sent off. 

Ireland are that badly skewed towards Sexton even though Sexton missed a match. This does not bode well for when he retires

Players who were on the pitch at the same time: uyOdPf.png 

This is the players who were on the pitch for more than 28 point scoring moments each. Yes, Rees-Zammit and Tipuric stand out even in this field.

  uyhdox.png 

Showing which players are on the pitch for point-scoring moments just highlights that the ones on for most are those who don't get swapped off. 

I tried to make a dendrogram of all the teams. To make the labels readable, you need to reduce it to only those players on the pitch for at least 25 point-scoring moments. uyhyuN.png 

Unfortunately this removes all the Italian players. 

To include any Italian players, you have to drop the cut-off to at least 13 point-scoring moments. 

The same is true for the heat maps. Below is one where the labels are legible (all players were present for at least 25 point-scoring moments) uyX7hD.png 

Below is the one that includes any Italian players. uyXVPe.png 

Comparing the heatmaps for the different teams is interesting: uyiVos.png 

They're such different patterns. I mean, there's some similarity between all of them but, for instance, the Scottish one is much less dense, while the colouring for "on the pitch at the same time" is tightly packed in one corner for Wales. For Ireland it's packed, but less tightly. England is packed, but more in the middle-ish. I'm reasonably sure I've seen a carpet with the same pattern as France's and Italy's would make a good start for an artwork. 

Conclusion: This has definitely been worth doing, not just from a "learning how bits of R and json work" point of view. Visualising the data has revealed some interesting patterns, which I think reflects things about the teams. It would be interesting to see how things like this look in sports with rolling subs (odd that there's a Rugby League World Cup this year that might fill that gap ;) ) 

Other possible future work could include doing the same thing but looking at which players are on the pitch when points are conceded (although that could be unfair to Italy), or looking at attacking play again, but in the 2022 Six Nations, because there's only been one coaching change in the interim (Italy) and it'll be interesting to see how many of the patterns repeat - Ireland's reliance on Sexton, the very centralised England core, France not scoring (many) late tries. This analysis may be affected by England's enforced change of kicker due to Owen Farrell's injury.

Saturday, 26 February 2022

6 Nations 2021 Data Visualisation Project - Wales

Team: Wales 

Number of point scoring occasions: 46 

Number of players present for at least 1: 34 

Who scored the points? 
  u9NCbq.png 
Wales were another team where the kickers scored most of the points, but tries were reasonably evenly spread. 

When were the points scored? u9HTPe.png 
Wales's points were well spread over time 

Who scored when? 
u9VAAY.png 

Percentage version of the previous u9VXs1.png 

The spread of point scorers is borne out when the points per minute chart is coloured in by scorer. 

Who was present for more than one point scoring opportunity? u9VREs.png 

Louis Rees-Zammit and Justin Tipuric were the only Wales players present for all point-scoring moments. 

u9VJJe.png 

The "who was on the pitch when for points" line chart shows Wyn Jones, Tomas Francis, Josh Navidi, Dan Biggar, Adam Beard and Jonathan Davies were players who were taken off before the end of matches. Certainly the times for Dan Biggar might explain why Callum Sheedy got so many points.

Who was on the pitch at the same time when points were scored?

Dendrogram
u9fB3W.png
Heatmap
u9fcLN.png 

What's clear from the heatmap that wasn't clear from the dendrogram is that the Welsh team was very much Josh Adams, Dan Biggar, Adam Beard, Gareth Davies, Jonathan Davies (no relation), Louis Rees-Zammit, Justin Tipuric, Talupe Falatau, Alun Wyn Jones, Liam Williams, Wyn Jones (also no relation), Tomas Francis, Ken Owens, George North and Josh Navidi, with everyone else being brought on only if one of those needed to be taken off.

Matrix

   u9fR2b.png 

The matrix view doesn't show the central fifteen as well as the heatmap, because it shows not just the "core" team, but also the players most likely to be subbed on because Wales scored a lot of points after minute ~65 when the substitutions occurred. 

This is a good example of why producing all three types of diagram is useful. Patterns may be more obvious in one view that the other two, and which of the three shows the patterns most clearly may change based on the underlying data. 

(Files for this: json - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Walespoints.json?raw=true 
Rproj - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Six%20Nations%202021%20Wales.R?raw=true )

Wednesday, 23 February 2022

6 Nations 2021 Data Visualisation Project - Scotland

Team: Scotland 

Number of point scoring occasions: 39 

Number of players present for at least 1: 35 

Who scored the points? 
  ufgpHm.png 

Scotland's points were quite evenly spread between Finn Russell and Stuart Hogg, this is partly because of Finn Russell's injuries/yellow cards, but it also shows that they have two attacking prongs, which I think is part of why they did better in 2021. 

When were the points scored? ufcfnG.png 

Scotland's points are remarkably evenly spread out through out the 80 + extra time minutes. 

  Who scored when?
ufg2lD.png 

 The point scored by Russell and Hogg are also reasonably distributed by time, neither of them was "the one that scored at the start of the match" or "the one that finished matches off".

ufgPNe.png 

Percentage version of the previous 

Who was present for more than one point scoring opportunity? ufgZAb.png 

No Scotland player was present for all the points. Those present for the most were Stuart Hogg and Duhan van der Merwe who were present for 39/41 points. ufqxQz.png 

Scotland probably shows the best what I wanted to describe with this visualisation, so you can see that Matt Fagerson, Rory Sutherland, George Turner, Darcy Graham, Sam Johnson, Sam Skinner and Finn Russell were the players subbed off (I think in Russell's case, to prevent him picking up yet more yellow cards). 

Who was on the pitch at the same time when points were scored?

Dendrogram
ufqFZJ.png 

There does look to be a clump of Hogg, van der Merwe, the two Fagersons, Jamie Ritchie, Sutherland, Turner, Ali Price, Russell, Chris Harris (no, not that one) and Cameron Redpath (who makes me feel ancient) and then everyone else.

Heatmap
   ufqcsQ.png 

The number and the mix and match nature of the Scotland team is born about by most of the heatmap being very pale, with only really Hogg, van der Merwe and Hamish Watson always being together.

Matrix
   ufqgUx.png 

That's also borne out by the matrix view, which has a lot of players on it (23), but the previously identified central chunk with lots of players sticking out from it. 

(Files for this: json - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Scotlandpoints.json?raw=true 

Rproj - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Six%20Nations%202021%20Scotland.R?raw=true )

Friday, 18 February 2022

6 Nations 2021 Data Visualisation Project - Italy

Team: Italy 

Number of point scoring occasions: 16 

Number of players present for at least 1: 28 

Who scored the points? 
u7Z15O.png 

Not unexpectedly, the kickers. 

When were the points scored? u7Zio2.png 

Italy's points tend to be scored in the first 20 minutes of each half. Possibly this is related to the terrible phenomenon of "Italy are in the game for 60 minutes and then it all falls to pieces". 

Who scored when? 
  u7Zt7x.png 

(Sorry about the colour clash between Garbisi and Ioane) 

Who was present for more than one point scoring opportunity? u7ZMZN.png 

Only Ioane was on the pitch for all Italian points scored, then there's a clump of 6 players (Negri, Garbisi, Lamaro, Bigi, Meyer and Brex) who were on the pitch for 14/16 points scored.

u7ZUVs.png 

Other than Riccioni, there's no-one who is a clear "front row 50th minute sub off", possibly because players who were subs in one match were starters in the next. 

Who was on the pitch at the same time when points were scored? 

Dendogram
u7ZDk8.png 
Heatmap 
  u7ZnAQ.png 

The Italian heatmap ... is a Rorshach blot that looks like a beetle. Again, this possible reflects the same thing as the "when players played" diagram, because players who were starters in one match were subs in the next. 

Matrix 
  u7Zzsq.png 

As well as Riccioni sticking out (also seen in other diagrams), Lovotti, Cannone and Fischetti do, and I think they're all front rowers. (Double checks) Lovotti, Riccioni and Fischetti are props, but Cannone is a lock so I can't explain that. I will discuss with the subject matter expert and amend if I get any more information. 

(Files for this: json - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Italypoints.json?raw=true Rproj - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Six%20Nations%202021%20Italy.R?raw=true )

Sunday, 13 February 2022

6 Nations 2021 Data Visualisation Project - Ireland

Team: Ireland 

Number of point scoring occasions: 41 

Number of players present for at least 1: 32 

Who scored the points? 
Johnny Sexton. 

  uRqXXQ.png 

No really, as an Ireland fan, I am not at all terrified by how dependent Ireland are on a now 37 year old Johnny Sexton. And that domination is with him missing a game due to concussion protocols. 

 When were the points scored? uRqi0x.png 
A lot of Ireland's points were scored in minutes 30-50. 

Who scored when? 
  uRqA81.png 

And now the percentage view. uRq61n.png 

This view makes it even more clear how dependent Ireland are on Sexton. 

Who was present for more than one point scoring opportunity? uRqCLs.png 

Only Hugo Keenan was present for all of Ireland's points. uRqtvN.png 

Because Ireland's team were more mix and match there is less of a "and these are players who swap for each other", except for Josh van der Flier and Tadhg Furlong. 

  Who was on the pitch at the same time when points were scored?

Dendrogram
uRqhSJ.png
Heatmap
uRqEyj.png 

The mix and match nature of the Irish team shows up in the heatmap, with there being sort of a 8 core, followed by another bunch of 8, then a less used 8, then the least used 8 (including Peter O'Mahony who was banned for 3 games after that sending off). There's also more of a weaving of colours.

Matrix
  uRqU32.png 

Again, the matrix view supports the information from the others, that Ireland swapped a lot of players in and out, with a lot of players in it as co-occurring, even though they missed several matches. 

(Files for this: 
json - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Irelandpoints.json?raw=true Rproj - https://github.com/fulltimesportsfan/6-Nations-2021/blob/main/Six%20Nations%202021%20Ireland.R?raw=true )

Thursday, 10 February 2022

6 Nations 2021 Data Visualisation Project - France

Yes, the delay was partly making sure I didn't mispell Jalibert's name as Jalabert. Jaja forever. 

Team: France 

Number of point scoring occasions: 39 

Number of players present for at least 1 point scoring moment: 28 

Who scored the points?
uXULz1.png 

The points scorers are more evenly spread for France, but that might be because Jalibert was injured for a match and a half. Never the less, 3 non-kicking players scored 3 times, and another 2 scored twice.

 When were the points scored? uXUPZj.png 

France like scoring points in minutes 7-11, with 4 sets of points scored then. It's also interesting how early most of the points are with most coming before the 60th minute - only 5/39 coming after then. 

Who scored when? Sorry about the colour clash between Penaud and Jalibert. uXU2QG.png 

It's interesting that the points when Ntamack took over kicking came later in matches, even in the match he started rather than coming on as a substitute for Jalibert. 

This is the % version of the same diagram - uXUkoc.png 

Who was present for more than one point scoring opportunity? uXUJuW.png 

Charles Ollivon and Brice Dulin were on the pitch every time France scored. Gregory Alldritt, Gael Fickou and Antoine Dupont only missed one. 

The French version of the time on the pitch for the players (below) clearly shows the players who play in roles that lead to being subbed off (Haouas, Baille, Willemse and Marchand). It also shows the fact that Jalibert's points all come early really well too.

uXUdks.png 

Who was on the pitch at the same time when points were scored? 

Dendrogram
  uXU9Nz.png
Heatmap

uXUWlJ.png 

France don't have quite the same "two clear sets of players" as England, with France it's more a core of Ollivon, Dulin, Alldritt, Dupont and Fickou who can either play with Group 1 (Vakatawa, Villiere, Le Roux, Haouas, Baille, Willemse, Cretin, Jalibet, Penaud, Vincent, Thomas and Marchand) or Group 2 (Toafifenua, Jelonch, Ntamack, Rebbadj). It also nicely shows players who don't play together, probably because of positional overlap (Vakatawa and Villiere and Le Roux).

Matrix

  uXUZMn.png 

The matrix agrees with the heatmap, with the addition of Pierre Bourgarit sticking out. 

Files for this: 

json - https://github.com/fulltimesportsfan/6-Nations-2021/commit/e5afc284ab723602f3b446b0b6276f6e940095ef?raw=true 

Rproj - https://github.com/fulltimesportsfan/6-Nations-2021/commit/e5afc284ab723602f3b446b0b6276f6e940095ef?raw=true