The group stage is over, the Round of 32 is set โ and the model has changed its mind. โฝ๏ธ
When I ran the numbers before kickoff the model had a bold, market-contrarian opinion: ๐ช๐ธ Spain to win it at 35.3%, ๐ฆ๐ท Argentina second at 23.0% and both dramatically underpriced by the bookies โ Spain’s +19.3pp edge over Polymarket was the strongest disagreement between maths and money I’d ever put on this blog. I promised I’d come back on the other side of the trophy ceremony to find out whether the model deserved its confidence.
This is not that piece. Consider it extra time: an unscheduled period I hadn’t planned to write, called because seventy-two matches of actual football have done what ten million simulations couldn’t. ๐ช๐ธ Spain are no longer the favourite. ๐ฆ๐ท Argentina are. Before we get to why, let’s do the honest thing and grade the homework.
Where the model was right, and where it face-planted
Elo prices the long run. It does not price two good weeks or two bad ones, which is exactly what a World Cup group stage is โ so the gap between what the model expected in June and what actually happened is where the story lives.
And it lives loudest in the exact fixture I built the goal model around. Back in part 2c my worked example was ๐ช๐ธ Spain vs ๐จ๐ป Cape Verde โ a 579-point Elo gap so lopsided the Poisson had to be floored just to give ๐จ๐ป Cape Verde a pulse. The model gave ๐ช๐ธ Spain 94.1% to win, ๐จ๐ป Cape Verde 1.0%, and a most-likely scoreline of 3โ0. The actual result: 0โ0. ๐จ๐ป Cape Verde held the biggest paper favourite of the entire group stage to a goalless draw, qualified from Group H anyway, and turned my tidy little example into the biggest upset of the tournament so far. That floor of 0.25 expected goals I spent a whole paragraph justifying? It earned its keep.
Some of the pre-tournament reads aged badly:
| Team | Pre-tournament qualify | Reality |
|---|---|---|
| ๐บ๐พ Uruguay | 95.0% | Out |
| ๐น๐ท Turkey | 86.8% | Out |
| ๐จ๐ฟ Czech Republic | 76.8% | Out |
Group H produced two of the tournament’s headline inversions at once: ๐บ๐พ Uruguay out and ๐จ๐ป Cape Verde in โ a near-perfect reversal of what the ratings expected from ๐ช๐ธ Spain’s group.
And some aged rather well:
| Team | Pre-tournament qualify | Reality |
|---|---|---|
| ๐ฌ๐ญ Ghana | 7.9% | Through |
| ๐จ๐ป Cape Verde | 24.3% | Through |
| ๐จ๐ฉ DR Congo | 25.4% | Through |
There’s no single lesson here beyond the obvious one the snooker write-ups kept circling: a rating integrated over years of competitive results is a statement about the long run, and a 48-team knockout that turns on three matches per side will always hand you a fistful of upsets the rating never saw coming. ๐ฌ๐ญ Ghana going from a 7.9% no-hoper to a certainty is not the model being wrong โ it’s the model being a prior, and the football being the evidence.
The bracket, for real this time
The group stage is done, so instead of simulating 72 matches and then the knockouts, the engine only needs to simulate forward from the Round of 32 โ much like the snooker simulations that always started from a fixed draw.
Here is the left-hand panel of that animation, frozen for inspection:
And the shape is unkind to ๐ช๐ธ Spain. ๐ฆ๐ท Argentina and ๐ช๐ธ Spain are still in opposite halves โ they cannot meet before the final โ but ๐ซ๐ท France, a market co-favourite all along whose form the group stage duly confirmed, have landed in ๐ช๐ธ Spain’s half. The top half is now a two-heavyweight pile-up: ๐ช๐ธ Spain reach the final from there about 40% of the time and ๐ซ๐ท France about 38%, which means one of them most likely knocks the other out in the semis. ๐ฆ๐ท Argentina, by contrast, has the run of a far softer bottom half โ 55% to reach the final, with ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ England next at 14%.
๐ช๐ธ Spain’s modal path to the final: ๐ฉ๐ฟ Algeria (1780), ๐ต๐น Portugal (1990), ๐ง๐ช Belgium (1884), ๐ซ๐ท France (2123). ๐ฆ๐ท Argentina’s: ๐จ๐ป Cape Verde (1622), ๐ฆ๐บ Australia (1800), ๐จ๐ด Colombia (2004), ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ England (2038). Both enter the knockouts at an identical Elo of 21441 โ the bracket alone turns that into 31.8% for ๐ฆ๐ท Argentina and 24.7% for ๐ช๐ธ Spain.
The effect on the full title race:
| Team | Pre-tournament | Now | ฮ |
|---|---|---|---|
| ๐ฆ๐ท Argentina | 23.0% | 31.8% | +8.8pp |
| ๐ช๐ธ Spain | 35.3% | 24.7% | โ10.6pp |
| ๐ซ๐ท France | 12.7% | 22.2% | +9.5pp |
| ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ England | 6.0% | 5.7% | โ0.3pp |
| ๐จ๐ด Colombia | 3.3% | 3.9% | +0.6pp |
| ๐ง๐ท Brazil | 3.9% | 3.3% | โ0.6pp |
| ๐ต๐น Portugal | 3.5% | 1.8% | โ1.7pp |
| ๐ณ๐ฑ Netherlands | 2.0% | 2.5% | +0.5pp |
The non-obvious move: ๐ซ๐ท France, a distant third at 12.7% before kickoff, have surged to 22.2% โ close enough to ๐ช๐ธ Spain’s 24.7% that second place is a genuine coin flip.
Model vs market, take two
Disclaimer: This section discusses betting odds for the purpose of statistical comparison and analysis. It is not intended to promote gambling or serve as betting advice. Please gamble responsibly and be aware of your local laws and age restrictions.
The spine of part 2c was a number: ๐ช๐ธ Spain’s +19.3pp edge over Polymarket, the model screaming value where the market shrugged. The obvious question for a follow-up is whether that disagreement resolved โ and if so, who blinked.
Both did. They’re converging.2
| Team | Model now | Market now | Edge now | (Edge pre-tournament) |
|---|---|---|---|---|
| ๐ฆ๐ท Argentina | 31.8% | 22.0% | +9.8pp | (+14.7pp) |
| ๐ช๐ธ Spain | 24.7% | 10.5% | +14.2pp | (+19.3pp) |
| ๐ซ๐ท France | 22.2% | 22.9% | โ0.7pp | (โ2.9pp) |
| ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ England | 5.7% | 10.4% | โ4.7pp | (โ4.5pp) |
| ๐จ๐ด Colombia | 3.9% | 2.4% | +1.5pp | (+1.5pp) |
| ๐ง๐ท Brazil | 3.3% | 5.6% | โ2.4pp | (โ4.4pp) |
| ๐ณ๐ฑ Netherlands | 2.5% | 4.2% | โ1.7pp | (โ1.9pp) |
| ๐ต๐น Portugal | 1.8% | 4.9% | โ3.1pp | (โ6.7pp) |
And the same picture plotted log-log, so the disagreements stand out:
The most famous edge โ ๐ช๐ธ Spain’s +19.3pp โ has shrunk to +14.2pp. Not because the model held firm while the market caught up: both moved. The model cooled on ๐ช๐ธ Spain (35.3% โ 24.7%); the market went further still (16.0% โ 10.5%). ๐ฆ๐ท Argentina is the mirror: here the market did the catching up. Polymarket nearly tripled their implied chance from 8.3% to 22.0% while the model climbed more modestly (23.0% โ 31.8%), narrowing the gap from +14.7pp to +9.8pp โ but leaving ๐ฆ๐ท Argentina as the clearest model-vs-market disagreement that remains.
Further down, the pattern from the snooker pieces survives intact: ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ England, ๐ง๐ท Brazil and ๐ต๐น Portugal all carry a market premium the rating won’t pay โ the established-name, knows-how-to-win-a-tournament tax that betting crowds price into famous shirts and Elo simply can’t see. International football has its Ronnies too.
The format is a disgrace, and now I can prove it
Allow me a paragraph of editorial โ I think I’ve earned it. We just played seventy-two group matches to send sixteen teams home, and bar ๐จ๐ป Cape Verde’s opening-match ambush almost all of it went to script. Two genuine surprises in two weeks โ ๐บ๐พ Uruguay and ๐น๐ท Turkey, both odds-on to qualify, both out โ and everyone else who was meant to advance, advanced. Two weeks of football to confirm what the ratings already knew in June.
It was also less competitive than a World Cup should be, and for once I can put a number on the grumble. A blow-out โ a win by three goals or more โ landed in exactly a quarter of this group stage’s matches. Across the seven tournaments of the 32-team era, 1998 to 2022, that rate averaged 14.6% and never once cleared 21%:
| Edition | Group-stage blow-outs (โฅ3 GD) |
|---|---|
| 1998 | 16.7% |
| 2002 | 12.5% |
| 2006 | 14.6% |
| 2010 | 10.4% |
| 2014 | 20.8% |
| 2018 | 16.7% |
| 2022 | 10.4% |
| 2026 | 25.0% |
More than double the 2022 rate of 10.4%, and clear of every previous edition by a street. Invite sixteen more teams who have no business on the same pitch as the elite, and the scoreboards say exactly that.3
The tie-break ladder did its own damage. ๐ฉ๐ฟ Algeria and ๐ฆ๐น Austria met in their final group match knowing any result would qualify them both โ they played last, with perfect information on everything that mattered. Before a ball was kicked, the potential Disgrace of Kansas City was already being written about โ an echo of the Disgrace of Gijรณn, in which ๐ฉ๐ช West Germany and ๐ฆ๐น Austria quietly played out the result that suited them both. But 2026’s version was worse: ๐ฉ๐ฟ Algeria had no incentive to win. A win promoted them to Group J runners-up and a Round-of-32 tie against ๐ช๐ธ Spain; a draw kept them third and facing ๐จ๐ญ Switzerland instead. Winning was actively harmful.
The match ended 3โ3 in stoppage time: Mahrez struck in the 90+3 to make it 3-2 before Sasa Kalajdzic equalised in the 90+5 with the last kick of the game. If it was staged, it was staged perfectly โ nobody can claim collusion after a finish that dramatic. But from the 60th minute, played at 2โ2, both teams were comfortable: the crowd whistled, Laimer was spotted doing stretches, neither side showed any urgency to find a third. You cannot distinguish honest fatigue from rational quiet when the format has already made both look identical.4
The teams are not to blame. If the rules reward not trying to win, rational competitors will not try to win โ that is what competing means. The fault is FIFA’s for designing rules that punish the very instinct they are supposed to reward. Good game design makes the rational move the exciting one; here it made restraint the winning strategy. ๐ด๓ ง๓ ข๓ ณ๓ ฃ๓ ด๓ ฟ Scotland and ๐ฐ๐ท South Korea paid the price โ playing out their fate three days before ๐ฉ๐ฟ Algeria kicked off, with no idea how many points would be enough or which R32 match they were competing for, while ๐ฉ๐ฟ Algeria stepped onto the pitch knowing every other group’s outcome and exactly what a draw meant for their bracket path. That is not a level playing field. I do not know what you call a governing body that designs a system this broken forty-four years after showing the world what broken looks like, but I know it is not incompetence.
And the bracket it spat out is a lottery. A seeded draw exists to keep the best sides apart until the business end; this one does the reverse. ๐ณ๐ฑ Netherlands won Group F; ๐ฒ๐ฆ Morocco came through Group C โ ๐ง๐ท Brazil’s group โ to qualify. Their Round-of-32 reward is each other. ๐ต๐น Portugal vs ๐ญ๐ท Croatia in the same round is a tie most tournaments would call a quarter-final. The likely Round of 16 brings ๐ช๐ธ Spain vs ๐ต๐น Portugal and ๐ซ๐ท France vs ๐ฉ๐ช Germany โ fixtures a sane draw saves for the semi-finals.
So after two weeks and seventy-two matches, the tournament finally starts in earnest โ with the strongest thirty-two teams, single elimination.
For the record: I loathe FIFA’s greed and I think 48 teams is a format only an accountant could love. But โ and it costs me something to admit it โ the bloat does make the simulation more fun: more teams, more bracket chaos, more for the model to get spectacularly wrong. Some consolation for this data geek. I am already, grimly, looking forward to the inevitable 64-team edition.
Final whistle โฝ
The model has eaten its own headline โ and then some. ๐ช๐ธ Spain were probably going to win this thing; now it’s ๐ฆ๐ท Argentina out front, ๐ซ๐ท France gatecrashing the podium, and the market โ for once โ shuffled most of the way toward agreeing. The +19.3pp argument I picked with the bookies in June is half-settled, amicably, with both sides having moved toward the middle.
The bracket has a sense of humour. ๐จ๐ป Cape Verde โ the team I used in part 2c to illustrate the goal floor, so lightly rated the Poisson needed a minimum just to give them a credible expected-goal figure โ held ๐ช๐ธ Spain to 0-0, qualified from Group H, and drew ๐ฆ๐ท Argentina in the Round of 32. The model has done its sums. Wichtig is’ auf’m Platz, ๐จ๐ป Cape Verde.
Extra time is up. I said in part 2c I’d be back on the other side of the trophy ceremony to find out whether the model deserved its confidence โ that piece arrives on 19 July, the penalty shootout. Thirty-one matches stand between here and the only result that actually grades the homework.
All the code, data snapshots and figures for this article live on GitLab.
As before, ratings come from eloratings.net, the established World Football Elo Ratings, refreshed after every international. The snapshot used for this article was taken at 2026-06-28T05:00:05Z. ↩︎
Decimal odds quote the total return per unit stake including the stake, so fair odds for probability \(p\) are exactly \(1/p\). Summed across all 48 contracts the market-implied probabilities exceed 100% โ the overround is the house’s cut. Polymarket’s is about 3%; a typical sportsbook is closer to 5โ8%. The house always wins. ↩︎
Blow-out rates are computed from the martj42 international-results dataset, taking each 32-team World Cup’s group stage to be its first 48 matches (8 groups of 4). The 2026 figure is over all 72 group matches. ↩︎
๐ฉ๐ฟ Algeria’s incentive was simply to avoid winning โ a draw was perfectly optimal. ๐ฆ๐น Austria’s position was potentially even more perverse. Had ๐บ๐ฟ Uzbekistan and ๐จ๐ฉ Congo drawn in Group K, that group’s third-placed team would have finished with just two points; ๐ฆ๐น Austria’s three pre-match points would then have guaranteed them a top-eight finish among all third-placed teams even after a one-goal defeat โ lose by two and they might have been out. In that scenario, losing by a single goal was ๐ฆ๐น Austria’s rational play: hand ๐ฉ๐ฟ Algeria the runner-up slot (and ๐ช๐ธ Spain), and collect ๐จ๐ญ Switzerland instead. A competition worth billions had written rules under which conceding goals on purpose was the winning strategy โ good game design makes sure such scenarios cannot arise. ↩︎