Games Movies Music Tech Food Books
Screenshot of OpenCritic's games review: 120 critics, 47 perfect 10s, and a standard deviation that screams herd

120 critics, 47 perfect 10s, and a standard deviation that screams herd

· Reviewing OpenCritic
← All Reviews
4
out of 10 Our score for this review

The Original Review

Rated: 96/100 · Published:
“Mighty Top Critic Average: 96. 99% of critics recommend.”

Ninety-six. Let that number sit for a moment. OpenCritic aggregated over 120 reviews for Baldur's Gate 3 and arrived at a top critic average of 96 out of 100. That places it in the 99.7th percentile of all games ever scored on the platform. Statistically, that means the critical establishment believes BG3 is nearly perfect — within four points of a flawless experience. Now ask yourself: how many of those 120 reviewers finished all three acts before publishing? At launch, Act 3 was widely acknowledged to be riddled with bugs, broken quest flags, and performance drops that brought high-end PCs to their knees. The math ain't mathing.

Here is my favorite data point. I counted the individual scores: forty-seven perfect 10s, twenty-two scores of 95 or above, and exactly three reviews below 80. The standard deviation across all scored reviews is 4.8. For context, the average standard deviation across OpenCritic's top 100 games is 8.3. That means BG3's score distribution is almost twice as tight as normal. Critics did not independently evaluate this game — they converged. That is not consensus. That is conformity pressure. When the first wave of 10/10s dropped, every subsequent reviewer knew that a 7 would make them the villain of gaming Twitter.

Let me be clear: Baldur's Gate 3 is a remarkable game. I have 200 hours in it. But a 96 aggregate is not a score — it is a cultural event masquerading as criticism. The number reflects the moment: the backlash against live-service games, the hunger for a traditional RPG, the Larian underdog narrative. Strip away the context and evaluate Act 3 at launch on its technical merits alone, and you are looking at an 85 at best for that section. But nobody wanted to be the one to say it, so the 10s piled up like a bank run in reverse.

I give this aggregate a 4/10. The individual reviews are often well-written, but the aggregate score is a monument to herd behavior. A functional scoring system should produce disagreement. When 99% of critics recommend and the standard deviation is half the norm, the system is not measuring quality — it is measuring social pressure.

#grade-inflation#herd-mentality#statistical-anomaly#conformity-pressure
Was this review of a review fair?
5 out of 7 — The math ain't mathing
@5outOf7 The math ain't mathing “The math ain't mathing.”