A new measure of tournament toughness: Harmonic Field Strength

... The optimum exponent is 0.77, meaning the optimum influence of the 2nd player is 59% rather then 50% of the top player and the optimum influence of the 10th player is one sixth rather than one tenth that of the top player.
It seems like using half-integers in the series (1.5, 2.5, 3.5...) will give nearly the same change of influence. Is there a theoretical reason to believe that an exponent rather than an offset gives a better result?

Does the HFS let me know, "This tournament is as tough to cash in as if every player I play has a rating equal to the HFS"?
 
[...]

Does the HFS let me know, "This tournament is as tough to cash in as if every player I play has a rating equal to the HFS"?
Seems pretty close to that. Here are some results from simulations (Dream64 plus several other majors), showing the HFS (with 0.77 exponent), the player in the field with rating closest to that, and that fraction of the time that player finished top 16 in 64-player tournament in the simulations). You can see it is around a third of the time.
1760698543824.png
 
[...] Is there a theoretical reason to believe that an exponent rather than an offset gives a better result?
[..]
I know of none, but comparison with simulations shows the HFS with optimized exponent works quite well.

Here's what we did.

A "simulation" means playing out a tournament where each match winner is determined by flipping a bent coin in the computer. Knowing the rating difference between the players and that it is a race to 11 (what we used) tells us how much to bend each coin.

We did 100,000 64-player SE simulations for each of several tournament fields (Florida Open US Open, Peri, etc.) with random draw, and for each tournament we assigned points to finish places of of 1, 2, 4, 8, 16, or 32 for R32 finish, R16, QF, SF, runnerup and winner. Then for each player we compute an average number of points per tournament. So this tracks typical $$ distribution pretty well.

The first tournament we'll call THE BEAST: the field is the top 64 players in world by Fargo Rating.

And because THE BEAST is the hardest tournament field, all players will earn fewer points in THE BEAST than in any real-world field. Take YAPP, for instance, because he is in every field. If YAPP or any given player earns more points in a tournament, that's what we mean by a weaker tournament field. YAPP earns 2.86 points per tournament in THE BEAST, and he earns 5.43 points per tournament in the China Open, close to twice as many, i.e., a ratio of close to 2. There is a ratio like this for all 24 players who played both in the China Open and in THE BEAST, and the geometric mean of those those ratios is our tournament difficulty factor. For China Open that factor really is 2.0. We express that 2.0 in "rating speak," meaning we call the China Open 100 points in some sense below THE BEAST. Rating-speak (converting a ratio to a rating gap) allows a cleaner comparison to Harmonic Field Strength, which already lives in that rating-type world.

If the brute-force tournament difficulty factor and HFS really are measuring the same thing (what we want), there should be a nice straight line relationship between them. A weaker nice-to-have is that the fields are ordered the same way in terms of difficulty (field strength).

Tuning that exponent is really like finding the sweet spot for that top dogs vs depth competition.

This is a good straight line, meaning HFS/0.77 is an easy way to assess and compare tournament field strengths for pro tournaments without doing extensive simulation.
1760728867412.png
 

Attachments

  • 1760728844031.png
    1760728844031.png
    5.7 KB · Views: 10
Back
Top