- R_Silvian wrote:
- !!!
???
If you read the results in the right way, all fits very well:
Fishtest plays Selfplay-Tests, only. And the last progression-tests shows indeed a nice progress, compared to Stockfish 16:
https://github.com/official-stockfish/Stockfish/wiki/Regression-TestsThe progress is
+19 Elo here.
Now, we look in the single head-to-head results of Stockfish 231231 in my testrun versus Stockfish 16:
https://www.sp-cc.de/files/programs.datBecause this is a (small) Selfplay-Test, too:
1000 (+285,=489,-226), 53.0 %
means: Stockfish 231231 scored 53.0% in its 1000 games head-to-head versus Stockfish 16. And 53.0% score means:
+21 EloSo, the Selfplay-Elo results fit very well... And the complete ratinglist-result of Stockfish 231231 in my UHO-Top15 Ratinglist is not selfplay-Elo.
So, what we can learn here, is, selfplay-Elo is not the same as Ratinglist-Elo, even though, both use UHO-openings.
And, we can learn, reading results carefully and in the right way, can make some fiction very real.