https://rebel7775.wixsite.com/rebel/adrl-blitzADRL stands for - Anti Draw Rating List.
Less draws between top engines.
Using the challenging TCEC positions composed by
Jeroen Noomen and
GM Matthew Sadler where there is always something to play for.
- Code:
-
# PLAYER : RATING ERROR POINTS PLAYED (%) CFS(%) W D L D(%)
1 Stockfish-16 : 3746.5 22.2 408.0 520 78 100 301 214 5 41
2 Berserk-11.1 : 3554.2 16.6 296.0 520 57 79 171 250 99 48
3 Komodo-Dragon-2.5 : 3546.4 12.7 291.0 520 56 96 150 282 88 54
4 RubiChess-20230918 : 3520.9 21.7 274.5 520 53 74 133 283 104 54
5 Rebel-Dev : 3513.2 6.7 269.5 520 52 57 131 277 112 53
6 CST-2.0 : 3510.9 21.4 268.0 520 52 98 130 276 114 53
7 Koivisto-9.0 : 3475.6 23.3 245.0 520 47 54 106 278 136 53
8 Ethereal-14.00-NNUE : 3474.0 20.9 244.0 520 47 65 105 278 137 53
9 Clover-6.0 : 3467.1 18.7 239.5 520 46 77 92 295 133 57
10 Rebel-EAS : 3460.2 19.6 235.0 520 45 57 104 262 154 50
11 Caissa-1.13.1 : 3457.8 13.5 233.5 520 45 100 106 255 159 49
12 Revenge-3.0 : 3429.9 9.6 215.5 520 41 83 90 251 179 48
13 Igel-3.5.0 : 3422.0 10.9 210.5 520 40 54 77 267 176 51
14 rofChade-3.1 : 3421.2 10.0 210.0 520 40 --- 80 260 180 50
Total Games : 3640
White Wins : 1569 (43.1%)
Black Wins : 207 (5.7%)
Draws : 1864 (51.2%)
Remarks1. Playing robin rounds only, meaning every engine plays against every engine.
2. Draw rate dropped to 51.2%
3. More games will be played but as I have noticed not much has changed after 150 games each.
4. Stockfish 16 massively profited, the elo difference with number two is
192 elo. As for a wild guess, Stockfish has a much bigger net based on more than 100 billion positions, see
here.
5. Within a couple of days I will start a second ADRL based on short time control 40/120 and it is expected the elo gap between Stockfish and its runner-up will shrink.
6.
By no means this is criticism on other rating lists who play from more balanced positions. The ADRL just sends a message there are alternatives to the ever increasing draw rate between top engines. Such as Stefan Pohl excellent initiative with his
UHO rating list also does, much lower draw rates.