Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Wed Oct 16, 2024 1:57 pm
The testruns of Rebel EAS 2.0 and Rebel Extreme are finished. Only for my full Ratinglist, because both are too weak to enter the UHO-Top15 ratinglist.
I find these results of the 2 new Rebels quite disappointing, especially when comparing them with Velvet 8 (normal/risky): Rebel Extreme gained 66000 EAS-points compared to Rebel EAS 2.0, but lost -164 Celo. Velvet 8 risky (EAS:201351) gained 104000 EAS-points compared to Velvet 8 normal(EAS:96753) (=doubled the EAS-score!), but lost only -18 Celo !!!
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Wed Oct 16, 2024 8:52 pm
Thank for testing the 2 new Rebel's in the first place.
Contrary to you I am not disappointed at all.
Rebel-EAS-2.0 in comparison with 16.3 doubled its EAS score with hardly any elo loss, it's a signal I am on the right track.
Rebel-Extreme, second after Patricia and about 200 elo stronger, perfect start.
I have enough ideas left for both concepts.
Admin Admin
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Wed Oct 16, 2024 9:09 pm
BTW, in your test you let Rebel play against 2 x Patricia, that means all 3 lose EAS points, see my post elsewhere. If you remove the games against the 2 x Patricia's I am pretty sure the EAS will be higher.
Admin Admin
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Wed Oct 16, 2024 9:21 pm
BTW-2, I can't find the games of the 2 Rebel's on your website, do I need new glasses?
pohl4711
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 7:12 am
Admin wrote:
BTW, in your test you let Rebel play against 2 x Patricia, that means all 3 lose EAS points, see my post elsewhere. If you remove the games against the 2 x Patricia's I am pretty sure the EAS will be higher.
??? Completely wrong. Of course, these aggressive engines are never tested against each other in my testings (sorry, that I did not mention this earlier):
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 7:18 am
Admin wrote:
Thank for testing the 2 new Rebel's in the first place.
Contrary to you I am not disappointed at all.
Rebel-EAS-2.0 in comparison with 16.3 doubled its EAS score with hardly any elo loss, it's a signal I am on the right track.
Rebel-Extreme, second after Patricia and about 200 elo stronger, perfect start.
I have enough ideas left for both concepts.
OK, Rebel EAS 2.0 compared with Rebel 16.3 looks better, indeed. But, what M.Honert (Velvet) achieved is very, very impressing. He lost less than -20 Celo and gained 100k EAS-points.
The Patricia author just claimed on discord, the new Patricia dev is around +200 Elo stronger than V3.1 (would mean the same strength as Rebel Extreme), without loosing EAS (still around 400k). Of course, this is only a claim. But in the past, his predictions fitted quite well to my following testings.
pohl4711
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 7:23 am
On discord (Stockfish engines dev) the Viridithas author made this, could be interesting for you, I presume. Each yellow X is one engine of my full EAS-ratinglist. Great to see, that EAS becomes interesting for more and more people in our computerchess-bubble. Especially interesting, because Viridithas has a very low EAS-scoring...
Current Pareto frontier for engine aggressiveness / strength (good news: Rebel Extreme is one of the engines on the border-line (but only until the new Patricia is released?!?), but Rebel EAS 2 is not. So, the work must go on...)
Admin Admin
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 9:12 am
pohl4711 wrote:
Admin wrote:
BTW, in your test you let Rebel play against 2 x Patricia, that means all 3 lose EAS points, see my post elsewhere. If you remove the games against the 2 x Patricia's I am pretty sure the EAS will be higher.
??? Completely wrong. Of course, these aggressive engines are never tested against each other in my testings (sorry, that I did not mention this earlier):
Yes you did say that, ok, no problem.
Admin Admin
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 9:16 am
pohl4711 wrote:
Admin wrote:
BTW-2, I can't find the games of the 2 Rebel's on your website, do I need new glasses?
Only in my full Ratinglist. Too weak for my UHO-Top15 Ratinglist:
Can you please upload the games of the 2 Rebel's for me, it's important for my evaluation.
Admin Admin
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 9:17 am
pohl4711 wrote:
Admin wrote:
Thank for testing the 2 new Rebel's in the first place.
Contrary to you I am not disappointed at all.
Rebel-EAS-2.0 in comparison with 16.3 doubled its EAS score with hardly any elo loss, it's a signal I am on the right track.
Rebel-Extreme, second after Patricia and about 200 elo stronger, perfect start.
I have enough ideas left for both concepts.
OK, Rebel EAS 2.0 compared with Rebel 16.3 looks better, indeed. But, what M.Honert (Velvet) achieved is very, very impressing. He lost less than -20 Celo and gained 100k EAS-points.
The Patricia author just claimed on discord, the new Patricia dev is around +200 Elo stronger than V3.1 (would mean the same strength as Rebel Extreme), without loosing EAS (still around 400k). Of course, this is only a claim. But in the past, his predictions fitted quite well to my following testings.
Wow........
pohl4711
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 10:13 am
Admin wrote:
Can you please upload the games of the 2 Rebel's for me, it's important for my evaluation.
(In the UHO_Full_Ratinglist zip-file, the games are included, too, but without any comments/evals)
pohl4711
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 10:18 am
Admin wrote:
pohl4711 wrote:
Admin wrote:
BTW, in your test you let Rebel play against 2 x Patricia, that means all 3 lose EAS points, see my post elsewhere. If you remove the games against the 2 x Patricia's I am pretty sure the EAS will be higher.
??? Completely wrong. Of course, these aggressive engines are never tested against each other in my testings (sorry, that I did not mention this earlier):
Yes you did say that, ok, no problem.
And (additionally), both new Rebels played the exact same opponents as Patricia 3.0/3.1. So the results are perfectly comparable. New Patricia will play the same opponents, too, whenever she will be released.
Admin Admin
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 10:53 am
pohl4711 wrote:
Admin wrote:
pohl4711 wrote:
Admin wrote:
BTW, in your test you let Rebel play against 2 x Patricia, that means all 3 lose EAS points, see my post elsewhere. If you remove the games against the 2 x Patricia's I am pretty sure the EAS will be higher.
??? Completely wrong. Of course, these aggressive engines are never tested against each other in my testings (sorry, that I did not mention this earlier):
Yes you did say that, ok, no problem.
And (additionally), both new Rebels played the exact same opponents as Patricia 3.0/3.1. So the results are perfectly comparable. New Patricia will play the same opponents, too, whenever she will be released.
What springs to mind is that you have chosen an almost exact elo pool as I use, very nice. Of course your time control (as opposed to mine : bullet) as expected lowers the EAS, but fortunately not too much. The same applies for playing games till mate, while I don't, has an effect on the game length and thus also on the EAS.
Good testing, thank you.
pohl4711 likes this post
pohl4711
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Thu Oct 17, 2024 1:01 pm
Admin wrote:
What springs to mind is that you have chosen an almost exact elo pool as I use, very nice. Of course your time control (as opposed to mine : bullet) as expected lowers the EAS, but fortunately not too much. The same applies for playing games till mate, while I don't, has an effect on the game length and thus also on the EAS.
Good testing, thank you.
This could be interesting for you and for your testing and for comparison:
But Alexander uses only the old HCE of Stockfish (up to V11), and allows to change some of the HCE evaluation-parameters in an avatar-file...
Use UCI option "Avatar File" to chose the avatar-file Additionally set UCI option "High Tal" to true (default false)
alew 100 percent setting was made by Peter Martan, the others by Stefan Pohl (SPCC)
The alew setting sets some of the eval-parameters (which all have 100 as default) down:
alew 100 percent: Some parameters are dropped from 100 to 50 and some from 100 to 70.
alew 90 percent: Some parameters are dropped from 100 to 55 and some from 100 to 73 (-10% less drop, compared to alew 100 percent)
alew 80 percent: Some parameters are dropped from 100 to 60 and some from 100 to 76 (-20% less drop, compared to alew 100 percent)
alew 70 percent: Some parameters are dropped from 100 to 65 and some from 100 to 79 (-30% less drop, compared to alew 100 percent)
alew 100 percent is the weakest, but most aggressive setting. The lower the percent-number of the other alew-settings, the higher the strength, but the lower the aggressiveness.
Alexander 2.0 alew 100 percent is around -130 Celo weaker than Stockfish final HCE and around the strength of Revenge 1.0.
So, here you have a superaggressive not-neural-HCE-(Stockfish-)Engine, which can be adjusted in aggressiveness and strength, just by choosing one of the 4 avatar-files !!!
I started a testrun of Alexander 2 alew 100 (same 14 opponents as for the Rebel-testruns and Patricia-testruns), and after 4000 games, the EAS-Score is nearly 400.000 (!!!)
So, right now, the Pareto chart would look like this (if the results are stable until the testrun is finished):
pohl4711
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 8:02 am
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 9:21 am
pohl4711 wrote:
The testruns of Rebel EAS 2.0 and Rebel Extreme are finished. Only for my full Ratinglist, because both are too weak to enter the UHO-Top15 ratinglist.
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 10:13 am
pohl4711 wrote:
So, all back to start (Alexander 2.0):
Thanks for testing "my" .avt- setting, told Andrea Manzo about a bug in Alexander Santiago you noticed, he answered he fixed one (a bug) already, but better wait till I've tried a new one download, he won't have much time at the moment of his own, regards Peter.
Edit: Signature in forum here seems not to work for me
Edit, edit: A first match between Alexander Santiago (re-downloaded just after starting this posting) and 2.0 in CuteChess GUI with default settings of both, 3'+1" single thread, UHO 2024 900-990cp 6mvs, ran smooth without any disconnets nor losses on time:
Score of AlexanderSantiago vs Alexander2.0: 280 - 63 - 157 [0.717] Elo difference: 161.5 +/- 26.5, LOS: 100.0 %, DrawRatio: 31.4 % 500 of 500 games finished.
Can send you the games by mail, if you want me to, but I guess, they aren't of any special interest, are they?
Ghppn likes this post
pohl4711
Posts : 160 Join date : 2022-03-01 Location : Berlin
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 12:51 pm
peter wrote:
pohl4711 wrote:
So, all back to start (Alexander 2.0):
Thanks for testing "my" .avt- setting, told Andrea Manzo about a bug in Alexander Santiago you noticed, he answered he fixed one (a bug) already, but better wait till I've tried a new one download, he won't have much time at the moment of his own, regards Peter.
Edit: Signature in forum here seems not to work for me
Edit, edit: A first match between Alexander Santiago (re-downloaded just after starting this posting) and 2.0 in CuteChess GUI with default settings of both, 3'+1" single thread, UHO 2024 900-990cp 6mvs, ran smooth without any disconnets nor losses on time:
Score of AlexanderSantiago vs Alexander2.0: 280 - 63 - 157 [0.717] Elo difference: 161.5 +/- 26.5, LOS: 100.0 %, DrawRatio: 31.4 % 500 of 500 games finished.
Can send you the games by mail, if you want me to, but I guess, they aren't of any special interest, are they?
I redownloaded the binaries, they are the same like this morning, no change. So still buggy.
The games looked quite normal, but all games were lost (except 1 draw)
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 1:56 pm
Sorry, Stefan, only "bug" I found so far, is the name of some of the binaries in download reads Aleander instead of Alexander, e.g. this is so as for the bmi2- compile I used, but I simply renamed that (which I wouldn't have had to neither probably, because started in console the version- name was ok anyhow). Didn't have more than the 500 games quoted above, but as told, no bug nor unexpected weakness to be seen there for me. Caissa 1.12 maybe is too strong an opponent for a non- NNUE- engine like Alexander in handicap- mode anyhow still, but that I don't have to tell to you, I guess
One difference between your match and mine is the alew.avt you used and I didn't (this time), wanting to see the default- setting play against the same as for 2.0. Theoretically Santiago could have problems wit alew.avt, will give that a try now too.
Let me know, if you want to get my .pgn, if I yet find any bug in Alexander Santiago in follow- up tests, I'll report it too.
Ghppn likes this post
peter
Posts : 24 Join date : 2022-10-23
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 5:35 pm
To see, if the avatar- setting causes trouble, I had another one match Alexander Santiago- Alexander 2.0, this time both engines using the alew.avt- setting, again 3'+1" single thread, UHO 2024 900-990cp 6mvs:
Score of SantiagoAlew vs Alex2Alew: 161 - 79 - 260 [0.582] Elo difference: 57.5 +/- 21.1, LOS: 100.0 %, DrawRatio: 52.0 % 500 of 500 games finished.
And with EAS- tool:
Code:
bad avg.win Rank EAS-Score sacs shorts draws moves Engine/player ------------------------------------------------------------------- 1 59380 06.83% 10.56% 20.77% 74 SantiagoAlew 2 45200 05.06% 15.19% 24.23% 76 Alex2Alew ------------------------------------------------------------------- *** Average length of all won games: 75 moves
Ghppn likes this post
Admin Admin
Posts : 2608 Join date : 2020-11-17 Location : Netherlands
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 7:06 pm
What is Alexander Santiago?
Does it play in the upcoming ICGA tournament?
Ghppn likes this post
Dio
Posts : 222 Join date : 2021-08-28
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 7:25 pm
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished Fri Oct 18, 2024 7:32 pm
No, that's ShashChess (playing in Santiago). As Stefan wrote, Alexander is the "little brother" of ShashChess from same author (Andrea Manzo), not using NNUE but HCE and meant for human players to have (by UCI- Elo or) many eval- parameters adaptable as handicap- modes of special playing styles.
Edit: didn't see, you had edited your answer from yes to yes and no, while I was already typing too
Ghppn likes this post
Sponsored content
Subject: Re: SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished
SPCC: Testruns of Rebel EAS 2 and Rebel Extreme finished