Posts : 1851 Join date : 2020-11-17 Location : Netherlands
Subject: STS re-re-re-re-re-visited Sat Oct 22, 2022 12:34 pm
For STS lovers -
STS stands for Strategic Test Suite. From the CPW we read -
Strategic Test Suite, (STS) a series of themed test suites by Dann Corbit and Swaminathan Natarajan, designed to evaluate chess engine's long term understanding of strategical and positional concepts. More recently, the positions were revised and converted by Ferdinand Mosca to be used by an analysis tool, MEA.
......
We made an effort to give STS a major update by re-analyzing the 1500 positions with nowadays strongest engines, Stockfish 15 and Lc0. And with MEA you can produce your own alternative rating list. See [ SF15 ] [ LC0 ] or as text file [ SF15 ] [ LC0 ].
So I only changed names of engine- binaries (newly stored in folder engines, there they do work if started with double click) and the engine- names for output.
Can somebody help?
Thanks in advance! Peter.
Dio
Posts : 152 Join date : 2021-08-28
Subject: Re: STS re-re-re-re-re-visited Sun Oct 23, 2022 2:32 pm
set Name="Blue Marlin 15.3"
Admin and peter like this post
peter
Posts : 13 Join date : 2022-10-23
Subject: Re: STS re-re-re-re-re-visited Sun Oct 23, 2022 2:45 pm
Hey there! Great, now it did work!
Code:
EPD : epd\sts-sf15.epd Time : 100ms Solving Max Total Time Hash Engine Score Used Time Found Pos Time Score Rate ms Mb Cpu CCRL 1 Stockfish 220911 13942 00:04:10.0 1210 1500 00:00:00.0 15000 92.9% 100 64 30 0 2 Blue Marlin 15.3 13834 00:04:10.0 1208 1500 00:00:00.0 15000 92.2% 100 64 30 0
Created with MEA by Ferdinand Mosca
Thanks a lot for prompt help, regards
Dio likes this post
Admin Admin
Posts : 1851 Join date : 2020-11-17 Location : Netherlands
Subject: Re: STS re-re-re-re-re-visited Sun Oct 23, 2022 8:15 pm
peter wrote:
Hey there! Great, now it did work!
Code:
EPD : epd\sts-sf15.epd Time : 100ms Solving Max Total Time Hash Engine Score Used Time Found Pos Time Score Rate ms Mb Cpu CCRL 1 Stockfish 220911 13942 00:04:10.0 1210 1500 00:00:00.0 15000 92.9% 100 64 30 0 2 Blue Marlin 15.3 13834 00:04:10.0 1208 1500 00:00:00.0 15000 92.2% 100 64 30 0
EPD : epd\sts-sf15.epd Time : 100ms Max Total Time Hash Engine Score Found Pos ELO Score Rate ms Mb Cpu 1 Koivisto 8.0 12248 1031 1500 3676 15000 81.7% 100 64 1
Solving time out -> Elo rating in.
Dio likes this post
peter
Posts : 13 Join date : 2022-10-23
Subject: Re: STS re-re-re-re-re-visited Sun Oct 23, 2022 9:39 pm
EPD : epd\sts-sf15.epd Time : 100ms Max Total Time Hash Engine Score Found Pos ELO Score Rate ms Mb Cpu 1 Koivisto 8.0 12248 1031 1500 3676 15000 81.7% 100 64 1
Solving time out -> Elo rating in.
Thanks for info and link, regards
peter
Posts : 13 Join date : 2022-10-23
Subject: Re: STS re-re-re-re-re-visited Wed Oct 26, 2022 9:30 am
One more question, please. How can I edit the list of results? Having deleted some of the old runs, they get restored by next one new run again. Even tried to rename the mea_reults.txt- file and create a new empty one, it doesn't matter, old entries appear in new file after next run again. Guess it has something to do with directory epd_out, but even having renamed this one and stored a new empty one with old name too, yet in table of results always all old ones are shown again after next one new run.
Thanks in advance again, regards
Admin Admin
Posts : 1851 Join date : 2020-11-17 Location : Netherlands
Subject: Re: STS re-re-re-re-re-visited Wed Oct 26, 2022 10:05 am
From the website -
Delete engines - results are kept in the file mea-results.csv and here you can make changes or delete engines.
make-html.bat - double click for new html or text output, for instance if you have made changes to mea-results.csv.
peter likes this post
peter
Posts : 13 Join date : 2022-10-23
Subject: Re: STS re-re-re-re-re-visited Wed Oct 26, 2022 11:06 am
Very prompt answer again, thank you very much!
I had tried editing of .csv- file already too but failed to store it in correct format then. Editor made a .txt- file out of it, so therefore it didn't work.
Having corrected that and having make-html.bat run then again everything's fine now. Best thanks
texium
Posts : 39 Join date : 2022-07-19
Subject: Re: STS re-re-re-re-re-visited Wed Nov 09, 2022 11:45 pm
The site is gone, doesn't show up on site news, and the page is blank.
Admin Admin
Posts : 1851 Join date : 2020-11-17 Location : Netherlands
Subject: Re: STS re-re-re-re-re-visited Thu Nov 10, 2022 12:05 am
Reason is my provider is overzealous adding new functions to the web editor and has a habit to introduce bugs with it. As a result the website recently was completely ruined showing the contents of the main page also on every page. Had to install an older backup and sts fell of it.
Last edited by Admin on Mon Mar 06, 2023 4:32 pm; edited 1 time in total
Ghppn likes this post
texium
Posts : 39 Join date : 2022-07-19
Subject: Re: STS re-re-re-re-re-visited Thu Nov 10, 2022 5:06 pm
okay, thanks for addressing it.
peter
Posts : 13 Join date : 2022-10-23
Subject: Re: STS re-re-re-re-re-visited Tue Jan 03, 2023 7:59 am
How about that: These 594 "best" (not too easy) positions out of 1500 of STS
The 594 from first link can be used as single best move positions too, the most unbalaned Anti Draw Openings could serve such as well if evaluated with Frank Schubert's EloStatTS instead of MEA I guess, just for being not as eval- dependent of the single position and engine. Positions with more then single best move just with multiple solutions evaluated "positive" and with (instead of STS- points) time- dependent mini- matches for each postion and engine- pairing, see description of program here:
Wouldn't that with about 1000 positions mixed up like that out of opening, midgame and endgame make a nearer to game- playing positional test suite maybe?
Happy_2023_regards
fsanders
Posts : 14 Join date : 2023-01-10
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 10:08 am
I like the MEA tool fpr position testing. One thing I dont understand, how are the numbers on the html table calculated? I mean if I have 1000 püsitiond and the max points are 10, the max result should be 10.000. But I get a score of lets say 16.915. What does that mean and how is it calculated. Are ther any explanations in the net?
Admin Admin
Posts : 1851 Join date : 2020-11-17 Location : Netherlands
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 10:20 am
fsanders wrote:
I like the MEA tool fpr position testing. One thing I dont understand, how are the numbers on the html table calculated? I mean if I have 1000 püsitiond and the max points are 10, the max result should be 10.000. But I get a score of lets say 16.915. What does that mean and how is it calculated. Are ther any explanations in the net?
Can you post about 100 of your epd's ?
fsanders
Posts : 14 Join date : 2023-01-10
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 11:04 am
100 was just an example, but I can post a table that shows what I mean:
Positions 621, found 374 score 10.261
Admin Admin
Posts : 1851 Join date : 2020-11-17 Location : Netherlands
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 11:15 am
The points assigned to moves aren't always from 1 to 10, any values go.
18.360 / 621 = 30
So I think the epd file you tested all best moves are rewarded with 30 points and other moves less.
That's why I asked to post some of the epds.
fsanders
Posts : 14 Join date : 2023-01-10
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 3:16 pm
The positions are a subset of STS, I think there is no best move more than 10.
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 8:22 pm
Here the output with the latest version
Admin Admin
Posts : 1851 Join date : 2020-11-17 Location : Netherlands
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 9:59 pm
From the download I first ran the official 1500 positions, then the 38 you posted, I get :
As you can see 38 positions and maximum points = 380, as it should.
fsanders
Posts : 14 Join date : 2023-01-10
Subject: Re: STS re-re-re-re-re-visited Tue Jan 10, 2023 11:22 pm
Thanks, I will download again, do a new install and try again. (As you can see the header of the table is really different(used time, score rate,...), no idea why)