Posts : 1254 Join date : 2020-11-17 Location : France
Subject: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Mar 16, 2022 12:09 am
Rebel-15-with-ChrisW-NNUE dev-v2.2, bit of a mouthful, but that's its name at the moment.
First off, it works. I've been playing test games, so far against the Toga4-with-prior-NNUE, and keeping tabs on progress.
The new network inference engine is faster, and able to handle all the prior nets. Tests with the new inference engine vs the old inference engine, both inside Toga4, showed a 6%, 7% increase in nps (old version was getting 0.91 Mnps on my test machine, the new getting 0.99 Mnps). They ran for a few thousand test games at 40/10, W=935, L=807, D=2153 =52%. Then got the new larger net architecture going, I don't have a net from Ed yet, so tested with a few of my own nets. Nps is down, of course, net is twice the size, at 0.7 Mnps. Tried with three nets, 40/10, results varied between 54% and 56%. My other usual test is to grab a few hundred of the first PGNS, load them into Scid and count the relative wins and losses before move 50. Well, the batch I looked at, had something like 25 wins for the new net before it lost one game, which was about move 60, I think. That's a lot of Tal for a 56% overall run. Not sure where that comes from, no doubt Ed will discover it, but that's pretty good. Then tried a net at 40/40, which is running now, showing 57% with the early win rate down a little. I'ld imagine that an Ed-trained net with Ed-tuning tweaks will be able to improve on that, so I'll send him the code in the morning and over to Ed.
Admin, Mclane, adminx, matejst, Damir Desevac and Dio like this post
Chris Whittington
Posts : 1254 Join date : 2020-11-17 Location : France
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Mar 16, 2022 2:08 am
Posts : 3024 Join date : 2020-11-17 Age : 57 Location : United States of Europe, Germany, Ruhr area
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Mar 16, 2022 3:52 pm
This is a new Tal ?
Chris Whittington
Posts : 1254 Join date : 2020-11-17 Location : France
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Mar 16, 2022 5:43 pm
Mclane wrote:
This is a new Tal ?
No. It’s an intermediate stage towards producing a releasable Rebel-15. Both Ed and I train our own nets and write our own search and there are no plans for releasing an Ed search with Chris net, nor a Chris search with an Ed net.
That said, I think Ed is getting some good results already and some good games. I guess he might publish games or even run it on GRL as a private engine, dunno, all up to him. My part would only be fixing net bugs or maybe suggesting some more net enhancements.
matejst and Dio like this post
Admin Admin
Posts : 2609 Join date : 2020-11-17 Location : Netherlands
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Mar 16, 2022 7:34 pm
Mclane wrote:
This is a new Tal ?
Nope, what Chris said.
Chris created new net structures which (for instance) allows me to upgrade the existing Rebel 14.2 net and gain elo. We exchange nets for the benefit of both as in this case.
Mclane and matejst like this post
TheSelfImprover
Posts : 3116 Join date : 2020-11-18
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Mar 16, 2022 9:14 pm
This is a high standard of play IMO. It looks simple, but I really struggled to understand several of the moves. This is in contrast to Mclane's 8-bit tournament, where I felt I understood most of the moves easily.
Admin and Mclane like this post
Mclane
Posts : 3024 Join date : 2020-11-17 Age : 57 Location : United States of Europe, Germany, Ruhr area
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Mar 16, 2022 10:49 pm
Yes. 8 bit is easy for us. 32 or 64 bit is difficult for our brains. I once tried to visit chris in france. I was not succesful although i saw his letterbox in front of me. With me were my son and my father. 3 czubs. Maybe too much.
TheSelfImprover likes this post
Mclane
Posts : 3024 Join date : 2020-11-17 Age : 57 Location : United States of Europe, Germany, Ruhr area
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Thu Mar 17, 2022 10:34 pm
When will there be a new CSTAL with new gambit book.
Mclane
Posts : 3024 Join date : 2020-11-17 Age : 57 Location : United States of Europe, Germany, Ruhr area
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Mon Mar 21, 2022 9:46 pm
Any news?
Admin Admin
Posts : 2609 Join date : 2020-11-17 Location : Netherlands
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Mon Mar 21, 2022 11:34 pm
Working on a new neural net, so far small improvement, too little for release. The obvious way to improve is to add more positions so I am now busy doubling the size from 1.5 billion to 3 billion and that takes a lot of computer time even when you have 72 active threads available for the job.
TheSelfImprover and matejst like this post
adminx
Posts : 115 Join date : 2020-11-26 Location : Acworth, GA (USA)
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Apr 06, 2022 9:50 am
Chris,
I really look forward to seeing the end product of CSTal. In the mean time I am enjoying viewing the games of the 10 minute version.
[pgn] [Event "HIARCS 15.0-CSTal match (2 games, 15 minutes per game plus 3 seconds per move)"] [Site "?"] [Date "2022.04.06"] [Round "2"] [White "CSTal"] [Black "HIARCS 15.0"] [Result "1/2-1/2"] [EventDate "2022.04.06"] [ECO "E12"]
Posts : 1254 Join date : 2020-11-17 Location : France
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Wed Apr 06, 2022 10:26 am
adminx wrote:
Chris,
I really look forward to seeing the end product of CSTal.
Yes, me too. I spent the last week fighting the Microsoft compiler which doesn’t like humungous sized neural nets.
In the mean time I am enjoying viewing the games of the 10 minute version.
That’s just a little fun version. I’m surprised it can play at all actually.
[pgn] [Event "HIARCS 15.0-CSTal match (2 games, 15 minutes per game plus 3 seconds per move)"] [Site "?"] [Date "2022.04.06"] [Round "2"] [White "CSTal"] [Black "HIARCS 15.0"] [Result "1/2-1/2"] [EventDate "2022.04.06"] [ECO "E12"]
1.d4 Nf6 2.Nf3 b6 3.c4 Bb7 4.Nc3 e6 5.a3 Ne4 6.Nxe4 Bxe4 7.Nd2 Bg6 8.g3 Nc6 9.e3 e5 10.d5 Na5 11.h4 h6 12.e4 {[%emt 0:00:22] [%eval 54,19]} Â 12...Bc5 {[%emt 0:00:56] [%eval 9,24]} Â 13.Bd3 {[%emt 0:00:19] [%eval 77,19]} Â 13...O-O {[%emt 0:00:31] [%eval 0,22]} Â 14.g4 {[%emt 0:00:21] [%eval 45,20]} Â 14...f6 {[%emt 0:00:47] [%eval 50,22]} Â 15.Bc2 {[%emt 0:00:30] [%eval 38,20]} Â 15...c6 {[%emt 0:00:23] [%eval 4,21]} Â 16.b4 {[%emt 0:00:26] [%eval 38,22]} Â 16...Bd4 {[%emt 0:00:21] [%eval 17,23]} Â 17.Rb1 {[%emt 0:00:26] [%eval 20,22]} Â 17...Nb7 {[%emt 0:00:14] [%eval 10,23]} Â 18.h5 {[%emt 0:00:28] [%eval 17,21]} Â 18...Bh7 {[%emt 0:00:20] [%eval 6,23]} Â 19.Nf3 {[%emt 0:00:23] [%eval 27,20]} Â 19...c5 {[%emt 0:00:21] [%eval 24,22]} Â 20.O-O {[%emt 0:00:17] [%eval 32,19]} Â 20...Nd6 {[%emt 0:00:20] [%eval 28,23]} Â 21.Qe2 {[%emt 0:00:22] [%eval 34,20]} Â 21...Qc7 {[%emt 0:00:15] [%eval 19,22]} Â 22.Rd1 {[%emt 0:00:25] [%eval 8,19]} Â 22...Rfe8 {[%emt 0:01:16] [%eval 16,24]} Â 23.Bd3 {[%emt 0:00:25] [%eval 33,19]} Â 23...Nf7 {[%emt 0:00:22] [%eval 26,23]} Â 24.bxc5 {[%emt 0:00:26] [%eval 46,18]} Â 24...Bxc5 {[%emt 0:00:22] [%eval 13,24]} Â 25.Nh4 {[%emt 0:00:17] [%eval 51,18]} Â 25...d6 {[%emt 0:00:36] [%eval 15,25]} Â 26.Kh2 {[%emt 0:00:16] [%eval 51,18]} Â 26...Ng5 {[%emt 0:00:46] [%eval 3,26]} Â 27.Rf1 {[%emt 0:00:19] [%eval 50,18]} Â 27...Rac8 {[%emt 0:00:17] [%eval 7,23]} Â 28.f3 {[%emt 0:00:22] [%eval 51,17]} Â 28...Qd7 {[%emt 0:00:24] [%eval 14,22]} Â 29.Bc2 {[%emt 0:00:24] [%eval 51,18]} Â 29...Bd4 {[%emt 0:00:25] [%eval 20,22]} Â 30.Rb4 {[%emt 0:00:29] [%eval 64,19]} Â 30...Rf8 {[%emt 0:00:38] [%eval 21,23]} Â 31.Ba4 {[%emt 0:00:21] [%eval 72,19]} Â 31...Qe7 {[%emt 0:00:29] [%eval 31,24]} Â 32.Bb5 {[%emt 0:00:15] [%eval 97,19]} Â 32...Rc7 {[%emt 0:00:32] [%eval 24,24]} Â 33.Kg2 {[%emt 0:00:19] [%eval 99,17]} Â 33...Qd8 {[%emt 0:00:19] [%eval 32,22]} Â 34.Ba6 {[%emt 0:00:12] [%eval 97,18]} Â 34...Bc5 {[%emt 0:00:35] [%eval 35,25]} Â 35.Rb1 {[%emt 0:00:13] [%eval 97,19]} Â 35...Qd7 {[%emt 0:00:14] [%eval 38,23]} Â 36.Rh1 {[%emt 0:00:17] [%eval 101,19]} Â 36...Bd4 {[%emt 0:00:10] [%eval 40,21]} Â 37.Re1 {[%emt 0:00:13] [%eval 98,19]} Â 37...Bc3 {[%emt 0:00:19] [%eval 37,24]} Â 38.Rd1 {[%emt 0:00:17] [%eval 93,19]} Â 38...Bd4 {[%emt 0:00:10] [%eval 42,22]} Â 39.Rh1 {[%emt 0:00:19] [%eval 96,19]} Â 39...Nf7 {[%emt 0:00:10] [%eval 46,22]} Â 40.Qd3 {[%emt 0:00:10] [%eval 109,19]} Â 40...Nd8 {[%emt 0:00:11] [%eval 29,22]} Â 41.Rd1 {[%emt 0:00:19] [%eval 104,20]} Â 41...Nb7 {[%emt 0:00:09] [%eval 21,20]} Â 42.Bxb7 {[%emt 0:00:11] [%eval 100,22]} Â 42...Rxb7 {[%emt 0:00:05] [%eval 19,22]} Â 43.Be3 {[%emt 0:00:09] [%eval 84,22]} Â 43...Bxe3 {[%emt 0:00:08] [%eval 0,23]} Â 44.Qxe3 {[%emt 0:00:03] [%eval 69,21]} Â 44...Rc8 {[%emt 0:00:06] [%eval 0,24]} Â 45.Rdc1 {[%emt 0:00:09] [%eval 67,22]} Â 45...Rbc7 {[%emt 0:00:05] [%eval 0,23]} Â 46.Rb4 {[%emt 0:00:14] [%eval 57,22]} Â 46...Rc5 {[%emt 0:00:06] [%eval 0,24]} Â 47.a4 {[%emt 0:00:14] [%eval 52,20]} Â 47...Kh8 {[%emt 0:00:06] [%eval 0,22]} Â 48.Qe2 {[%emt 0:00:21] [%eval 49,20]} Â 48...Qd8 {[%emt 0:00:08] [%eval 0,24]} Â 49.Qf2 {[%emt 0:00:20] [%eval 32,19]} Â 49...f5 {[%emt 0:00:06] [%eval 0,22]} Â 50.exf5 {[%emt 0:00:10] [%eval 55,19]} Â 50...Rxd5 {[%emt 0:00:06] [%eval 0,23]} Â 51.cxd5 {[%emt 0:00:08] [%eval 110,20]} Â 51...Rxc1 {[%emt 0:00:06] [%eval 0,25]} Â 52.Ng6+ {[%emt 0:00:11] [%eval 110,21]} Â 52...Bxg6 {[%emt 0:00:08] [%eval 0,21]} Â 53.hxg6 {[%emt 0:00:10] [%eval 137,20]} Â 53...Qc8 {[%emt 0:00:06] [%eval 0,24]} Â 54.Rb2 {[%emt 0:00:09] [%eval 143,21]} Â 54...Rc5 {[%emt 0:00:05] [%eval 0,25]} Â 55.Qh4 {[%emt 0:00:09] [%eval 146,20]} Â 55...Qf8 {[%emt 0:00:06] [%eval 0,25]} Â 56.Rb5 {[%emt 0:00:07] [%eval 167,19]} Â 56...Kg8 {[%emt 0:00:06] [%eval 0,22]} Â 57.Kg3 {[%emt 0:00:07] [%eval 187,20]} Â 57...Rc1 {[%emt 0:00:04] [%eval 0,20]} Â 58.a5 {[%emt 0:00:06] [%eval 225,20]} Â 58...Rc5 {[%emt 0:00:04] [%eval 0,23]} Â 59.Rxc5 {[%emt 0:00:09] [%eval 464,23]} Â 59...bxc5 {[%emt 0:00:05] [%eval 0,25]} Â 60.Qh2 {[%emt 0:00:10] [%eval 503,23]} Â 60...Qb8 {[%emt 0:00:04] [%eval 0,28]} Â 61.Qe2 {[%emt 0:00:10] [%eval 121,22]} Â 61...Qb7 {[%emt 0:00:04] [%eval 0,29]} Â 62.Qd3 {[%emt 0:00:12] [%eval 109,21]} Â 62...Kf8 {[%emt 0:00:04] [%eval 0,28]} Â 63.Kh3 {[%emt 0:00:16] [%eval 48,21]} Â 63...Kg8 {[%emt 0:00:04] [%eval 0,28]} Â 64.a6 {[%emt 0:00:10] [%eval 12,20]} Â 64...Qb8 {[%emt 0:00:04] [%eval 0,28]} Â 65.Qe2 {[%emt 0:00:10] [%eval 14,20]} Â 65...Qb3 {[%emt 0:00:05] [%eval 0,25]} Â 66.Kg2 {[%emt 0:00:09] [%eval 5,21]} Â 66...Kf8 {[%emt 0:00:05] [%eval 0,27]} Â 67.Qd2 {[%emt 0:00:08] [%eval 4,21]} Â 67...Qb4 {[%emt 0:00:05] [%eval 0,26]} Â 68.Qc1 {[%emt 0:00:05] [%eval 5,20]} Â 68...Ke7 {[%emt 0:00:06] [%eval 0,21]} Â 69.g5 {[%emt 0:00:08] [%eval 16,21]} Â 69...Qf4 {[%emt 0:00:04] [%eval 0,23]} Â 70.f6+ {[%emt 0:00:06] [%eval 0,22]} Â 70...Kf8 {[%emt 0:00:05] [%eval 0,25]} Â 71.Qb1 {[%emt 0:00:07] [%eval 0,23]} Â 71...Qxg5+ {[%emt 0:00:05] [%eval 0,27]} Â 72.Kh3 {[%emt 0:00:07] [%eval 0,25]} Â 72...gxf6 {[%emt 0:00:05] [%eval 0,27]} Â 73.Qb8+ {[%emt 0:00:06] [%eval 0,24]} Â 73...Kg7 {[%emt 0:00:04] [%eval 0,28]} Â 74.Qxa7+ {[%emt 0:00:07] [%eval 0,25]} Â 74...Kxg6 {[%emt 0:00:03] [%eval 0,27]} Â 75.Qd7 {[%emt 0:00:06] [%eval 0,25]} Â 75...Qc1 {[%emt 0:00:04] [%eval 0,27]} Â 76.a7 {[%emt 0:00:08] [%eval 0,24]} Â 76...Qh1+ {[%emt 0:00:04] [%eval 0,32]} Â 77.Kg3 {[%emt 0:00:04] [%eval 0,27]} Â 77...Qe1+ {[%emt 0:00:04] [%eval 0,30]} Â 78.Kh3 {[%emt 0:00:07] [%eval 0,28]} Â 78...Qf1+ {[%emt 0:00:04] [%eval 0,32]} Â 79.Kh4 {[%emt 0:00:06] [%eval 0,27]} Â 79...Qf2+ {[%emt 0:00:04] [%eval 0,28]} Â 80.Kh3 {[%emt 0:00:07] [%eval 0,27]} Â 80...Qxf3+ {[%emt 0:00:03] [%eval 0,28]} Â 81.Kh2 {[%emt 0:00:07] [%eval 0,27]} Â 81...Qf4+ {[%emt 0:00:04] [%eval 0,28]} Â 82.Kh3 {[%emt 0:00:07] [%eval 0,28]} Â 82...Qe3+ {[%emt 0:00:04] [%eval 0,28]} Â 83.Kh4 {[%emt 0:00:06] [%eval 0,28]} Â 83...Qf2+ {[%emt 0:00:04] [%eval 0,30]} Â 84.Kh3 {[%emt 0:00:04] [%eval 0,29]} Â 84...Qf3+ {[%emt 0:00:04] [%eval 0,31]} Â 85.Kh2 {[%emt 0:00:00] [%eval 0,63]} Â 85...Qf4+ {[%emt 0:00:04] [%eval 0,30]} Â 86.Kh3 {[%emt 0:00:05] [%eval 0,29]} Â 86...Qf3+ {[%emt 0:00:25] [%eval 0,37]} Â 1/2-1/2 [/pgn]
matejst
Posts : 612 Join date : 2020-11-26
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2 Thu Apr 07, 2022 2:24 am
In general, the main handicap of the CSTal/Rebel versions so far is the search. There is something that Marvin's search does well when paired with an aggressive evaluation function: somehow to find tactics. Yesterday I was analyzing a Sicilian position and while Toga Rebel gave a huge advantage to white, it did not see a mate in six (I can't remember how much time I spent on the position), something Marvin Rebel found immediately. I then tried CSTal 10mn, same problem.
I don't have a clue how the search/eval work together, but time after time I noticed the same patterns: Rebel/CSTal would sacrifice for initiative, and then miss a winning attack; or, in defense, simply succumb to a frontal attack that could have been prevented. In other words, there is no awareness of danger.
I already wrote that CSTal was a strange mixture of Karpov and Kasparov -- prevention and aggressiveness. In complicated opening positions it plays probably better than Toga Rebel. It seems to better understand tempi (e.g., it won't play Be3 before exf4) and space.
Chris Whittington
Posts : 1254 Join date : 2020-11-17 Location : France
In general, the main handicap of the CSTal/Rebel versions so far is the search. There is something that Marvin's search does well when paired with an aggressive evaluation function: somehow to find tactics. Yesterday I was analyzing a Sicilian position and while Toga Rebel gave a huge advantage to white, it did not see a mate in six (I can't remember how much time I spent on the position), something Marvin Rebel found immediately. I then tried CSTal 10mn, same problem.
I don't have a clue how the search/eval work together, but time after time I noticed the same patterns: Rebel/CSTal would sacrifice for initiative, and then miss a winning attack; or, in defense, simply succumb to a frontal attack that could have been prevented. In other words, there is no awareness of danger.
I already wrote that CSTal was a strange mixture of Karpov and Kasparov -- prevention and aggressiveness. In complicated opening positions it plays probably better than Toga Rebel. It seems to better understand tempi (e.g., it won't play Be3 before exf4) and space.
It's difficult to comment because we don't know really how the evaluation works either. There will be 'holes' in NN knowledge where it simply misevaluates, its the same with all nets, deeper search compensates, and the good "understanding" of an NN where there are no holes makes for an overall better system.
Any news? I was hoping for a Rebel 15 with a slightly improved Rebel net (the few glaring problems already discussed here solved), and a similar engine with a mature CSTal net, and some search improvements.
Chris Whittington
Posts : 1254 Join date : 2020-11-17 Location : France
Any news? I was hoping for a Rebel 15 with a slightly improved Rebel net (the few glaring problems already discussed here solved), and a similar engine with a mature CSTal net, and some search improvements.
Ed is building a net at the moment, with larger number of training positions and (I think) one the enhancements to the architecture. He is also experimenting with which positions go into the training set and which not. There really is a great deal of experimenting and testing involved, and only so much hardware to do it on. Actually, Ed could do with somebody donating him a PC (doesn’t need to be latest thing) with maybe about 4 cores and a GPU, even a 1060 is enough, believe it or not, for NN training, that’s his bottleneck.
I have a spare laptop we could use, but it is a AMD A8 with Radeon graphics, I think. So it is not much, but if I can help, I will gladly do it. Just tell me what.
Admin Admin
Posts : 2609 Join date : 2020-11-17 Location : Netherlands
I think I have about 50 elo by now but not happy yet, there must be more. Besides I am not 100% sure if it is still possible to release a SSE version. NNUE eval (by far) works best with AVX2 and new NNUE architectures developed by Chris at some point in the future may no longer work for SSE computers as they become an obstacle for progress.
I hope that Rebel 15 will be SSE compatible, no hard guarantee, but if so there will be also a Rodent NN SSE.
I have a spare laptop we could use, but it is a AMD A8 with Radeon graphics, I think. So it is not much, but if I can help, I will gladly do it. Just tell me what.
Laptops are probably not advised for this type of NN training because the training software is so thoroughly optimised (we are using PyTorch) that it continuously hammers both the GPU and CPU and that generates a lot of heat. training can last several days. Heat dissipation in laptops is not terribly good, in general. There’s quite possibly issues with farming out the training elsewhere, for example, I am currently working with 12 billion training positions and that is targeted to grow plenty more. That’s roughly 500 or 600 Gb of data, it can be compressed and I guess we could change data format and use the SF data format (I think that’s 2-3 bytes per position) but even so that’s ~30Gb for 12B positions. Completely impractical for me on rural French internet and probably not much better in practice with any “home” internet.
50 Elo is a huge increase at this level, this would bring Rebel 15.0 roughly to the level of Fritz 18NN or the new Wasp 5.50. A Rebel 15.0 would also be a nice Easter present. , thanks for your work Ed.
50 Elo is a huge increase at this level, this would bring Rebel 15.0 roughly to the level of Fritz 18NN or the new Wasp 5.50. A Rebel 15.0 would also be a nice Easter present. , thanks for your work Ed.
I second this...
Dio likes this post
Sponsored content
Subject: Re: Update on Rebel-15-with-ChrisW-NNUE dev-v2.2