Lc0 question

Subject: Lc0 question Fri Nov 27, 2020 12:38 pm

In ProDeo 3.0 I have analyzed the main book with SF12 but as noted elsewhere the SF12 analysis procedure switches from normal to NNUE after each position causing that the scores are not in sync. And so I am in the process using Lc0 instead.

Currently (as an investigation) I have tried 2 nets.

1. An 128x10 net (size 6.21 Mb) depth=7 maxtime 1000ms.

2. An 384x30 net (size 130 Mb) depth=5 maxtime 1000ms.

Both use about the same time to finish.

The question is, which net performs best when it is about (say) the first 10 moves of a game.

Posts : 880 Join date : 2020-11-25 Location : USA

Admin wrote:: In ProDeo 3.0 I have analyzed the main book with SF12 but as noted elsewhere the SF12 analysis procedure switches from normal to NNUE after each position causing that the scores are not in sync. And so I am in the process using Lc0 instead.

Currently (as an investigation) I have tried 2 nets.

1. An 128x10 net (size 6.21 Mb) depth=7 maxtime 1000ms.

2. An 384x30 net (size 130 Mb) depth=5 maxtime 1000ms.

Both use about the same time to finish.

The question is, which net performs best when it is about (say) the first 10 moves of a game.

Hello Ed,

I have many thousands of games testing Lc0. The bigger nets with all else being equal perform the best.
The best Lc0 nets are found here.
https://github.com/jhorthos/lczero-training/wiki/Leela-Training
The J94-80 net is new and most likely the best. And I have tested the other nets. And they are also best.

The J92 nets are also about equal. And they are Zero trained.

Posts : 37 Join date : 2020-11-17

Admin wrote:: The question is, which net performs best when it is about (say) the first 10 moves of a game.

First ten moves are mostly positional and at about 1s/move with an RTX GPU clearly 384x30 nets win. I would choose the last zero net J92-330.

Subject: Re: Lc0 question Mon Nov 30, 2020 9:59 am

Analyzed the first 10 moves of the ProDeo book with Lc0, the rest remains SF12 and I am getting much better results. Amazing.

Posts : 1254 Join date : 2020-11-17 Location : France

Admin wrote:: Analyzed the first 10 moves of the ProDeo book with Lc0, the rest remains SF12 and I am getting much better results. Amazing.

Not sure if you tried it, but when I analysed positions en masse with lc0, it was possible to launch several processes without affecting nps too badly. Which is weird because each process has to share the same GPU. My 2080 is on a PC with only six cores, so no idea if your 20 core would scale

Subject: Re: Lc0 question Mon Nov 30, 2020 10:52 am

I tried that too with my 1060, system froze.

I can do 20 (or so) threads with the CPU edition, but the 1060 is faster.

Posts : 1254 Join date : 2020-11-17 Location : France

Admin wrote:: I tried that too with my 1060, system froze.

I can do 20 (or so) threads with the CPU edition, but the 1060 is faster.

Will it work for two processes? Should give same nps but twice as many games.
For N processes there will be a bottleneck caused by wanting too much GPU RAM, 1060 obvs not optimal. I think you can tell LC0 to limit down GPU RAM usage.

Subject: Re: Lc0 question Mon Nov 30, 2020 1:33 pm

Won't matter, unlike your 2080 my 1060 uses 100% GPU time.

Posts : 37 Join date : 2020-11-17

Chris Whittington wrote:

Admin wrote:: Analyzed the first 10 moves of the ProDeo book with Lc0, the rest remains SF12 and I am getting much better results. Amazing.

Not sure if you tried it, but when I analysed positions en masse with lc0, it was possible to launch several processes without affecting nps too badly. Which is weird because each process has to share the same GPU. My 2080 is on a PC with only six cores, so no idea if your 20 core would scale

Hmm, that's strange. My GPU utilization with single Lc0 instance on RTX 2070 is 95-99% if the time used per move or analysis is higher than several dozen milliseconds, that is, most of the time. So, I never use multiple instances of Lc0.

Posts : 880 Join date : 2020-11-25 Location : USA

Chris Whittington wrote:

Admin wrote:: Analyzed the first 10 moves of the ProDeo book with Lc0, the rest remains SF12 and I am getting much better results. Amazing.

Not sure if you tried it, but when I analysed positions en masse with lc0, it was possible to launch several processes without affecting nps too badly. Which is weird because each process has to share the same GPU. My 2080 is on a PC with only six cores, so no idea if your 20 core would scale

Yes. I do know why this is true. But you also see this behavior when test matching 2 Lc0 engines and nets. And turn Ponder on for the match. It is crazy.

» question about K/B/R/Q odds
» Testing at fixed depth -- a question to Ed
» This is about the question which program is Ed Schröders best / strongest engine