ProDeo
Would you like to react to this message? Create an account in a few clicks or log in to continue.
ProDeo

Computer Chess
 
HomeHome  CalendarCalendar  Latest imagesLatest images  FAQFAQ  SearchSearch  MemberlistMemberlist  UsergroupsUsergroups  RegisterRegister  Log in  

 

 GPU speed benchmark

Go down 
4 posters
Go to page : 1, 2  Next
AuthorMessage
Admin
Admin
Admin


Posts : 2528
Join date : 2020-11-17
Location : Netherlands

GPU speed benchmark Empty
PostSubject: GPU speed benchmark   GPU speed benchmark EmptySat May 01, 2021 5:41 pm

Please run this little benchmark. I will maintain the results.

Start Lc0 v27 (or a very recent version) from the command line by typing "go movetime 10000". Lc0 will run for 10 seconds. Then contribute the GPU, NPS and NN size 128, 256 or 384.

My 1060 6Gb
NN 128 - nps = 19.443
NN 256 - nps =  3.736
NN 384 - nps =  1.147
Back to top Go down
http://rebel13.nl/
Mclane

Mclane


Posts : 2921
Join date : 2020-11-17
Age : 57
Location : United States of Europe, Germany, Ruhr area

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 01, 2021 8:41 pm

Mine says go is no command

„Unknown command line argument: go
Why not benchmark ?!
Also if we all use different nets the whole relation makes no sense.

I did benchmark with the amd RX470 , this is DX12, the nvidias use CUDA.


Total time 351347
Nps 727
Net j92-100

Total time 359207
Nps 694
Net j94-130


Last edited by Mclane on Sat May 01, 2021 9:05 pm; edited 2 times in total
Back to top Go down
http://www.thorstenczub.de
Admin
Admin
Admin


Posts : 2528
Join date : 2020-11-17
Location : Netherlands

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 01, 2021 8:50 pm

Mclane wrote:
Mine says go is no command
Why not benchmark ?!
Also if we all use different nets the whole relation makes no sense.

1. First start Lc0, then go movetime 10000

2. These benchmarks on internet are not to be trusted when it comes to chess.

3. Your 1060 should give similar NPS to mine.


Back to top Go down
http://rebel13.nl/
Mclane

Mclane


Posts : 2921
Join date : 2020-11-17
Age : 57
Location : United States of Europe, Germany, Ruhr area

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 01, 2021 9:14 pm

Admin wrote:
Mclane wrote:
Mine says go is no command
Why not benchmark ?!
Also if we all use different nets the whole relation makes no sense.

1. First start Lc0, then go movetime 10000

2. These benchmarks on internet are not to be trusted when it comes to chess.

3. Your 1060 should give similar NPS to mine.


I see. Mistake was to start lc0 directly with the command. You do 2 steps. Starting lc0 and then go movetime.

Amd rx470 (4GB) (Xeon 6 core) 627 NPS with j94-130
Nvidia 1060 (3 GB) (on the phenom II X6 core). 970 NPS j94-130
Nvidia 2070 (8 gb) (ryzen9 with 8 cores) 9021 NPS j94-130
Amd rx570 (8GB) (ryzen5 with 6 cores) 4710 NPS j94-130


Last edited by Mclane on Sat May 01, 2021 10:27 pm; edited 1 time in total
Back to top Go down
http://www.thorstenczub.de
mwyoung

mwyoung


Posts : 880
Join date : 2020-11-25
Location : USA

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 4:25 am

Lc0 v27 running on a RTX 2080ti

NN 128 - 155,314 nps
NN 192 - 71,118 nps
NN 256 - 39,198 nps
NN 320 - 20,518 nps
NN 384 - 15,769 nps
NN 512 - 8,097 nps
Back to top Go down
Admin
Admin
Admin


Posts : 2528
Join date : 2020-11-17
Location : Netherlands

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 12:04 pm

Mclane wrote:
Admin wrote:
Mclane wrote:
Mine says go is no command
Why not benchmark ?!
Also if we all use different nets the whole relation makes no sense.

1. First start Lc0, then go movetime 10000

2. These benchmarks on internet are not to be trusted when it comes to chess.

3. Your 1060 should give similar NPS to mine.


I see. Mistake was to start lc0 directly with the command. You do 2 steps. Starting lc0 and then go movetime.

Amd rx470 (4GB) (Xeon 6 core) 627 NPS with j94-130
Nvidia 1060 (3 GB) (on the phenom II X6 core). 970 NPS j94-130
Nvidia 2070 (8 gb) (ryzen9 with 8 cores) 9021 NPS j94-130
Amd rx570 (8GB) (ryzen5 with 6 cores) 4710 NPS j94-130

Numbers make sense since j94-130 is a typical 384x30 net.
Back to top Go down
http://rebel13.nl/
Admin
Admin
Admin


Posts : 2528
Join date : 2020-11-17
Location : Netherlands

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 12:09 pm

mwyoung wrote:
Lc0 v27 running on a RTX 2080ti

NN 128 - 155,314 nps
NN 192 - 71,118 nps
NN 256 - 39,198 nps
NN 320 - 20,518 nps
NN 384 - 15,769 nps
NN 512 - 8,097 nps

Great!

And so that means that my poor 1060 runs 39198/3736 = 10,49197002141328 times slower  GPU speed benchmark 203c
Back to top Go down
http://rebel13.nl/
Mclane

Mclane


Posts : 2921
Join date : 2020-11-17
Age : 57
Location : United States of Europe, Germany, Ruhr area

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 12:40 pm

There are years between these graphical cards.
Important is that the gpu fits to the cpu
Back to top Go down
http://www.thorstenczub.de
Admin
Admin
Admin


Posts : 2528
Join date : 2020-11-17
Location : Netherlands

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 2:00 pm

Mclane wrote:
There are years between these graphical cards.
Important is that the gpu fits to the cpu

But it never can fit, apples and oranges. I think the CCRL folks found an elegant compromise, on the 40/2 only NN engines on RTX-2080 versus PC on 8 cores, and no NN engines on 40/15. Elegant, but it does not say anything about who is stronger. It's based on the assumption that SF is equal in strength to Lc0 in this configuration. A fair rating list can only be done on equal hardware.
Back to top Go down
http://rebel13.nl/
Mclane

Mclane


Posts : 2921
Join date : 2020-11-17
Age : 57
Location : United States of Europe, Germany, Ruhr area

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 2:06 pm

Yes but even my 2070 lc0 is stronger then Stockfish 13 on 8 cores.
Back to top Go down
http://www.thorstenczub.de
mwyoung

mwyoung


Posts : 880
Join date : 2020-11-25
Location : USA

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 2:27 pm

Admin wrote:
Mclane wrote:
There are years between these graphical cards.
Important is that the gpu fits to the cpu

But it never can fit, apples and oranges. I think the CCRL folks found an elegant compromise, on the 40/2 only NN engines on RTX-2080 versus PC on 8 cores, and no NN engines on 40/15. Elegant, but it does not say anything about who is stronger. It's based on the assumption that SF is equal in strength to Lc0 in this configuration. A fair rating list can only be done on equal hardware.

It can never be fair. What is fair. When Lc0 wins, or has the some cost in hardware, or has the same Lc0 ratio as the AZ match. I just rate Lc0 as Lc0 on x hardware. CPU or GPU with the net name. And rate Lc0 like a standalone dedicated chess computer.

The problem for Lc0 is you must rate the hardware and the net. As both are major factors in the overall rating of the group.

For example "CCRL Rating - 5. Lc0 0.26.3 t40-1541 RTX2080 3652 +16 −16 59.7% −61.5 62.4% 1094"

CCRL rating is misleading, as the t40 net is weak compared to the better nets.

The other issue is time controls. As Lc0 plays much better with more time. So playing only blitz is also misleading.
Back to top Go down
mwyoung

mwyoung


Posts : 880
Join date : 2020-11-25
Location : USA

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 2:29 pm

Mclane wrote:
Yes but even my 2070 lc0 is stronger then Stockfish 13 on 8 cores.

At what time control and net. I had Stockfish beat my RTX with 1 core at fast time controls.
Back to top Go down
mwyoung

mwyoung


Posts : 880
Join date : 2020-11-25
Location : USA

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 02, 2021 2:54 pm

Admin wrote:
mwyoung wrote:
Lc0 v27 running on a RTX 2080ti

NN 128 - 155,314 nps
NN 192 - 71,118 nps
NN 256 - 39,198 nps
NN 320 - 20,518 nps
NN 384 - 15,769 nps
NN 512 - 8,097 nps

Great!

And so that means that my poor 1060 runs 39198/3736 = 10,49197002141328 times slower  GPU speed benchmark 203c

Lc0 runs fast on tensor cores fp-16. And even a cheap RTX will work great. If you can find one.

And what is great about Lc0 is you can keep your GTX, and run Lc0 with any two cards. If you upgrade.
Back to top Go down
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptyFri May 28, 2021 9:52 pm

I have 2x 2080 Super GPUs.
I used the net 68002 (one of the 384 nets, which tests strongest for my hardware).
I activated the tablebase files, because I always do that.
I set the threads higher than you are supposed to but I find it causes no problems ever, including with speed.

203629 nodes in 9.325 seconds = 21837 NPS

C:\Users\dannc\chess\lc0\lc0-27-cuda>lc0
_
| _ | |
|_ |_ |_| v0.27.0 built Feb 21 2021
Detected 32 core(s) and 64 thread(s) in 1 group(s).
Group 0 has 32 core(s) and 64 thread(s).
setoption name SyzygyPath value D:\syzygy
setoption name BackendOptions value backend=cuda-fp16,(gpu=0),(gpu=1)
setoption name Backend value multiplexing
setoption name WeightsFile value C:\Users\dannc\chess\lc0\net\68002.pb.gz
setoption name Threads value 8
go movetime 10000
Loading Syzygy tablebases from D:\syzygy
Found 746 WDL, 0 DTM and 740 DTZ tablebase files.
Loading weights file from: C:\Users\dannc\chess\lc0\net\68002.pb.gz
Creating backend [multiplexing]...
Creating backend [cuda-fp16]...
CUDA Runtime version: 11.1.0
Latest version of CUDA supported by the driver: 11.2.0
GPU: GeForce RTX 2080 SUPER
GPU memory: 8 Gb
GPU clock frequency: 1815 MHz
GPU compute capability: 7.5
Creating backend [cuda-fp16]...
CUDA Runtime version: 11.1.0
Latest version of CUDA supported by the driver: 11.2.0
GPU: GeForce RTX 2080 SUPER
GPU memory: 8 Gb
GPU clock frequency: 1815 MHz
GPU compute capability: 7.5
info depth 1 seldepth 2 time 2840 nodes 7 score cp 5 nps 636 tbhits 0 pv c2c4 e7e5
info depth 2 seldepth 3 time 2851 nodes 28 score cp 5 nps 1272 tbhits 0 pv g2g3 d7d5 g1f3
info depth 2 seldepth 4 time 2862 nodes 39 score cp 3 nps 1218 tbhits 0 pv e2e3 e7e6 d2d4 g8f6
info depth 3 seldepth 4 time 2862 nodes 102 score cp 6 nps 3187 tbhits 0 pv c2c4 e7e5 b1c3 g8f6
info depth 3 seldepth 5 time 2874 nodes 185 score cp 6 nps 4204 tbhits 0 pv c2c4 e7e5 g2g3 g8f6 f1g2
info depth 3 seldepth 6 time 2891 nodes 312 score cp 6 nps 5032 tbhits 0 pv c2c4 e7e5 g1f3 e5e4 f3d4
info depth 4 seldepth 6 time 2898 nodes 441 score cp 6 nps 6485 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6
info depth 4 seldepth 7 time 2920 nodes 566 score cp 6 nps 6219 tbhits 0 pv c2c4 e7e5 g2g3 g8f6 f1g2 d7d5 c4d5
info depth 4 seldepth 8 time 2943 nodes 944 score cp 6 nps 8353 tbhits 0 pv c2c4 e7e5 g2g3 g8f6 f1g2 d7d5 c4d5
info depth 5 seldepth 9 time 2974 nodes 1595 score cp 6 nps 11000 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 d1a4 c7c6 f1g2
info depth 5 seldepth 10 time 3054 nodes 3232 score cp 6 nps 14428 tbhits 0 pv c2c4 e7e5 g2g3 g8f6 f1g2 d7d5 c4d5 f6d5 g1f3
info depth 5 seldepth 11 time 3109 nodes 4583 score cp 6 nps 16367 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6
info depth 5 seldepth 12 time 3187 nodes 6481 score cp 6 nps 18103 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 d1a4 c7c6 f1g2 b7b5 a4b3
info depth 6 seldepth 12 time 3217 nodes 7130 score cp 6 nps 18423 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1
info depth 6 seldepth 13 time 3258 nodes 8102 score cp 6 nps 18929 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 d1a4 c7c6 f1g2 f6d5 b1c3
info depth 6 seldepth 14 time 3352 nodes 10397 score cp 6 nps 19917 tbhits 0 pv c2c4 e7e5 g2g3 g8f6 f1g2 d7d5 c4d5 f6d5 g1f3 b8c6 e1g1
info depth 6 seldepth 15 time 3441 nodes 12855 score cp 6 nps 21004 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 f8e7 d2d4
info depth 6 seldepth 16 time 3590 nodes 17085 score cp 6 nps 22480 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 d5b6 a2a3 f8e7
info depth 6 seldepth 17 time 3976 nodes 27787 score cp 6 nps 24246 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 d5b6 b2b3 f8e7 c1b2
info depth 7 seldepth 17 time 4032 nodes 29046 score cp 6 nps 24144 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 d5b6 b2b3 f8e7 c1b2
info depth 7 seldepth 18 time 4180 nodes 33014 score cp 6 nps 24436 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 d5b6 b2b3 f8e7 c1b2
info depth 7 seldepth 19 time 4270 nodes 35889 score cp 6 nps 24922 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 d5b6 b2b3 f8e7 c1b2
info depth 7 seldepth 19 time 4319 nodes 37121 score cp 6 nps 24930 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 g8f6 f1g2 f8e7 e1g1 e8g8
info depth 7 seldepth 20 time 4439 nodes 40720 score cp 6 nps 25291 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5
info depth 7 seldepth 21 time 4559 nodes 44334 score cp 6 nps 25626 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 g8f6 f1g2 f8e7 e1g1 e8g8
info depth 7 seldepth 21 time 4650 nodes 47004 score cp 6 nps 25812 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 d5b6 b2b3 f8e7 c1b2 f7f6
info depth 7 seldepth 22 time 4770 nodes 51064 score cp 6 nps 26308 tbhits 0 pv c2c4 e7e5 g2g3 d7d5 c4d5 g8f6 f1g2 f6d5 g1f3 b8c6 e1g1 d5b6 b2b3 f8e7 c1b2 f7f6
info depth 7 seldepth 22 time 5039 nodes 59642 score cp 6 nps 26987 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6
info depth 7 seldepth 23 time 5249 nodes 66242 score cp 6 nps 27384 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 f1g2 g8f6 e1g1 e8g8 c1g5
info depth 7 seldepth 24 time 5467 nodes 73554 score cp 6 nps 27882 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6
info depth 7 seldepth 25 time 5698 nodes 81074 score cp 6 nps 28258 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 b2b3
info depth 7 seldepth 26 time 5839 nodes 85747 score cp 6 nps 28496 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 b2b3
info depth 7 seldepth 27 time 5955 nodes 89582 score cp 6 nps 28666 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 b2b3 b6b5
info depth 7 seldepth 28 time 6181 nodes 96684 score cp 6 nps 28843 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 b2b3 b6b5
info depth 8 seldepth 28 time 6212 nodes 97649 score cp 6 nps 28864 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 b2b3 b6b5
info depth 8 seldepth 29 time 7033 nodes 126489 score cp 6 nps 30094 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 c1d2 c8d7 a1c1
info depth 8 seldepth 30 time 8623 nodes 180158 score cp 6 nps 31099 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 b2b3 b6b5 a4b2 g8f6 f1g2 c8f5
info depth 8 seldepth 30 time 9325 nodes 203629 score cp 6 nps 31346 tbhits 0 pv d2d4 d7d5 c2c4 e7e6 b1c3 c7c5 c4d5 e6d5 g1f3 b8c6 g2g3 f8e7 d4c5 d5d4 c3a4 b7b5 c5b6 a7b6 b2b3 b6b5 a4b2 g8f6 f1g2 c8f5 e1g1
bestmove d2d4 ponder d7d5

TheSelfImprover likes this post

Back to top Go down
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptyFri May 28, 2021 9:55 pm

I notice that the LC0 output claims 31346 NPS, so it must be accumulating all of the nodes from all of the previous plies, whereas I used the numbers from only the last ply.
So I think that the LC0 output is correct.

Admin likes this post

Back to top Go down
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptyFri May 28, 2021 9:58 pm

mwyoung wrote:
Lc0 v27 running on a RTX 2080ti

NN 128 - 155,314 nps
NN 192 - 71,118 nps
NN 256 - 39,198 nps
NN 320 - 20,518 nps
NN 384 - 15,769 nps
NN 512 - 8,097 nps

Where do we find 512 nets? I would like to test them.
Back to top Go down
Admin
Admin
Admin


Posts : 2528
Join date : 2020-11-17
Location : Netherlands

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptyFri May 28, 2021 10:07 pm

When you have 2 GPU's can Lc0 profit from that, like a CPU can use 2 threads?
Back to top Go down
http://rebel13.nl/
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptyFri May 28, 2021 10:19 pm

Admin wrote:
When you have 2 GPU's can Lc0 profit from that, like a CPU can use 2 threads?
That is right. But it has a limit, I think. Last I saw, three GPUs was the maximum that could be utilized effectively.
But as hardware advances things like that change, so I do not know if it is still true.
I sure bought mine at the right time.
It would cost me three or four times as much if I bought them today.
Back to top Go down
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptyFri May 28, 2021 10:24 pm

Admin wrote:
When you have 2 GPU's can Lc0 profit from that, like a CPU can use 2 threads?
This is where I told LC0 to use both of my GPUs:

setoption name BackendOptions value backend=cuda-fp16,(gpu=0),(gpu=1)
setoption name Backend value multiplexing

The first setoption tells it what kind of math to use (cuda-fp16) and which GPUs to use (gpu=0),(gpu=1).
The second command tells it how to share the work (multiplexing).

I don't really understand the different backend values, but those bench best for me.
I have read a document on how to set the parameters (it is old) but it does not really explain anything.

Back to top Go down
mwyoung

mwyoung


Posts : 880
Join date : 2020-11-25
Location : USA

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 29, 2021 12:42 am

Dann Corbit wrote:
mwyoung wrote:
Lc0 v27 running on a RTX 2080ti

NN 128 - 155,314 nps
NN 192 - 71,118 nps
NN 256 - 39,198 nps
NN 320 - 20,518 nps
NN 384 - 15,769 nps
NN 512 - 8,097 nps

Where do we find 512 nets?  I would like to test them.

Hello Dann,

I only know of one. Here is the link.

https://www.comp.nus.edu.sg/~sergio-v/test/512x40/
Back to top Go down
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 29, 2021 2:21 am

mwyoung wrote:
Dann Corbit wrote:
mwyoung wrote:
Lc0 v27 running on a RTX 2080ti

NN 128 - 155,314 nps
NN 192 - 71,118 nps
NN 256 - 39,198 nps
NN 320 - 20,518 nps
NN 384 - 15,769 nps
NN 512 - 8,097 nps

Where do we find 512 nets?  I would like to test them.

Hello Dann,

I only know of one. Here is the link.

https://www.comp.nus.edu.sg/~sergio-v/test/512x40/
Thanks, I am running a contest with it against 68002 at one minute base plus one second to get a rough idea of what we have.
Back to top Go down
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 29, 2021 6:58 am

Avert your eyes if you are not the sort who enjoys watching baby seals being clubbed to death by angry Canadians:
Code:

    Program        Elo    +   -   Games   Score   Av.Op.  Draws
  1 Lc0-68002   : 3311   60  52    80    78.1 %   3089   43.8 %
  2 Lc0-512       : 3089   52  60    80    21.9 %   3311   43.8 %
I planned a much longer run, but well, enough bloodshed.

So now the question remains:
Was it the net or the card?

I guess it takes at least twice as long to train a double wide net.
But whatever the case, this net is no match for 68002.
Back to top Go down
mwyoung

mwyoung


Posts : 880
Join date : 2020-11-25
Location : USA

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 29, 2021 12:14 pm

Dann Corbit wrote:
Avert your eyes if you are not the sort who enjoys watching baby seals being clubbed to death by angry Canadians:
Code:

    Program        Elo    +   -   Games   Score   Av.Op.  Draws
  1 Lc0-68002   : 3311   60  52    80    78.1 %   3089   43.8 %
  2 Lc0-512       : 3089   52  60    80    21.9 %   3311   43.8 %
I planned a much longer run, but well, enough bloodshed.

So now the question remains:
Was it the net or the card?

I guess it takes at least twice as long to train a double wide net.
But whatever the case, this net is no match for 68002.

Are you turning on the multi-gather option? This seems to help NPS in many positions.
I ran some game in 12 hour time control games with the 512 net against Stockfish 13. I played 4 games and it draw all 4 games.

So it can play ok, but needs time. And my guess is the net was not fully trained.
Back to top Go down
Dann Corbit




Posts : 188
Join date : 2020-11-26

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySat May 29, 2021 9:30 pm

mwyoung wrote:

Are you turning on the multi-gather option?
Probably not. I do not know what most of the settings are for.
I have multiplex selected for the administration of the two cards, because I measured that method fastest, but I do not know anything about multi-gather or how to set it.
Back to top Go down
mwyoung

mwyoung


Posts : 880
Join date : 2020-11-25
Location : USA

GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark EmptySun May 30, 2021 2:08 am

I would turn on Multi-Gather. I am testing it now. And it looks like a must option to have turned ON.
In version Lc0 v28. The option will be turned on by default.





Back to top Go down
Sponsored content





GPU speed benchmark Empty
PostSubject: Re: GPU speed benchmark   GPU speed benchmark Empty

Back to top Go down
 
GPU speed benchmark
Back to top 
Page 1 of 2Go to page : 1, 2  Next

Permissions in this forum:You cannot reply to topics in this forum
ProDeo :: Computer Chess-
Jump to: