
Here's another one: it's easy and my program finds it within 1 second (and less than 0.5 million nodes)
Yes Rein Damy needs also less than 1 second. The kind of combination is here very similar to the previous one isn't it?Rein Halbersma wrote:
Here's another one: it's easy and my program finds it within 1 second (and less than 0.5 million nodes)
I did not remember it, but after you showed the other position by Sijbrands, this one and the previous one popped up in my head again. This one is from GM Scholma IIRC, and it was published almost 20 years ago in Sijbrands's magazine "Dammen".TAILLE wrote:Yes Rein Damy needs also less than 1 second. The kind of combination is here very similar to the previous one isn't it?Rein Halbersma wrote:
Here's another one: it's easy and my program finds it within 1 second (and less than 0.5 million nodes)
BTW I easily recognize this position I proposed myself on this forum some time ago
Oops I am not able to really answer your questions because we have obviously different definitions for the depth of a tree or for a node/leaf. Essentially, this is the consequence of several factors:Rein Halbersma wrote:My program solves this position after search of 13 ply and a tree of 7.5 million nodes (= separate calls to search() function) and 7 seconds. My effective branching factor is around 3.38 for this position. What are some numbers for your search?TAILLE wrote:Rein,
This problem seems far easier. I do not have the fractions of second for my time measure, butRein Halbersma wrote:
Gerard, another challenge for your new algorithm: how long does it take to find the winning move?
Damy resolves the problem (34-30, 50-44) in less than 2 seconds. Do you have similar measure?
After 0.02 sec Damage finds the right move ( 7 Ply search)Hi,
After quite a very long time I have just managed to reach a quite stable new search algorithm for Damy.
Can you tell me how many time your program needs to resolve the following Sijbrands composition?
White to move : +1
[
Damage finds the right Move and Score after 0.08 sec (Ply 10).Gerard, another challenge for your new algorithm: how long does it take to find the winning move?![]()
Damage finds the right move after 0.02 sec Ply 8 and around 30K nodes.Here's another one: it's easy and my program finds it within 1 second (and less than 0.5 million nodes)
Code: Select all
Date W L D U P P%
16-sep-2012 7 3 148 0 162 51,3%
28-sep-2012 14 7 137 0 165 52,2%
1-Oct-2012 4 11 143 0 151 47,8%
10-oct-2012 2 8 148 0 152 48,1%
13-oct-2012 5 11 141 1 151 48,1%
16-oct-2002 4 11 143 0 151 47,8%
19-Oct-2012 3 9 146 0 152 48,1%
23-Oct-2012 5 9 143 1 153 48,7%
Try and run the first 2 matches through BayesELo, compute rating difference, confidence interval and likelihood of superiority. Then repeat with all matches. You'll be surprised how big the rating uncertainty is from short (~300) matches with differences in the 10-15 ELO range.BertTuyt wrote:I didn't post for some time the reason i was playing Engine matches
After 2 won matches, i started to test IID, which was not a success.
Then I restarted with the old source (at least that was what I thought) to get a feeling for statistics.
But for an unknown reason Damage was not able to win a 158 games match anymore
Also the other matches revealed little differences
See below 8 Engine Matches.
I don't have a good explanation yet?
Based on code comparison , I could not find any clue so far.
What I also cant imagine is that the first 2 results were statistical fluctuations, or that Kingsrow sometimes has a bad day (or days), or that during initialization something weird can happen with Kingsrow with a lower strength as a result. Maybe Ed has more experience.
Anyway, the difference is not so dramatic, but I'm puzzled.
BertCode: Select all
Date W L D U P P% 16-sep-2012 7 3 148 0 162 51,3% 28-sep-2012 14 7 137 0 165 52,2% 1-Oct-2012 4 11 143 0 151 47,8% 10-oct-2012 2 8 148 0 152 48,1% 13-oct-2012 5 11 141 1 151 48,1% 16-oct-2002 4 11 143 0 151 47,8% 19-Oct-2012 3 9 146 0 152 48,1% 23-Oct-2012 5 9 143 1 153 48,7%
Michel,MichelG wrote:Statistics is hard
In fact, if you see any result anywhere that says statement X was proven with 95% (2 sigma) chance, then statement X is probably not true. Sadly this happens in a lot of fields, even in cancer research, because of the misuse of statistics.
Reins reasoning that there is only a very small chance of such a fluke is the right answer to the wrong question. Try answering this one:
how big is a chance, when playing a match of games that you end at an self-chosen point, gives a result that is remarkable?
I won't do the math, but it's goiing to be fairly big.
The point is that a 95% significant statistic result does not mean that there is a 95% chance that something is true. Looking at bert's table for instance, i don't think you can conclude that the version of match 1 & 2 outperformed the version in the later matches.Rein Halbersma wrote:
Michel,
What exactly are you trying to say? How would you propose to test for engine improvements? Do you see any role for statistics there?
Rein
Code: Select all
Date W L D U P P%
16-sep-2012 7 3 148 0 162 51,3%
28-sep-2012 14 7 137 0 165 52,2%
1-Oct-2012 4 11 143 0 151 47,8%
10-oct-2012 2 8 148 0 152 48,1%
13-oct-2012 5 11 141 1 151 48,1%
16-oct-2002 4 11 143 0 151 47,8%
19-Oct-2012 3 9 146 0 152 48,1%
23-Oct-2012 5 9 143 1 153 48,7%
25-Oct-2012 2 5 150 1 154 49,0%
28-Oct-2012 5 8 145 0 155 49,1%
30-Oct-2012 3 5 150 0 156 49,4%
Bert,BertTuyt wrote: After 2 won matches, i started to test IID, which was not a success.
Then I restarted with the old source (at least that was what I thought) to get a feeling for statistics.