Just a quick follow-up and progress-to-date on the discussion some of us were having concerning a technique for speeding up sims by not shuffling each shoe. The following sims were run with a modified version of ET Fan's PowerSim which initializes 250 arrays of dealing sequences and uses each once before reshuffling the shoe. (The first array is initialized with the numbers 1 through the total number of cards, and each successive sequence array is a copy of the previous, which is then shuffled using the same random-swapping techinque as shuffing the shoe. These arrays then determine the sequence that cards are dealt from the shoe, rather than using 1, 2, 3,... each time. After dealing from each of the 250 sequences once, the shoe itself is shuffled and the process repeated.)
As I previously reported, this techique cuts the PowerSim execution times by 40% to 45%. The question is, does the techique produce useful results?
Each of the following summarized result lines represents 5 billion rounds simmed, but the stats were collected each one million rounds, so there were 5000 sets of data points in each sim. Stats were collected for the IBA, the per-hand variance, the number of doubles, splits, player BJs, and dealer BJs. The purpose of analyzing 5000 one-million-round results was to compare the means and standard deviations of the 5000 samples to determine if the standard shuffle and the "fast shuffle" had similar statistical characteristics. (For example, if the standard deviations were higher using the fast shuffle, that would indicate that the true significance of the samples was less than expected, even if the means happened to be similar.) Sims were run for 1-, 2-, 6- and 8-deck games using the same rules and strategy files (75% pen, S17 DAS DOA, using generic strategy, except for the 2-deck game, for which ET Fan had a custom strategy for H17 DAS so I used that). In the chart, PS is the original PowerSim and PSx2 is the fast-shuffle version:
----- IBA ------- -- VARIANCE --- -- DOUBLES --- -- SPLITS --- - PLAYER BJ - - DEALER BJ -
Mean StdDev Mean StdDev Mean StdDev Mean StdDev Mean StdDev Mean StdDev
-------- -------- ------- ------- -------- ----- ------- ----- ------- ----- ------- -----
1-DECK PS -0.001956 0.001132 1.31111 0.00117 102146.8 302.2 20717.1 152.8 47944.4 199.0 47945.2 208.5
PSx2 -0.001962 0.001132 1.31109 0.00115 102136.2 296.2 20719.4 149.6 47942.7 204.3 47947.8 203.8
2-DECKS PS -0.004509 0.001149 1.35176 0.00129 112250.9 313.2 25867.0 177.9 47620.6 204.8 47617.5 205.2
PSx2 -0.004569 0.001145 1.35173 0.00125 112239.5 306.7 25868.6 174.1 47623.1 202.9 47622.4 200.1
6-DECKS PS -0.004252 0.001129 1.33212 0.00128 103974.1 300.4 27864.9 182.3 47426.0 202.7 47424.8 204.2
PSx2 -0.004257 0.001139 1.33213 0.00129 103971.3 297.3 27869.3 184.7 47426.2 203.5 47423.7 201.6
8-DECKS PS -0.004508 0.001145 1.33329 0.00129 104065.3 300.3 28239.6 184.3 47400.7 202.6 47407.3 202.2
PSx2 -0.004465 0.001137 1.33330 0.00131 104064.3 300.4 28241.3 185.1 47405.5 202.2 47407.4 204.3
In general, I believe the results were pretty good; i.e. I believe that if given a third set of data using one or the other sim, I think it would be hard to tell which it was. (But I do intend to try that experiment.)
Another test of the results is to see if the 5000 data sets seem to follow a normal distribution. If so, then about 68% of the results should fall within one StdDev of the mean and about 95% should fall within two StdDevs. Each of the data sets easily meet that criteria when compared to there own means and StdDevs, but what if we take the means and StdDevs of the original PowerSim as truly representative of the poplulation, and check to see if the 5000 fast shuffle samples also meet that criteria? The following chart shows what percentage of the PowerSimX2 samples fall within 1 and 2 StdDevs of the PowerSim means:
IBA DBLS SPLS P-BJ D-BJ
----- ----- ----- ----- -----
1-DECK 1 SD 0.687 0.693 0.688 0.668 0.696
2 SD 0.954 0.956 0.956 0.948 0.959
1-DECKS 1 SD 0.689 0.694 0.692 0.685 0.694
2 SD 0.955 0.960 0.961 0.957 0.961
1-DECKS 1 SD 0.684 0.694 0.682 0.680 0.688
2 SD 0.952 0.954 0.951 0.954 0.960
1-DECKS 1 SD 0.680 0.683 0.686 0.680 0.677
2 SD 0.958 0.953 0.951 0.955 0.954
Again, I believe the results are pretty good, but I'm completely open to criticism of the analysis and suggestions for further tests.