LabVIEW

cancel
Showing results for 
Search instead for 
Did you mean: 

Riffle is Biased!

Solved!
Go to solution

As the unofficial chairman of the National Riffle Association, I was seriously ticked off this morning to discover a major flaw in the LV implementation of the good, old Fisher Yates shuffle.  Fortunately, when it matters to me, I never use code that I can not trace back to the source, so I have always used my own implementations.  But myself, and one or perhaps two others, have tried to raise awareness of this potentially nifty VI.

 

I have included a VI which repeatedly Riffles an array of integers 0..9.  It then calculates the distribution of final position for each value as well as the distribution of values for each index in the riffled array.  This should be flat, but notice the bias near the edges.  For reference I have added a simple implementation of the FY shuffle, notice the lack of bias in the results.  Smiley Mad 

 

Other than license protection (Riffle is FDS+), why is the implementation of such a standard algorithm hiding in a CLFN?  Why is the algorithm, like many others not referenced?

 

Tested in LV9, LV10 on Windoze.

 

RiffleDistribution.pngFY Distribution.png

Message 1 of 21
(5,620 Views)

@Darin.K wrote:

Other than license protection (Riffle is FDS+), why is the implementation of such a standard algorithm hiding in a CLFN?


  • Allows reuse of the code in other products (such as CVI)?
  • Speed (an old VI, so you would have to test it in an old version to confirm)?

 


___________________
Try to take over the world!
0 Kudos
Message 2 of 21
(5,600 Views)

tst wrote:
  • Allows reuse of the code in other products (such as CVI)?
  • Speed (an old VI, so you would have to test it in an old version to confirm)?

 


I'll buy speed considering Ramp still uses a CLFN as well.  I'd like to see more G and less C from NI, although refurbishing older VIs is probably not a path to more sales.  Maybe I'll OpenG a VI or two one of these days.

0 Kudos
Message 3 of 21
(5,594 Views)

@Darin.K wrote:

As the unofficial chairman of the National Riffle Association,[...]


"I'll give up my Signal Operation VIs when someone pries my cold dead fingers from around their connector panes." ~ Charlton "CrossCorrelation" Heston

Jim
You're entirely bonkers. But I'll tell you a secret. All the best people are. ~ Alice
For he does not know what will happen; So who can tell him when it will occur? Eccl. 8:7

Message 4 of 21
(5,575 Views)

During some afternoon coffee, it was not too hard to figure out the blunder as it is one of the oldest ones in the book.  The algorithm being used by the Riffle VI takes each index in the array and swaps it with a randomly chosen element.  Nice and simple, right?  Unfortunately, in this case the method is generating 10^10 combinations when we all know that there are 10! = 3,6280,800 possible permutations.  Since 10^10 is not divisible by 10!, not all configurations are going to be equally represented, therefore bias.

 

The Fisher-Yates method generates the correct number of combinations so they are all equally likely.

 

The help for Riffle claims that both indices for the swap are chosen randomly, this would be even worse.

 

Choose 'Mod Riffle' to compare the method I described to the Riffle results.

Message 5 of 21
(5,566 Views)

It almost looks like a saddle function as the top surface.

What happens if you go beyond 9 say, to 99.

0 Kudos
Message 6 of 21
(5,565 Views)

@Darin.K wrote:

The help for Riffle claims that both indices for the swap are chosen randomly, this would be even worse.


I don't know anything about the riffle algorithm, but doing that should result in ~14% of the elements never been moved. I'm not sure what effect that would have on the distibution. You can see some details here - http://lavag.org/topic/3238-randomise-array/page__view__findpost__p__15997


___________________
Try to take over the world!
0 Kudos
Message 7 of 21
(5,524 Views)

Just some links to other related discussions:

 

Now available for download: "Randomize 1D Array.vi" that accepts any array type

Randomize Array

 

And yes, I agree this does not look good and should be fixed.

0 Kudos
Message 8 of 21
(5,522 Views)

@tst wrote:

@Darin.K wrote:

The help for Riffle claims that both indices for the swap are chosen randomly, this would be even worse.


I don't know anything about the riffle algorithm, but doing that should result in ~14% of the elements never been moved. I'm not sure what effect that would have on the distibution. You can see some details here - http://lavag.org/topic/3238-randomise-array/page__view__findpost__p__15997


I guess I did not leave it in the second test VI, but as you suspected choosing two random indices results in the original array being the most probable configuration. The graph shows a ridge along the diagonal with a value of roughly 14%.
0 Kudos
Message 9 of 21
(5,476 Views)

Form same casual benchmarking, your FY-shuffle beats riffle by about 25% if made into a subVI (debugging disabled). Curioulsy, it is only faster if either inlined or fully reentrant. It almost looks like a reentrant version is inlined even if not specifically so configured.

 

I have never seen a subVI that changes speed by a factor of two between reentrant and non-reentrant. I guess the code is small enough that the compiler decides to inline the reentrant version even if not secifically inlined. Maybe I am completely off. 😉

0 Kudos
Message 10 of 21
(5,472 Views)