Algorithm to generate top N% of permutations with most dissimilarities

Let us consider a set of data with, say, 10 values. We need to estimate its characterstics by Monte Carlo method, that is with a large number of randomly generated permutated sets.

If we'd consider generation of all possible permutations, that would give too big number, almost impractical - even for 10 values n! is 3628800. We can generate a (relatively small) subset of permutations randomly, but this does not take into account that permutations are not equally "distinct".

Here we come to the question: is there an algorithm of permutations generation which would count permutations' dissimilarity and produce most noticable permutations first (so that we can stop it after, say, 1000 cycles)?

I don't have specific requirements on the "similarity" measure, so if anything like a distance, information gain, entropy, Spearman, etc. are involved, that's would be ok.

Currently I'm doing one random permutation like so (pseudocode):

void permutate(const int n, int &out[])
{
  for(int k = 0; k < n; ++k) out[k] = k;
  for(int k = 0; k < n - 1; ++k)
  {
    int i1 = rand() % (n - k - 1) + 1;
    swap(out, k, k + i1);
  }
}

and run this predefined number of times.

Here the probability of swapping neighbouring values is the same as swapping distant values.

asked Jan 4, 2024 at 17:44

Stan

8,77810 gold badges63 silver badges107 bronze badges

This sounds to me like a set cover problem.

500 - Internal Server Error
– 500 - Internal Server Error

2024-01-04 17:57:14 +00:00
Commented Jan 4, 2024 at 17:57
The first thing that comes to mind is to generate permutations iteratively, but at every iteration, you forbid element i from being at position j if it has already been in position j in a previously generated matching. Keep iterating until no permutation exists that satisfies this constraint.

Stef
– Stef

2024-01-04 18:03:27 +00:00
Commented Jan 4, 2024 at 18:03
en.wikipedia.org/wiki/Curse_of_dimensionality is your friend. It is hard for random permutations to be close to each other in any meaningful way. I'd actually worry more that an attempt to avoid it will cause some subtle structural features to get replicated.

btilly
– btilly

2024-01-04 18:20:07 +00:00
Commented Jan 4, 2024 at 18:20

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Algorithm to generate top N% of permutations with most dissimilarities

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest