Comment by kridsdale1

Comment by kridsdale1 a year ago

The systems I’ve use pre-allocate users effectively randomly an arm by hashing their user id or equivalent.

ivalm a year ago

To make sure user id U doesn’t always end up in eg control group it’s useful to concatenate the id with experiment uuid.

Reply View 0 replies

ryan-duve a year ago

How do you handle different users having different numbers of trials when calculating the "click through rate" described in the article?

Reply View 0 replies

s1mplicissimus a year ago

careful when doing that though! i've seen some big eyes when people assumed IDs to be uniform randomly distributed and suddenly their "test group" was 15% instead of the intended 1%. better generate a truely random value using your languages favorite crypto functions and be able to work with it without fear of busting production

Reply View 7 replies

np_tedious a year ago

The user ID is non uniform after hash and mod? How?

Reply View | 6 replies
- lern_too_spel a year ago
  
  If you mod by anything other than a power of two, it won't be. https://lemire.me/blog/2019/06/06/nearly-divisionless-random...
  
  Reply View | 4 replies
  
  np_tedious a year ago
  
  That article is mostly about speed. The following seems like the one thing that might be relevant:
  > Naively, you could take the random integer and compute the remainder of the division by the size of the interval. It works because the remainder of the division by D is always smaller than D. Yet it introduces a statistical bias
  That's all it says. Is the point here just that 2^31 % 17 is not zero, so 1,2,3 are potentially happening slightly more than 15,16? If so, this is not terribly important
  
  Reply View | 3 replies
- s1mplicissimus a year ago
  
  additional to the other excellent comments they will become non-uniform once you start deleting records. that will break all hopes you might have had in modulo and percentages being reliable partitions because the "holes" in your ID space could be maximally bad for whatever usecase you thought up.
  
  Reply View | 0 replies

hinkley a year ago

Just make sure you do the hash right so you don’t end up with cursed user IDs like EverQuest.

Reply View 0 replies