Wednesday, 12 May 2010

pr.probability - Minimum Hamming Distance Distribution in a Random Subset of Binary Vectors+

Select K random binary vectors Yi of length m uniformly at random.



Let the following collection of random variables be defined: Xi,j=w(YioplusYj) where w(cdot) denotes the Hamming weight of a binary vector, i.e., the number of the nonzero coordinates in its argument. Define Dmin(Y1,ldots,YK) as the smallest of the Xi,j for ineqj.



Thus we have n=C(K,2)=K(K1)/2 non-independent random variables Xi,j with support {0,1,ldots,m} and individual distribution Bin(m,1/2). It seems to me that the random variables Xi,j will be s-wise negatively correlated (for s large enough) if distances between pairs chosen from a subcollection of Yi1,Yi2,ldots,Yiv where (v<K) tincreases then the distances between Yij and the remaining vectors will tend to decrease. Take s=v+1.



It is possible to get a bound on the following quantity. Fix w an integer less than m/2. The Hamming sphere of radius w has "volume", i.e., contains Vw(m)=sumws=0C(m,s) vectors and we approximately have to first order in the exponent
Vw(m)=2mH((w+1)/2)


where H(cdot) is the binary entropy function. Then, for a random uniform choice of the Yi for i=1,2,ldots,K it is clear that if the Hamming spheres centred at these vectors are disjoint then the minimum distance is at least 2w+1 thus
Pr[Dmingeq2w+1]leqfrac(2mV)2mfrac(2m2V)2mcdotsfrac(2m(K1)V)2m



where V=Vw(m). This means that, by replacing each fraction of the form (1x) by exp(x) where x>0 but small, we get the approximate upper bound
Pr[Dmingeq2w+1]leqexpleft[K(K1)V2/(2m+1right] which then expresses this upper bound in terms of the entropy function, which is nice. Unfortunately this upper bound is quite loose.



I will be happy with any pointers to literature or any other suggestions.

No comments:

Post a Comment