Wednesday, 19 October 2011

pr.probability - sum of order statistics

The inequality E(SK)geqE(Si) holds.



To avoid any doubt, let me be more specific. Let Y1,Y2,...,YN be a collection of random variables, and write X1geqX2geq...geqXN for their reordering in non-increasing order.



Suppose K<N is fixed and let SK be the sum of the K largest of the random variables, that is SK=X1+...+XK.



Let R be a random variable taking values in 0,1,...,N which is independent of the random variables Yi. The independence from the Yi is important of course (this is how I interpret your "based on some criteria". If the R is allowed to depend on the realisation of the Yi then all sorts of different behaviours are possible).



Now let SR be the sum of the R largest of the random variables, that is SR=X1+...+XR. (In your notation this is Si).



Suppose that ER=K. Then I claim that ESRleqESK, with equality iff R=K with probability 1. (Unless the Yi are somehow degenerate, in which case equality can occur in other cases as well).



Proof: Write pk=P(Rgeqk) for k=1,2,...,N. We have sumpk=ER=K.



Also



SR=sumNk=1XkI(Rgeqk)



so



ESR=sumNk=1P(Rgeqk)EXk=sumNk=1pkEXk.



(Here we used the independence of R from the Xi).



Consider maximising this sum subject to the constraints that sumpk=K and that
1geqp1geqp2geqp3geq....



Since the terms EXk are decreasing in k,
the maximum is achieved when pk=1 for kleqK and pk=0 for k>K.
(Provided the Yi are not degenerate, the terms EXk are strictly decreasing,
and this is the only way to achieve the maximum. If not, the maximum may be achieved in some other cases too).



That is, the maximum value of ESR occurs precisely if R is equal to K with probability 1.



It doesn't matter whether the Yi are identically distributed, and also they don't need to be independent. However, it is important that R is independent of the Yi.

No comments:

Post a Comment