probability distributions - Can the sum of two independent r.v.'s with convex support be uniformly distributed?

Monday, 15 February 2016

probability distributions - Can the sum of two independent r.v.'s with convex support be uniformly distributed?

Is it possible to prove that the sum of two independent r.v.'s $X$ and $Y$ with convex support cannot be uniformly distributed on an interval $[a,b]$ , with $a < b$ ?

(Let us rule out the trivial case where $X$ is degenerate and $Y$ is uniform.) Note that $X$ and $Y$ need not be identically distributed.

I have a strong intuition supporting a negative answer because commutativity gets $X+Y$ to "pile up" away from the tails, but I cannot find a way to pin this down to an impossibility proof.

Related questions on Math.SE:

1) Can the sum of two independent identical random variables be uniform? and Can the sum of two i.i.d. random variables be uniformly distributed? answer negatively under the additional assumption that $X$ and $Y$ are identically distributed.

2) in Can sum of two random variables be uniformly distributed @GEdgar answers positively under the assumption that the two independent r.v.'s $X$ and $Y$ have non-convex support.

Answer

Your intuition is correct : the answer is negative. Here is a proof.

Suppose, by contradiction, that $X$ and $Y$ are independent, non-degenerate variables with convex support, say $[c,d]$ and $[e,f]$ , and $X+Y$ is uniform on $[c+e,d+f]$ . Replacing $(X,Y)$ with $(X-c,Y-e)$ , we may assume without loss that $c=e=0$ . Interchanging $X$ and $Y$ if necessary, we may further assume that $d\leq f$ . If $d

At this point, we then have $X,Y$ independent with support exactly equal to $[0,1]$ , and $X+Y$ distributed uniformly on $[0,2]$ . Now, define the following events :

$\begin{array}{cccl} X_1= \lbrace X\in[0,\frac{1}{3}] \rbrace, & X_2= \lbrace X\in(\frac{1}{3},\frac{2}{3}] \rbrace, & X_3= \lbrace X\in(\frac{2}{3},1] \rbrace, \\ Y_1= \lbrace Y\in[0,\frac{1}{3}] \rbrace, & Y_2= \lbrace Y\in(\frac{1}{3},\frac{2}{3}] \rbrace, & Y_3= \lbrace Y\in(\frac{2}{3},1] \rbrace, & \end{array}$

$P_k=\cup_{i+j=k}X_i\times Y_j, Q_l=\cup_{k=2}^l P_k, L(x)=\lbrace X+Y \leq x \rbrace$

We then have the inclusions :

$L\big(\frac{l-1}{3}\big) \subseteq Q_l \subseteq L\big(\frac{l}{3}\big)$

Putting $q_l=P(Q_l)$ , we deduce

$\frac{l-1}{6}\leq q_l \leq \frac{l}{6} \tag{1}$

Next, put $x_k=P(X_k), y_k=P(Y_k), p_k=P(P_k)$ . It follows from

(1) that the following number is nonnegative :

$s=(1-x_1)\big(q_2-\frac{1}{6}\big)+x_1\big(\frac{2}{3}-q_4\big)+x_1x_2y_2 \tag{2}$

Clearly $q_2=x_1y_1$ . And using $x_3=1-(x_1+x_2)$ and $y_3=1-(y_1+y_2)$ , we find that

$q_4 =x_1y_1+x_2y_1+x_1y_2+x_1y_3+x_2y_2+x_3y_1=x_1+y_1+x_2y_2-x_1y_1 \tag{3}$

Reinjecting this back into (2), we obtain

$s=\bigg(x_1-\frac{1}{2}\bigg)\bigg(\frac{1}{3}-x_1\bigg) \tag{4}$

It follows that $x_1\in[\frac{1}{3},\frac{1}{2}]$ . Using natural symmetries of the problem, it follows that $x_3,y_1$ and $y_3$ are also in $[\frac{1}{3},\frac{1}{2}]$ .

Next, consider

$\begin{array}{clc} t_1 &=& (x_1x_3-x_2^2)\big(\frac{5}{6}-q_5\big)+x_3x_2\big(\frac{2}{3}-q_4\big)+x_3^2\big(q_3-\frac{1}{3}\big) \\ t_2 &=& (x_1x_3-x_2^2)\big(q_2-\frac{1}{6}\big)+x_1x_2\big(q_3-\frac{1}{3}\big)+ x_1^2\big(\frac{2}{3}-q_4\big) \\ t &=& t_1+t_2 \end{array}$

Note that since $x_1x_3 \geq \frac{1}{9} \geq x_2^2$ , the numbers $t_1,t_2$ and $t$ are all nonnegative. Simplifying $t$ as we did for $s$ , we obtain :

$t=-\frac{x_2}{4}(1+x_2)(\frac{1}{3}-x_2)-\frac{3(x_3-x_1)^2}{4}\bigg(\frac{4}{9}-x_2\bigg) \tag{5}$

In (5) above we have a nonnegative LHS and a nonpositive RHS. Note also that $x_2>0$ because the support of $X$ is $[0,1]$ . This forces $x_2=\frac{1}{3}$ and

$(x_3-x_1)^2=0$ , so that $x_1=x_2=x_3=\frac{1}{3}$ , and similarly $y_1=y_2=y_3=\frac{1}{3}$ . But then $q_2=\frac{1}{9}$ contradicts (1). This finishes the proof.

UPDATE: Since (5) was questioned in the comments, I append a computer verification of it.

                  GP/PARI CALCULATOR Version 2.7.4 (released)
          i386 running darwin (x86-64/GMP-6.0.0 kernel) 64-bit version
compiled: Sep 27 2015, Apple LLVM version 6.0 (clang-600.0.57) (based on LLVM 3.5svn)
                            threading engine: single
                 (readline v6.3 enabled, extended help enabled)


                     Copyright (C) 2000-2015 The PARI Group

PARI/GP is free software, covered by the GNU General Public License, and comes 
WITHOUT ANY WARRANTY WHATSOEVER.

Type ? for help, \q to quit.
Type ?12 for how to get moral (and possibly technical) support.

parisize = 8000000, primelimit = 500000
? x3=1-(x1+x2)

%1 = -x1 + (-x2 + 1)
? y3=1-(y1+y2)
%2 = -y1 + (-y2 + 1)
? 
? x(k)=if((1<=k)&&(k<=3),eval(Str("x",k)),0)
%3 = (k)->if((1<=k)&&(k<=3),eval(Str("x",k)),0)
? y(k)=if((1<=k)&&(k<=3),eval(Str("y",k)),0)
%4 = (k)->if((1<=k)&&(k<=3),eval(Str("y",k)),0)
? 
? p(k)=sum(j=1,k,x(j)*y(k-j))

%5 = (k)->sum(j=1,k,x(j)*y(k-j))
? 
? q(l)=sum(k=2,l,p(k))
%6 = (l)->sum(k=2,l,p(k))
? 
? s_term=(1-x1)*(q(2)-(1/6))+x1*((2/3)-q(4))+x1*x2*y2
%7 = -x1^2 + 5/6*x1 - 1/6
? 
? check1=s_term-(x1-(1/3))*((1/2)-x1)
%8 = 0

? 
? t1_term=(x1*x3-(x2^2))*((5/6)-q(5))+x3*x2*((2/3)-q(4))+(x3^2)*(q(3)-(1/3))
%9 = x1^3 + (3*x2 - 13/6)*x1^2 + (3*x2^2 - 25/6*x2 + 3/2)*x1 + (x2^3 - 11/6*x2^2 + 4/3*x2 - 1/3)
? t2_term=(x1*x3-(x2^2))*(q(2)-(1/6))+x1*x2*(q(3)-(1/3))+(x1^2)*((2/3)-q(4))
%10 = -x1^3 + 5/6*x1^2 + (-1/6*x2 - 1/6)*x1 + 1/6*x2^2
? t_term=t1_term+t2_term
%11 = (3*x2 - 4/3)*x1^2 + (3*x2^2 - 13/3*x2 + 4/3)*x1 + (x2^3 - 5/3*x2^2 + 4/3*x2 - 1/3)
? 
? easy_part=(x2/4)*(x2+1)*((1/3)-x2)
%12 = -1/4*x2^3 - 1/6*x2^2 + 1/12*x2

? hard_part=(3/4)*((x3-x1)^2)*((4/9)-x2)
%13 = (-3*x2 + 4/3)*x1^2 + (-3*x2^2 + 13/3*x2 - 4/3)*x1 + (-3/4*x2^3 + 11/6*x2^2 - 17/12*x2 + 1/3)
? 
? check2=(t_term+(easy_part+hard_part))
%14 = 0

Blog

Monday, 15 February 2016