Le théorème de Bayes tient-il pour les attentes?

18

Est-il vrai que pour deux variables aléatoires et , $A$ $B$

E (A ∣ B) = E (B ∣ A) \frac{E (A)}{E (B)} ?

$E(A\mid B)=E(B\mid A)\frac{E(A)}{E(B)}?$

bayesian mathematical-statistics

— tomka
source

3

Hmm ... Je ne pense pas que ces deux côtés soient équivalents

— Jon

6

Comme indiqué dans les réponses, la question est probablement dénuée de sens en raison de l'intégration de variables aléatoires d'un côté qui sont les variables de conditionnement de l'autre.

— Xi'an

25

\begin{matrix} (1) & E [A ∣ B] \overset{?}{=} E [B ∣ A] \frac{E [A]}{E [B]} \end{matrix}

$E[A\mid B] \stackrel{?}= E[B\mid A]\frac{E[A]}{E[B]} \tag 1$ Le résultat conjecturé

(1)

$(1)$ est trivialement vrai pourindépendantsvariables aléatoires

A

$A$ et

B

$B$ avec des moyens non nuls.

Si , alors le côté droit de implique une division par et donc n'a pas de sens. Notez que si et $E[B]=0$ $(1)$ $0$ $(1)$ $A$ $B$ soient indépendants ou non n'est pas pertinent.

En général , ne s'applique pas aux variables aléatoires dépendantes mais des exemples spécifiques de et dépendants satisfaisant peuvent être trouvés. Notez que nous devons continuer d'insister sur le fait que , sinon le côté droit de n'a pas de sens. Gardez à l'esprit que est une variable aléatoire qui se trouve être une fonction de la variable aléatoire , disons $(1)$ $A$ $B$ $(1)$ $E[B]\neq 0$ $(1)$ $E[A\mid B]$ $B$ tandis que est unevariable aléatoirequi estfonctionde la variable aléatoire , disons . Donc, revient à demander si $g(B)$ $E[B\mid A]$ $A$ $h(A)$ $(1)$

peut être une vraie affirmation, et évidemment la réponse est quene peut pas être un multiple de

\begin{matrix} (2) & g (B) \overset{?}{=} h (A) \frac{E [A]}{E [B]} \end{matrix}

$g(B)\stackrel{?}= h(A)\frac{E[A]}{E[B]} \tag 2$

g (B)

$g(B)$

h (A)

$h(A)$ en général.

À ma connaissance, il n'y a que deux cas particuliers où peut tenir. $(1)$

Comme noté ci - dessus, pour indépendants des variables aléatoires et , et sont dégénérées variables aléatoires (appelées constantes par des gens statistiquement illettrés) que l' égalité et , respectivement, et donc si , nous avons l'égalité en $A$ $B$ $g(B)$ $h(A)$ $E[A]$ $E[B]$ $E[B]\neq 0$ $(1)$ .
À l'autre extrémité du spectre de l'indépendance, supposons que où est une fonction inversible et donc et sont des variables aléatoires entièrement dépendantes. Dans ce cas, $A=g(B)$ $g(\cdot)$ $A=g(B)$ $B=g^{-1}(A)$ et ainsi devient
$E [A ∣ B] = g (B), E [B ∣ A] = g^{- 1} (A) = g^{- 1} (g (B)) = B$ $E[A\mid B] = g(B), \quad E[B\mid A] = g^{-1}(A) = g^{-1}(g(B)) = B$ $(1)$ qui tient exactement lorsqueoùpeut être n'importe quel nombre réel non nul. Ainsi,est valable chaque fois queest un multiple scalaire de, et bien sûrdoit être non nul (cf.la réponse de Michael Hardy). Le développement ci-dessus montre quedoit être unefonctionlinéaireet que fonctionne $g (B) \overset{?}{=} B \frac{E [A]}{E [B]}$ $g(B)\stackrel{?}= B\frac{E[A]}{E[B]}$ $g(x) = \alpha x$ $\alpha$ $(1)$ $A$ $B$ $E[B]$ $g(x)$ $(1)$ ne peut pas tenir pour affine avec . Cependant, notez qu'Alecos Papadopolous dans sa réponse et ses commentairesaffirmeensuiteque si est unevariable aléatoirenormaleavec une moyenne non nulle, alors pourles valeursspécifiquesde et qu'il fournit, et satisfont . À mon avis, son exemple est incorrect. $g(x) = \alpha x + \beta$ $\beta \neq 0$ $B$ $\alpha$ $\beta\neq 0$ $A=\alpha B+\beta$ $B$ $(1)$

Dans un commentaire sur cette réponse, Huber a suggéré de considérer l'égalité conjecturée symétrique qui bien sûr est toujours valable pour les variables aléatoires indépendantes quelles que soient les valeurs de et et pour les multiples scalaires également. Bien sûr, plus trivialement, tout

\begin{matrix} (3) & E [A ∣ B] E [B] \overset{?}{=} E [B ∣ A] E [A] \end{matrix}

$E[A\mid B]E[B] \stackrel{?}=E[B\mid A]E[A]\tag{3}$

E [A]

$E[A]$

E [B]

$E[B]$

A = α B

$A = \alpha B$

(3)

$(3)$ vaut pour les variables aléatoires de moyenne nulle

et

(indépendantes ou dépendantes, multiples scalaires ou non; peu importe!):

est suffisant pour l'égalité dans

. Ainsi,

pourrait ne pas être aussi intéressant que

comme sujet de discussion.

A

$A$

B

$B$

E [A] = E [B] = 0

$E[A]=E[B]=0$

(3)

$(3)$

(3)

$(3)$

(1)

$(1)$

— Dilip Sarwate
source

9

+1. Pour être généreuse, la question pourrait être interprétée comme demandant si

, où la question de la division par zéro disparaît.

E (A | B) E (B) = E (B | A) E (A)

$E(A|B)E(B)=E(B|A)E(A)$

— whuber

1

@whuber Merci. Ma modification répond à la question plus générale de savoir s'il est possible d'avoir

.

E [A ∣ B] E [B] = E [B ∣ A] E [A]

$E[A\mid B]E[B]=E[B\mid A]E[A]$

— Dilip Sarwate

11

The result is untrue in general, let us see that in a simple example. Let $X \mid P=p$ have a binomial distribution with parameters $n,p$ and $P$ have the beta distrubution with parameters $(\alpha, \beta)$ , that is, a bayesian model with conjugate prior. Now just calculate the two sides of your formula, the left hand side is $\DeclareMathOperator{\E}{\mathbb{E}} \E X \mid P = nP$ , while the right hand side is

E (P ∣ X) \frac{E X}{E P} = \frac{α + X}{n + α + β} \frac{α / (α + β)}{n α / (α + β)}

$\E( P\mid X) \frac{\E X}{\E P} = \frac{\alpha+X}{n+\alpha+\beta} \frac{\alpha/(\alpha+\beta)}{n\alpha/(\alpha+\beta)}$ and those are certainly not equal.

— kjetil b halvorsen
source

2

The conditional expected value of a random variable $A$ given the event that $B=b$ is a number that depends on what number $b$ is. So call it $h(b).$ Then the conditional expected value $\operatorname{E}(A\mid B)$ is $h(B),$ a random variable whose value is completely determined by the value of the random variable $B$ . Thus $\operatorname{E}(A\mid B)$ is a function of $B$ and $\operatorname{E}(B\mid A)$ is a function of $A$ .

The quotient $\operatorname{E}(A)/\operatorname{E}(B)$ is just a number.

So one side of your proposed equality is determined by $A$ and the other by $B$ , so they cannot generally be equal.

(Perhaps I should add that they can be equal in the trivial case when the values of $A$ and $B$ determine each other, as when for example, $A = \alpha B, \alpha \neq 0$ and $E[B]\neq 0$ , when

E [A ∣ B] = α B = E [B ∣ A] \cdot α = E [B ∣ A] \frac{α E [B]}{E [B]} = E [B ∣ A] \frac{E [A]}{E [B]} .

$E[A\mid B] = \alpha B = E[B\mid A]\cdot\alpha = E[B\mid A]\frac{\alpha E[B]}{E[B]} = E[B\mid A]\frac{E[A]}{E[B]}.$ But functions equal to each other only at a few points are not equal.)

— Michael Hardy
source

You mean they are not necessarily equal? I mean they CAN be equal?

— BCLC

1

@BCLC : They are equal only in trivial cases. And two functions equal to each other at some points and not at others are not equal.

— Michael Hardy

2

"But only in that trivial case can they be equal" (emphasis added) is not quite correct. Consider independent

A

$A$ and

B

$B$ with

E [B] \neq 0

$E[B]\neq 0$ . Then,

E [A ∣ B] = E [A]

$E[A\mid B] = E[A]$ while

E [B ∣ A] = E [B]

$E[B\mid A] = E[B]$ and so

E [B ∣ A] \frac{E [A]}{E [B]} = E [B] \frac{E [A]}{E [B]} = E [A] = E [A ∣ B] .

$E[B\mid A] \frac{E[A]}{E[B]} = E[B]\frac{E[A]}{E[B]} = E[A] = E[A\mid B].$

— Dilip Sarwate

@DilipSarwate I was about to say that haha!

— BCLC

I edited your answer to add a few details for the case you pointed out. Please roll back if you don't like the changes.

— Dilip Sarwate

-1

The expression certainly does not hold in general. For the fun of it, I show below that if $A$ and $B$ follow jointly a bivariate normal distribution, and have non-zero means, the result will hold if the two variables are linear functions of each other and have the same coefficient of variation (the ratio of standard deviation over mean) in absolute terms.

For jointly normals we have

E (A ∣ B) = μ_{A} + ρ \frac{σ_{A}}{σ_{B}} (B - μ_{B})

$\operatorname{E}(A \mid B) = \mu_A + \rho \frac{\sigma_A}{\sigma_B}(B - \mu_B)$

and we want to impose

μ_{A} + ρ \frac{σ_{A}}{σ_{B}} (B - μ_{B}) = [μ_{B} + ρ \frac{σ_{B}}{σ_{A}} (A - μ_{A})] \frac{μ_{A}}{μ_{B}}

$\mu_A + \rho \frac{\sigma_A}{\sigma_B}(B - \mu_B) = \left[\mu_B + \rho \frac{\sigma_B}{\sigma_A}(A - \mu_A)\right]\frac{\mu_A}{\mu_B}$

⟹ μ_{A} + ρ \frac{σ_{A}}{σ_{B}} (B - μ_{B}) = μ_{A} + ρ \frac{σ_{B}}{σ_{A}} \frac{μ_{A}}{μ_{B}} (A - μ_{A})

$\implies \mu_A + \rho \frac{\sigma_A}{\sigma_B}(B - \mu_B) = \mu_A + \rho \frac{\sigma_B}{\sigma_A}\frac{\mu_A}{\mu_B}(A - \mu_A)$

Simplify $\mu_A$ and then $\rho$ , and re-arrange to get

B = μ_{B} + \frac{σ_{B}^{2}}{σ_{A}^{2}} \frac{μ_{A}}{μ_{B}} (A - μ_{A})

$B = \mu_B +\frac{\sigma^2_B}{\sigma^2_A}\frac{\mu_A}{\mu_B}(A - \mu_A)$

So this is the linear relationship that must hold between the two variables (so they are certainly dependent, with correlation coefficient equal to unity in absolute terms) in order to get the desired equality. What it implies?

First, it must also be satisfied that

E (B) \equiv μ_{B} = μ_{B} + \frac{σ_{B}^{2}}{σ_{A}^{2}} \frac{μ_{A}}{μ_{B}} (E (A) - μ_{A}) ⟹ μ_{B} = μ_{B}

$E(B) \equiv \mu_B = \mu_B+\frac{\sigma^2_B}{\sigma^2_A}\frac{\mu_A}{\mu_B}(E(A) - \mu_A) \implies \mu_B = \mu_B$

so no other restirction is imposed on the mean of $B$ ( or of $A$ ) except of them being non-zero. Also a relation for the variance must be satisfied,

Var (B) \equiv σ_{B}^{2} = {(\frac{σ_{B}^{2}}{σ_{A}^{2}} \frac{μ_{A}}{μ_{B}})}^{2} Var (A)

$\operatorname{Var}(B) \equiv \sigma^2_B = \left(\frac{\sigma^2_B}{\sigma^2_A}\frac{\mu_A}{\mu_B}\right)^2\operatorname{Var}(A)$

⟹ {(σ_{A}^{2})}^{2} σ_{B}^{2} = {(σ_{B}^{2})}^{2} σ_{A}^{2} {(\frac{μ_{A}}{μ_{B}})}^{2}

$\implies \left(\sigma^2_A\right)^2\sigma^2_B = \left(\sigma^2_B\right)^2\sigma^2_A\left(\frac{\mu_A}{\mu_B}\right)^2$

⟹ {(\frac{σ_{A}}{μ_{A}})}^{2} = {(\frac{σ_{B}}{μ_{B}})}^{2} ⟹ ({cv}_{A})^{2} = ({cv}_{B})^{2}

$\implies \left(\frac{\sigma_A}{\mu_A}\right)^2 = \left(\frac{\sigma_B}{\mu_B}\right)^2 \implies (\text{cv}_A)^2 = (\text{cv}_B)^2$

⟹ | {cv}_{A} | = | {cv}_{B} |

$\implies |\text{cv}_A| = |\text{cv}_B|$

which was to be shown.

Note that equality of the coefficient of variation in absolute terms, allows the variables to have different variances, and also, one to have positive mean and the other negative.

— Alecos Papadopoulos
source

1

Isn't this a convoluted way to

A = α B

$A = \alpha B$ where

α

$\alpha$ is some scalar?

— Matthew Gunn

1

@MatthewGunn Your comment is right on target. Normality has nothing to do with the matter. For random variables

A

$A$ and

B

$B$ such that

A = α B

$A = \alpha B$ ,

E [A ∣ B] = α B = A

$E[A\mid B] = \alpha B = A$ and similarly,

E [B ∣ A] = B

$E[B\mid A] = B$ . Consequently, assuming that

E [B] \neq 0

$E[B]\neq 0$ ,

E [A ∣ B] = α B = E [B ∣ A] \cdot α = E [B ∣ A] \frac{α E [B]}{E [B]} = E [B ∣ A] \frac{E [A]}{E [B]} .

$E[A\mid B] = \alpha B = E[B\mid A]\cdot\alpha = E[B\mid A]\frac{\alpha E[B]}{E[B]} = E[B\mid A]\frac{E[A]}{E[B]}.$ No normality, no

| c v_{A} | = | c v_{B} |

$|cv_A|=|cv_B|$ etc, and actually just a rehash of a comment in Michael Hardy's answer.

— Dilip Sarwate

If you write \text{Var} instaed of \operatorname{Var} then you'll see

a Var X

$a\text{Var}X$ and

a Var (X)

$a\text{Var}(X)$ instead of

a Var X

$a\operatorname{Var}X$ and

a Var (X) .

$a\operatorname{Var}(X).$ That's why the latter is standard usage.

— Michael Hardy

@MatthewGun It seems to me that providing answers that contain specific examples is considered valuable content in this site. So yes, when a random variable is an affine function of another, and they are jointly normal with non-zero means, then one needs to have equal coefficients of variation, while, also there are no restrictions on the means of these rv's. On the other hand, when a random variable is just a linear function of another, the relation holds always. So no my answer is not a convoluted way to say

A = a B

$A=aB$ . (cc:@DilipSarwate)

— Alecos Papadopoulos

2

If

B

$B$ is a non-normal random variable with

E [B] = μ_{B} \neq 0

$E[B]=\mu_B\neq 0$ and

A = c B + d

$A=c B+d$ (and so

B = \frac{A - d}{c}

$B=\frac{A-d}{c}$ ), then

E [A ∣ B] = c B + d = A, E [B ∣ A] = \frac{A - d}{c} = B .

$E[A\mid B]=cB+d=A, E[B\mid A]=\frac{A-d}{c}=B.$ Now, if we want to have

E [A ∣ B] = c B + d

$E[A\mid B]=cB+d$ to equal

E [B ∣ A] \cdot \frac{μ_{A}}{μ_{B}} = B \cdot \frac{μ_{A}}{μ_{B}}

$E[B\mid A]\cdot\frac{\mu_A}{\mu_B} =B\cdot\frac{\mu_A}{\mu_B}$ , it must be that

c B + d = B \cdot \frac{μ_{A}}{μ_{B}} ⟹ d = 0, c = \frac{μ_{A}}{μ_{B}}

$cB+d=B\cdot\frac{\mu_A}{\mu_B}\implies d=0,c=\frac{\mu_A}{\mu_B}$ and so

A = c B = \frac{μ_{A}}{μ_{B}} B

$A=cB=\frac{\mu_A}{\mu_B}B$ . So, for nonnormal

B

$B$ , the OP's conjectured result holds if

A = c B

$A=cB$ but not if

A = c B + d, d \neq 0

$A=cB+d, d\neq 0$ .Of course, as you have proved, the result holds for normal random variables if

A = c B + d, d \neq 0

$A=cB+d, d\neq 0$ .

— Dilip Sarwate