Expectation is the average value of a random variable, variance measures dispersion around the mean, and covariance captures joint variability of two variables. These are the core moments used across probability and statistics.
\(\mathbb{E}[X] = \sum_x x\,p_X(x)\); linearity: \(\mathbb{E}[aX+bY]=a\mathbb{E}[X]+b\mathbb{E}[Y]\).\(\mathrm{Var}(X)=\mathbb{E}[X^2]-\mathbb{E}[X]^2\); scaling: \(\mathrm{Var}(aX+b)=a^2\mathrm{Var}(X)\).\(\mathrm{Cov}(X,Y)=\mathbb{E}[XY]-\mathbb{E}[X]\mathbb{E}[Y]\); independence ⇒ covariance 0 (but not conversely).\(\mathrm{Var}(\sum_i X_i)=\sum_i \mathrm{Var}(X_i) + 2\sum_{i. \(\rho_{X,Y}=\frac{\mathrm{Cov}(X,Y)}{\sigma_X\sigma_Y}\), in [−1,1].\(\mathbb{E}[X]=\mathbb{E}[\,\mathbb{E}[X\mid Y]~]\), \(\mathrm{Var}(X)=\mathbb{E}[\mathrm{Var}(X\mid Y)] + \mathrm{Var}(\mathbb{E}[X\mid Y])\).\(\mathbb{E}[\mathbf{1}_A]=\mathbb{P}(A)\); useful for counting arguments.\(\mathbb{E}[\sum_{i=1}^n X_i]\) without independence.\(\mathrm{Var}(X)=\mathbb{E}[(X-\mathbb{E}[X])^2]\) equals \(\mathbb{E}[X^2]-\mathbb{E}[X]^2\).\(\mathrm{Var}(X+Y)=\mathrm{Var}(X)+\mathrm{Var}(Y)+2\mathrm{Cov}(X,Y)\) and specialize to independent variables.