Statistics-Chapter1

Basic Concepts

(P3-P8) Sample Spaces and Events Ω sample space
ω outcome (point or element)
A event (subset of Ω)
A complement of A (not A)
A ∪ B union (A or B)
A ∩ B or AB intersection (A and B)
A − B set difference (ω in A but not in B)
A ⊂ B set inclusion
null event (always false)
Ω true event (always true)

Indicator Function :

  1. switch set concept to digital concept, like One-Hot and Mask concept in ML.
  2. the connection between probability and expectation.

Two Interpretations of Probability (P6)

  • The Frequentist View
  • Degree of beliefs(The Bayesian View)

Three Axioms (P5)

  1. P(A) ≥ 0 for every A
  2. P(Ω) = 1
  3. If A1, A2, … are disjoint then

Properties (P6-7) One can derive many properties of P from axioms

  • P(⌀) = 0
  • A ⊂ B  ⇒  P(A) ≤ P(B)
  • 0 ≤ P(A) ≤ 1
  • P(Ac) = 1 − P(A)
  • A ∩ B = ⌀  ⇒  P(A ∪ B) = P(A) + P(B)

Lemma(容斥原理): For any events A and B , P(A ∪ B) = P(A) + P(B) − P(AB)

Theorem (Continuity of Probabilities): If An → A , then P(An) → P(A) as n → ∞ . (Use Axiom3 to prove)

uniform probability distribution: To count number of points in an event, we can use combinational methods: The binomial coefficient "n choose k" is Properties: ,


Bayes

Independent Events (P8) Two events A and B are independent if P(AB) = P(A)P(B) A set of events {Ai : i ∈ I} is independent if P (⋂i ∈ JAi) = ∏i ∈ JP(Ai) for every finite subset J of I.

Conditional Probability (P10) If P(B) > 0 then the conditional probability of A given B is 1.13 Example. A medical test for a disease D has outcomes + and - . (其实就是混淆矩阵 Confusion Matrix)

D Dc
+ .009 (真阳性TP) .099(假阳性 FP)
- .001 (假阴性FN) .891 (真阴性TN)

如果前提有病/没病,能正确检测的概率是 90%(Recall)
但检测结果为阳性,真正得病的概率只有 8% (Precision) 这里原因是因为类别不平衡,健康人群基数太大。即使误报率(FP)只有 9.9%, 误报的数量淹没的真实病人的数量。所以不能忽视先验 (Prior) 而只看 Likelihood。

Lemma(P11) : For any events A and B with P(B) > 0 , P(AB) = P(A ∣ B)P(B) = P(B ∣ A)P(A)

The Law of Total Probability(P12) : Let {A1, …, Ak} be a partition of Ω . Then

Bayes' Theorem : (P12) , 可以看出 P(Ai ∣ B) ∝ P(B ∣ Ai) P(Ai). 上面那道例题也可套用 Bayes 解