Let us consider two random variables
read as
Note that the equivalency between
The three sentences above mean the same.
Let us consider three random variables
read as
There is also another way of saying this, which will come useful in probabilistic graphical models:
This is to say the joint probability of
The story behind this expression is: if I already know
The story behind this expression is: if I already know
Note that the equivalency between
We have defined marginal and conditional independence. Then we want to read independence in the language of probabilistic graphical models. The following table covers six very fundamental graphical structure and the independence relationship they imply.
Probabilistic Graphical Model | Marginal Independence | Conditional Independence | |
---|---|---|---|
1 | |||
2 | |||
3 descendant | |||
4 descendant | |||
5 common parent | |||
6 collision |
The first five lines of the following table make intuitive sense. The one graphical structure where people make mistakes the most (okay, where I make mistakes the most) is the "collision":
Probabilistic graphical models in practice are much larger than just three random variables. If we face a large graph and are asked whether
Find all trails that link
Judge whether any of these trails are active. For a trail to be active, we need to check the nodes on the trail and see:
For collision nodes (like
For non-collision nodes, they are not observed.
If there exist one or more active trails,
We discriminate between collision nodes and non-collision nodes because only the collision node has different independence property among line 3-6 in our table. For probability influence to flow, we need to activate collision by observing them and avoid cutting caused by observing a non-collision node.
The point I tend to forget is that observing a descendant of a collision node also activates collision. This is because observing a node's descendant is indirectly observing the node itself. We used line 3,4, and 6 in the table in this reasoning, making this quite subtle.
The Market blanket of a random variable
The Markov boundary of a random variable
All parents of
All children of
All the other parents of the children of
It is obvious that we need all parents and children of
It seems like a mouthful to say "other parents of the children of
Usually, visualization is more intuitive than texts, tables, and equations. However, I find probabilistic graphical models are not that kind of visualization. They are not intuitive and easy to misunderstand for me. What is more intuitive is actually the textual story behind it. Then, I realized probabilistic graphical models are not graph visualization. They are models!
Therefore, I now think of probabilistic graphical models as yet another mathematical notation. Like any other funny-looking math notations, they are not intuitive at first sight and it takes practice to get them right. So, kindly grant ourselves patience. 🙃