soft-clustering

This forum is inactive. Browsing/searching possible.

Connect your moderator Slack workspace to receive post notifications:

Hey, from this equation, it looks like the probability that xn is in cluster k ( (\pi)k) doesn't depend on n, which means that all the data points have the same probability distribution to be in every cluster, shouldn't it be ( (\pi)kn) ? which means that every point has it's own vector ( (\pi)n) which expresses the probability that this specific point to be in each cluster? otherwise it is the same for all of them,
Thanks,

7 Jan '22 ·

anonymous

Top comment

You should understand \(\pi_k\) as the relative size of the k-th cluster. The probability of a data point \(x_n\) being in cluster k is \(P(x_n\mid z_n=k)\) which is modeled as a gaussian distribution. One very important point is that \(z_n\) is not observed, only \(x_n\) is, so to decide to which cluster a data point \(x_n\) belongs, you should compute the posterior \(P(z_n\mid x_n)\).

7 Jan '22 · 1 ·

el mahdi chayti

Page 1 of 1

Add comment

How to style: strictly use the or click here. E.g., \(\alpha + \beta\) gives (inline) \(\alpha + \beta\). No \(\LaTeX\) preview (yet).