CMU 11-785 L22 Revisiting EM algorithm and generative models

zealscott

于 2021-02-25 21:15:01 发布

阅读量523

点赞数 1

CC 4.0 BY-SA版权

分类专栏： CMU 11-785 文章标签：神经网络机器学习深度学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/crazy_scott/article/details/114106664

本文介绍了CMU课程11-785中关于EM算法和生成模型的内容。EM算法是一种处理缺失数据或信息的迭代技术，通过完成数据并重新估计参数来估计概率模型。PCA被视作高斯数据的生成模型，而因子分析则是另一种高斯数据的生成模型，它允许噪音不一定是正交的。EM算法用于解决模型估计中的缺失数据问题，PCA则寻找最小化投影误差的主子空间，可以迭代求解，并且PCA实际上是一个线性自编码器的基础。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Key points

EM: An iterative technique to estimate probability models for data with missing components or information
- By iteratively “completing” the data and reestimating parameters
PCA: Is actually a generative model for Gaussian data
- Data lie close to a linear manifold, with orthogonal noise
- A lienar autoencoder!
Factor Analysis: Also a generative model for Gaussian data
Data lie close to a linear manifold
Like PCA, but without directional constraints on the noise (not necessarily orthogonal)

Generative models

Learning a generative model

You are given some set of observed data $X=\{x\}$
You choose a model $\theta)$ for the distribution of $x$
- $θ\theta$ are the parameters of the model
Estimate the theta such that $\theta)$ best “fits” the observations $X=\{x\}$
How to define “best fits”?
- Maximum likelihood!
- Assumption: The data you have observed are very typical of the process

EM algorithm

Tackle missing data and information problem in model estimation
Let $o$ are observed data

$\log P(o)=\log \sum_{h} P(h, o)=\log \sum_{h} Q(h) \frac{P(h, o)}{Q(h)}$

The logarithm is a concave function, therefore

$\log \sum_{h} Q(h) \frac{P(h, o)}{Q(h)} \geq \sum_{h} Q(h) \log \frac{P(h, o)}{Q(h)}$

最低0.47元/天解锁文章

200万优质内容无限畅学

博客等级

码龄8年

196
原创

413
点赞

1799
收藏

326
粉丝

关注

私信

热门文章

分类专栏

展开全部收起

上一篇：: CMU 11-785 L21 Boltzmann machines2

下一篇：: CMU 11-785 L23 Variational Autoencoders

最新评论

基于IMDb数据集的情感分析（Doc2Vec模型与神经网络实现）
gdisnsagu: 有预处理完的文件吗
KMP算法详解（C++实现）
2401_84256088: 又臭又长还有错，看我写的 /** * @param s 待匹配的字符串 * @param p 模式串 * @return s是否包含p * next[j]表示以p[j]结尾的子串，的最长相等先后缀的长度 */ bool kmp (const string &s, const string &p) { int n = s.size(), m = p.size(), next[m], i, j, k; next[0] = 0; for (j = 1; j < m; j++) { for (k = next[j-1]; k && p[j] != p[k]; k = next[k - 1]); next[j] = p[j] == p[k] ? k + 1 : 0; } for (i = 0, j = 0; i < n && j < m;) { if (s[i] == p[j]) i++, j++; else j = next[j]; } return j == m; }
矩阵求导法则与性质
Jerry fk: 我也在纠结这玩意儿，我刚看了定义，他那个刚好写反了
hexo下LaTeX无法显示的解决方案
风翼飞镰: 这是关键啊:CDN地址！
python plot hist 密度图概率和不为1
尚未填写: 有用，感谢！想要绘制多组数据的概率图的话，只需把不同的weights添加到一个列表即可，比如： x_value = [train_points, test_points] train_weights = np.ones_like(train_points)/float(len(train_points)) test_weights = np.ones_like(test_points)/float(len(test_points)) weights = [train_weights, test_weights] plt.hist(x_value, bins=10, histtype="bar", alpha=0.5, label=["training set", "test set"], weights=weights) plt.legend() plt.show()

大家在看

最新文章

目录

展开全部

收起

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。