Deeplearninganditsapplicationstosignalandinformationprocessing资源-CSDN下载

deep

learning

5星 · 超过95%的资源需积分: 9 167 浏览量 2013-10-26 06:14:05 上传评论收藏 378KB PDF 举报

资源推荐

资源详情

资源评论

[

exploratory

DSP

]

oday, signal processing

research has a significantly

widened its scope compared

with just a few years ago [4],

and machine learning has

been an important technical area of the

signal processing society. Since 2006,

deep learning—a new area of machine

learning research—has emerged [7],

impacting a wide range of signal and

information processing work within the

traditional and the new, widened scopes.

Various workshops, such as the 2009

ICML Workshop on Learning Feature

Hierarchies; the 2008 NIPS Deep

Learning Workshop: Foundations and

Future Directions; and the 2009 NIPS

Workshop on Deep Learning for Speech

Recognition and Related Applications as

well as an upcoming special issue on deep

learning for speech and language process-

ing in IEEE Transactions on Audio,

Speech, and Language Processing (2010)

have been devoted exclusively to deep

learning and its applications to classical

signal processing areas. We have also seen

the government sponsor research on deep

learning (e.g., the DARPA deep learning

program, available at http://www.darpa.

mil/ipto/solicit/baa/BAA-09-40_PIP.pdf).

The purpose of this article is to intro-

duce the readers to the emerging technol-

ogies enabled by deep learning and to

review the research work conducted in

this area that is of direct relevance to sig-

nal processing. We also point out, in our

view, the future research directions that

may attract interests of and require efforts

from more signal processing researchers

and practitioners in this emerging area

for advancing signal and information pro-

cessing technology and applications.

INTRODUCTION TO DEEP LEARNING

Many traditional machine learning and

signal processing techniques exploit shal-

low architectures, which contain a single

layer of nonlinear feature transformation.

Examples of shallow architectures are

conventional hidden Markov models

(HMMs), linear or nonlinear dynamical

systems, conditional random fields

(CRFs), maximum entropy (MaxEnt)

models, support vector machines (SVMs),

kernel regression, and multilayer percep-

tron (MLP) with a single hidden layer. A

property common to these shallow learn-

ing models is the simple architecture that

consists of only one layer responsible for

transforming the raw input signals or fea-

tures into a problem-specific feature

space, which may be unobservable. Take

the example of a support vector machine.

It is a shallow linear separation model

with one feature transformation layer

when kernel trick is used, and with zero

feature transformation layer when kernel

trick is not used.

Human information processing

mechanisms (e.g., vision and speech),

however, suggest the need of deep archi-

tectures for extracting complex structure

and building internal representation

from rich sensory inputs (e.g., natural

image and its motion, speech, and

music). For example, human speech pro-

duction and perception systems are both

equipped with clearly layered hierarchical

structures in transforming information

from the waveform level to the linguistic

level and vice versa. It is natural to

believe that the state of the art can be

advanced in processing these types of

media signals if efficient and effective

deep learning algorithms are developed.

Signal processing systems with deep

architectures are composed of many lay-

ers of nonlinear processing stages, where

each lower layer’s outputs are fed to its

immediate higher layer as the input. The

successful deep learning techniques

developed so far share two additional key

properties: the generative nature of the

model, which typically requires an addi-

tional top layer to perform the discrimi-

native task, and an unsupervised

pretraining step that makes effective use

of large amounts of unlabeled training

data for extracting structures and regular-

ities in the input features.

A BRIEF HISTORY

The concept of deep learning originated

from artificial neural network research.

Multilayer perceptron with many hidden

layers is a good example of the models

with deep architectures. Backpropagation,

invented in 1980s, has been a well-known

algorithm for learning the weights of

these networks. Unfortunately backpropa-

gation alone does not work well in prac-

tice for learning networks with more than

a small number of hidden layers (see a

review and interesting analysis in [1]).

The pervasive presence of local optima in

the nonconvex objective function of the

deep networks is the main source of diffi-

culty in learning. Backpropagation is

based on local gradient descent and starts

usually at some random initial points. It

often gets trapped in poor local optima

and the severity increases significantly as

the depth of the networks increases. This

difficulty is partially responsible for steer-

ing away most of the machine learning

and signal processing research from neu-

ral networks to shallow models that have

convex loss functions (e.g., SVMs, CRFs,

and MaxEnt models) for which global

optimum can be efficiently obtained at

the cost of less powerful models.

The optimization difficulty associated

with the deep models was empirically

IEEE SIGNAL PROCESSING MAGAZINE [145] JANUARY 2011

Digital Object Identifier 10.1109/MSP.2010.939038

Dong Yu and Li Deng

Deep Learning and Its Applications

to Signal and Information Processing

Date of publication: 17 December 2010

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余5页未读，立即下载

评论收藏

内容反馈

Jason29zhang

2014-01-08

很有用啊，综述类的，刚开始接触的应该看看

jay7575

粉丝: 5

Deep learning and its applications to signal and information pro...

最新资源

Deep learning and its applications to signal and information pro...

Deep learning

deep learning

Deep Learning

Pattern Recognition using Neural and Functional Networks(Springer2009新书)

2013-Deep Learning for Signal and Information Processing

Understanding Deep Learning

国家开放大学计算机应用基础终结性考试（大作业）

离散数学知识点整理（超级全面详细！）

《科研伦理与学术规范》期末考试文档2（40题）

Word2Recite 桌面单词

2021全国及分省市县行政区划矢量图层shp文件.rar

38000词汇思维导图（1-50词根）β版.rar

Zotero及常用插件

博士“申请-考核制”面试——英文提问问题/答案模板

PWM脉冲调制直流电机simulink仿真

iris-data.csv

Revit 各版本官方族库及项目样板下载和安装方法，2016-2021族库离线包下载.rar

ActivityTcl和oommf安装包.rar

2020C题数学建模国赛一等奖论文+完整代码和excel数据处理表格.zip

cad卸载不干净无法重装 CAD卸载工具 Autodesk系列软件一件卸载工具 3.3.0.0.7z

某餐饮企业的订单详情表数据（博客练习专用）.zip

Elsevier爱思唯尔的word模板.zip

动态衡量式A星算法代码中涉及的音乐文件.zip

2021CFA一级Notes1-5（完）.zip

《CAD字体字库》-目前市场上最全面的字体库

php算法之选择排序

ADO.NET 本质论

最新资源