机器学习基于记忆增强神经网络的元学习方法研究：快速适应小样本任务的分类与回归资源-CSDN下载

35 浏览量 2025-05-05 11:12:02 上传评论收藏 1.57MB PDF 举报

资源推荐

资源详情

资源评论

Meta-Learning with Memory-Augmented Neural Networks

Adam Santoro ADAMSANTORO@GOOGLE.COM

Google DeepMind

Sergey Bartunov SBOS@SBOS.IN

Google DeepMind, National Research University Higher School of Economics (HSE)

Matthew Botvinick BOTVINICK@GOOGLE.COM

Daan Wierstra WIERSTRA@GOOGLE.COM

Timothy Lillicrap COUNTZERO@GOOGLE.COM

Google DeepMind

Abstract

Despite recent breakthroughs in the applications

of deep neural networks, one setting that presents

a persistent challenge is that of “one-shot learn-

ing.” Traditional gradient-based networks require

a lot of data to learn, often through extensive it-

erative training. When new data is encountered,

the models must inefﬁciently relearn their param-

eters to adequately incorporate the new informa-

tion without catastrophic interference. Architec-

tures with augmented memory capacities, such as

Neural Turing Machines (NTMs), offer the abil-

ity to quickly encode and retrieve new informa-

tion, and hence can potentially obviate the down-

sides of conventional models. Here, we demon-

strate the ability of a memory-augmented neu-

ral network to rapidly assimilate new data, and

leverage this data to make accurate predictions

after only a few samples. We also introduce a

new method for accessing an external memory

that focuses on memory content, unlike previous

methods that additionally use memory location-

based focusing mechanisms.

1. Introduction

The current success of deep learning hinges on the abil-

ity to apply gradient-based optimization to high-capacity

models. This approach has achieved impressive results on

many large-scale supervised tasks with raw sensory input,

such as image classiﬁcation (He et al., 2015), speech recog-

Proceedings of the 33

International Conference on Machine

Learning, New York, NY, USA, 2016. JMLR: W&CP volume

nition (Yu & Deng, 2012), and games (Mnih et al., 2015;

Silver et al., 2016). Notably, performance in such tasks is

typically evaluated after extensive, incremental training on

large data sets. In contrast, many problems of interest re-

quire rapid inference from small quantities of data. In the

limit of “one-shot learning,” single observations should re-

sult in abrupt shifts in behavior.

This kind of ﬂexible adaptation is a celebrated aspect of hu-

man learning (Jankowski et al., 2011), manifesting in set-

tings ranging from motor control (Braun et al., 2009) to the

acquisition of abstract concepts (Lake et al., 2015). Gener-

ating novel behavior based on inference from a few scraps

of information – e.g., inferring the full range of applicabil-

ity for a new word, heard in only one or two contexts – is

something that has remained stubbornly beyond the reach

of contemporary machine intelligence. It appears to present

a particularly daunting challenge for deep learning. In sit-

uations when only a few training examples are presented

one-by-one, a straightforward gradient-based solution is to

completely re-learn the parameters from the data available

at the moment. Such a strategy is prone to poor learning,

and/or catastrophic interference. In view of these hazards,

non-parametric methods are often considered to be better

suited.

However, previous work does suggest one potential strat-

egy for attaining rapid learning from sparse data, and

hinges on the notion of meta-learning (Thrun, 1998; Vi-

lalta & Drissi, 2002). Although the term has been used

in numerous senses (Schmidhuber et al., 1997; Caruana,

1997; Schweighofer & Doya, 2003; Brazdil et al., 2003),

meta-learning generally refers to a scenario in which an

agent learns at two levels, each associated with different

time scales. Rapid learning occurs within a task, for ex-

ample, when learning to accurately classify within a par-

ticular dataset. This learning is guided by knowledge

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余8页未读，立即下载

评论收藏

内容反馈

磐基Stack专业服务团队

粉丝: 8849

机器学习基于记忆增强神经网络的元学习方法研究：快速适应小样本任务的分类与回归

### 深度学习基于梯度下降的模型无关元学习算法：快速适应新任务的深度网络设计

基于机器学习算法的糖尿病预测模型研究

机器学习与人工智能(人工神经网络)习题与答案.docx

神经网络与深度学习3小时PPT-邱锡鹏

基于泰坦尼克之灾问题的机器学习传统算法和神经网络算法对比分析.pdf

基于BP回归神经网络的人体角度拟合研究.pdf

MATLAB神经网络：GRNN的数据预测-基于广义回归神经网络货运量预测.zip

样本量变化对上证指数预测精度的影响——基于MATLAB的BP神经网络模型的预测分析.pdf

基于BP神经网络 粒子群优化BP神经网络 CNN卷积神经网络 LSTM 长短期记忆神经网络 ELMAN递归神经网络 BiLSTM双向长短期记忆 遗传算法神经网络 七种神经网络回归预测算法汇总（基于M

使用五种基于机器学习、三种基于深度学习、一种基于集成学习的二分类模型.zip

基于机器学习的股票价格研究.pdf

基于机器学习算法的信用风险预测模型研究——以某互联网金融公司数据样本为例

RBF神经网络用于分类与回归

中小上市企业信用分类的应用研究——基于BP神经网络与SVM的分类方法.pdf

机器学习方法在矿产资源定量预测应用研究进展.pdf

Labview机器学习工具包及例程_Iabview神经网络工具包，labview中的机器学习工具包

svmfenlei.rar_小样本分类_神经网络 分类

基于广义回归神经网络的粒子滤波算法研究.pdf

机器学习分类问题及算法研究综述.pdf

基于决策树的BP神经网络权值初始化方法及其应用研究.pdf

基于神经网络的回归测试用例优化研究.pdf

机器学习考试题目及答案1

基于广义回归神经网络的黄金价格预测研究.pdf

基于BP神经网络的坝基岩体分类方法.pdf

基于高版本Matlab的七种神经网络回归预测算法汇总及代码实现，支持多输入单输出，可直接用附带样本进行实验

有导师学习神经网络的分类 ---鸢尾花种类识别.PPT

Leetcode 142. 环形链表 II 快慢指针

最新资源

基于BP神经网络粒子群优化BP神经网络 CNN卷积神经网络 LSTM 长短期记忆神经网络 ELMAN递归神经网络 BiLSTM双向长短期记忆遗传算法神经网络七种神经网络回归预测算法汇总（基于M

svmfenlei.rar_小样本分类_神经网络分类