《深度学习入门：基于Python的理论与实现》（deeplearning-from-scratch）下载mnist数据集的解决方案

阿颖&阿伟

已于 2024-02-22 16:57:14 修改

阅读量990

点赞数 5

CC 4.0 BY-SA版权

分类专栏：【1-1】深度学习理论及问题文章标签： python 深度学习

于 2021-10-25 21:30:11 首次发布

本文链接：https://blog.csdn.net/sazass/article/details/120960222

【1-1】深度学习理论及问题专栏收录该内容

18 篇文章

订阅专栏

在《深度学习入门：基于Python的理论与实现》章节的第三章就开始以MNIST数据集为基础编写代码。然而根据源码的操作，很有可能会出现mnist下载超时的情况。以下是解决方案：

1. 获取代码读取数据集的路径

以mnist_show.py为例：
mnist_show.py源码：

# coding: utf-8
import sys, os
sys.path.append(os.pardir)  # 为了导入父目录的文件而进行的设定
import numpy as np
from dataset.mnist import load_mnist
from PIL import Image


def img_show(img):
    pil_img = Image.fromarray(np.uint8(img))
    pil_img.show()

(x_train, t_train), (x_test, t_test) = load_mnist(flatten=True, normalize=False)

img = x_train[0]
label = t_train[0]
print(label)  # 5

print(img.shape)  # (784,)
img = img.reshape(28, 28)  # 把图像的形状变为原来的尺寸
print(img.shape)  # (28, 28)

img_show(img)

然后就会执行load_mnist函数。
dataset目录内mnist.py文件的 load_mnist函数代码的开头：

    print("!!!!",save_file) # 打印读取的路径
    if not os.path.exists(save_file):
        init_mnist()

然后打印save_file变量，获得路径地址，一般都是默认的dataset目录内。

2. 手动下载MNIST数据集

MNIST数据集的官网，下载：

train-images-idx3-ubyte: training set images
train-labels-idx1-ubyte: training set labels
t10k-images-idx3-ubyte:  test set images
t10k-labels-idx1-ubyte:  test set labels

在这里插入图片描述
保存到第一步中获取的路径。