mltk.datasets.image.mnist¶
MNIST¶
This is a dataset of 60,000 28x28 grayscale images of the 10 digits, along with a test set of 10,000 images. More info can be found at the MNIST homepage
Example
(x_train, y_train), (x_test, y_test) = mnist.load_data()
assert x_train.shape == (60000, 28, 28)
assert x_test.shape == (10000, 28, 28)
assert y_train.shape == (60000,)
assert y_test.shape == (10000,)
License¶
Yann LeCun and Corinna Cortes hold the copyright of MNIST dataset, which is a derivative work from original NIST datasets. MNIST dataset is made available under the terms of the Creative Commons Attribution-Share Alike 3.0 license.
Variables
The shape of each sample |
|
Labels for dataset samples |
|
Public download URL |
|
SHA1 hash of archive file |
Functions
|
Download the dataset, extract, load into memory, and return as a tuple of numpy arrays |
|
Download the dataset, extract all sample images to a directory, and return the path to the directory. |
- INPUT_SHAPE = (28, 28)¶
The shape of each sample
- CLASSES = ['0', '1', '2', '3', '4', '5', '6', '7', '8', '9']¶
Labels for dataset samples
- DOWNLOAD_URL = 'https://storage.googleapis.com/tensorflow/tf-keras-datasets/mnist.npz'¶
Public download URL
- VERIFY_SHA1 = '731c5ac602752760c8e48fbffcf8c3b850d9dc2a2aedcf2cc48468fc17b673d1'¶
SHA1 hash of archive file
- load_data(dest_dir=None, dest_subdir='datasets/mnist', logger=None, clean_dest_dir=False)[source]¶
Download the dataset, extract, load into memory, and return as a tuple of numpy arrays
- Returns:
(x_train, y_train), (x_test, y_test)
- Return type:
Tuple of NumPy arrays
- x_train: uint8 NumPy array of grayscale image data with shapes
(60000, 28, 28)
, containing the training data. Pixel values range from 0 to 255.- y_train: uint8 NumPy array of digit labels (integers in range 0-9)
with shape
(60000,)
for the training data.- x_test: uint8 NumPy array of grayscale image data with shapes
(10000, 28, 28), containing the test data. Pixel values range from 0 to 255.
- y_test: uint8 NumPy array of digit labels (integers in range 0-9)
with shape
(10000,)
for the test data.
- Parameters:
dest_dir (str) –
logger (Logger) –
- load_data_directory(dest_dir=None, dest_subdir='datasets/mnist', logger=None, clean_dest_dir=False)[source]¶
Download the dataset, extract all sample images to a directory, and return the path to the directory.
Each sample type is extract to its corresponding subdirectory, e.g.:
~/.mltk/datasets/mnist/0 ~/.mltk/datasets/mnist/1 …
- Return type:
Path to extract directory
- Parameters:
dest_dir (str) –
logger (Logger) –