mltk.datasets.image.fashion_mnist

Fashion-MNIST

This is a dataset of 60,000 28x28 grayscale images of 10 fashion categories, along with a test set of 10,000 images. This dataset can be used as a drop-in replacement for MNIST.

The classes are:

  • T-shirt/top

  • Trouser

  • Pullover

  • Dress

  • Coat

  • Sandal

  • Shirt

  • Sneaker

  • Bag

  • Ankle boot

Returns: Tuple of NumPy arrays: (x_train, y_train), (x_test, y_test).

x_train: uint8 NumPy array of grayscale image data with shapes (60000, 28, 28), containing the training data.

y_train: uint8 NumPy array of labels (integers in range 0-9) with shape (60000,) for the training data.

x_test: uint8 NumPy array of grayscale image data with shapes (10000, 28, 28), containing the test data.

y_test: uint8 NumPy array of labels (integers in range 0-9) with shape (10000,) for the test data.

Example:

(x_train, y_train), (x_test, y_test) = fashion_mnist.load_data()
assert x_train.shape == (60000, 28, 28)
assert x_test.shape == (10000, 28, 28)
assert y_train.shape == (60000,)
assert y_test.shape == (10000,)
License:

The copyright for Fashion-MNIST is held by Zalando SE. Fashion-MNIST is licensed under the MIT license

Variables

INPUT_SHAPE

The shape of each sample

CLASSES

Labels for dataset samples

Functions

load_data([dest_dir, dest_subdir, logger, ...])

Download the dataset, extract, load into memory, and return as a tuple of numpy arrays

load_data_directory([dest_dir, dest_subdir, ...])

Download the dataset, extract all sample images to a directory, and return the path to the directory.

INPUT_SHAPE = (28, 28)

The shape of each sample

CLASSES = ['tshirt', 'trouser', 'pullover', 'dress', 'coat', 'sandal', 'shirt', 'sneaker', 'bag', 'boot']

Labels for dataset samples

load_data(dest_dir=None, dest_subdir='datasets/flash_mnist', logger=None, clean_dest_dir=False)[source]

Download the dataset, extract, load into memory, and return as a tuple of numpy arrays

Returns:

(x_train, y_train), (x_test, y_test)

Return type:

Tuple of NumPy arrays

Parameters:
  • dest_dir (str) –

  • logger (Logger) –

load_data_directory(dest_dir=None, dest_subdir='datasets/fashion_mnist', logger=None, clean_dest_dir=False)[source]

Download the dataset, extract all sample images to a directory, and return the path to the directory.

Each sample type is extract to its corresponding subdirectory, e.g.:

~/.mltk/datasets/fashion_mnist/tshirt ~/.mltk/datasets/fashion_mnist/dress …

Return type:

Path to extract directory

Parameters:
  • dest_dir (str) –

  • logger (Logger) –