mltk.datasets.audio.ten_digits

Ten Digits

This is a synthetically generated dataset with the keywords:

  • zero

  • one

  • two

  • three

  • four

  • five

  • six

  • seven

  • eight

  • nine

The samples are 16kHz, 16-bit PCM .wav files.

Variables

DOWNLOAD_URL

Public download URL

VERIFY_SHA1

SHA1 hash of the downloaded archive file

CLASSES

The class labels of the dataset samples

Functions

download([dest_dir, dest_subdir, logger, ...])

Download and extract the dataset

generate_dataset([out_dir])

Generate the dataset

DOWNLOAD_URL = 'https://www.dropbox.com/s/o8e84fsb5abu3t1/sl_synthetic_ten_digits.7z?dl=1'

Public download URL

VERIFY_SHA1 = 'a716e86a27fb5255d2a3bff4441dcd5b4044ea3a'

SHA1 hash of the downloaded archive file

CLASSES = ['zero', 'one', 'two', 'three', 'four', 'five', 'six', 'seven', 'eight', 'nine']

The class labels of the dataset samples

download(dest_dir=None, dest_subdir='datasets/ten_digits', logger=None, clean_dest_dir=False)[source]

Download and extract the dataset

Return type:

str

Returns:

The directory path to the extracted dataset

Parameters:
  • dest_dir (str) –

  • logger (Logger) –

generate_dataset(out_dir=None)[source]

Generate the dataset

This generates the dataset using the AudioDatasetGenerator Python package provided by the MLTK.

Parameters:

out_dir (str) –