mltk.datasets.audio.ten_digits¶
Ten Digits¶
This is a synthetically generated dataset with the keywords:
zero
one
two
three
four
five
six
seven
eight
nine
The samples are 16kHz, 16-bit PCM .wav
files.
Variables
Public download URL |
|
SHA1 hash of the downloaded archive file |
|
The class labels of the dataset samples |
Functions
|
Download and extract the dataset |
|
Generate the dataset |
- DOWNLOAD_URL = 'https://www.dropbox.com/s/o8e84fsb5abu3t1/sl_synthetic_ten_digits.7z?dl=1'¶
Public download URL
- VERIFY_SHA1 = 'a716e86a27fb5255d2a3bff4441dcd5b4044ea3a'¶
SHA1 hash of the downloaded archive file
- CLASSES = ['zero', 'one', 'two', 'three', 'four', 'five', 'six', 'seven', 'eight', 'nine']¶
The class labels of the dataset samples
- download(dest_dir=None, dest_subdir='datasets/ten_digits', logger=None, clean_dest_dir=False)[source]¶
Download and extract the dataset
- Return type:
str
- Returns:
The directory path to the extracted dataset
- Parameters:
dest_dir (str) –
logger (Logger) –
- generate_dataset(out_dir=None)[source]¶
Generate the dataset
This generates the dataset using the AudioDatasetGenerator Python package provided by the MLTK.
- Parameters:
out_dir (str) –