mltk.datasets.audio.direction_commands

Direction Commands

This is a synthetically generated dataset with the keywords:

  • left

  • right

  • up

  • down

  • stop

  • go

The samples are 16kHz, 16-bit PCM .wav files.

Variables

DOWNLOAD_URL

Public download URL

VERIFY_SHA1

SHA1 hash of the downloaded archive file

CLASSES

The class labels of the dataset samples

Functions

download([dest_dir, dest_subdir, logger, ...])

Download and extract the dataset

generate_dataset([out_dir])

Generate the synthetic dataset

DOWNLOAD_URL = 'https://www.dropbox.com/s/ulumv0sxbtcssvr/synthetic_direction_commands.7z?dl=1'

Public download URL

VERIFY_SHA1 = '9e9295b4eda3d9da9cd242063c1646b15908da55'

SHA1 hash of the downloaded archive file

CLASSES = ['left', 'right', 'up', 'down', 'stop', 'go']

The class labels of the dataset samples

download(dest_dir=None, dest_subdir='datasets/direction_commands', logger=None, clean_dest_dir=False)[source]

Download and extract the dataset

Return type:

str

Returns:

The directory path to the extracted dataset

Parameters:
  • dest_dir (str) –

  • logger (Logger) –

generate_dataset(out_dir=None)[source]

Generate the synthetic dataset

This generates the dataset using the AudioDatasetGenerator Python package provided by the MLTK.

Parameters:

out_dir (str) –