Download
You can get the complete dataset (includes SQLite3, CSV and .onset files, as well as the ground truth data).
The dataset is also provided in three different formats separately:
You can also get the ground truth data (MIDI renditions of the excerpts included in the dataset, in MIDI and .onset formats).
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License.
Description
The dataset is managed in three tables:Songs table
This table contains information about the songs used in the data collection. The fields of the table are:Field name | Data Type | Description |
---|---|---|
id | int | Unique ID of the entry |
filename | string | Name of the audio file (only useful to the data collection system) |
song_title | string | Name of the song |
artist | string | Name of the performing artist |
year | int | Year the song was recorded |
start_time | time | Start time of the audio excerpt used in the data collection |
end_time | time | End time of the audio excerpt used in the data collection |
song_section | string | Included to help researchers locate the part of the song used in the query. Main Theme (MT) and Chorus (C) distinctions are ambiguous for some songs. |
Users table
Contains information about the experiment participants. From the point of view of the data collection process, this table could be named subjects or participants, but from the perspective of a QBT system, they are users.Field name | Data Type | Description |
---|---|---|
id | int | Unique ID of the entry |
age | int | User reported age |
gender | string | User reported gender |
listening_habits | float | 0--5 range |
instrument_training | float | 0--5 range |
theory_training | float | 0--5 range |
handedness | int | -1 (left handed), 0 (ambidextrous) or 1 (right handed) |
tone_deaf | int | 0 means not tone deaf and 1 means tone deaf |
arrhythmic | int | 0 means not arrhythmic and 1 means arrhythmic |
user_id | string | UUID |
native_language | string | User reported native language |
specific_training | string | List of numbers of years of training for each instrument the user has been trained in. |
Tasks table
Each entry in this table corresponds the to data tapped by a user on a single song. There are multiple entries for a single user/song combination, because the collection condition might be different (i.e. tapping from long term memory vs. short term memory).Field name | Data Type | Description |
---|---|---|
id | int | Unique ID of the entry |
version_number | string | Version of the front end software used to capture the data |
song_title | string | Title of the song the user was asked to tap |
user_id | string | UUID of the user tapping (see Users table) |
session_id | datetime | Date and time of the collection |
experimenter_id | string | Identifier of the device used to collect the data |
task_order | int | Incremental integer indicating the order of the songs tapped by a single user in a single tapping session (the same number is repeated for the long and short term memory conditions) |
device_type | string | Description of the device used for tapping |
song_familiarity | float | 0--5 range: indicates the reported song familiarity. This value only makes sense for the long term condition (i.e. when with_music field is 0 ). If with_music is 1 , then this field has a value of 0. |
with_music | int | 0 indicates long term memory tapping and 1 indicates short term memory tapping |
audio_helpful | int | 1--6 range: number reporting if listening to the song helped. This value only makes sense for the short term condition (i.e. when with_music is 1 ). If with_music is 0 , then this field has a value of 0. Values other than 0 correspond to:
|
tap_data | string | Comma separated list of tap on times (in seconds) |
tap_off_data | string | Comma separated list of tap off times (in seconds) |
tap_x_data | string | Comma separated list of of the tap on's x (horizontal) coordinate (in pixels) |
tap_y_data | string | Comma separated list of of the tap on's y (vertical) coordinate (in pixels) |
tap_off_x_data | string | Comma separated list of of the tap off's x (horizontal) coordinate (in pixels) |
tap_off_y_data | string | Comma separated list of of the tap off's y (vertical) coordinate (in pixels) |