Mass project

From CCRMA Wiki
Revision as of 09:44, 29 July 2006 by Cc (Talk | contribs)

Jump to: navigation, search

Welcome to the Masking Ambient Speech Sounds project Wiki. The project is on-track to begin listening trials on the 21st of July.

Experiments

Experiment 1

The first listening tests will involve project staff members to check if things make sense. If it looks good we'll start working with non-project volunteers. Experiment 1, in the the CCRMA "Pit," will take about 30 mins. and involve 30 trials. There will be 6 conditions of masking sound crossed with 5 conditions of speech sounds. The masker (FM noise) and the speech sounds will be presented as if the sources are outside the room. We'll use the measured room model from Tokyo and the exterior sound source position (hallway). The "as if" impression will be created by convolving with the measured impulse responses.

Necessary ingredients: (x = done)

  1. (x) ambient room sound recording from Tokyo
  2. (x) 15 sec. recordings of FM noise masker with parameter variation
  3. (x) 4 min. recordings of 4 conversations (animated / not-animated, crowd / pair, always 50% gender balance)
  4. (x) 15 sec. clips cut from conversations
  5. (x) convolved versions of 15 sec. files putting them "as if" in the hallway
  6. (x) GUI for running randomized listening, A/B forced choice, logging results

Exp1GUI.png

Strategies to define conditions for FM masing noise

To define the conditions of this first experiment, the approach will be to leave all the parameters fixed, except the modulation frequency.

Noise set Contains a complete technical documentation of the masking noise generation. It also contains the soundfiles.

The conditions of the masking FM noise will be defined by the following criteria:

  • 3 bands of FM noise will be used (centered at 200 350 and 500 Hz):
    This bands are selected based on an analysis of speech voice recorded in the Tokyo office. The motivation behind this decision is to identify the relevant parameters in the leaking voice. For example, we know that the wall is filtering much of the high frequency components, so that's relevant in the selection of the main frequencies.
  • The amplitude (volume) of each band will be fixed:
    The amplitude was tuned in order to psychoacoustically balance the level of the three noise bands that will be used. This balance was done without modulation.
  • The amplitude of the modulation will be proportional to the modulation frequency:
    The motivation behind this choice is to minimize the annoyance effect. When the modulation rate is low, higher amplitudes are more noticed and annoying.
  • The relation between of modulation frequency of the 3 bands is then the main factor to define the conditions:
    For this experiment, 3 modulation rates are selected, 2, 5 and 7 Hz. The idea is to span some of the frequencies in the range of 2 to 7 Hz. Basically, all the combination of these 3 rates are used for each center frequency, plus a case with no modulation at all.

NOTE: I'm going to upload pictures in this document as soon as Nando can open the permissions to do so.

--Jcaceres 17:09, 24 July 2006 (PDT)

The beta-test of the experiment tool took longer than anticipated. Some minor fixes remain. The ones I remember from yesterday (Friday, 28th) and the ToDo list for Monday:

  1. delete slider from bottom of GUI (in Qt Designer)
  2. when user hits "OK, Next" button, clear the buttons, with button->setDown(false)
  3. comment out all the "cout" statements that are printing during trials
  4. find a way to enforce machine speed at max during trial (automatic energy saving causes stuttering)
  5. convert QString to const char for logger file open (use const char * QString::latin1 ())
  6. create a "shuffle" method in MainDialog.cpp and apply it for the actual first test
  7. I think the repeat that gets triggered for each individual mono file is ok, but I'm worried that they could slip out of sync. Better to trigger the repeat for all four from the first channel's repeat
  8. 8) add envelopes at all file starts, stops, repeats (with STK's Asymp class), pipe the file's output through it

_____________

  1. 9) IF there is still a problem with 12 disk files keeping up, you may see the message "behind" printed from FileWvIn. The next fix to try and this might be important anyway is to go to quad files rather than 4 mono files.

_____________

  1. 10) there are probably more things I'm forgetting, but this is close

GOOD LUCK! --Cc 09:42, 29 July 2006 (PDT)

Conference Call Meetings

July 18, 2006

  • FM Modulation discussion (Yasushi's Comments, with Juan-Pablo's comment on answer A:):
  1. Do you have any idea how to specify frequency modulation for each frequency band?
    • A: based on speech freq, ~2-8 Hz
  2. The period in time for each frequency should be the same?
    • A: No, different. When it's the same the masking efficiency decreases. It seems also more anoying.
  3. Modulation speed will be getting faster according to higher frequency, or
    • A: I don't know yet, this is going to be the main parameter in the first experiment I think.
  4. The frequency modulation considering the voice sound
  5. We have to analyze how the voice sound is modulated in different frequency bands?
    • A: I thiks this is the best way, and we have to consider that the wall is filtering almost all the high frequencies.
  • Discussion of the experiment setup.
  • Look at the documentation, the new example of impulse responses, and delay of arrival.

July 24, 2006

Tuesday 9:30AM Japan - Monday 5:30PM Stanford

  • Discuss Experiment 1.
  • Ask Atsuko about calibration files and SPL meeter.
  • Comment diffusion in the Pit with PZM system (Hiroko).
  • Discuss Experiment Design writen by Hiroko and Atsuko.

July 31, 2006

Tuesday 9:30AM Japan - Monday 5:30PM Stanford

  • Discuss Experiment Design writen by Hiroko and Atsuko.

Links

--Jcaceres 17:33, 17 July 2006 (PDT)

--Hiroko 18:06, 26 July 2006 (PDT)