Next: Bayesian Framework Application Up: BAYESIAN TWO SOURCE MODELING Previous: DUET and DASSS Review

Bayesian Framework

As suggested above, the DASSS data produced by equation 4 may in fact reveal which two sources are active at a particular point in time-frequency space if exactly two sources are active. This is very useful information, because once the active sources are known, they may be demixed by solving for

and

in:

$\displaystyle \left[ \begin{array}{c} S_u \\ S_v \\ \end{array} \right]$

$\textstyle =$

$\displaystyle \left[ \begin{array}{cc} 1 & 1 \\ a_u e^{-j\omega\delta_u} & a_v... ...nd{array} \right]^{-1} \left[ \begin{array}{c} X_1 \\ X_2 \end{array} \right],$

(12)

which follows directly from equations 1 and 2 when only two sources are active. Formally, we express the two most likely sources given some DUET or DASSS data

as those maximizing $p(u,v\vert D)$ . Applying Bayes' rule, we can express this as

$\displaystyle p(u,v\vert D)$

$\textstyle =$

$\displaystyle \ensuremath{\frac{p(D\vert u,v)p(u,v)}{p(D)}}.$

(13)

We can see by inspection of equations 9 through 11 that the STFT frequency under consideration, $\omega$ , affects DASSS data

(the

values). So, we are mindful that the problem is a different one for each of the frequency values $\omega$ under consideration. Since

is not a function of

, it is possible to discard it from the maximization (though we may wish to use it later if a confidence measure is sought). The quantity

is largely estimated with musical knowledge. For example we may know that the clarinet (source

for example) tends to play at the same time as and less loudly than the violin (say, source

), whose frequency components tend to be at harmonics of frequencies above 200 Hz, and rarely throughout time. Though very useful, such information is not within our current signal processing interest, and is not considered now. (For now, we will treat all

as equally likely.) We are left, then, to consider $p(D\vert u,v)$ for each $\omega$ , the probability that particular DASSS data is produced when sources

and

(but no others) are active at frequency $\omega$ . To this end, we next explicitly return to the distributions suggested by equations 9 through 11. By doing so, we identify the necessary values of $p(D\vert u,v)$ where

represents DASS scores

Next: Bayesian Framework Application Up: BAYESIAN TWO SOURCE MODELING Previous: DUET and DASSS Review

Aaron S. Master 2003-10-30