We will denote as ``X'' a population of m registered speakers:
For scoring purposes, we will consider a set of test utterances. We will use the term genuine test utterances for those which correspond to registered speakers, and the term impostor test utterances for those which are treated as belonging to impostor speakers. Note that the same speech segment can be used as a genuine utterance and as an impostor utterance, in some test configurations. Therefore, different notations can correspond to the same speech utterance.
Each registered speaker is supposed to have produced genuine test utterances , the set of which will be denoted as ``'':
where superscript k denotes the test utterance of speaker .
In the rest of this chapter, we will denote as c the total number of
genuine test utterances , and as the proportion of utterances
belonging to speaker in the test set , that is:
With the convention:
we will denote:
Integer is the number of registered speakers for which there is at
least one genuine test utterance .
Finally, we will denote as M the set of male registered speakers, as F the set of female registered speakers and as and the respective number of male and female registered speakers for which there is at least one genuine test utterance .
In the most general case, the whole set of impostor test utterances can be divided in subsets corresponding to one among n impostors using the system with a claimed identity . Hence the general notation
to denote the set of impostor test utterances produced by impostor claiming he is .
Similarly to genuine test utterances, we will denote as d the total number of impostor utterances, and as the proportion of impostor tests by impostor against registered speaker , that is:
We will also write:
When the identity of impostors is not a relevant factor (or is unknown), subscript j can be dropped, and denotes the impostor attempt against registered speaker (). Conversely, in open-set speaker identification , impostors do not claim a particular identity; they just try to be identified as one of the registered speakers , whoever this speaker may be. In this case, subscript can be dropped, and denotes the impostor attempt by impostor (). If, moreover, the impostor's identity does not matter, subscript j can also be dropped, and simply denotes the impostor attempt ().
With the conventions:
if | and | otherwise | |||
if | and | otherwise | |||
if | and | otherwise |
we will write:
and
Integer is the number of impostors for which there is
at least one test utterance against registered speaker , integer
represents the number of registered speakers against which
there is at least one test utterance from impostor , is the
number of impostors from which there is at least one impostor test utterance,
is the number of registered speakers against which there is at
least one impostor test utterance , and denotes the total number of
couples for which there is at least one impostor test
utterance (from against ).
Finally, we will denote as the set of male impostor speakers, as the set of female impostor speakers, and