next up previous contents index
Next: Speaker File: SPEAKERS.DBF Up: Files used in a Previous: Files used in a

Corpus File: CORPUS.DBF

The Corpus File lists all the prompt files used in a particular set of recordings. This file is scanned whenever a file is specified for a recording ``take''. The prompt files, which have a 2-character name and extension .txt, are arranged in an alphabetical catalogue A-Z, each letter containing up to 36 prompt files (e.g. A0-A9 plus AA-AZ). Data entries for the prompt files are of two types:

  1. those required for the prompting process, and
  2. those required for the file header.

Order of data entries:

CMT: Optional entry providing information about the prompt file, 
     typically the prompt file name and a short description. 
CCD: Corpus code  ( 2 letters (ex: P2) ) 
DBN: Database name (20 characters max) 
CNM: Corpus name (40 characters max) 
CTY: Item type (one letter) 
NBI: Number of items in the file (int) 
LAN: Language (char) 
PCF: Protocol Description File  (XXXXXXXX.DES) 
-------------------------------------------------------------
A 
- 
-------------------------------------------------- 
......... 
-------------------------------------------------- 
P 
- 
CMT:       P1.TXT Prompt text for passage 1
CCD: P1 
DBN: EUROM_1 
CNM: A reading passage (ESPRIT CD) 
CTY: P 
NBI: 1 
LAN: E 
PCF: PASSAGE.DES 
-------------------------------------------------- 
Q 
- 
........ 
-------------------------------------------------- 
S 
- 
CMT:       S2.TXT Prompt text for Block 2 of 5 sentences
CCD: S2 
DBN: EUROM_1 
CNM: Sentences (UCL) 
CTY: S 
NBI: 5 
LAN: E 
PCF: SENTEN.DES
-------------------------------------------------------- 
T 
- 
CMT:        T3.TXT Prompt text for Block 3 of 30 digit triples
CCD: T3 
DBN: EUROM_1 
CNM: list 3D Digits Triples 
CTY: N 
NBI: 30 
LAN: E 
PCF: DIGIT.DES 
---------------------------------------------------




EAGLES SWLG SoftEdition, May 1997. Get the book...