Next: Spoken language reference materials
Up: EAGLES SLWG Handbook
Previous: Bibliographical references
- Abercrombie (1967)
D. Abercrombie (1967).
Elements of general phonetics.
Edinburgh University Press, Edinburgh.
- Aho et al. (1987)
A. Aho, B. Kernighan & P. Weinberger (1987).
The AWK programming language.
Addison-Wesley Publishing Company, Reading, Mass., etc.
- Ainsworth (1988)
W. Ainsworth (1988).
Speech recognition by machine.
Peter Peregrinus.
- Aitchison (1994)
J. Aitchison (1994).
Words in the mind. An introduction to the mental lexicon.
Blackwell, Oxford.
- Aitkin et al. (1989)
M. Aitkin, D. Anderson, B. Francis & J. Hinde (1989).
Statistical modelling in GLIM.
Clarendon Press, Oxford.
- Akers & Lennig (1985)
G. Akers & M. Lennig (1985).
Intonation in text-to-speech synthesis: Evaluation of algorithms.
Journal of the Acoustical Society of America, JASA 77:
- Akmajian (1984)
A. Akmajian (1984).
Linguistics: An introduction to language and communication.
The MIT Press, Cambridge, Massachusetts, .
- Allen (1988)
G. Allen (1988).
The PHONASCII system.
Journal of the International Phonetic Association 18(1):
- Allen et al. (1987)
J. Allen, M. Hunnicutt & D. Klatt (1987).
From text to speech: The MITalk system.
Cambridge University Press, Cambridge.
- Allerhand (1987)
M. Allerhand (1987).
Knowledge-based speech pattern recognition.
Kogan Page, London.
- Alleva et al. (1992)
- F. Alleva, H. Hon, X. Huang, M. Hwang, R. Rosenfeld & R. Weide (1992).
Applying SPHINX-II to the DARPA Wall Street Journal CSR
: Speech and Natural Language workshop,
393-398, Harriman, New York.
- Alleva et al. (1993)
F. Alleva, X. Huang & M.-Y. Hwang (1993).
An improved search algorithm using incremental knowledge for
continuous speech recognition.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 307-311, Minneapolis, MN, April.
- Althoff et al. (1996)
F. Althoff, G. Drexel, H. Lüngen, M. Pampel & C. Schillo (1996).
The treatment of compounds in a morphological component for speech
: D. Gibbon, , Natural language
processing and speech technology. Results of the 3rd KONVENS Conference,
Bielefeld, October 1996, 71-76. Mouton de Gruyter, Berlin, New
- Andernach et al. (1993)
T. Andernach, G. Deville & L. Mortier (1993).
The design of a real world Wizard of Oz experiment for a speech
driven telephone directory information system.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1165-1168, Berlin, September.
- Andry et al. (1990)
F. Andry, E. Bilange, F. Charpentier, K. Choukri, M. Ponamali & S. Soudoplatoff (1990).
Computerised simulation tools for the design of an oral dialogue
: Proceedings of the ESPRIT Technical
Conference, Brussels, November.
- Andry et al. (1992)
F. Andry, S. McGlashan, N. Youd, N. Fraser & S. Thornton (1992).
Making DATR work for speech: Lexicon compilation in SUNDIAL.
Computational Linguistics 18(3): 245-267.
- Argente (1991)
J. Argente (1991).
From speech to speaking styles.
: Proceedings of the ESCA Workshop `Phonetics and phonology of speaking styles: Reduction and elaboration in speech communication', 1-1, 1-12, Barcelona.
- Atal (1976)
B. Atal (1976).
Automatic recognition of speakers from their voices.
Proceedings of the IEEE, April, 64(4): 460.
- Atal et al. (1991)
B. Atal, J. Miller & R. Kent, (1991).
Papers in speech communication: Speech processing.
Acoustical Society of America.
- Aubergé (1992)
V. Aubergé (1992).
Developing a structured lexicon for synthesis of prosody.
: G. Bailly, C. Benoît & T. Sawallis,
, Talking machines: Theories, models and designs,
307-321. North-Holland, Amsterdam.
- Austin (1962)
J. Austin (1962).
How to do things with words.
Oxford University Press, Oxford.
- Autesserre et al. (1989)
D. Autesserre, G. Pérennou & M. Rossi (1989).
Methodology for the transcription and labeling of a speech corpus.
Journal of the International Phonetic Association 19(1):
- Averbuch et al. (1987)
A. Averbuch, L. Bahl & R. Bakis (1987).
Experiments with the TANGORA 20000 word speech recognizer.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
- Averbuch et al. (1986)
A. Averbuch, L. Bahl, R. Bakis, P. Brown, A. Cole, G. Daggett, S. Das, K. Davies, S. De Gennaro, P. De Souza, E. Epstein, D. Fraleigh, F. Jelinek, S. Katz, B. Lewis, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman & P. Spinelli (1986).
An IBM PC-based large-vocabulary isolated-utterance speech
: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 53-56.
- Baayen (1991)
H. Baayen (1991).
De CELEX lexicale databank.
Forum der Letteren 32(3): 221-231.
- Bahl et al. (1989)
L. Bahl, P. Brown, P. De Souza & R. Mercer (1989).
A tree-based statistical language model for natural language speech
IEEE Transactions on Acoustics, Speech and Signal Processing,
ASSP-37(7) 1001-1008.
Also in: A. Waibel, K.-F. Lee, eds. (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 507-514.
- Bahl et al. (1983)
L. Bahl, F. Jelinek & R. Mercer (1983).
A maximum likelihood approach to continuous speech recognition.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
March, 5: 179-190.
- Bahl et al. (1984)
L. Bahl, F. Jelinek, R. Mercer & A. Nadas (1984).
Next word statistical predictor.
IBM Tech. Disclosure Bulletin, December, 27(7A): 3941-3942.
- Bailleul (1987)
C. Bailleul (1987).
Evaluation des performances d'un système de reconnaissance vocale
dans des tâches de contrôle airiens.
Note Interne, CENA/N87083, 22 June.
- Bailly (1994)
G. Bailly (1994).
Rule compilers and text-to-speech systems.
Les Cahiers de l'ICP 3: 87-91.
- Bailly & Benoît (1992)
G. Bailly & C. Benoît, (1992).
Talking machines: Theories, models and designs.
North-Holland, Elsevier Science Publishers, Amsterdam.
- Baker (1975a)
J. Baker (1975a).
The DRAGON system - An overview.
IEEE Transactions on Acoustics, Speech and Signal Processing,
ASSP-23 24-29.
- Baker (1975b)
J. Baker (1975b).
Stochastic modeling for automatic speech understanding.
: D. Reddy, , Speech recognition,
521-541. Academic Press, New York, N.Y.
Also in: A. Waibel, K.-F. Lee, eds. (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 297-307.
- Baker (1989)
J. Baker (1989).
Dragondictate-30k: Natural language speech recognition with 30000
: Proceedings of the European Conference on
Speech Technology, 2, 161-163.
- Baker et al. (1992)
J. Baker, P. Bamberg, K. Bishop, L. Gillick, V. Helman, Z. Huang, Y. Ito, S. Lowe, B. Peskin, R. Roth & F. Scattone (1992).
Large vocabulary recognition of Wall Street Journal sentences
at Dragon systems.
: Speech and Natural Language Workshop,
387-392, Harriman, New York, 23-26 February.
- Ball (1991)
M. Ball (1991).
Computer coding of the IPA: Extensions to the IPA.
Journal of the International Phonetic Association 21(1):
- Ballou (1987)
G. Ballou, (1987).
Handbook for sound engineers.
W. Sams & Co., Indianapolis, U.S.A.
- Barber et al. (1989)
S. Barber, R. Carlson, P. Cosi, M. Di Benedetto, B. Granström & K. Vagges (1989).
A rule-based Italian text-to-speech system.
: Proceedings of the Eurospeech '89,
2, 517-520, Paris.
- Barry & Fourcin (1990)
W. Barry & A. Fourcin (1990).
Speaker selection criteria.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Interim Report Year I, Reference SAM-UCL-G002, Document
- Barry & Fourcin (1992)
W. Barry & A. Fourcin (1992).
Levels of labelling.
Computer Speech and Language 6: 1-14.
- Barry et al. (1989)
W. Barry, M. Grice, V. Hazan & A. Fourcin (1989).
Excitation distributions for synthesised speech.
: Proceedings of the Eurospeech '89,
1, 353-356, Paris.
- Bartlett (1987)
B. Bartlett (1987).
Choosing the right microphones by understanding design tradeoffs.
J. Audio. Eng. Soc. 35.
- Bates & Ayuso (1991)
M. Bates & D. Ayuso (1991).
A proposal for incremental dialogue evaluation.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 319-322, Pacific Grove, CA, February.
- Bates et al. (1990)
M. Bates, S. Boisen & J. Makhoul (1990).
Developing an evaluation methodology for spoken language systems.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 102-108, Hidden Valley, PA, June.
- Baum (1900)
F. Baum (1900).
The Wizard of Oz.
Collins, London.
Edition of 1974.
- Baum (1972)
L. Baum (1972).
An inequality and associated maximization technique in statistical
estimation of a Markov process.
Inequalities 3(1): 1-8.
- Beckman (1986)
M. Beckman (1986).
Stress and non-stress accent.
Foris, Dordrecht.
- Belina & Hogrefe (1988)
F. Belina & D. Hogrefe (1988).
The CCITT specification and design language SDL.
Computer networks and ISDN systems 16: 311-341.
- Bell et al. (1990)
T. Bell, J. Cleary & I. Witten (1990).
Text compression.
Prentice Hall, Englewood Cliffs, NJ.
- Benoît (1989)
C. Benoît (1989).
Intelligibility test for the assessment of French synthesizers
using semantically unpredictable sentences.
: Proceedings of the ESCA Workshop on Speech
Input/Output Assessment and Speech Databases, 1.7.1-1.7.4.
- Benoît (1991)
C. Benoît (1991).
On the assessment of audio-visual speech synthesis.
: Proceedings of the Workshop on
International Cooperation and Standardisations of Speech Databases and Speech
I/O Assessment Methods, Chiavari, Italy.
- Benoît et al. (1992)
C. Benoît, T. Lallouache, T. Mohamadi & C. Abry (1992).
A set of French visemes for visual speech synthesis.
: G. Bailly & C. Benoît, ,
Talking machines: Theories, models, and design, 485-504.
North Holland, Elsevier Science Publishers, Amsterdam.
- Benoît et al. (1989)
C. Benoît, A. Van Erp, M. Grice, V. Hazan & U. Jekosch (1989).
Multilingual synthesizer assessment using semantically unpredictable
: Proceedings of the Eurospeech '89,
2, 633-636, Paris.
- Bentler (1985)
P. Bentler (1985).
Theory and implementation of EQS, a structural equations
BMDP Statistical Software Inc., Los Angeles.
- Berendsen et al. (1986)
E. Berendsen, S. Langeweg & H. Van Leeuwen (1986).
Computational phonology: Merged not mixed.
: Proceedings of the International
Conference on Computational Linguistics '86, 612-614.
- Berger et al. (1994)
A. Berger, P. Brown, J. Cocke, S. Della Pietra, V. Della Pietra,J. Gillett, J. Lafferty, R. Mercer, H. Printz & L. Ures (1994).
The Candide system for machine translation.
: Proceedings of the ARPA Human Language
Technology Workshop, 152-157, Plainsboro, NJ, March.
- Berkley & Flanagan (1990)
D. Berkley & J. Flanagan (1990).
Integration of speech recognition, text-to-speech synthesis, and
talker verification into a hands free audio/image teleconferencing system
ICSLP 20(1): 861-864.
- Bimbot et al. (1995)
F. Bimbot, I. Magrin-Chagnolleau & L. Mathan (1995).
Second-order statistical measures for text-independent speaker
Speech Communication 17.
- Bimbot & Mathan (1993)
F. Bimbot & L. Mathan (1993).
Text-free speaker recognition using an arithmetic-harmonic sphericity
: Proceedings of the Eurospeech,
- Black et al. (1991)
E. Black, S. Abney, D. Flickinger, C. Gdaniec, R. Grishman, P. Harrison, D. Hindle, R. Ingria, F. Jelinek, J. Klavans, M. Liberman, M. Marcus, R. Roukos, B. Santorini & T. Strazalkowski (1991).
A procedure for quantitatively comparing the syntactic coverage of
English grammars.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 306-311, Pacific Grove, CA, February.
- Bladon (1990)
A. Bladon (1990).
Evaluating the prosody of text-to-speech synthesizers.
: Proceedings of the Speech Tech '90,
- Blauert (1983)
J. Blauert (1983).
Spatial hearing.
MIT Press, Cambridge.
- Bleiching (1992)
D. Bleiching (1992).
Prosodisches Wissen im Lexikon.
: G. Görz, , KONVENS 92, 1.
Konferenz ``Verarbeitung natürlicher Sprache'', Nürnberg, 7.-9.
Oktober 1992, 59-68. Springer-Verlag, Berlin.
- Bleiching et al. (1996)
D. Bleiching, G. Drexel & D. Gibbon (1996).
Ein synkretismusmodell für die deutsche morphologie.
: D. Gibbon, , Natural language
processing and speech technology. Results of the 3rd KONVENS Conference,
Bielefeld, October 1996, 237-248. Mouton de Gruyter, Berlin, New
- Bleiching & Gibbon (1994)
D. Bleiching & D. Gibbon (1994).
Handbuch zur Demonstrator-Wortliste.
V1.1. May 1994, Bielefeld University, Bielefeld, Germany.
- Bloothooft et al. (1995)
G. Bloothooft, V. Hazan, D. Huber & J. Llisterri (1995).
European studies in phonetics and speech communication.
OTS Publications, Utrecht.
- Bobrow & Winograd (1977)
D. Bobrow & T. Winograd (1977).
An overview of KRL, a knowledge representation language.
Cognitive Science 1: 3-46.
- Boguraev et al. (1988)
B. Boguraev, J. Carroll, S. Pulman, G. Russell, G. Ritchie, A. Black, E. Briscoe & C. Grover (1988).
The lexical component of a natural language toolkit.
: D. Walker, A. Zampolli & N. Calzolari,
, Automating the lexicon: Research and practice in a
multilingual environment. Cambridge University Press, Cambridge.
- Bolinger (1972)
D. Bolinger (1972).
Accent is predictable (if you're a mind-reader).
Language 48: 633-644.
- Bolt (1970)
R. Bolt (1970).
Speaker identification by speech spectrograms: A scientists' view
of its reliability for legal purposes.
JASA 47(2): 597.
Part 2.
- Boogaart & Silverman (1992)
T. Boogaart & K. Silverman (1992).
Evaluating the overall comprehensibility of speech synthesizers.
: Proceedings of the 2nd International
Conference on Spoken Language Processing, ICSLP, 1207-1210, Banff.
- Boogart et al. (1993)
T. Boogart, P. Van Alphen & J. Doll (1993).
Application oriented assessment of dialogue systems.
: Joint ESCA - NATO/RSG10 Tutorial and
Research Workshop on Applications of Speech Technology, Lautrach, September.
- Boves (1984)
L. Boves (1984).
The phonetic basis of perceptual ratings of running speech.
Foris, Dordrecht.
- Brachman & Levesque (1985)
R. Brachman & H. Levesque (1985).
Readings in knowledge representation.
Morgan Kaufmann Publishers, Inc., Los Altos, California.
- Breiman et al. (1984)
L. Breiman, J. Friedman, R. Ohlsen & C. Stone (1984).
Classification and regression trees.
Wadsworth, Belmont, CA.
- Bridle et al. (1982)
J. Bridle, M. Brown & R. Chamberlain (1982).
An algorithm for connected word recognition.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
899-902, Paris, May.
- Brietzmann et al. (1983)
A. Brietzmann, H. Hein, H. Niemann & P. Regel (1983).
The Erlangen system for understanding continuous German speech.
: IEEE International Conference on
Acoustics, Speech and Signal Processing, ICASSP, 304-307, Boston.
- Bristow (1984)
G. Bristow (1984).
Electronic speech synthesis.
Collins, London.
- Bristow (1986)
G. Bristow (1986).
Electronic speech recognition.
Collins, London.
- Brouwer & De Haan (1987)
D. Brouwer & D. De Haan, (1987).
Woman's language, socialization and self-image.
Foris Publications, Dordrecht.
- Browman (1980)
C. Browman (1980).
Rules for demisyllable synthesis using Lingua, a language
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
561-564, Denver.
- Brown et al. (1992)
P. Brown, V. Della Pietra, P. De Souza & R. Mercer (1992).
Class-based n-gram models of natural language.
Computational Linguistics 18(4): 467-479.
- Bruce (1989)
G. Bruce (1989).
Report from the IPA Working Group on suprasegmental categories.
Working Papers 35, Lund University, Department of Linguistics,
Lund 25-40.
- Bunt et al. (1985)
H. Bunt, R.-J. Beun, F. Dols, J. von der Linden & G. thoe Schwartzenberg (1985).
The TENDUM dialogue system and its theoretical basis.
IPO Annual Progress Report 19: 105-113.
- Burrell (1991)
M. Burrell (1991).
Assessment of the degradations of synthetic speech and time frequency
warping over different listening levels.
: Proceedings of the Institute of
Acoustics, 13, Pt. 2.
- Button (1990)
G. Button (1990).
Going up a blind alley: Conflating conversation analysis and
computational modelling.
: P. Luff, G. Gilbert & D. Frohlich,
, Computers and conversation, 67-90. Academic
Press, London.
- Cahill (1993)
L. Cahill (1993).
Morphonology in the lexicon.
: Proceedings of the Sixth Conference of the
European Chapter of the Association for Computational Linguistics,
87-96, Utrecht.
- Cahill & Evans (1990)
L. Cahill & R. Evans (1990).
An application of DATR: The TIC lexicon.
: R. Evans & G. Gazdar, ,
The DATR Papers, 31-39. School of Cognitive and Computing
Science, University of Sussex, Brighton, .
- Campbell (1995)
J. Campbell (1995).
Testing with the YOHO CD-ROM Voice Verification Corpus.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
- Carbonell & Pierrel (1986)
N. Carbonell & J. Pierrel (1986).
Architecture and knowledge sources in a human computer oral dialog
: Proceedings of the NATO workshop:
Structure of multimodal dialogues including voice, Corsica, France.
- Carlson et al. (1979)
R. Carlson, B. Granström & D. Klatt (1979).
Some notes on the perception of temporal patterns in speech.
: Proceedings of the 9th International
Congress of Phonetics Sciences, 2, 260-267,
- Carroll & Chang (1970)
J. Carroll & J. Chang (1970).
Analysis of individual differences in multidimensional scaling via an
n-way generalization of the ``eckhard-young'' composition.
Psychometrika 35: 283-319.
- Carson-Berndsen (1993)
J. Carson-Berndsen (1993).
Time map phonology and the projection problem in spoken language
Doctoral dissertation, University of Bielefeld, Bielefeld, Germany.
- Cartier et al. (1992)
M. Cartier, F. Emerald, D. Pascal, P. Combescure & A. Soubigou (1992).
Une méthode d'évaluation multicritère de sorties vocales:
Application au test de 4 systèmes de synthèse à partir du texte.
: 19èmes Journées d'Étude sur la
Parole, Brussels.
- CCITT (1988a)
CCITT (1988a).
Artificial voices.
Blue Book IXth Plenary Assembly V: 87-99.
Recommendation P.50.
- CCITT (1988b)
CCITT (1988b).
Objective measurement of active speech level.
Rec. P. 56 Melbourne, CCITT.
- Chafe (1992)
W. Chafe (1992).
The importance of corpus linguistics to understanding the nature of
: J. Svartvik, , Directions in
corpus linguistics: Proceedings of the Nobel Symposium 82, New York,
79-97, Berlin. Mouton de Gruyter.
- Charniak & McDermott (1985)
E. Charniak & D. McDermott (1985).
Introduction to Artificial Intelligence.
Addison-Wesley, Reading, Massachusetts.
- Chollet & Gagnoulet (1981)
G. Chollet & C. Gagnoulet (1981).
On the evaluation of recognizers and databases using a reference
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, Atlanta.
- Chomsky (1965)
N. Chomsky (1965).
Aspects of the theory of syntax.
The MIT Press, Cambridge, MA.
- Chomsky & Halle (1968)
N. Chomsky & M. Halle (1968).
The sound pattern of English.
Harper and Row, New York, Evanston, London.
- Choukri et al. (1988)
K. Choukri, G. Chollet & C. Montacié (1988).
Test workstation for the evaluation of speech recognition algorithms,
applications and databases.
: Proceedings of the 7th FASE Symposium
(Speech'88), 145-151, Edinburgh, August 1988.
- Church (1987a)
K. Church (1987a).
Phonological parsing and lexical retrieval.
Cognition 25: 53-69.
- Church (1987b)
K. Church (1987b).
Phonological parsing in speech recognition.
Kluwer Academic Publishers, Boston, Dordrecht, Lancaster.
- Coates (1986)
J. Coates (1986).
Women, men and language: A sociolinguistic account of sex
differences in language.
Longman, London.
- Cole (1995)
Cole (1995).
The challenge of spoken language systems: Research directions for
the nineties.
IEEE Transactions on Speech and Audio Processing 3: 1-20.
- Combescure (1981)
P. Combescure (1981).
20 listes de dix phrases phonétiquement équilibrées.
Revue d'Acoustique 56: 34-38.
- Content et al. (1990)
A. Content, P. Mousty & M. Radeau (1990).
Brulex, une base de données lexicales informatise pour le
français écrit et parlé.
L'Année Psychologique 90: 551-566.
- Cookson (1988)
S. Cookson (1988).
Final evaluation of VODIS voice operated database inquiry system.
: Proceedings of Speech-88, 7th FASE
Symposium, 1311-1320, Edinburgh, August.
- Cosi & Omologo (1991)
P. Cosi & M. Omologo (1991).
Caratterizzazione statistica della segmentazione manuale del segnale
Associazione Italiana Acustica (AIA) Meeting. Napoli, Italy,
10-12 April. Cited in Barry and Fourcin 1992.
- Crowdy (1993)
S. Crowdy (1993).
Spoken corpus design and transcription.
Longman, Harlow.
- Cruse (1986)
D. Cruse (1986).
Lexical semantics.
CUP, Cambridge.
- Crystal (1980)
D. Crystal (1980).
Introduction to language pathology.
Edward Arnold Ltd., London.
- Crystal (1985)
D. Crystal (1985).
A dictionary of linguistics and phonetics.
Basil Blackwell, Oxford, UK.
- Cucchiarini (1993)
C. Cucchiarini (1993).
Phonetic transcription: A methodological and empirical study.
Doctoral thesis, University of Nijmegen, Nijmegen.
- Dahlbäck & Jönsson (1986)
N. Dahlbäck & A. Jönsson (1986).
A system for studying human-computer dialogues in natural language.
Research Report LiTH-IDA-R-86-42, Department of Computer and
Information Science, Linköping University, Linköping.
- Dahlbäck & Jönsson (1989)
N. Dahlbäck & A. Jönsson (1989).
Empirical studies of discourse representations for natural language
: Proceedings of the 4th Conference of the
European Chapter of the Association for Computational Linguistics,
291-298, Manchester.
- Dalsgaard & Baekgaard (1994)
P. Dalsgaard & A. Baekgaard (1994).
Spoken language dialogue systems.
: H. Niemann, R. De Mori & G. Hanrieder,
, Progress and prospects in speech and language technology,
178-191. Infix, Sankt Augustin.
- Damhuis et al. (1994)
M. Damhuis, T. Boogaart, C. in 't Veld, M. Versteijlen, W. Schelvis, L. Bos & L. Boves (1994).
Creation & analysis of the Dutch Polyphone Corpus.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1803-1806,
- Davis & Davis (1975)
D. Davis & C. Davis (1975).
Sound system engineering.
W. Sams & Co., Indianapolis, U.S.A.
- De Mori et al. (1984)
R. De Mori, M. Gilloux, G. Mercier, M. Simon, C. Tarrides & J. Vaissière (1984).
Integration of acoustic, phonetic, prosodic and lexical knowledge in
an expert system for speech understanding.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP.
- De Pijper (1983)
J. De Pijper (1983).
Modelling British English intonation.
Foris, Dordrecht.
- Della Pietra et al. (1994)
S. Della Pietra, V. Della Pietra, J. Gillett, J. Lafferty, H. Printz & L. Ures (1994).
Inference and estimation of a long-range trigram model.
Second International Colloquium `Grammatical Inference and
Applications', Alicante, Spain, September 1994 78-92.
Springer-Verlag, Berlin.
- Delogu et al. (1993a)
C. Delogu, A. Di Carlo, C. Sementino & S. Stecconi (1993a).
A methodology for evaluating human-machine spoken language
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1427-1430, Berlin,
- Delogu et al. (1991)
C. Delogu, A. Paoloni, P. Pocci & C. Sementina (1991).
Quality evaluations of text-to-speech synthesizers using magnitude
estimation, categorical estimation, pair comparison and reaction time methods.
: Proceedings of the Eurospeech '91,
353-355, Genova.
- Delogu et al. (1993b)
C. Delogu, A. Paoloni, P. Ridolfi & K. Vagges (1993b).
Intelligibility of Italian text-to-speech synthesizers over
ortophonic and telephonic channel.
: Proceedings of the Eurospeech '93,
3, 1893-1896, Berlin.
- Delogu et al. (1992a)
C. Delogu, A. Paoloni & C. Sementina (1992a).
Comprehension of natural and synthetic speech: Preliminary studies.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992. SAM Internal Report
- Delogu et al. (1992b)
C. Delogu, P. Paoloni, P. Pocci & C. Sementina (1992b).
A comparison among different methodologies for evaluating the quality
of text-to-speech synthesis systems.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology an standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992. SAM Internal Report
- Delomier et al. (1989)
D. Delomier, A. Meunier & M.-A. Morel (1989).
Linguistic features of human-machine oral interaction.
: Proceedings of the Eurospeech '89,
2, 236-239, Paris.
- Dempster et al. (1977)
A. Dempster, M. Laird & D. Rubin (1977).
Maximum likelihood from incomplete data via the EM algorithm.
J. Royal Statist. Soc. Ser. B (methodological) 39: 1-38.
- Den Os (1994)
E. Den Os (1994).
Transliteration of the Dutch Speech Styles Corpus.
: Proceedings of the Institute of Phonetic
Sciences, 18, 87-94, University of Amsterdam.
- Derouault & Merialdo (1986)
A. Derouault & B. Merialdo (1986).
Natural language modelling for phoneme-to-text transcription.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
November, 8: 742-749.
- Diaper (1986)
D. Diaper (1986).
Identifying the knowledge requirements of an expert system's natural
language processing interface.
: M. Harrison & A. Monk, ,
People and Computers V: Proceedings of the 2nd Conference of the
British Computer Society Human-Computer Interaction Specialist Group,
Cambridge. Cambridge University Press.
- Diaper (1989)
D. Diaper (1989).
The Wizard's apprentice: A program to help analyse natural
language dialogues.
: A. Sutcliffe & L. Macaulay, ,
People and Computers: Designing for usability. Proceedings of the 2nd
Conference of the British Computer Society Human-Computer Interaction
Specialist Group, Cambridge. Cambridge University Press.
- Doddington (1985)
G. Doddington (1985).
Speaker recognition - Identifying people by their voices.
Proceedings of the IEEE, November, 73(11): 1651.
- Dolmazon et al. (1990)
J.-M. Dolmazon, J.-C. Caërou & W. Barry (1990).
Initial development of SAM standard workstation.
SAM-UCL-022, December, Appendix Se.10, University College London,
- Dougherty (1990)
D. Dougherty (1990).
sed & awk.
O'Reilly & Associates Inc., Sebastopol, CA.
- Dreckschmidt (1987)
G. Dreckschmidt (1987).
The linguistic component in the speech understanding system SPICOS.
: H. Tillmann & G. Willée, ,
Analyse und Synthese gesprochener Sprache, Jahrestagung der
Gesellschaft für Linguistische Datenverarbeitung, Bonn,
96-101. Olms, Hildesheim.
- Drullman & Collier (1993)
R. Drullman & R. Collier (1993).
Speech synthesis with accented and unaccented diphones.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 147-156. Mouton de
Gruyter, Berlin.
- Duda & Hart (1973)
R. Duda & P. Hart (1973).
Pattern classification and scene analysis.
J. Wiley, New York.
- Duncan (1974)
S. Duncan (1974).
On signalling that it's your turn to speak.
Journal of Experimental Social Psychology 10: 234-247.
- Dybkjaer et al. (1993)
H. Dybkjaer, N. Bernsen & L. Dybkjaer (1993).
Wizard-of-Oz and the trade-off between naturalness and recognizer
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 947-950, Berlin,
- Eargle (1976)
J. Eargle (1976).
Sound recording.
Van Nostrand Reinhold Company, New York, USA.
- Edwards & Lampert (1993)
J. Edwards & M. Lampert, (1993).
Talking data: Transcription and coding in discourse
Lawrence Erlbaum, Hillsdale.
- Efron & Tibshirani (1993)
B. Efron & R. Tibshirani (1993).
An introduction to the bootstrap.
Chapman & Hall, New York.
- Egan (1948)
J. Egan (1948).
Articulation testing methods.
Laryngoscope 58: 955-991.
- Ehrlich (1986)
U. Ehrlich (1986).
Ein Lexikon für das natürlich-sprachliche Dialogsystem
Arbeitsberichte des IMMD, vol. 19, University of
Erlangen-Nürnberg, Erlangen, Germany.
- Eisen (1993)
B. Eisen (1993).
Reliability of speech segmentation and labelling at different levels
of transcription.
: Proceedings of the Third European
Conference on Speech Communication and Technology, 1,
673-676, 21-23 September 1993, Berlin, Germany.
- Erman (1977)
L. Erman (1977).
A functional description of the HEARSAY-II speech
understanding system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, Hartford.
- Erman & Hayes-Roth (1981)
L. Erman & F. Hayes-Roth (1981).
The HEARSAY-II speech understanding system: Integrating
knowledge to resolve uncertainty.
: B. Webber & N. Nilsson, ,
Readings in Artificial Intelligence, 349-389. Tioga, Palo
Alto, CA.
- Erman & Lesser (1980)
L. Erman & V. Lesser (1980).
The HEARSAY-II speech understanding system: A tutorial.
: W. Lea, , Trends in speech
recognition, 361-381. Prentice Hall, Englewood Cliffs, NJ.
Also in: A. Waibel and K.-F. Lee, eds. (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 235-245.
- Esling (1988)
J. Esling (1988).
7.1 Computer coding of IPA symbols and 7.3 detailed phonetic
representation of computer data bases.
Journal of the International Phonetic Association 18(2):
- Esling (1990)
J. Esling (1990).
Computer coding of the IPA: Supplementary report.
Journal of the International Phonetic Association 20(1):
- Esling & Gaylord (1993)
J. Esling & H. Gaylord (1993).
Computer codes for phonetic symbols.
Journal of the International Phonetic Association 23(2):
- Essen & Steinbiss (1992)
U. Essen & V. Steinbiss (1992).
Cooccurrence smoothing for stochastic language modelling.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
I, 161-164, San Francisco, CA, March.
- Evans & Gazdar (1989)
R. Evans & G. Gazdar (1989).
The DATR papers.
Research Report: May 1989, School of Cognitive and Computing
Science, University of Sussex, School of Cognitive and Computing Science,
University of Sussex, Brighton.
- Evans & Gazdar (1990)
R. Evans & G. Gazdar (1990).
The DATR papers.
Research Report: February 1990, School of Cognitive and Computing
Science, University of Sussex, School of Cognitive and Computing Science,
University of Sussex, Brighton.
- Federico (1989)
A. Federico (1989).
Comparison between automatic methods and human listeners in speaker
recognition tasks.
: Proceedings of the Eurospeech,
- Fellbaum et al. (1994)
K. Fellbaum, H. Klaus & J. Sotscheck (1994).
Hörversuche zur Beurteilung der Sprachqualität von
Sprachsynthesesystemen für die deutsche Sprache.
: Fortschritte der Akustik,
Plenarvorträge und Fachbeiträge der 20. Deutschen Jahrestagung
für Akustik, 117-122, Dresden, DPG GmbH.
- Ferguson (1976)
G. Ferguson (1976).
Statistical analysis in psychology and education.
McGraw-Hill, Tokyo.
- Ferrané et al. (1992)
I. Ferrané, M. De Calmès, D. Cotto, J.-M. Pécatte & G. Pérennou (1992).
Statistiques lexicales sur le corpus de textes utilisés dans le
projet BREF: Questions de couverture lexicale.
: Proceedings Communication Homme-Machine,
Séminaire LEXIQUE, 217-226, 21-22 January 1992, IRIT-UPS,
- Fillmore (1968)
C. Fillmore (1968).
The case for case.
: E. Bach & R. Harms, ,
Universals in linguistic theory, 1-88. Holt, Rinehart and
Winston, New York.
- Fissore et al. (1993)
L. Fissore, E. Giachin, P. Laface & P. Massafra (1993).
Using grammars in forward and backward search.
: Proceedings of the European Conference on
Speech Communication and Technology, 1525-1528, Berlin, September.
- Flanagan et al. (1991)
J. Flanagan, D. Berkley, G. Elko & M. Sondhi (1991).
Autodirective microphone systems.
Acoustica 73: 58-71.
- Fourcin (1993)
A. Fourcin (1993).
The SAM project.
Ellis Horwood, Chichester.
- Fourcin et al. (1989)
A. Fourcin, G. Harland, W. Barry & V. Hazan, (1989).
Speech input and output assessment. Multilingual methods and
Ellis Horwood Ltd., Chichester.
- Fraser (1991)
N. Fraser (1991).
Corpus-based evaluation of the SUNDIAL system.
: J. Neal & S. Walter, ,
Proceedings of the Natural Language Processing Systems Evaluation
Workshop, Rome. Rome Laboratory.
Technical Report RL-TR-91-362.
- Fraser & Gilbert (1991a)
N. Fraser & G. Gilbert (1991a).
Effects of system voice quality on user utterances in speech dialogue
: Proceedings of the Second European
Conference on Speech Communication and Technology, 57-60, Genova,
- Fraser & Gilbert (1991b)
N. Fraser & G. Gilbert (1991b).
Simulating speech systems.
Computer Speech and Language 5: 81-99.
- Fraser et al. (1992)
N. Fraser, N. Gilbert & C. McDermid (1992).
The value of simulation data.
: Proceedings of the Workshop on Empirical
Models and Methodology for Natural Language Dialogue Systems, Trento, April.
- French (1991)
J. French (1991).
Updated notes for soundprint transcribers + one page sample text from
COBUILD corpus.
Working paper, NERC-WP4-47, October, J.P. French Associated,
York and COBUILD, Birmingham.
- French (1992)
J. French (1992).
Transcription proposals: Multi-level system.
Working paper, NERC-WP 4-50, October, University of
Birmigham, Birmingham.
- Fu (1982)
K. Fu (1982).
Syntactic pattern recognition and applications.
Prentice-Hall, Englewood Cliffs, NJ.
- Furui (1981)
S. Furui (1981).
Cepstral analysis technique for automatic speaker verification.
IEEE Transactions on Acoustics, Speech and Signal Processing
- Furui (1994)
S. Furui (1994).
An overview of speaker verification technology.
: ESCA-ETRW Workshop, 1-10,
- Generet et al. (1995)
M. Generet, H. Ney & F. Wessel (1995).
Extensions of absolute discounting for language modelling.
: Proceedings of the Fourth European
Conference on Speech Communication and Technology, 1245-1248,
Madrid, September.
- Gerbino et al. (1993)
E. Gerbino, P. Baggia, A. Ciaramella & C. Rullent (1993).
Test and evaluation of a spoken dialogue system.
: Proceedings of the International
Conference on Acoustics, Speech and Signal Processing, ICASSP'93,
Minneapolis, April.
- Geutner (1995)
P. Geutner (1995).
Using morphology towards better large-vocabulary speech
recognition systems.
Interactive Systems Laboratories, University of Karlsruhe,
Karlsruhe, Germany.
- Gibbon (1991)
D. Gibbon (1991).
Lexical signs and lexicon structure: Phonology and prosody in the
Research Report ASL-MEMO-20-91/UBI, University of Bielefeld,
Bielefeld, Germany.
- Gibbon (1992a)
D. Gibbon (1992a).
ILEX: A linguistic approach to computational lexica.
: U. Klenk, , Computatio linguae.
Aufsätze zur algorithmischen und quantitativen Analyse der Sprache,
32-51. Franz Steiner Verlag, Stuttgart.
- Gibbon (1992b)
D. Gibbon (1992b).
Language and software, or: Fritzl's quest.
: C. Floyd, H. Züllighoven, R. Budde &
R. Keil-Slavik, , Software Development and Reality
Construction, 376-390. Springer Verlag, Berlin, Heidelberg, New
- Gibbon (1993)
D. Gibbon (1993).
Generalized DATR for flexible lexical access: PROLOG
VERBMOBIL Report 2, October 1993, Bielefeld University,
Bielefeld, Germany.
- Gibbon (1995)
D. Gibbon (1995).
The VERBMOBIL lexicon: Bielefeld lexicon database V2.1.
VERBMOBIL Technisches Dokument 21, 31 January 1995, Bielefeld
University, Bielefeld, Germany.
- Gibbon & Ehrlich (1995)
D. Gibbon & U. Ehrlich (1995).
Spezifikationen für ein VERBMOBIL-Lexikondatenbankkonzept.
VERBMOBIL Memo 69, Bielefeld University & Daimler Benz AG,
Bielefeld, Ulm.
- Gilbert & Weismer (1974)
H. Gilbert & G. Weismer (1974).
The effect of smoking on the speaking fundamental frequency of adult
Journal of Psycholinguistic Research 3: 225-231.
- Gish et al. (1986)
H. Gish, M. Kraner, W. Russel & J. Wolf (1986).
Methods and experiments for text-independent speaker recognition over
the telephone line.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, 865.
- Gish & Schmidt (1994)
H. Gish & M. Schmidt (1994).
Text-independent speaker identification.
: IEEE Signal Processing, 11,
- Goldsmith (1990)
J. Goldsmith (1990).
Autosegmental and metrical phonology.
Indiana University Linguistics Club, Bloomington, Indiana.
- Goldstein (1995)
M. Goldstein (1995).
Classification of methods used for assessment of text-to-speech
systems according to the demands placed on the listener.
Speech Communication 16: 225-244.
- Goldstein et al. (1992)
M. Goldstein, B. Lindström & O. Till (1992).
Assessing global performance of speech synthesizers: Context effects when assessing naturalness of Swedish sentence-pairs generated by 4 systems using 3 different assessment procedures (free number magnitude estimation, 5- and 11-point category scales).
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
SAM Internal Report II.a, Final report, Year three:
- Goldstein & Till (1992)
M. Goldstein & O. Till (1992).
Assessing segmental intelligibility of two rule-based synthesizers
and natural speech using the ESPRIT/SAMVCV test procedures (SOAP v3.0)
in Swedish and testing for differences between two correlated proportions.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. Univeristy College London, London.
SAM Internal Report II.b, Final report, Year three:
- Gong (1995)
Y. Gong (1995).
Speech recognition in noisy environments: A survey.
Speech Communication 16: 261-291.
- Gonzalez & Thomason (1978)
R. Gonzalez & M. Thomason (1978).
Syntactic pattern recognition: An introduction.
Addison-Wesley, Reading, MA.
- Good (1953)
I. Good (1953).
The population frequencies of species and the estimation of
population parameters.
Biometrika, December, 40: 237-264.
- Goodine et al. (1992)
D. Goodine, L. Hirschman, J. Polifroni, S. Seneff & V. Zue (1992).
Evaluating interactive spoken language systems.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP'92, 201-204,
Banff, October.
- Goorfin (1989)
L. Goorfin (1989).
Electronic dictionary pronounces over 83,000 words.
Speech Technology 4(4): 49-51.
- Gorin et al. (1991)
A. Gorin, S. Levinson, A. Gertner & E. Goldman (1991).
Adaptive acquisition of language.
Computer, Speech and Language, April, 5(2): 101-132.
- Gray & Kopp (1944)
C. Gray & G. Kopp (1944).
Voiceprint identification.
Bell Telephone Report, Bell Laboratories.
- Green (1986)
D. Green (1986).
Control, activation and resource: A framework and a model for the
control of speech in bilinguals.
Brain and Language 27: 210-223.
- Greenspan et al. (1985)
S. Greenspan, H. Nusbaum & D. Pisoni (1985).
Perception of speech generated by rule: Effects of training and
attentional limitations.
Research on Speech Perception Progress Report 11, pages 263-287,
Indiana University, Indianapolis.
- Grenier (1977)
Y. Grenier (1977).
Identification du locuteur et adaptation au locuteur d'un
système de reconnaissance phonémique.
Ph.D. Thesis.
- Grice (1975)
H. Grice (1975).
Logic and conversation.
: P. Cole & J. Morgan, ,
Syntax and semantics 3: Pragmatics, 41-58. Academic Press,
New York.
- Grice et al. (1991)
M. Grice, K. Vagges & D. Hirst (1991).
Assessment of intonation in text-to-speech synthesis systems - A
pilot test in English and Italian.
: Proceedings of the Eurospeech '91,
2, 879-882, Genova.
- Grice et al. (1992a)
M. Grice, K. Vagges & D. Hirst (1992a).
Prosodic form tests.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992, Stage report So. 5,
Part One.
- Grice et al. (1992b)
M. Grice, K. Vagges & D. Hirst (1992b).
Prosodic function tests.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992, Stage report So. 5,
Part Two.
- Grosz (1977)
B. Grosz (1977).
The representation and use of focus in dialogue understanding.
University of California.
- Guindon (1988)
R. Guindon (1988).
A multidisciplinary perspective on dialogue structure in user-advisor
: R. Guindon, , Cognitive Science
and its applications for human-computer interaction, 163-200.
- Guindon et al. (1987)
R. Guindon, K. Shuldberg & J. Connor (1987).
Grammatical and ungrammatical structures in user-advisor dialogues:
Evidence for sufficiency of restricted languages in natural language
interfaces to advisory systems.
: Proceedings of the 25th Annual Meeting of
the Association for Computational Linguistics, 41-44, Stanford.
- Guindon et al. (1986)
R. Guindon, P. Sladky, H. Brunner & J. Connor (1986).
The structure of user-adviser dialogues: Is there method in their
: Proceedings of the 24th Annual Meeting of
the Association for Computational Linguistics, 224-230.
- Guyomard & Siroux (1986a)
M. Guyomard & J. Siroux (1986a).
PALABRE Phase 1 experimental protocol.
CNET/TSS/RCP WP4 task 3, April.
- Guyomard & Siroux (1986b)
M. Guyomard & J. Siroux (1986b).
PALABRE Phase 2 experimental protocol.
CNET/TSS/RCP WP4 task 3, May.
- Guyomard & Siroux (1987)
M. Guyomard & J. Siroux (1987).
Experimentation in the specification of an oral dialogue.
: H. Niemann, M. Lang & G. Sagerer,
, Recent Advances in Speech Understanding and Dialog Systems.
NATO ASI Series. Series F: Computer and Systems Sciences, Vol. 46,
497-501. Springer-Verlag, Berlin, Heidelberg, New York, London, Paris,
- Guyomard & Siroux (1988)
M. Guyomard & J. Siroux (1988).
Constitution incrementale d'un corpus de dialogues oraux cooperatifs.
Journal Acoustique 1.
- Haeb-Umbach & Ney (1994)
R. Haeb-Umbach & H. Ney (1994).
Improvements in time-synchronous beam search for 10000-word
continuous speech recognition.
IEEE Transactions on Speech and Audio Processing, April, 2:
- Hansen et al. (1992)
J. Hansen, C. Pelaez, L. Solana & P. Vossen (1992).
Performance assessment and evaluation: Specification document.
SUNSTAR Report II.4.
- Hauptmann & Rudnicky (1988)
A. Hauptmann & A. Rudnicky (1988).
Talking to computers: An empirical investigation.
International Journal of Man-Machine Studies 28: 583-604.
- Hayes (1963)
W. Hayes (1963).
Holt, Rinehart and Winston, Inc., New York.
- Hazan & Grice (1989)
V. Hazan & M. Grice (1989).
The assessment of synthetic speech intelligibility using semantically
unpredictable sentences.
: Proceedings of the ESCA Workshop on Speech
Input/Output Assessment and Speech Databases, 1.6.1-1.6.4.
- Hazan & Shi (1993)
V. Hazan & B. Shi (1993).
Individual variability in the perception of synthetic speech.
: Proceedings of the Eurospeech '93,
3, 1849-1852, Berlin.
- Heemskerk & Van Heuven (1993)
J. Heemskerk & V. Van Heuven (1993).
MORPA, a morpheme lexicon based morphological parser.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 67-85. Mouton de Gruyter,
- Helfrich (1979)
H. Helfrich (1979).
Age markers in speech.
: K. Scherer & H. Giles, ,
Social markers in speech, 63-107. Cambridge University
Press, Cambridge.
- Hertz et al. (1985)
S. Hertz, J. Kadin & K. Karplus (1985).
The DELTA rule development system for speech synthesis from text.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
- Hess (1983)
W. Hess (1983).
Pitch determination of speech signals.
Springer-Verlag, Heidelberg, F.R.G.
- Hess et al. (1995)
W. Hess, K. Kohler & H. Tillmann (1995).
The PhonDat/Verbmobil Speech Corpus.
: Proceedings of the Eurospeech 95, Madrid.
- Heyer et al. (1991)
G. Heyer, K. Waldhur & H. Khatchadourian (1991).
Motivation, goals and milestones of ESPRIT II MULTILEX.
: Génie Linguistique 91,
1, Versailles, France, 16-17 January.
- Hieronymus et al. (1990)
- J. Hieronymus, H. Alexander, C. Bennett, I. Cohen, D. Davies, J. Dalby, J. Laver, W. Barry, A. Fourcin & J. Wells (1990).
Proposed speech segmentation criteria for the SCRIBE project.
SCRIBE Project Report.
- Hirschman et al. (1990)
L. Hirschman, D. Dahl, D. McKay, L. Norton & M. Linebarger (1990).
Beyond class A: A proposal for automatic evaluation of discourse.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 109-112, Hidden Valley, PA, June.
- Hjelmquist et al. (1987)
E. Hjelmquist, B. Jansson & G. Torell (1987).
Psychological aspects on blind people's reading of radio-distributed
daily newspapers.
: B. Knave & P. Widebäck, ,
Work with display units 86, 187-201. North-Holland, Elsevier
Science Publishers, Amsterdam.
- Hockett (1958)
C. Hockett (1958).
A course in modern linguistics.
Macmillan, New York.
- Höge et al. (1985)
H. Höge, E. Marschall, O. Schmidbauer & R. Sommer (1985).
Worthypothesengenerierung im Projekt SPICOS.
: H. Niemann, , Mustererkennung 85,
7. DAGM-Symposium Erlangen, Informatik-Fachberichte, vol. 107,
175-179. Springer-Verlag, Berlin.
- Holmes (1988)
J. Holmes (1988).
Speech synthesis and recognition.
Van Nostrand Reinhold (UK) Co. Ltd., Wokingham.
- Homayounpour et al. (1993)
M. Homayounpour, J. Goldman, G. Chollet & J. Vaissiere (1993).
Performance comparison of machine and human speaker verification.
: Proceedings of the Eurospeech,
- House (1988)
A. House (1988).
The recognition of search by machine - A bibliography.
Academic Press Ltd., New York, N.Y.
- House et al. (1965)
A. House, C. Williams, M. Hecker & K. Kryter (1965).
Articulation testing methods: Consonantal differentiation with a
closed response set.
Journal of the Acoustical Society of America, JASA 37:
- House et al. (1992)
J. House, Y. Shitara, M. Grice & P. Howard-Jones (1992).
Evaluation of prosody in dialogue synthesis.
Speech, Hearing and Language 6: 89-108.
- Houtgast & Verhave (1991)
T. Houtgast & J. Verhave (1991).
A physical approach to speech quality assessment: Correlation
patterns in the speech spectrogram.
: Proceedings of the Eurospeech '91,
1, 285-288, Genova.
- Houtgast & Verhave (1992)
T. Houtgast & J. Verhave (1992).
An objective approach to speech quality.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Stage report So. 9, Final report, Year three: 1.III.91-28.II.1992.
- Howard-Jones (1992a)
P. Howard-Jones (1992a).
SOAP, Speech Output Assessment Package.
Version 4.0, ESPRIT SAM-UCL-042.
- Howard-Jones (1992b)
P. Howard-Jones (1992b).
Specification of listener dimensions.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Stage report So. 8, Part One, Final report, Year three:
- Howell (1990)
P. Howell (1990).
Clear speech and turn-taking cues in telephone dialogue.
Report to BT, University College London, London.
- Hunt (1991)
A. Hunt (1991).
New commercial applications of telephone-network-based speech
recognition and speaker verification.
Proceedings of the Eurospeech 15(2): 431.
- Hunt (1990)
M. Hunt (1990).
Figures of merit for assessing connected-word recognizers.
Speech Communication 9: 329-336.
- IPDS (1995)
IPDS (1995).
CD-ROM#2: The Kiel corpus of spontaneous speech. vol. 1,
- IPDS (1996)
IPDS (1996).
CD-ROM#3: The Kiel corpus of spontaneous speech. vol. 2,
- ITU-T (1993)
ITU-T (1993).
Draft recommendation P.8S - Subjective performance assessment
of the quality of speech voice output devices.
Study group 12 - contribution 6, ITU-T.
- Jakobson et al. (1951)
R. Jakobson, G. Fant & M. Halle (1951).
Preliminaries to speech analysis.
The MIT Press, Cambridge.
- Jassem &
obacz (1989) -
W. Jassem & P.
obacz (1989).
IPA phonemic transcription using an IBM PC and compatibles.
Journal of the International Phonetic Association 19(1):
- Jekosch (1992)
U. Jekosch (1992).
The Cluster-Identification Test.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Internal report II.e, Final report, Year three: 1.III.91-28.II.1992.
- Jekosch & Pols (1994)
U. Jekosch & L. Pols (1994).
A feature-profile for application-specific speech synthesis
assessment and devaluation.
: Proceedings of the 3rd International
Conference on Spoken Language Processing, ICSLP, Yokohama.
- Jelinek (1985)
F. Jelinek (1985).
A real-time, isolated-word, speech recognition system for dictation
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
- Jelinek (1991)
F. Jelinek (1991).
Self-organized language modeling for speech recognition.
: A. Waibel & K.-F. Lee, ,
Readings in speech recognition, 450-506. Morgan Kaufmann
Publishers, San Mateo, CA.
- Jelinek et al. (1992)
F. Jelinek, J. Lafferty & R. Mercer (1992).
Basic methods of probabilistic context free grammars.
: P. Laface & R. De Mori, ,
Speech recognition and understanding, 347-360. Springer,
- Jelinek & Mercer (1980)
F. Jelinek & R. Mercer (1980).
Interpolated estimation of Markov source parameters from sparse
: E. Gelsema & L. Kanal, ,
Pattern recognition in practice, 381-397. North-Holland
Publishing Company, Amsterdam.
- Jelinek et al. (1990)
F. Jelinek, R. Mercer & S. Roukos (1990).
Classifying words for improved statistical language models.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
621-624, Albuquerque, NM, April.
- Jelinek et al. (1991a)
F. Jelinek, R. Mercer & S. Roukos (1991a).
Principles of lexical language modeling for speech recognition.
: S. Furui & M. Sondhi, ,
Advances in Speech Signal Processing, 651-699. Marcel
Dekker, New York.
- Jelinek et al. (1991b)
F. Jelinek, B. Merialdo, S. Roukos & M. Strauss (1991b).
A dynamic language model for speech recognition.
: Proceedings of the DARPA Workshop `Speech
and Natural Language Workshop', 293-295, Pacific Grove, CA,
- Johnston (1993)
R. Johnston (1993).
An on-going series of subjective experiments to assess speech output
from text-to-speech systems.
Unpublished report to CCITT Study Group, No. 12.
- Jongenburger & Van Bezooijen (1992)
W. Jongenburger & R. Van Bezooijen (1992).
Evaluatie van ELK: Attitudes van de gebruikers,
verstaanbaarheid en acceptabiliteit van de spraaksynthese, bruikbaarheid van
het zoeksysteem.
Stichting Spraaktechnologie, Utrecht.
- Jönsson & Dalbäck (1988)
A. Jönsson & N. Dalbäck (1988).
Talking to your computer is not like talking to your best friend.
: Proceedings of the First Scandinavian
Conference on Artificial Intelligence, Tromso, Norway.
- Joreskog & Sorbom (1984)
J. Joreskog & D. Sorbom (1984).
Lisrel VI. Analysis of linear structural relationships by
maximum likelihood, instrument variables, and least squares methods.
Scientific software, Mooreville, IN.
- Karttunen (1983)
L. Karttunen (1983).
KIMMO: A general morphological processor.
Texas Linguistic Forum 22: 165-186.
- Kasuya et al. (1993)
H. Kasuya, Y. Endo & S. Saliu (1993).
Novel acoustic measurements of jitter and shimmer characteristics
from pathological voice.
: Proceedings of the Eurospeech '93,
- Katz (1987)
S. Katz (1987).
Estimation of probabilities from sparse data for the language model
component of a speech recognizer.
IEEE Transactions on Acoustics, Speech and Signal Processing,
March, 35: 400-401.
- Kelley (1983a)
J. Kelley (1983a).
An empirical methodology for writing user-friendly natural language
computer applications.
: Proceedings of the International
Conference of Computer-Human Interaction, CHI '83.
- Kelley (1983b)
J. Kelley (1983b).
Natural language and computers: Six steps for writing an
easy-to-use computer application.
The Johns Hopkins University, Baltimore.
- Kelley (1984)
J. Kelley (1984).
An interactive design methodology for user-friendly natural language
office information applications.
Association for Computing Machinery Transactions on Office
Information Systems 2: 26-41.
- Kerkhoff et al. (1984)
J. Kerkhoff, J. Wester & L. Boves (1984).
A compiler for implementing the linguistic phase of a text-to-speech
conversion system.
: H. Bennis & W. Van Lessen-Kloecke,
, Linguistics in The Netherlands 1984, 111-119.
Foris, Dordrecht.
- Kersta (1962)
L. Kersta (1962).
Voiceprint infallibility.
: Meeting of Acoust. Soc. Am., Seattle.
- Kinsey (1994)
G. Kinsey (1994).
Using voice recognition with IVR systems.
: AVIOS conference proceedings,
49-56, San Jose.
- Kirchhoff (1996)
K. Kirchhoff (1996).
Phonologisch strukturierte hmms zur automatischen spracherkennung.
: D. Gibbon, , Natural language
processing and speech technology. Results of the 3rd KONVENS Conference,
Bielefeld, October 1996, 55-63. Mouton de Gruyter, Berlin, New
- Klatt (1976)
D. Klatt (1976).
The linguistics uses of segmental duration in English: Acoustic
and perceptual evidence.
Journal of the Acoustical Society of America, JASA 59:
- Klatt (1977)
D. Klatt (1977).
Review of the ARPA speech understanding project.
Journal of the Acoustical Society of America, JASA 62(6):
Also in: A. Waibel, K.-F. Lee, eds., (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 554-575.
- Klatt (1982)
D. Klatt (1982).
The KLATTalk text-to-speech conversion system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
- Klatt (1987)
D. Klatt (1987).
Review of text-to-speech conversion in English.
Journal of the Acoustical Society of America 82: 737-793.
- Kneser & Ney (1995)
R. Kneser & H. Ney (1995).
Improved backing-off for m-gram language modeling.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
I, 49-52, Detroit, MI, May.
- Knowles & Alderson (1995)
G. Knowles & P. Alderson (1995).
Working with speech: The computational analysis of formal
British English speech.
Longmans, London.
- Knowles et al. (1995)
G. Knowles, L. Taylor & B. Williams (1995).
A corpus of formal British English speech.
Longmans, London.
- Knuth (1973)
D. Knuth (1973).
The art of computer programming 3: Sorting and searching.
Addison-Wesley, Reading, Massachusetts.
- Kohler et al. (1995)
K. Kohler, M. Pätzold & A. Simpson (1995).
From scenario to segment: The controlled elicitation,
transcription, segmentation and labelling of spontaneous speech.
Arbeitsberichte (AIPUK) 29, Institut für Phonetik und Digitale
Sprachverarbeitung, IPDS, Universität Kiel, Kiel/Germany.
- Kornai (1991)
A. Kornai (1991).
Formal phonology.
Doctoral dissertation, Stanford University, Stanford.
- Koskenniemi (1983)
K. Koskenniemi (1983).
Two-level morphology: A general computational model for
word-form recognition and production.
University of Helsinki, Department of General Linguistics, Helsinki,
- Kraft & Portele (1995)
V. Kraft & T. Portele (1995).
Quality evaluation of five German speech synthesis systems.
Acta Acustica 3: 351-365.
- Kryter (1962a)
K. Kryter (1962a).
Methods for the calculation and use of the Articulation Index.
Journal of the Acoustical Society of America, JASA 34:
- Kryter (1962b)
K. Kryter (1962b).
Validation of the Articulation Index.
Journal of the Acoustical Society of America, JASA 34:
- Kuhn & De Mori (1990)
R. Kuhn & R. De Mori (1990).
A Cache-based natural language model for speech recognition.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
June, 12: 570-583.
- Labov (1972)
W. Labov (1972).
Sociolinguistic patterns.
University of Pennsylvania Press, Pennsylvania.
- Labov (1994)
W. Labov (1994).
Principles of linguistic change. Volume 1: Internal
Blackwell, Oxford.
- Labrador & Dinesh (1984)
C. Labrador & P. Dinesh (1984).
Experiments in speech interaction with conventional data services.
Interact '84, 104-108.
- Lacouture & Normandin (1993)
R. Lacouture & Y. Normandin (1993).
Efficient lexical access strategies.
: Proceedings of the European Conference on
Speech Technology.
- Ladefoged (1975)
P. Ladefoged (1975).
A course in phonetics.
Harcourt, Brace, Jovanovich, New York.
- Lafferty et al. (1992)
J. Lafferty, D. Sleator & D. Temperley (1992).
Grammatical trigrams: A probabilistic model of link grammars.
: Proceedings of the AAAI Fall Symposium on
Probabilistic Approaches to Natural Language, Cambridge, MA.
- Langer & Gibbon (1992)
H. Langer & D. Gibbon (1992).
DATR as a graph representation language for ILEX speech oriented
Research Report, March 1992, ASL-TR-43-92/UBI, University of
Bielefeld, Bielefeld, Germany.
- Langeweg (1988)
S. Langeweg (1988).
The stress system of Dutch.
Doctoral dissertation, Leiden University, Leiden.
- Larmouth (1986)
D. Larmouth (1986).
The legal and ethical status of surreptitious recording in dialect
research: Do human subjects guidelines apply?
: D. Larmouth, T. Murray & C. Murray,
, Legal and ethical issues in surreptitious recording,
Publication of the American Dialect Society, number 76. University of Alabama
Press, Tuscaloosa and London.
- Lau et al. (1993)
R. Lau, R. Rosenfeld & S. Roukos (1993).
Trigger-based language models: A maximum entropy approach.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 45-48, Minneapolis, MN, April.
- Laver (1991)
J. Laver (1991).
The gift of speech.
Papers in the analysis of speech and voice, Edinburgh University
Press, Edinburgh.
- Laver (1994)
J. Laver (1994).
Principles of phonetics.
Cambridge University Press, Cambridge.
- Laver et al. (1988)
J. Laver, J. McAllister, M. McAllister & M. Jack (1988).
A Prolog-based automatic text-to-phoneme conversion system for
British English.
: Proceedings of the Second Symposium on
Advanced Man-Machine Interface through Spoken Language, November 19-22,
- Laver et al. (1989)
J. Laver, M. McAllister & J. McAllister (1989).
Pre-processing of anomalous text-strings in an automatic
text-to-speech system.
: S. Ramsaran, , Studies in the
pronunciation of English: A commemorative volume in memory of A.C. Gimson.
Croon Helm, London.
- Lea (1980)
W. Lea, (1980).
Trends in speech recognition.
Prentice-Hall, Englewood Cliffs, NJ.
- Lee et al. (1990)
K.-F. Lee, H.-W. Hon & R. Reddy (1990).
An overview of the SHPINX speech recognition system.
: A. Waibel & K.-F. Lee, ,
Readings in speech recognition, 600-610. Morgan Kaufmann
Publishers, San Mateo, California.
- Leggett & Williams (1984)
J. Leggett & G. Williams (1984).
An empirical investigation of voice as an input modality for computer
International Journal of Man-Machine Studies 21: 493-520.
- Lehiste (1970)
I. Lehiste (1970).
MIT Press, Cambridge, Mass.
- Lehiste et al. (1976)
I. Lehiste, J. Olive & L. Streeter (1976).
Role of duration in disambiguating syntactically ambiguous sentences.
Journal of the Acoustical Society of America, JASA 60:
- Lehmann (1983)
E. Lehmann (1983).
Theory of point estimation.
J. Wiley, New York.
- Lehnert & Giron (1995)
H. Lehnert & F. Giron (1995).
Vocal communication in virtual environments.
: Conference documentation of Virtual
Reality World '95, 279-293, Stuttgart/Germany.
- Lesser et al. (1975)
V. Lesser, R. Fennell, L. Erman & D. Reddy (1975).
Organization of the HEARSAY-II speech understanding system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP-23,
- Levelt (1989)
J. Levelt (1989).
Speaking: From intonation to articulation.
ACL-MIT Press Series in Natural Language Processing. Bradford Book -
The MIT-Press, Cambridge Massachusetts, London, England.
- Levinson et al. (1983)
S. Levinson, L. Rabiner & M. Sondhi (1983).
An introduction to the application of the theory of probabilistic
functions of a Markov process to automatic speech recognition.
The Bell System Technical Journal, April, 62(4): 1035-1074.
- Life et al. (1988)
M. Life, M. Lee & J. Long (1988).
Assessing the usability of future speech technology: Towards a
: Speech '88: 7th FASE Symposium,
- Likert (1932)
R. Likert (1932).
A technique for the measurement of attitudes.
Archives of Psychology 140.
- Linggard (1985)
R. Linggard (1985).
Electronic synthesis of speech.
Cambridge University Press, Cambridge.
- Llisterri (1994)
J. Llisterri (1994).
Prosody Encoding Survey, Multext - LRE Project 62-050.
- Llisterri & Mariño (1993)
J. Llisterri & J. Mariño (1993).
Spanish adaptation of SAMPA and automatic phonetic transcription.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications, Year 1, 1
April 1993-30 September 1993, 1-9. London.
SAM-A periodic progress report, Document No: SAM-A/UPC/001/V1.
- Logan et al. (1989)
J. Logan, B. Greene & D. Pisoni (1989).
Measuring the segmental intelligibility of synthetic speech produced
by ten text-to-speech systems.
Journal of the Acoustical Society of America, JASA 86:
- Logan et al. (1985)
J. Logan, D. Pisoni & B. Greene (1985).
Measuring the segmental intelligibility of synthetic speech:
Results from eight text-to-speech systems.
Research on speech perception Progress Report 11, 3-31, Indiana
University, Indianapolis.
- Loman & Boves (1993)
H. Loman & L. Boves (1993).
Development of rule based synthesis for text-to-speech.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 157-168. Mouton de
Gruyter, Berlin.
- Lowerre & Reddy (1980)
B. Lowerre & R. Reddy (1980).
The HARPY speech understanding system.
: W. Lea, , Trends in speech
recognition, 340-360. Prentice Hall, Englewood Cliffs, NJ.
Also in: A. Waibel, K.-F. Lee, eds., (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 576-586.
- Luce et al. (1983)
P. Luce, T. Feustel & D. Pisoni (1983).
Capacity demands in short-term memory for synthetic and natural word
Human Factors 25: 17-32.
- Luzzati & Néeel (1989)
D. Luzzati & F. Néel (1989).
Dialogue behaviour induced by machine.
: Proceedings of the Eurospeech '89,
2, 601-604, Paris.
- Lyons (1977)
J. Lyons (1977).
Semantics. Volumes I and II.
Cambridge University Press, Cambridge.
- Maassen & Povel (1985)
B. Maassen & D.-J. Povel (1985).
The effect of segmental and suprasegmental corrections on the
intelligibility of deaf speech.
Journal of the Acoustical Society of America, JASA 78:
- MacDermid (1993)
C. MacDermid (1993).
Features of naive callers' dialogues with a simulated speech
understanding and dialogue system.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 955-958, Berlin,
- MacWhinney (1995)
B. MacWhinney (1995).
The CHILDES Project: Tools for analyzing talk.
Lawrence Erlbaum, Hillsdale, NJ.
- Manous et al. (1985)
L. Manous, M. Dedina, H. Nusbaum & D. Pisoni (1985).
Speeded sentence verification of natural and synthetic speech.
Research on Speech Perception Progress Report 11, Indiana
University, Indianapolis.
- Marascuilo & Serlin (1988)
L. Marascuilo & R. Serlin (1988).
Statistical methods for the social, and behavioral sciences.
Freeman and company, New York.
- Mariani (1989)
J. Mariani (1989).
Recent advances in speech processing.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
- Marslen-Wilson (1989)
W. Marslen-Wilson, (1989).
Lexical representation and process.
The MIT Press, Cambridge, Massachusetts and London, England.
- Mérialdo (1988)
B. Mérialdo (1988).
Multi-level decoding for very-large-size-dictionary speech
IBM Journal of Research and Development 32(2): 169-301.
- Michaelis & Strube (1995)
D. Michaelis & H. Strube (1995).
Orthogonale akustische Stimmgüteparameter zur
Fortschritte der Akustik - DAGA '95 to be printed.
- Monaghan & Ladd (1989)
A. Monaghan & D. Ladd (1989).
Evaluating intonation in the CSTR text-to-speech system.
: Proceedings of the ESCA Workshop on Speech
I/O Assessment and speech databases, Noordwijkerhout.
- Monaghan & Ladd (1990)
A. Monaghan & D. Ladd (1990).
Symbolic output as the basis for evaluating intonation in
text-to-speech systems.
Speech Communication 9: 305-314.
- Moody (1991)
A. Moody (1991).
Speaker verification.
Internal Report, January 1991, Ensigma Ltd.
- Moore (1977)
R. Moore (1977).
Evaluating speech recognizers.
IEEE Transactions on Acoustics, Speech and Signal Processing
25(2): 178-183.
- Moore (1986)
R. Moore (1986).
The NATO research study group on speech processing: RSG10.
: Proceedings of the Speech Tech'86,
201-203, New York, 28-30 April 1986.
- Moore (1988)
R. Moore (1988).
The technology of speech recognition.
: Proceedings of the CCTA/Blenheim-Online
Conference on Knowledge Based Systems in Government, Bristol, 8-10 November
- Moore (1991)
R. Moore (1991).
International coordination of research standards in speech science
and technology.
: Proceedings of the ICSLP-90 Workshop on
International Coordination of Spoken Language Database and Assessment
Techniques for Speech Input/Output, Kobe, Japan, November 1991.
- Moore (1992a)
R. Moore (1992a).
Speech recognition: Available assessment methods and needs for
: Proceedings of the Workshop on
International Cooperation and Standardisation of Spoken Language Databases
and Speech I/O Assessment Techniques, Chiavari, Italy, 26-28 September
- Moore (1992b)
R. Moore (1992b).
User needs in speech research.
: Proceedings of the Workshop on European
Textual Corpora, Pisa, Italy, 23-26 January 1992.
- Moore (1994a)
R. Moore (1994a).
The ``Capability Profile''.
DRA-CSE Research Note DRA CIS CSE1 RN94/08, August 1994, DRA Speech
Research Unit, Malvern, Worcs., UK.
- Moore (1994b)
R. Moore (1994b).
The EAGLES working group on spoken language, Advanced
Speech Applications. European research on speech technology.
: K. Varghese, S. Pfleger & J. Lefevre,
, Research Reports ESPRIT Volume 1. Springer-Verlag, Berlin.
- Mori et al. (1992)
S. Mori, C. Suen & K. Yamamoto (1992).
Historical review of OCR research and development.
Proceedings of the IEEE, July, 80(7): 1029-1058.
- Morimoto et al. (1990)
T. Morimoto, K. Shikano, H. Iida & A. Kurematsu (1990).
Integration of speech recognition and language processing in the
spoken language translation system SL-TRANS.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 921-928, Kyoto.
- Moulines & Charpentier (1990)
E. Moulines & F. Charpentier (1990).
Pitch synchronous waveform processing techniques for text-to-speech
synthesis using diphones.
Speech Communication 9: 453-467.
- Müller & Runge (1993)
C. Müller & F. Runge (1993).
Dialogue design principles - key for usability of voice processing.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 943-946, Berlin,
- Murray & Arnott (1993)
I. Murray & J. Arnott (1993).
Toward the simulation of emotion in synthetic speech: A review of
the literature on human vocal emotion.
Journal of the Acoustical Society of America, JASA 93:
- Murray & Murray (1986)
T. Murray & C. Murray (1986).
On the legality and ethics of surreptitious recording.
: D. Larmouth, T. Murray & C. Murray,
, Legal and ethical issues in surreptitious recording,
Publication of the American Dialect Society, number 76. University of Alabama
Press, Tuscaloosa and London.
- Murveit et al. (1993)
H. Murveit, J. Butzberger, V. Digalakis & M. Weintraub (1993).
Large vocabulary dictation using SRI's Decipher speech
recognition system: Progressive search techniques.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 319-322, Minneapolis, MN, April.
- Nadas (1984)
A. Nadas (1984).
Estimation of probabilities in the language model of the IBM speech
recognition system.
IEEE Transactions on Acoustics, Speech and Signal Processing,
August, 32: 859-861.
- Nadas (1985)
A. Nadas (1985).
On Turing's formula for word probabilities.
IEEE Transactions on Acoustics, Speech and Signal Processing,
December, 33: 1414-1416.
- Nespor & Vogel (1986)
M. Nespor & I. Vogel (1986).
Prosodic phonology.
Foris, Dordrecht.
- Newell (1978)
A. Newell (1978).
The palantype transcription unit - its history and progress to date.
Hearing, 99-104.
- Newell (1989)
A. Newell (1989).
Speech simulation studies - performance and dialogue specification.
: J. Peckham, , Recent developments
and applications of natural language processing, 141-157. Kogan
Page, London.
- Ney (1984)
H. Ney (1984).
The use of a one-stage dynamic programming algorithm for connected
word recognition.
IEEE Transactions on Acoustics, Speech and Signal Processing,
April, 32(2): 263-271.
- Ney & Aubert (1994)
H. Ney & X. Aubert (1994).
A word graph algorithm for large vocabulary, continuous speech
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1355-1358,
Yokohama, Japan, September.
- Ney & Essen (1993)
H. Ney & U. Essen (1993).
Estimating small probabilities by leaving-one-out.
: Third European Conference on Speech
Communication and Technology, 2239-2242, Berlin, September.
- Ney et al. (1994)
H. Ney, U. Essen & R. Kneser (1994).
On structuring probabilistic dependencies in language modelling.
Computer Speech and Language 8: 1-38.
- Ney et al. (1992)
H. Ney, D. Mergel, A. Noll & A. Paesele (1992).
Data driven search organization for continuous speech recognition.
IEEE Transactions on Signal Processing, February, 40(2):
- Ney et al. (1988)
H. Ney, D. Mergel, A. Noll & A. Paeseler (1988).
Overview of speech recognition in the SPICOS system.
: H. Niemann, M. Lang & G. Sagerer,
, Recent advances in speech understanding and dialog systems,
46 NATO ASI Series F, 305-310.
Springer-Verlag, Berlin.
- Niemann et al. (1985)
H. Niemann, A. Brietzmann, R. Mühlfeld, P. Regel & G. Schukat (1985).
The speech understanding and dialog system EVAR.
: R. De Mori & C. Suen, ,
New systems and architectures for automatic speech recognition and
synthesis, NATO ASI Series F, vol. 16, 271-302. Springer-Verlag,
- Niemann et al. (1992)
H. Niemann, E. Nöth, M. Mast & E. Schukat-Talamazzini (1992).
Ein Lexikon für ein natürlich-sprachliches Dialogsystem.
: Beiträge des ASL-Lexikonworkshops,
15-18, Wandlitz, 26-27 November.
- Nolan (1987)
F. Nolan (1987).
The limits of segmental description.
: Proceedings of the Eleventh International
Conference of Phonetic Sciences, 5, 411-414, 1-7
August 1987, Tallinn, Estonia.
- Nooteboom & Kruijt (1987)
S. Nooteboom & J. Kruijt (1987).
Accents, focus distribution, and the perceived distribution of given
and new information.
Journal of the Acoustical Society of America, JASA 82:
- Nossin (1991)
M. Nossin (1991).
Le projet GENELEX: EUREKA pour les dictionnaires
Génie Linguistique 91, volume 1.
Versailles, France, 16-17 January 1991.
- Nunn & Van Heuven (1993)
A. Nunn & V. Van Heuven (1993).
MORPHON: Lexicon-based text-to-phoneme conversion and
phonological rules.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 88-113. Mouton de Gruyter,
- Nusbaum et al. (1986)
H. Nusbaum, S. Greenspan & D. Pisoni (1986).
Perceptual attention in monitoring natural and synthetic speech.
Research on Speech Perception Progress Report 12, Indiana
University, Indianapolis.
- Nye & Gaitenby (1974)
P. Nye & J. Gaitenby (1974).
The intelligibility of synthetic monosyllabic words in short,
syntactically normal sentences.
Haskins Laboratories Status Report on Speech Research, 37/38, pages
- Nye et al. (1975)
P. Nye, F. Ingemann & L. Donald (1975).
Synthetic speech comprehension: A comparison of listener
performances with and preferences among different speech forms.
Haskins Laboratories Status Report on Speech Research, 41.
- Oerder & Ney (1993)
M. Oerder & H. Ney (1993).
Word graphs: An efficient interface between continuous speech
recognition and language understanding.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 119-122, Minneapolis, MN, April.
- Oglesby (1994)
J. Oglesby (1994).
What's in a number? Moving beyond the equal error rate.
To appear in Speech Communication, August 1995. Preliminary version
published in Martigny ETRW, pp. 87-90.
- Olsen & Olsen (1990)
G. Olsen & J. Olsen (1990).
User-centered design of Collaborative Technology.
Cognitive Science and Machine Intelligence Laboratory.
To appear in Organizational Computing 32.
- O'Malley & Caisse (1987)
M. O'Malley & M. Caisse (1987).
How to evaluate text-to-speech systems.
Speech Technology 3: 66-75.
- O'Neill (1975)
J. O'Neill (1975).
Measurement of hearing by tests of speech and language.
: S. Singh, , Measurement procedures
in speech, hearing, and language, 219-252. University Park Press,
- Oppenheim (1978)
A. Oppenheim (1978).
Applications of digital signal processing.
Prentice-Hall, Englewood Cliffs, N.J.
- O'Shaughnessy (1986)
D. O'Shaughnessy (1986).
Speaker recognition.
IEEE ASSP Magazine, 4-17.
- O'Shaughnessy (1987)
D. O'Shaughnessy (1987).
Speech communiacation - human and machine.
Addison-Wesley, New York.
- Pallet et al. (1990)
D. Pallet, W. Fisher & J. Garofolo (1990).
DARPA ATIS results, June 1990.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 114-121, Hidden Valley, PA, June.
- Pallett (1985)
D. Pallett (1985).
Performance assessment of automatic speech recognizers.
Journal of the National Bureau of Standards 90(5).
September-October 1985.
- Parducci (1965)
A. Parducci (1965).
Category judgement: A range-frequency model.
Psychological Review 72: 407-418.
- Pavlovic et al. (1990)
C. Pavlovic, M. Rossi & R. Espesser (1990).
Use of the magnitude estimation technique for assessing the
performance of text-to-speech synthesis system.
Journal of the Acoustical Society of America, JASA 87:
- Pavlovic et al. (1991)
C. Pavlovic, M. Rossi & R. Espesser (1991).
Perceived spectral energy distributions for EUROM-0 speech and for
some synthetic speech.
: Proceedings of the 12th International
Congress of Phonetic Sciences, 5, 418-421,
- Peckels & Rossi (1973)
J. Peckels & M. Rossi (1973).
Le test diagnostic par paires minimales. Adaptation au
Français du ``Diagnostic Rhyme Test" de W.D. Voiers.
Revue d'Acoustique 27: 245-262.
- Peckham (1990)
J. Peckham (1990).
An overview of speaker verification technology and application over
the telephone.
: Proceedings of the Voice System
Worldwide, 166.
- Peckham (1993)
J. Peckham (1993).
A new generation of spoken dialogue systems: Results and lessons
from the SUNDIAL project.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 33-40, Berlin, September.
- Peckham & Thomas (1990)
J. Peckham & T. Thomas (1990).
Recognizer sensitivity analysis: A method for assessing the
performance of speech recognizers.
Speech Communication 9: 317-328.
- Pérennou et al. (1991)
G. Pérennou, D. Cotto, M. De Calmès, I. Ferrané, J. Pécatte & J. Tihoni (1991).
Composantes phonologique et orthographique de BDLEX.
: Deuxièmes Journées Nationales du
GRECO-PRC Communication Homme-Machine, 351-362, Toulouse, 29-30
- Pérennou et al. (1992)
G. Pérennou, D. Cotto, M. De Calmès, I. Ferrané & J.-M. Pécatte (1992).
Le projet BDLEX de base de données lexicales du Français
écrit et parlé.
: Proceedings Communication Homme-Machine,
Séminaire LEXIQUE, 153-171, 21-22 January 1992, IRIT-UPS
- Pérennou & De Calmès (1987)
G. Pérennou & M. De Calmès (1987).
BDLEX lexical data and knowledge base of spoken and written
: European Conference on Speech Technology,
1, 393-396, Edinburgh.
- Pérennou & Tihoni (1992)
G. Pérennou & J. Tihoni (1992).
Lexique et phonologie en reconnaissance de la parole.
: Proceedings Communication Homme-Machine,
Séminaire LEXIQUE, 41-57, 21-22 January 1992, IRIT-UPS
- Perkins (1977)
W. Perkins (1977).
Speech pathology, an applied behavioral science.
The C.V. Mosby Company, Saint Louis.
- Philips et al. (1987)
S. Philips, S. Stelle & C. Tanz (1987).
Language, gender and sex in comparative perspective.
Cambridge University Press, Cambridge.
- Pieraccini et al. (1993)
R. Pieraccini, E. Levin & E. Vidal (1993).
Learning how to understand language.
: Third European Conference on Speech
Communication and Technology, 1407-1412, Berlin, September.
- Pierce (1991)
A. Pierce (1991).
Acoustics: An introduction to its physical principles and
McGraw Hill, Inc., New York.
- Pisoni et al. (1985a)
D. Pisoni, B. Greene & H. Nusbaum (1985a).
Perception of synthetic speech generated by rule.
Proceedings of the IEEE 73: 1665-1676.
- Pisoni et al. (1985b)
D. Pisoni, B. Greene & H. Nusbaum (1985b).
Some human factors issues in the perception of synthetic speech.
: Proceedings Speech Tech '85,
57-61, New York.
- Pitrelli et al. (1994)
J. Pitrelli, M. Beckman & J. Hirschberg (1994).
Evaluation of prosodic transcription labeling reliability in the
ToBI framework.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 18-22 September 1994,
Yokohama, Japan.
- Plenat (1991)
M. Plenat (1991).
Vers d'une phonémisation des sigles.
: Deuxièmes journées du GDR-PRC
Communication Homme-Machine, EC2 Editeur, 363-371, Toulouse,
29-30 January.
- Plomp & Mimpen (1979)
R. Plomp & A. Mimpen (1979).
Improving the reliability of testing the speech reception threshold
for sentences.
Audiology 8: 43-52.
- Pols (1991)
L. Pols (1991).
Quality assessment of text-to-speech synthesis-by-rule.
: S. Furui & M. Sondhi, ,
Advances in speech signal processing, 387-416. Marcel Dekker
Inc., New York.
- Pols et al. (1987)
L. Pols, J.-P. Lefevre, G. Boxelaar & N. Van Son (1987).
Word intelligibility of a rule synthesis system for French.
: Proceedings of the European Conference on
Speech Technology, 1, 179-182, Edinburgh.
- Ponamale et al. (1990)
M. Ponamale, E. Bilange, K. Choukri & S. Soudoplatoff (1990).
A computer-aided approach to the design of an oral dialogue system.
: Proceedings of Eastern Multiconference,
- Portele et al. (1994)
T. Portele, B. Heuft, F. Höfer, H. Meyer & W. Hess (1994).
A new high quality speech synthesis system for German.
: Proceedings Yokohama/New Paltz.
- Pratt (1987)
R. Pratt (1987).
Quantifying the performance of text-to-speech synthesizers.
Speech Technology, 54-64.
- Price (1990)
P. Price (1990).
Evaluation of spoken language systems: The ATIS domain.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 91-95, Hidden Valley, PA, June.
- Quené (1993)
H. Quené (1993).
Segment durations and accent as cues to word segmentation in Dutch.
Journal of the Acoustical Society of America, JASA 94:
- Rabiner & Schafer (1978)
L. Rabiner & R. Schafer (1978).
Digital processing of speech signals.
Prentice-Hall, Englewood Cliffs, N.J.
- Radford (1988)
A. Radford (1988).
Transformational grammar: A first course.
CUP, Cambridge.
- Ralston et al. (1991)
J. Ralston, D. Pisoni, S. Lively, B. Greene & J. Mullennix (1991).
Comprehension of synthetic speech produced by rule: Word monitoring
and sentence-by-sentence listening times.
Human Factors 33: 471-491.
- Rayner et al. (1993)
- M. Rayner, H. Alshawi, I. Breton, D. Carter, V. Digalakis, B. Gamback, J. Kaja, J. Karlgren, B. Lyberg, S. Pulman, P. Price & C. Samuelsson (1993).
A speech to speech translation system built from standard components.
: Proceedings of a Workshop: Human
Language Technology, 217-222, Princeton, NJ, 21-24 March.
- Reilly (1987)
R. Reilly (1987).
Ill-formedness and mis-communication in person-machine dialogue.
Information and Software Technology 29: 69-74.
- Reyelt et al. (1996)
M. Reyelt, M. Grice, R. Benzmüller, J. Mayer & A. Batliner (1996).
Prosodische Etikettierung des Deutschen mit ToBI.
: D. Gibbon, , Natural language processing and speech technology. Results of the 3rd KONVENS Conference, Bielefeld, October 1996, 144-155. Mouton de Gruyter, Berlin, New York.
- Reynolds (1994)
D. Reynolds (1994).
Speaker identification and verification using Gaussian mixture
speaker models.
To appear in Speech Communication, August 1995. Preliminary version
published in ETRW Martigny, pp. 27-30.
- Richards & Underwood (1984a)
M. Richards & K. Underwood (1984a).
How should people and computers speak to each other?
Interact '84, 33-36.
- Richards & Underwood (1984b)
M. Richards & K. Underwood (1984b).
Talking to machines. How are people naturally inclined to speak?
: E. Megaw, , Contemporary
Ergonomics. Taylor and Francis, London.
- Ritchie et al. (1992)
G. Ritchie, A. Black, G. Russell & S. Pulman (1992).
Computational morphology.
The MIT Press, Cambridge, Massachusetts and London.
- Roach et al. (1993)
P. Roach, G. Knowles, T. Varadi & S. Arnfield (1993).
MARSEC: A machine-readable Spoken English corpus.
Journal of the International Phonetic Association 23(2):
- Roach et al. (1990)
P. Roach, H. Roach, A. Dew & P. Rowlands (1990).
Phonetic analysis and the automatic segmentation and labeling of
speech sounds.
Journal of the International Phonetic Association 20(1):
- Roe & Wilpon (1994)
D. Roe & J. Wilpon (1994).
Voice communication between humans and machines.
National Academy Press, Washington.
- Roelofs (1987)
J. Roelofs (1987).
Synthetic speech in practice: Acceptance and efficiency.
Behaviour and Information Technology 6: 403-410.
- Rose (1971)
D. Rose (1971).
Audiological assessment.
Prentice-Hall International, Inc., London.
- Rosenberg (1973)
A. Rosenberg (1973).
Listener performance in speaker verification tasks.
IEEE Transactions on Audio Electroacoustic 21: 221-225.
- Rosenberg (1976)
A. Rosenberg (1976).
Automatic speaker verification: A review.
Proceedings of the IEEE, April, 64(4): 475.
- Rosenfeld (1994)
R. Rosenfeld (1994).
Adaptive statistical language modeling: A maximum entropy
Ph.D. Thesis, School of Computer Science, Carnegie Mellon
University, Pittsburgh, PA.
- Rossi (1988)
M. Rossi (1988).
Acoustics and electroacoustics.
Artech House, Norwood, MA, USA.
- Rowden (1992)
C. Rowden (1992).
Speech processing.
McGraw-Hill Book Company, London.
- Rudnicky et al. (1987)
A. Rudnicky, L. Baumeister, K. De Graff & E. Lehmann (1987).
The lexical access component of the CMU continuous speech
recognition system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP.
- Ruske (1985)
G. Ruske (1985).
Demisyllables as processing units for automatic speech recognition
and lexical access.
: R. De Mori & C. Suen, ,
New systems and architectures for automatic speech recognition and
synthesis, 16 NATO ASI Series F,
593-611. Springer-Verlag, Berlin.
- Ruske & Schotola (1981)
G. Ruske & T. Schotola (1981).
The efficiency of demisyllable segmentation in the recognition of
spoken words.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
971-974, Atlanta.
- Sacks et al. (1974)
H. Sacks, E. Schlegloff & G. Jefferson (1974).
A simplest systematics for the organization of turn-taking in
Language 50: 697-735.
- Sagerer (1990)
G. Sagerer (1990).
Automatisches Verstehen gesprochener Sprache, 74
Reihe Informatik.
Bibliographisches Institut, Mannheim.
- Sakoe (1979)
H. Sakoe (1979).
Two-level DP matching - A dynamic programming-based pattern
matching algorithm for connected word recognition.
IEEE Transactions on Acoustics, Speech and Signal Processing
27: 588-595.
- Salza et al. (1993)
P. Salza, G. Di Fabbrizio, M. Oreglia, M. Falcone, C. Sementina & C. Delogu (1993).
Development of a context dependent methodology for text-to-speech
synthesis evaluation in interactive dialogue systems.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications. London.
Report R2, SAM-A Periodic Progress Report. Year 1, 1 April 1993-30
September 1993.
- SAM (1992)
SAM (1992).
Multi-lingual speech input/output assessment, methodology and
ESPRIT project 2589 (SAM), Final report, Year three,
1 III 91-28 II 1992, Ref: SAM-UCL-G004, Univeristy College London, London.
- SAM-A (1993)
SAM-A (1993).
Speech technology assessment in multilingual applications.
ESPRIT Project 6819 (SAM-A), Report No. 2, Year 1, Ref SAM-A/G002.
- Scharpff & Van Heuven (1988)
P. Scharpff & V. Van Heuven (1988).
Effects of pause insertion on the intelligibility of low quality
: Proceedings of the 7th FASE/Speech '88
Symposium, 261-269, Edinburgh.
- Scherer & Giles (1979)
K. Scherer & H. Giles, (1979).
Social markers in speech.
Cambridge University Press, Cambridge.
- Schmidt & Watson (1991)
M. Schmidt & G. Watson (1991).
The evaluation and optimization of automatic speech segmentation.
: Proceedings of the Second European
Conference on Speech Communication and Technology, Eurospeech 91,
2, 701-704, 24-26 September 1991, Genova, Italy.
- Schröder et al. (1987)
S. Schröder, G. Sagerer & H. Niemann (1987).
Wissensakquisition mit semantischen Netzwerken.
: E. Paulus, , Mustererkennung 87,
9. DAGM-Symposium Braunschweig, Informatik-Fachberichte, 305-309.
Springer-Verlag, Berlin.
- Schukat-Talamazzini (1993)
E. Schukat-Talamazzini (1993).
Automatische Spracherkennung.
Habilitationsschrift, Erlangen University, Erlangen, Germany.
- Schwab et al. (1985)
E. Schwab, H. Nusbaum & D. Pisoni (1985).
Some effects of training on the perception of synthetic speech.
Human Factors 27(4): 395-408.
- Schwartz & Austin (1991)
R. Schwartz & S. Austin (1991).
A comparison of several approximate algorithms for finding multiple
(n-best) sentence hypotheses.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
701-704, Toronto, May.
- Searle (1969)
J. Searle (1969).
Speech acts: An essay in the philosophy of language.
Cambridge University Press, Cambridge.
- Searle (1979)
J. Searle (1979).
Expression and meaning.
Cambridge University Press, Cambridge.
- Sells (1985)
P. Sells (1985).
Lectures on contemporary syntactic theories: An introduction to
Government-Binding theory, Generalized Phrase Structure Grammar, and
Lexical-Functional Grammar.
CSLI Center for the Study of Language and Information, Stanford,
- Siegel (1956)
S. Siegel (1956).
Nonparametric statistics for the behavioral sciences.
McGraw-Hill, New York.
- Silverman et al. (1990)
K. Silverman, S. Basson & S. Levas (1990).
Evaluating synthesizer performance: Is segmental intelligibility
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 981-984, Kobe.
- Silverman et al. (1992)
K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert & J. Hirschberg (1992).
ToBI: A standard for labeling English prosody.
: Proceedings of the 1992 International
Conference on Spoken Language Processing, ICSLP, 2,
867-870, 12-16 October 1992, Banff, Canada.
- Simpson & Fraser (1993)
A. Simpson & N. Fraser (1993).
Black box and glass box evaluation of the SUNDIAL system.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1423-1426, Berlin,
- Simpson & Ruth (1987a)
C. Simpson & J. Ruth (1987a).
The phonetic discrimination test for speech recognizers: Part I.
Speech Technology March/April.
- Simpson & Ruth (1987b)
C. Simpson & J. Ruth (1987b).
The phonetic discrimination test for speech recognizers: Part II.
Speech Technology October/November.
- Skinner et al. (1992)
T. Skinner, J. Holt & N. Nguyen (1992).
Automatic identity confirmation and adaptive solutions.
Speech Technology 106-111.
February 1992.
- Smith (1979)
P. Smith (1979).
Sex markers in speech.
: K. Scherer & H. Giles, ,
Social markers in speech, 109-146. Cambridge University
Press, Cambridge.
- Smith et al. (1992)
R. Smith, D. Hipp & A. Biermann (1992).
A dialog control algorithm and its performance.
: Proceedings of the 3rd Conference on
Applied Natural Language Processing, 9-16, Trento, April.
- Soclof (1990)
M. Soclof (1990).
A comparison of spontaneous speech and read speech in human-machine
problem solving dialogues.
Massachusetts Institute of Technology.
- Soong & Huang (1991)
F. Soong & E.-F. Huang (1991).
A Tree-Trellis Fast Search for finding the n-best sentence
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
705-708, Toronto, May.
- Soong et al. (1987)
F. Soong, A. Rosenberg, B. Juang & L. Rabiner (1987).
A Vector Quantization approach to speaker recognition.
AT&T Technical Journal 66.
Issue 2.
- Sorin (1994)
C. Sorin (1994).
Towards high-quality multilingual text-to-speech.
: Proceedings of the CRIM/FORWISS workshop,
53-62, Munich.
Also to appear in H. Niemann, ed., Progress and prospects in research
and technology, Infix Publishing Company, Sankt Augustin.
- Sotscheck (1982)
J. Sotscheck (1982).
Ein Reimtest für Verständlichkeitsmessungen mit deutscher
Sprache als ein verbessertes Verfahren zur Bestimmung der
Der Fernmeldung 36: 1-84.
- Sperberg-McQueen & Burnard (1994)
C. Sperberg-McQueen & L. Burnard, (1994).
Guidelines for electronic text encoding and interchange.
TEI P3. Chapter 1 Transcription of Speech. Association for
Computational Linguistics, Association for Computers and the Humanities,
Association for Literary and Linguistic Computing, Chicago and Oxford.
- Spiegel et al. (1990)
M. Spiegel, M. Altom, M. Macchi & K. Wallace (1990).
Comprehensive assessment of the telephone intelligibility of
synthesized and natural speech.
Speech Communication 9: 279-291.
- Sproat et al. (1992)
R. Sproat, J. Hirschberg & D. Yarowsky (1992).
A corpus-based synthesizer.
: Proceedings of the 2nd International
Conference on Spoken Language Processing, ICSLP, 1,
563-566, Banff.
- Steeneken (1982)
H. Steeneken (1982).
Ontwikkeling en toetsing van een Nederlandstalige Diagnostische
Rijmtest voor het testen van spraakcommunicatiekanalen.
Rapport IZF 1982-13, IZF, Soesterberg.
- Steeneken (1987)
H. Steeneken (1987).
Diagnostic information from subjective and objective intelligibility
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, Dallas.
- Steeneken (1989)
H. Steeneken (1989).
Objective and diagnostic assessment of (isolated) word recognizers.
: Proceedings of the European Speech
Conference ESCA, Paris.
- Steeneken (1991)
H. Steeneken (1991).
RAMOS - Recognizer Assessment by means of Manipulation Of
Speech applied.
: Proceedings of the European Speech
Conference ESCA, Genova.
- Steinbiss et al. (1994)
V. Steinbiss, B.-H. Tran & H. Ney (1994).
Improvements in beam search.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 2143-2146,
Yokohama, Japan, September.
- Stevens et al. (1968)
K. Stevens, C. Williams, J. Carbonell & B. Woods (1968).
Speaker authentication and identification: A comparison of
spectrographic and auditory presentations of speech material.
JASA 44: 1596-1607.
- Stubbs (1984)
M. Stubbs (1984).
Discourse analysis. The sociolinguistic analysis of natural
Blackwell, Oxford.
- Sundheim (1991)
B. Sundheim (1991).
Third message understanding evaluation and conference (MUC-3):
Phase 1 status report.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 301-305, Pacific Grove, CA, February.
- Syrdal & Sciacca (1994)
A. Syrdal & B. Sciacca (1994).
Testing the intelligibility of text-to-speech output with the
Diagnostic Pairs Sentence Intelligibility Evaluation.
ITD-94-23828A, Technical Memorandum. Submitted to the Journal of
the Acoustical Society of America, JASA, AT&T Bell Laboratories.
- 't Hart et al. (1990)
J. 't Hart, R. Collier & A. Cohen (1990).
A perceptual study of intonation.
Cambridge University Press, Cambridge.
- Terken (1985)
J. Terken (1985).
Use and function of accentuation. Some experiments.
Doctoral dissertation, Leiden University, Leiden.
- Terken (1993)
J. Terken (1993).
Human and synthetic intonation: A case study.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 241-259. Mouton de
Gruyter, Berlin.
- Terken & Collier (1989)
J. Terken & R. Collier (1989).
Automatic synthesis of natural-sounding intonation for text-to-speech
conversion in Dutch.
: Proceedings of the Eurospeech '89,
1, 357-359, Paris.
- Thielen (1992)
M. Thielen (1992).
Male and female speech.
Ph.D. Thesis, University of Amsterdam, Amsterdam.
- Thorsen (1980)
N. Thorsen (1980).
A study of the perception of sentence intonation - Evidence from
Journal of the Acoustical Society of America, JASA 67:
- Thurmair (1986)
G. Thurmair (1986).
Linguistische Analyse im Projekt SPICOS.
Kleinheubacher Berichte 29.
- Tomlinson (1990)
M. Tomlinson (1990).
Guide to database generation - recording protocol.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Interim Report Year I, Reference SAM-UCL-G002, Document
- Tosi et al. (1972)
O. Tosi, H. Oyer, W. Asbrook, W. Pedrey, C. Nicol & E. Nash (1972).
Experiment of voice identification.
JASA 51: 2030-2043.
- Tubach & Doignon (1991)
J. Tubach & P. Doignon (1991).
A system for natural spoken language queries: Design,
implementation and assessment.
: Proceedings of the 2nd European Conference
on Speech Communication and Technology, 1473-1476, Genova,
- Tubach & Bok (1985)
J.-P. Tubach & L.-J. Bok (1985).
ZUT - Petit dictionnaire français.
Institut de Phonitique de Grenoble, avec le concours du CNRS (GRECO
Comm. Parlie), Grenoble.
- Turing (1950)
A. Turing (1950).
Computing machinery and intelligence.
Mind 59: 433-460.
- Valtech et al. (1994)
V. Valtech, J. Odell, P. Woodland & S. Young (1994).
A dynamic network decoder design for large vocabulary speech
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1351-1354,
Yokohama, Japan, September.
- Van Bezooijen (1986)
R. Van Bezooijen (1986).
Lay ratings of long-term voice-and-speech characteristics.
: F. Beukema & A. Hulk, ,
Linguistics in the Netherlands 1986, 1-7. Foris, Dordrecht.
- Van Bezooijen (1988)
R. Van Bezooijen (1988).
Evaluation of two synthesis systems for Dutch - Development and
applications of intelligibility tests.
SPIN-ASSP Report No. 5, Stichting Spraaktechnologie, Utrecht.
- Van Bezooijen (1989)
R. Van Bezooijen (1989).
Evaluation of the suitability of Dutch text-to-speech conversion
for application in a digital daily newspaper.
: Proceedings of the ESCA Workshop Speech
I/O Assessment and Speech Databases, 6.3.1-6.3.4, Noordwijkerhout.
- Van Bezooijen & Jongenburger (1993)
R. Van Bezooijen & W. Jongenburger (1993).
Evaluation of an electronic newspaper for the blind in the
Netherlands - intelligibility, acceptability, adequacy, and users'attitudes.
: Proceedings of the ESCA Workshop on Speech
and Language Technology for Disabled Persons, 195-198, Stockholm.
- Van Bezooijen & Pols (1987)
R. Van Bezooijen & L. Pols (1987).
Evaluation of two synthesis-by-rule systems for Dutch.
: Proceedings of the European Conference on
Speech Technology, 1, 179-183.
- Van Bezooijen & Pols (1989)
R. Van Bezooijen & L. Pols (1989).
Evaluation of a sentence accentuation algorithm for a Dutch
text-to-speech system.
: Proceedings of the Eurospeech '89,
1, 218-221, Paris.
- Van Bezooijen & Pols (1990)
R. Van Bezooijen & L. Pols (1990).
Evaluating text-to-speech systems: Some methodological aspects.
Speech Communication 9: 263-270.
- Van Bezooijen & Pols (1993)
R. Van Bezooijen & L. Pols (1993).
Evaluation of text-to-speech conversion for Dutch.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech: Strategic research towards
high-quality text-to-speech conversion, 339-360. Mouton de Gruyter, Berlin.
- Van Bezooijen & Van Hout (1985)
R. Van Bezooijen & R. Van Hout (1985).
Accentedness ratings and phonological variables as measures of
variation in pronunciation.
Language and Speech 28: 129-142.
- Van Coile (1989)
B. Van Coile (1989).
The DEPES development system for text-to-speech synthesis.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
- Van Compernolle et al. (1991)
D. Van Compernolle, J. Smolders, P. Jaspers & T. Hellemans (1991).
Speaker clustering for dialectic robustness in speaker independent
: Proceedings of Eurospeech '91,
2, 723-726, Genova.
- Van Dommelen (1993)
W. Van Dommelen (1993).
Speaker height and weight identification: A re-evaluation of some
old dates.
Journal of Phonetics 21: 337-341.
- Van Hemert et al. (1987)
J. Van Hemert, U. Adriaens-Porzig & L. Adriaens (1987).
Speech synthesis in the SPICOS-project.
: H. Tillmann & G. Willée, ,
Analyse und Synthese gesprochener Sprache: Vorträge im Rahmen der
Jahrestagung 1987 der Gesellschaft für Linguistische Datenverarbeitung
e.V., Bonn, 4-6 March, 34-39. Olms, Hildesheim.
- Van Heuven & Scharpff (1991)
V. Van Heuven & P. Scharpff (1991).
Acceptability of several speech pausing strategies in low quality
speech synthesis; interaction with intelligibility.
: Proceedings of the 12th International
Congress of Phonetic Sciences, 458-461, Aix-en-Provence.
- Van Holsteijn (1993)
Y. Van Holsteijn (1993).
TextScan: A preprocessing module for automatic text-to-speech
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 27-41. Mouton de Gruyter,
- Van Hout (1989)
R. Van Hout (1989).
De structuur van taalvariatie, een sociolinguistisch onderzoek naar
het stadsdialect van nijmegen.
Doctoral dissertation, University of Nijmegen, Nijmegen.
- Van Santen (1992)
J. Van Santen (1992).
Diagnostic perceptual experiments for text-to-speech system
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1,
- Van Santen (1993)
J. Van Santen (1993).
Perceptual experiments for diagnostic testing of text-to-speech
Computer Speech and Language 7: 49-100.
- Van Santen (1994)
J. Van Santen (1994).
Using statistics in text-to-speech system construction.
: Proceedings of the ESCA/IEEE Workshop on
Speech Synthesis, 240-243, Mohonk NY.
- Van Son et al. (1988)
N. Van Son, L. Pols, S. Sandri & P. Salza (1988).
First quality evaluation of a diphone-based speech synthesis system
for Italian.
: Proceedings of the 7th FASE/Speech '88
Symposium, 2, 429-436, Edinburgh.
- Vergeynst et al. (1993)
N. Vergeynst, K. Edwards, J. Foster & M. Jack (1993).
Spoken dialogues for human-computer interaction over the telephone:
Complexity measures.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1415-1418, Berlin,
- Vintsyuk (1971)
T. Vintsyuk (1971).
Elementwise recognition of continuous speech composed of words from a
specified dictionary.
Cybernetics, March-April, 7: 133-143.
- Voiers (1977)
W. Voiers (1977).
Diagnostic evaluation of speech intelligibility.
Speech intelligibility and speaker recognition 2: 374-384.
Benchmark papers in acoustics, M.E. Hawley (ed.).
- Voiers (1983)
W. Voiers (1983).
Evaluating processed speech using the Diagnostic Rhyme Test.
Speech Technology 1: 338-352.
- Voiers et al. (1975)
W. Voiers, A. Sharpley & C. Hehmsoth (1975).
Research on diagnostic evaluation of speech intelligibility.
Research Report AFCRL-72-0694, Air Force Cambridge Research
Laboratories, Bedford, Massachusetts.
- Vroomen et al. (1993)
J. Vroomen, R. Collier & S. Mozziconacci (1993).
Duration and intonation in emotional speech.
: Proceedings of the Eurospeech '93,
1, 577-580, Berlin.
- Wahlster (1993)
W. Wahlster (1993).
VERBMOBIL, translation of face-to-face dialogs.
: Proceedings of the Eurospeech '93, opening
and plenary sessions, 29-38, Berlin.
- Waibel (1988)
A. Waibel (1988).
Prosody and speech recognition.
Research notes in artificial intelligence, Pitman Publishing, London.
- Waibel et al. (1991)
A. Waibel, A. Jain, A. McNair, H. Saito, A. Hauptmann & J. Tebelskis (1991).
A speech-to-speech translation system using connectionist and
symbolic processing strategies.
: Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, ICASSP-91,
- Waibel & Lee (1990)
A. Waibel & K.-F. Lee, (1990).
Readings in speech recognition.
Morgan Kaufmann Publishers, San Mateo, California.
- Wall & Schwartz (1991)
L. Wall & R. Schwartz (1991).
Programming perl.
O'Reilly & Associates Inc., Sebastopol, CA.
- Webers (1985)
J. Webers (1985).
Franzis, Munich, Germany.
- Wells (1987)
J. Wells (1987).
Computer-coded phonetic transcription.
Journal of the International Phonetic Association 17(2):
- Wells (1989)
J. Wells (1989).
Computer-coded phonemic notation of individual languages of the
European Community.
Journal of the International Phonetic Association 19(1):
- Wells (1993a)
J. Wells (1993a).
Applying SAM-PA to Spanish, Portuguese, and Greek: A
preliminary discussion document.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications. London.
Document No: SAM-A/D1-Appendix B, SAM-A periodic progress report,
Year 1, 1 April 1993-30 September 1993.
- Wells (1993b)
J. Wells (1993b).
An update on SAMPA.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications,
1-6. London.
Document No: SAM-A/D1-Appendix A, SAM-A periodic progress report,
Year 1, 1 April 1993-30 September 1993.
- Whittaker & Stenton (1989)
S. Whittaker & P. Stenton (1989).
User studies and the design of natural language systems.
: Proceedings of the 4th conference of the
European Chapter of the Association for Computational Linguistics,
116-123, Manchester.
- Willems et al. (1988)
N. Willems, R. Collier & J. 't Hart (1988).
Synthesis scheme for British English intonation.
Journal of the Acoustical Society of America, JASA 84:
- Winer (1971)
B. Winer (1971).
Statistical principles in experimental design.
McGraw-Hill, New York, .
- Winski & Fourcin (1994)
R. Winski & A. Fourcin (1994).
A common European approach to assessment, corpora and standards.
: K. Varghese, S. Pfleger & J. Lefevre,
, Advanced speech applications. European research on speech
technology (Research Reports ESPRIT Volume 1). Springer-Verlag, Berlin.
- Winski et al. (1995)
R. Winski, R. Moore & D. Gibbon (1995).
EAGLES spoken language working group: Overview and results.
: Proceedings of the 4th European Conference
on Speech Communication and Technology - Eurospeech'95, 841-844,
Madrid, September 1995.
- Witten (1982)
I. Witten (1982).
Principles of computer speech.
Academic Press, New York, N.Y.
- Woodland et al. (1995)
P. Woodland, C. Leggetter, J. Odell, V. Valtech & S. Young (1995).
The 1994 HTK large vocabulary speech recognition system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
I, 73-76, Detroit, MI, May.
- Woods & Zue (1976)
W. Woods & V. Zue (1976).
Dictionary expansion via phonological rules for a speech
understanding system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
561-564, Philadelphia.
- Wooffitt & Fraser (1992)
R. Wooffitt & N. Fraser (1992).
We're off to ring the Wizard, the wonderful Wizard of Oz.
: G. Button, , Technology in Working
Order: Studies of work, interaction and technology, 211-230.
Routeledge, London.
- Woszczyna et al. (1993)
M. Woszczyna, N. Coccaro, A. Eisele, A. Lavie, A. McNair, T. Polzin, I. Rogina, C. Rose, T. Sloboda, M. Tomita, J. Tsutsumi, N. Waibel & W. Ward (1993).
Recent advances in Janus: A speech translation system.
: Proceedings of a Workshop: Human Language
Technology, 211-216, 21-24 March, Princeton, NJ.
- Wright et al. (1993)
J. Wright, G. Jones & H. Lloyd-Thomas (1993).
A consolidated language model for speech recognition.
: Proceedings of the European Conference on
Speech Communication and Technology, 977-980, Berlin, September.
- Yamron (1994)
J. Yamron (1994).
A generalization of n-grams.
: Proceedings of the DARPA Workshop on
Robust Speech Recognition, Rutgers University, Piscataway, NJ, July-August.
- Yarrington & Foulds (1993)
D. Yarrington & R. Foulds (1993).
Personalizing synthesized voices.
: Proceedings of the ESCA Workshop on Speech
and Language Technologies for Disabled Persons, 169-172,
- Young et al. (1989)
S. Young, A. Hauptmann, W. Ward, E. Smith & P. Werner (1989).
High level knowledge sources in usable speech recognition systems.
Communications of the ACM 32(2): 183-194.
Also in: A. Waibel and K.-F. Lee, eds., (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 538-549.
- Zue et al. (1991)
V. Zue, J. Glass, D. Goodine, L. Hirschman, H. Leung, M. Phillips, J. Polifroni & S. Seneff (1991).
The MIT ATIS system: Preliminary development, spontaneous
speech data collection, and performance evaluation.
: Proceedings of the 2nd European Conference
on Speech Communication and Technology, 537-540, Genova,
EAGLES SWLG SoftEdition, May 1997. Get the book...