References

Next: Spoken language reference materials Up: EAGLES SLWG Handbook Previous: Bibliographical references

References

Abercrombie (1967): D. Abercrombie (1967). Elements of general phonetics. Edinburgh University Press, Edinburgh.
Aho et al. (1987): A. Aho, B. Kernighan & P. Weinberger (1987). The AWK programming language. Addison-Wesley Publishing Company, Reading, Mass., etc.
Ainsworth (1988): W. Ainsworth (1988). Speech recognition by machine. Peter Peregrinus.
Aitchison (1994): J. Aitchison (1994). Words in the mind. An introduction to the mental lexicon. Blackwell, Oxford.
Aitkin et al. (1989): M. Aitkin, D. Anderson, B. Francis & J. Hinde (1989). Statistical modelling in GLIM. Clarendon Press, Oxford.
Akers & Lennig (1985): G. Akers & M. Lennig (1985). Intonation in text-to-speech synthesis: Evaluation of algorithms. Journal of the Acoustical Society of America, JASA 77: 2157-2165.
Akmajian (1984): A. Akmajian (1984). Linguistics: An introduction to language and communication. The MIT Press, Cambridge, Massachusetts, .
Allen (1988): G. Allen (1988). The PHONASCII system. Journal of the International Phonetic Association 18(1): 9-25.
Allen et al. (1987): J. Allen, M. Hunnicutt & D. Klatt (1987). From text to speech: The MITalk system. Cambridge University Press, Cambridge.
Allerhand (1987): M. Allerhand (1987). Knowledge-based speech pattern recognition. Kogan Page, London.
Alleva et al. (1992): F. Alleva, H. Hon, X. Huang, M. Hwang, R. Rosenfeld & R. Weide (1992). Applying SPHINX-II to the DARPA Wall Street Journal CSR task. : Speech and Natural Language workshop, 393-398, Harriman, New York.
Alleva et al. (1993): F. Alleva, X. Huang & M.-Y. Hwang (1993). An improved search algorithm using incremental knowledge for continuous speech recognition. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, II, 307-311, Minneapolis, MN, April.
Althoff et al. (1996): F. Althoff, G. Drexel, H. Lüngen, M. Pampel & C. Schillo (1996). The treatment of compounds in a morphological component for speech recognition. : D. Gibbon, , Natural language processing and speech technology. Results of the 3rd KONVENS Conference, Bielefeld, October 1996, 71-76. Mouton de Gruyter, Berlin, New York.
Andernach et al. (1993): T. Andernach, G. Deville & L. Mortier (1993). The design of a real world Wizard of Oz experiment for a speech driven telephone directory information system. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 1165-1168, Berlin, September.
Andry et al. (1990): F. Andry, E. Bilange, F. Charpentier, K. Choukri, M. Ponamali & S. Soudoplatoff (1990). Computerised simulation tools for the design of an oral dialogue system. : Proceedings of the ESPRIT Technical Conference, Brussels, November.
Andry et al. (1992): F. Andry, S. McGlashan, N. Youd, N. Fraser & S. Thornton (1992). Making DATR work for speech: Lexicon compilation in SUNDIAL. Computational Linguistics 18(3): 245-267.
Argente (1991): J. Argente (1991). From speech to speaking styles. : Proceedings of the ESCA Workshop `Phonetics and phonology of speaking styles: Reduction and elaboration in speech communication', 1-1, 1-12, Barcelona.
Atal (1976): B. Atal (1976). Automatic recognition of speakers from their voices. Proceedings of the IEEE, April, 64(4): 460.
Atal et al. (1991): B. Atal, J. Miller & R. Kent, (1991). Papers in speech communication: Speech processing. Acoustical Society of America.
Aubergé (1992): V. Aubergé (1992). Developing a structured lexicon for synthesis of prosody. : G. Bailly, C. Benoît & T. Sawallis, , Talking machines: Theories, models and designs, 307-321. North-Holland, Amsterdam.
Austin (1962): J. Austin (1962). How to do things with words. Oxford University Press, Oxford.
Autesserre et al. (1989): D. Autesserre, G. Pérennou & M. Rossi (1989). Methodology for the transcription and labeling of a speech corpus. Journal of the International Phonetic Association 19(1): 2-15.
Averbuch et al. (1987): A. Averbuch, L. Bahl & R. Bakis (1987). Experiments with the TANGORA 20000 word speech recognizer. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 701-704.
Averbuch et al. (1986): A. Averbuch, L. Bahl, R. Bakis, P. Brown, A. Cole, G. Daggett, S. Das, K. Davies, S. De Gennaro, P. De Souza, E. Epstein, D. Fraleigh, F. Jelinek, S. Katz, B. Lewis, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman & P. Spinelli (1986). An IBM PC-based large-vocabulary isolated-utterance speech recognizer. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 53-56.
Baayen (1991): H. Baayen (1991). De CELEX lexicale databank. Forum der Letteren 32(3): 221-231.
Bahl et al. (1989): L. Bahl, P. Brown, P. De Souza & R. Mercer (1989). A tree-based statistical language model for natural language speech recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-37(7) 1001-1008. Also in: A. Waibel, K.-F. Lee, eds. (1990), Readings in speech recognition, Morgan Kaufmann Publishers, San Mateo, California, 507-514.
Bahl et al. (1983): L. Bahl, F. Jelinek & R. Mercer (1983). A maximum likelihood approach to continuous speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, March, 5: 179-190.
Bahl et al. (1984): L. Bahl, F. Jelinek, R. Mercer & A. Nadas (1984). Next word statistical predictor. IBM Tech. Disclosure Bulletin, December, 27(7A): 3941-3942.
Bailleul (1987): C. Bailleul (1987). Evaluation des performances d'un système de reconnaissance vocale dans des tâches de contrôle airiens. Note Interne, CENA/N87083, 22 June.
Bailly (1994): G. Bailly (1994). Rule compilers and text-to-speech systems. Les Cahiers de l'ICP 3: 87-91.
Bailly & Benoît (1992): G. Bailly & C. Benoît, (1992). Talking machines: Theories, models and designs. North-Holland, Elsevier Science Publishers, Amsterdam.
Baker (1975a): J. Baker (1975a). The DRAGON system - An overview. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-23 24-29.
Baker (1975b): J. Baker (1975b). Stochastic modeling for automatic speech understanding. : D. Reddy, , Speech recognition, 521-541. Academic Press, New York, N.Y. Also in: A. Waibel, K.-F. Lee, eds. (1990), Readings in speech recognition, Morgan Kaufmann Publishers, San Mateo, California, 297-307.
Baker (1989): J. Baker (1989). Dragondictate-30k: Natural language speech recognition with 30000 words. : Proceedings of the European Conference on Speech Technology, 2, 161-163.
Baker et al. (1992): J. Baker, P. Bamberg, K. Bishop, L. Gillick, V. Helman, Z. Huang, Y. Ito, S. Lowe, B. Peskin, R. Roth & F. Scattone (1992). Large vocabulary recognition of Wall Street Journal sentences at Dragon systems. : Speech and Natural Language Workshop, 387-392, Harriman, New York, 23-26 February.
Ball (1991): M. Ball (1991). Computer coding of the IPA: Extensions to the IPA. Journal of the International Phonetic Association 21(1): 36-41.
Ballou (1987): G. Ballou, (1987). Handbook for sound engineers. W. Sams & Co., Indianapolis, U.S.A.
Barber et al. (1989): S. Barber, R. Carlson, P. Cosi, M. Di Benedetto, B. Granström & K. Vagges (1989). A rule-based Italian text-to-speech system. : Proceedings of the Eurospeech '89, 2, 517-520, Paris.
Barry & Fourcin (1990): W. Barry & A. Fourcin (1990). Speaker selection criteria. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Interim Report Year I, Reference SAM-UCL-G002, Document SAM-UCL-001.
Barry & Fourcin (1992): W. Barry & A. Fourcin (1992). Levels of labelling. Computer Speech and Language 6: 1-14.
Barry et al. (1989): W. Barry, M. Grice, V. Hazan & A. Fourcin (1989). Excitation distributions for synthesised speech. : Proceedings of the Eurospeech '89, 1, 353-356, Paris.
Bartlett (1987): B. Bartlett (1987). Choosing the right microphones by understanding design tradeoffs. J. Audio. Eng. Soc. 35.
Bates & Ayuso (1991): M. Bates & D. Ayuso (1991). A proposal for incremental dialogue evaluation. : Proceedings of the DARPA Workshop on Speech and Natural Language, 319-322, Pacific Grove, CA, February.
Bates et al. (1990): M. Bates, S. Boisen & J. Makhoul (1990). Developing an evaluation methodology for spoken language systems. : Proceedings of the DARPA Workshop on Speech and Natural Language, 102-108, Hidden Valley, PA, June.
Baum (1900): F. Baum (1900). The Wizard of Oz. Collins, London. Edition of 1974.
Baum (1972): L. Baum (1972). An inequality and associated maximization technique in statistical estimation of a Markov process. Inequalities 3(1): 1-8.
Beckman (1986): M. Beckman (1986). Stress and non-stress accent. Foris, Dordrecht.
Belina & Hogrefe (1988): F. Belina & D. Hogrefe (1988). The CCITT specification and design language SDL. Computer networks and ISDN systems 16: 311-341.
Bell et al. (1990): T. Bell, J. Cleary & I. Witten (1990). Text compression. Prentice Hall, Englewood Cliffs, NJ.
Benoît (1989): C. Benoît (1989). Intelligibility test for the assessment of French synthesizers using semantically unpredictable sentences. : Proceedings of the ESCA Workshop on Speech Input/Output Assessment and Speech Databases, 1.7.1-1.7.4.
Benoît (1991): C. Benoît (1991). On the assessment of audio-visual speech synthesis. : Proceedings of the Workshop on International Cooperation and Standardisations of Speech Databases and Speech I/O Assessment Methods, Chiavari, Italy.
Benoît et al. (1992): C. Benoît, T. Lallouache, T. Mohamadi & C. Abry (1992). A set of French visemes for visual speech synthesis. : G. Bailly & C. Benoît, , Talking machines: Theories, models, and design, 485-504. North Holland, Elsevier Science Publishers, Amsterdam.
Benoît et al. (1989): C. Benoît, A. Van Erp, M. Grice, V. Hazan & U. Jekosch (1989). Multilingual synthesizer assessment using semantically unpredictable sentences. : Proceedings of the Eurospeech '89, 2, 633-636, Paris.
Bentler (1985): P. Bentler (1985). Theory and implementation of EQS, a structural equations program. BMDP Statistical Software Inc., Los Angeles.
Berendsen et al. (1986): E. Berendsen, S. Langeweg & H. Van Leeuwen (1986). Computational phonology: Merged not mixed. : Proceedings of the International Conference on Computational Linguistics '86, 612-614.
Berger et al. (1994): A. Berger, P. Brown, J. Cocke, S. Della Pietra, V. Della Pietra,J. Gillett, J. Lafferty, R. Mercer, H. Printz & L. Ures (1994). The Candide system for machine translation. : Proceedings of the ARPA Human Language Technology Workshop, 152-157, Plainsboro, NJ, March.
Berkley & Flanagan (1990): D. Berkley & J. Flanagan (1990). Integration of speech recognition, text-to-speech synthesis, and talker verification into a hands free audio/image teleconferencing system (humanet). ICSLP 20(1): 861-864.
Bimbot et al. (1995): F. Bimbot, I. Magrin-Chagnolleau & L. Mathan (1995). Second-order statistical measures for text-independent speaker identification. Speech Communication 17. 1-2.
Bimbot & Mathan (1993): F. Bimbot & L. Mathan (1993). Text-free speaker recognition using an arithmetic-harmonic sphericity measure. : Proceedings of the Eurospeech, 169-172.
Black et al. (1991): E. Black, S. Abney, D. Flickinger, C. Gdaniec, R. Grishman, P. Harrison, D. Hindle, R. Ingria, F. Jelinek, J. Klavans, M. Liberman, M. Marcus, R. Roukos, B. Santorini & T. Strazalkowski (1991). A procedure for quantitatively comparing the syntactic coverage of English grammars. : Proceedings of the DARPA Workshop on Speech and Natural Language, 306-311, Pacific Grove, CA, February.
Bladon (1990): A. Bladon (1990). Evaluating the prosody of text-to-speech synthesizers. : Proceedings of the Speech Tech '90, 215-220.
Blauert (1983): J. Blauert (1983). Spatial hearing. MIT Press, Cambridge.
Bleiching (1992): D. Bleiching (1992). Prosodisches Wissen im Lexikon. : G. Görz, , KONVENS 92, 1. Konferenz ``Verarbeitung natürlicher Sprache'', Nürnberg, 7.-9. Oktober 1992, 59-68. Springer-Verlag, Berlin.
Bleiching et al. (1996): D. Bleiching, G. Drexel & D. Gibbon (1996). Ein synkretismusmodell für die deutsche morphologie. : D. Gibbon, , Natural language processing and speech technology. Results of the 3rd KONVENS Conference, Bielefeld, October 1996, 237-248. Mouton de Gruyter, Berlin, New York.
Bleiching & Gibbon (1994): D. Bleiching & D. Gibbon (1994). Handbuch zur Demonstrator-Wortliste. V1.1. May 1994, Bielefeld University, Bielefeld, Germany.
Bloothooft et al. (1995): G. Bloothooft, V. Hazan, D. Huber & J. Llisterri (1995). European studies in phonetics and speech communication. OTS Publications, Utrecht.
Bobrow & Winograd (1977): D. Bobrow & T. Winograd (1977). An overview of KRL, a knowledge representation language. Cognitive Science 1: 3-46.
Boguraev et al. (1988): B. Boguraev, J. Carroll, S. Pulman, G. Russell, G. Ritchie, A. Black, E. Briscoe & C. Grover (1988). The lexical component of a natural language toolkit. : D. Walker, A. Zampolli & N. Calzolari, , Automating the lexicon: Research and practice in a multilingual environment. Cambridge University Press, Cambridge.
Bolinger (1972): D. Bolinger (1972). Accent is predictable (if you're a mind-reader). Language 48: 633-644.
Bolt (1970): R. Bolt (1970). Speaker identification by speech spectrograms: A scientists' view of its reliability for legal purposes. JASA 47(2): 597. Part 2.
Boogaart & Silverman (1992): T. Boogaart & K. Silverman (1992). Evaluating the overall comprehensibility of speech synthesizers. : Proceedings of the 2nd International Conference on Spoken Language Processing, ICSLP, 1207-1210, Banff.
Boogart et al. (1993): T. Boogart, P. Van Alphen & J. Doll (1993). Application oriented assessment of dialogue systems. : Joint ESCA - NATO/RSG10 Tutorial and Research Workshop on Applications of Speech Technology, Lautrach, September.
Boves (1984): L. Boves (1984). The phonetic basis of perceptual ratings of running speech. Foris, Dordrecht.
Brachman & Levesque (1985): R. Brachman & H. Levesque (1985). Readings in knowledge representation. Morgan Kaufmann Publishers, Inc., Los Altos, California.
Breiman et al. (1984): L. Breiman, J. Friedman, R. Ohlsen & C. Stone (1984). Classification and regression trees. Wadsworth, Belmont, CA.
Bridle et al. (1982): J. Bridle, M. Brown & R. Chamberlain (1982). An algorithm for connected word recognition. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 899-902, Paris, May.
Brietzmann et al. (1983): A. Brietzmann, H. Hein, H. Niemann & P. Regel (1983). The Erlangen system for understanding continuous German speech. : IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 304-307, Boston.
Bristow (1984): G. Bristow (1984). Electronic speech synthesis. Collins, London.
Bristow (1986): G. Bristow (1986). Electronic speech recognition. Collins, London.
Brouwer & De Haan (1987): D. Brouwer & D. De Haan, (1987). Woman's language, socialization and self-image. Foris Publications, Dordrecht.
Browman (1980): C. Browman (1980). Rules for demisyllable synthesis using Lingua, a language interpreter. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 561-564, Denver.
Brown et al. (1992): P. Brown, V. Della Pietra, P. De Souza & R. Mercer (1992). Class-based n-gram models of natural language. Computational Linguistics 18(4): 467-479.
Bruce (1989): G. Bruce (1989). Report from the IPA Working Group on suprasegmental categories. Working Papers 35, Lund University, Department of Linguistics, Lund 25-40.
Bunt et al. (1985): H. Bunt, R.-J. Beun, F. Dols, J. von der Linden & G. thoe Schwartzenberg (1985). The TENDUM dialogue system and its theoretical basis. IPO Annual Progress Report 19: 105-113.
Burrell (1991): M. Burrell (1991). Assessment of the degradations of synthetic speech and time frequency warping over different listening levels. : Proceedings of the Institute of Acoustics, 13, Pt. 2.
Button (1990): G. Button (1990). Going up a blind alley: Conflating conversation analysis and computational modelling. : P. Luff, G. Gilbert & D. Frohlich, , Computers and conversation, 67-90. Academic Press, London.
Cahill (1993): L. Cahill (1993). Morphonology in the lexicon. : Proceedings of the Sixth Conference of the European Chapter of the Association for Computational Linguistics, 87-96, Utrecht.
Cahill & Evans (1990): L. Cahill & R. Evans (1990). An application of DATR: The TIC lexicon. : R. Evans & G. Gazdar, , The DATR Papers, 31-39. School of Cognitive and Computing Science, University of Sussex, Brighton, .
Campbell (1995): J. Campbell (1995). Testing with the YOHO CD-ROM Voice Verification Corpus. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 341-344.
Carbonell & Pierrel (1986): N. Carbonell & J. Pierrel (1986). Architecture and knowledge sources in a human computer oral dialog system. : Proceedings of the NATO workshop: Structure of multimodal dialogues including voice, Corsica, France.
Carlson et al. (1979): R. Carlson, B. Granström & D. Klatt (1979). Some notes on the perception of temporal patterns in speech. : Proceedings of the 9th International Congress of Phonetics Sciences, 2, 260-267, Copenhagen.
Carroll & Chang (1970): J. Carroll & J. Chang (1970). Analysis of individual differences in multidimensional scaling via an n-way generalization of the ``eckhard-young'' composition. Psychometrika 35: 283-319.
Carson-Berndsen (1993): J. Carson-Berndsen (1993). Time map phonology and the projection problem in spoken language recognition. Doctoral dissertation, University of Bielefeld, Bielefeld, Germany.
Cartier et al. (1992): M. Cartier, F. Emerald, D. Pascal, P. Combescure & A. Soubigou (1992). Une méthode d'évaluation multicritère de sorties vocales: Application au test de 4 systèmes de synthèse à partir du texte. : 19èmes Journées d'Étude sur la Parole, Brussels.
CCITT (1988a): CCITT (1988a). Artificial voices. Blue Book IXth Plenary Assembly V: 87-99. Recommendation P.50.
CCITT (1988b): CCITT (1988b). Objective measurement of active speech level. Rec. P. 56 Melbourne, CCITT.
Chafe (1992): W. Chafe (1992). The importance of corpus linguistics to understanding the nature of language. : J. Svartvik, , Directions in corpus linguistics: Proceedings of the Nobel Symposium 82, New York, 79-97, Berlin. Mouton de Gruyter.
Charniak & McDermott (1985): E. Charniak & D. McDermott (1985). Introduction to Artificial Intelligence. Addison-Wesley, Reading, Massachusetts.
Chollet & Gagnoulet (1981): G. Chollet & C. Gagnoulet (1981). On the evaluation of recognizers and databases using a reference system. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Atlanta.
Chomsky (1965): N. Chomsky (1965). Aspects of the theory of syntax. The MIT Press, Cambridge, MA.
Chomsky & Halle (1968): N. Chomsky & M. Halle (1968). The sound pattern of English. Harper and Row, New York, Evanston, London.
Choukri et al. (1988): K. Choukri, G. Chollet & C. Montacié (1988). Test workstation for the evaluation of speech recognition algorithms, applications and databases. : Proceedings of the 7th FASE Symposium (Speech'88), 145-151, Edinburgh, August 1988.
Church (1987a): K. Church (1987a). Phonological parsing and lexical retrieval. Cognition 25: 53-69.
Church (1987b): K. Church (1987b). Phonological parsing in speech recognition. Kluwer Academic Publishers, Boston, Dordrecht, Lancaster.
Coates (1986): J. Coates (1986). Women, men and language: A sociolinguistic account of sex differences in language. Longman, London.
Cole (1995): Cole (1995). The challenge of spoken language systems: Research directions for the nineties. IEEE Transactions on Speech and Audio Processing 3: 1-20.
Combescure (1981): P. Combescure (1981). 20 listes de dix phrases phonétiquement équilibrées. Revue d'Acoustique 56: 34-38.
Content et al. (1990): A. Content, P. Mousty & M. Radeau (1990). Brulex, une base de données lexicales informatise pour le français écrit et parlé. L'Année Psychologique 90: 551-566.
Cookson (1988): S. Cookson (1988). Final evaluation of VODIS voice operated database inquiry system. : Proceedings of Speech-88, 7th FASE Symposium, 1311-1320, Edinburgh, August.
Cosi & Omologo (1991): P. Cosi & M. Omologo (1991). Caratterizzazione statistica della segmentazione manuale del segnale vocale. Associazione Italiana Acustica (AIA) Meeting. Napoli, Italy, 10-12 April. Cited in Barry and Fourcin 1992.
Crowdy (1993): S. Crowdy (1993). Spoken corpus design and transcription. Longman, Harlow.
Cruse (1986): D. Cruse (1986). Lexical semantics. CUP, Cambridge.
Crystal (1980): D. Crystal (1980). Introduction to language pathology. Edward Arnold Ltd., London.
Crystal (1985): D. Crystal (1985). A dictionary of linguistics and phonetics. Basil Blackwell, Oxford, UK.
Cucchiarini (1993): C. Cucchiarini (1993). Phonetic transcription: A methodological and empirical study. Doctoral thesis, University of Nijmegen, Nijmegen.
Dahlbäck & Jönsson (1986): N. Dahlbäck & A. Jönsson (1986). A system for studying human-computer dialogues in natural language. Research Report LiTH-IDA-R-86-42, Department of Computer and Information Science, Linköping University, Linköping.
Dahlbäck & Jönsson (1989): N. Dahlbäck & A. Jönsson (1989). Empirical studies of discourse representations for natural language interfaces. : Proceedings of the 4th Conference of the European Chapter of the Association for Computational Linguistics, 291-298, Manchester.
Dalsgaard & Baekgaard (1994): P. Dalsgaard & A. Baekgaard (1994). Spoken language dialogue systems. : H. Niemann, R. De Mori & G. Hanrieder, , Progress and prospects in speech and language technology, 178-191. Infix, Sankt Augustin.
Damhuis et al. (1994): M. Damhuis, T. Boogaart, C. in 't Veld, M. Versteijlen, W. Schelvis, L. Bos & L. Boves (1994). Creation & analysis of the Dutch Polyphone Corpus. : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 1803-1806, Yokohama.
Davis & Davis (1975): D. Davis & C. Davis (1975). Sound system engineering. W. Sams & Co., Indianapolis, U.S.A.
De Mori et al. (1984): R. De Mori, M. Gilloux, G. Mercier, M. Simon, C. Tarrides & J. Vaissière (1984). Integration of acoustic, phonetic, prosodic and lexical knowledge in an expert system for speech understanding. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. 42.9.1-42.9.4.
De Pijper (1983): J. De Pijper (1983). Modelling British English intonation. Foris, Dordrecht.
Della Pietra et al. (1994): S. Della Pietra, V. Della Pietra, J. Gillett, J. Lafferty, H. Printz & L. Ures (1994). Inference and estimation of a long-range trigram model. Second International Colloquium `Grammatical Inference and Applications', Alicante, Spain, September 1994 78-92. Springer-Verlag, Berlin.
Delogu et al. (1993a): C. Delogu, A. Di Carlo, C. Sementino & S. Stecconi (1993a). A methodology for evaluating human-machine spoken language interaction. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 1427-1430, Berlin, September.
Delogu et al. (1991): C. Delogu, A. Paoloni, P. Pocci & C. Sementina (1991). Quality evaluations of text-to-speech synthesizers using magnitude estimation, categorical estimation, pair comparison and reaction time methods. : Proceedings of the Eurospeech '91, 353-355, Genova.
Delogu et al. (1993b): C. Delogu, A. Paoloni, P. Ridolfi & K. Vagges (1993b). Intelligibility of Italian text-to-speech synthesizers over ortophonic and telephonic channel. : Proceedings of the Eurospeech '93, 3, 1893-1896, Berlin.
Delogu et al. (1992a): C. Delogu, A. Paoloni & C. Sementina (1992a). Comprehension of natural and synthetic speech: Preliminary studies. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Final report, Year three, 1.III.91-28.II.1992. SAM Internal Report II.c.
Delogu et al. (1992b): C. Delogu, P. Paoloni, P. Pocci & C. Sementina (1992b). A comparison among different methodologies for evaluating the quality of text-to-speech synthesis systems. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology an standardisation. University College London, London. Final report, Year three, 1.III.91-28.II.1992. SAM Internal Report II.d.
Delomier et al. (1989): D. Delomier, A. Meunier & M.-A. Morel (1989). Linguistic features of human-machine oral interaction. : Proceedings of the Eurospeech '89, 2, 236-239, Paris.
Dempster et al. (1977): A. Dempster, M. Laird & D. Rubin (1977). Maximum likelihood from incomplete data via the EM algorithm. J. Royal Statist. Soc. Ser. B (methodological) 39: 1-38.
Den Os (1994): E. Den Os (1994). Transliteration of the Dutch Speech Styles Corpus. : Proceedings of the Institute of Phonetic Sciences, 18, 87-94, University of Amsterdam.
Derouault & Merialdo (1986): A. Derouault & B. Merialdo (1986). Natural language modelling for phoneme-to-text transcription. IEEE Transactions on Pattern Analysis and Machine Intelligence, November, 8: 742-749.
Diaper (1986): D. Diaper (1986). Identifying the knowledge requirements of an expert system's natural language processing interface. : M. Harrison & A. Monk, , People and Computers V: Proceedings of the 2nd Conference of the British Computer Society Human-Computer Interaction Specialist Group, Cambridge. Cambridge University Press.
Diaper (1989): D. Diaper (1989). The Wizard's apprentice: A program to help analyse natural language dialogues. : A. Sutcliffe & L. Macaulay, , People and Computers: Designing for usability. Proceedings of the 2nd Conference of the British Computer Society Human-Computer Interaction Specialist Group, Cambridge. Cambridge University Press.
Doddington (1985): G. Doddington (1985). Speaker recognition - Identifying people by their voices. Proceedings of the IEEE, November, 73(11): 1651.
Dolmazon et al. (1990): J.-M. Dolmazon, J.-C. Caërou & W. Barry (1990). Initial development of SAM standard workstation. SAM-UCL-022, December, Appendix Se.10, University College London, London.
Dougherty (1990): D. Dougherty (1990). sed & awk. O'Reilly & Associates Inc., Sebastopol, CA.
Dreckschmidt (1987): G. Dreckschmidt (1987). The linguistic component in the speech understanding system SPICOS. : H. Tillmann & G. Willée, , Analyse und Synthese gesprochener Sprache, Jahrestagung der Gesellschaft für Linguistische Datenverarbeitung, Bonn, 96-101. Olms, Hildesheim.
Drullman & Collier (1993): R. Drullman & R. Collier (1993). Speech synthesis with accented and unaccented diphones. : V. Van Heuven & L. Pols, , Analysis and synthesis of speech, strategic research towards high-quality text-to-speech generation, 147-156. Mouton de Gruyter, Berlin.
Duda & Hart (1973): R. Duda & P. Hart (1973). Pattern classification and scene analysis. J. Wiley, New York.
Duncan (1974): S. Duncan (1974). On signalling that it's your turn to speak. Journal of Experimental Social Psychology 10: 234-247.
Dybkjaer et al. (1993): H. Dybkjaer, N. Bernsen & L. Dybkjaer (1993). Wizard-of-Oz and the trade-off between naturalness and recognizer constraints. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 947-950, Berlin, September.
Eargle (1976): J. Eargle (1976). Sound recording. Van Nostrand Reinhold Company, New York, USA.
Edwards & Lampert (1993): J. Edwards & M. Lampert, (1993). Talking data: Transcription and coding in discourse research. Lawrence Erlbaum, Hillsdale.
Efron & Tibshirani (1993): B. Efron & R. Tibshirani (1993). An introduction to the bootstrap. Chapman & Hall, New York.
Egan (1948): J. Egan (1948). Articulation testing methods. Laryngoscope 58: 955-991.
Ehrlich (1986): U. Ehrlich (1986). Ein Lexikon für das natürlich-sprachliche Dialogsystem EVAR. Arbeitsberichte des IMMD, vol. 19, University of Erlangen-Nürnberg, Erlangen, Germany.
Eisen (1993): B. Eisen (1993). Reliability of speech segmentation and labelling at different levels of transcription. : Proceedings of the Third European Conference on Speech Communication and Technology, 1, 673-676, 21-23 September 1993, Berlin, Germany.
Erman (1977): L. Erman (1977). A functional description of the HEARSAY-II speech understanding system. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Hartford.
Erman & Hayes-Roth (1981): L. Erman & F. Hayes-Roth (1981). The HEARSAY-II speech understanding system: Integrating knowledge to resolve uncertainty. : B. Webber & N. Nilsson, , Readings in Artificial Intelligence, 349-389. Tioga, Palo Alto, CA.
Erman & Lesser (1980): L. Erman & V. Lesser (1980). The HEARSAY-II speech understanding system: A tutorial. : W. Lea, , Trends in speech recognition, 361-381. Prentice Hall, Englewood Cliffs, NJ. Also in: A. Waibel and K.-F. Lee, eds. (1990), Readings in speech recognition, Morgan Kaufmann Publishers, San Mateo, California, 235-245.
Esling (1988): J. Esling (1988). 7.1 Computer coding of IPA symbols and 7.3 detailed phonetic representation of computer data bases. Journal of the International Phonetic Association 18(2): 99-106.
Esling (1990): J. Esling (1990). Computer coding of the IPA: Supplementary report. Journal of the International Phonetic Association 20(1): 22-26.
Esling & Gaylord (1993): J. Esling & H. Gaylord (1993). Computer codes for phonetic symbols. Journal of the International Phonetic Association 23(2): 83-97.
Essen & Steinbiss (1992): U. Essen & V. Steinbiss (1992). Cooccurrence smoothing for stochastic language modelling. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, I, 161-164, San Francisco, CA, March.
Evans & Gazdar (1989): R. Evans & G. Gazdar (1989). The DATR papers. Research Report: May 1989, School of Cognitive and Computing Science, University of Sussex, School of Cognitive and Computing Science, University of Sussex, Brighton.
Evans & Gazdar (1990): R. Evans & G. Gazdar (1990). The DATR papers. Research Report: February 1990, School of Cognitive and Computing Science, University of Sussex, School of Cognitive and Computing Science, University of Sussex, Brighton.
Federico (1989): A. Federico (1989). Comparison between automatic methods and human listeners in speaker recognition tasks. : Proceedings of the Eurospeech, 279-282.
Fellbaum et al. (1994): K. Fellbaum, H. Klaus & J. Sotscheck (1994). Hörversuche zur Beurteilung der Sprachqualität von Sprachsynthesesystemen für die deutsche Sprache. : Fortschritte der Akustik, Plenarvorträge und Fachbeiträge der 20. Deutschen Jahrestagung für Akustik, 117-122, Dresden, DPG GmbH.
Ferguson (1976): G. Ferguson (1976). Statistical analysis in psychology and education. McGraw-Hill, Tokyo.
Ferrané et al. (1992): I. Ferrané, M. De Calmès, D. Cotto, J.-M. Pécatte & G. Pérennou (1992). Statistiques lexicales sur le corpus de textes utilisés dans le projet BREF: Questions de couverture lexicale. : Proceedings Communication Homme-Machine, Séminaire LEXIQUE, 217-226, 21-22 January 1992, IRIT-UPS, Toulouse.
Fillmore (1968): C. Fillmore (1968). The case for case. : E. Bach & R. Harms, , Universals in linguistic theory, 1-88. Holt, Rinehart and Winston, New York.
Fissore et al. (1993): L. Fissore, E. Giachin, P. Laface & P. Massafra (1993). Using grammars in forward and backward search. : Proceedings of the European Conference on Speech Communication and Technology, 1525-1528, Berlin, September.
Flanagan et al. (1991): J. Flanagan, D. Berkley, G. Elko & M. Sondhi (1991). Autodirective microphone systems. Acoustica 73: 58-71.
Fourcin (1993): A. Fourcin (1993). The SAM project. Ellis Horwood, Chichester.
Fourcin et al. (1989): A. Fourcin, G. Harland, W. Barry & V. Hazan, (1989). Speech input and output assessment. Multilingual methods and standards. Ellis Horwood Ltd., Chichester.
Fraser (1991): N. Fraser (1991). Corpus-based evaluation of the SUNDIAL system. : J. Neal & S. Walter, , Proceedings of the Natural Language Processing Systems Evaluation Workshop, Rome. Rome Laboratory. Technical Report RL-TR-91-362.
Fraser & Gilbert (1991a): N. Fraser & G. Gilbert (1991a). Effects of system voice quality on user utterances in speech dialogue systems. : Proceedings of the Second European Conference on Speech Communication and Technology, 57-60, Genova, September.
Fraser & Gilbert (1991b): N. Fraser & G. Gilbert (1991b). Simulating speech systems. Computer Speech and Language 5: 81-99.
Fraser et al. (1992): N. Fraser, N. Gilbert & C. McDermid (1992). The value of simulation data. : Proceedings of the Workshop on Empirical Models and Methodology for Natural Language Dialogue Systems, Trento, April.
French (1991): J. French (1991). Updated notes for soundprint transcribers + one page sample text from COBUILD corpus. Working paper, NERC-WP4-47, October, J.P. French Associated, York and COBUILD, Birmingham.
French (1992): J. French (1992). Transcription proposals: Multi-level system. Working paper, NERC-WP 4-50, October, University of Birmigham, Birmingham.
Fu (1982): K. Fu (1982). Syntactic pattern recognition and applications. Prentice-Hall, Englewood Cliffs, NJ.
Furui (1981): S. Furui (1981). Cepstral analysis technique for automatic speaker verification. IEEE Transactions on Acoustics, Speech and Signal Processing 29(2).
Furui (1994): S. Furui (1994). An overview of speaker verification technology. : ESCA-ETRW Workshop, 1-10, Martigny.
Generet et al. (1995): M. Generet, H. Ney & F. Wessel (1995). Extensions of absolute discounting for language modelling. : Proceedings of the Fourth European Conference on Speech Communication and Technology, 1245-1248, Madrid, September.
Gerbino et al. (1993): E. Gerbino, P. Baggia, A. Ciaramella & C. Rullent (1993). Test and evaluation of a spoken dialogue system. : Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP'93, Minneapolis, April.
Geutner (1995): P. Geutner (1995). Using morphology towards better large-vocabulary speech recognition systems. Interactive Systems Laboratories, University of Karlsruhe, Karlsruhe, Germany.
Gibbon (1991): D. Gibbon (1991). Lexical signs and lexicon structure: Phonology and prosody in the ASL-lexicon. Research Report ASL-MEMO-20-91/UBI, University of Bielefeld, Bielefeld, Germany.
Gibbon (1992a): D. Gibbon (1992a). ILEX: A linguistic approach to computational lexica. : U. Klenk, , Computatio linguae. Aufsätze zur algorithmischen und quantitativen Analyse der Sprache, 32-51. Franz Steiner Verlag, Stuttgart.
Gibbon (1992b): D. Gibbon (1992b). Language and software, or: Fritzl's quest. : C. Floyd, H. Züllighoven, R. Budde & R. Keil-Slavik, , Software Development and Reality Construction, 376-390. Springer Verlag, Berlin, Heidelberg, New York.
Gibbon (1993): D. Gibbon (1993). Generalized DATR for flexible lexical access: PROLOG specification. VERBMOBIL Report 2, October 1993, Bielefeld University, Bielefeld, Germany.
Gibbon (1995): D. Gibbon (1995). The VERBMOBIL lexicon: Bielefeld lexicon database V2.1. VERBMOBIL Technisches Dokument 21, 31 January 1995, Bielefeld University, Bielefeld, Germany.
Gibbon & Ehrlich (1995): D. Gibbon & U. Ehrlich (1995). Spezifikationen für ein VERBMOBIL-Lexikondatenbankkonzept. VERBMOBIL Memo 69, Bielefeld University & Daimler Benz AG, Bielefeld, Ulm.
Gilbert & Weismer (1974): H. Gilbert & G. Weismer (1974). The effect of smoking on the speaking fundamental frequency of adult women. Journal of Psycholinguistic Research 3: 225-231.
Gish et al. (1986): H. Gish, M. Kraner, W. Russel & J. Wolf (1986). Methods and experiments for text-independent speaker recognition over the telephone line. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 865. 17.2.1.
Gish & Schmidt (1994): H. Gish & M. Schmidt (1994). Text-independent speaker identification. : IEEE Signal Processing, 11, 18-32.
Goldsmith (1990): J. Goldsmith (1990). Autosegmental and metrical phonology. Indiana University Linguistics Club, Bloomington, Indiana.
Goldstein (1995): M. Goldstein (1995). Classification of methods used for assessment of text-to-speech systems according to the demands placed on the listener. Speech Communication 16: 225-244.
Goldstein et al. (1992): M. Goldstein, B. Lindström & O. Till (1992). Assessing global performance of speech synthesizers: Context effects when assessing naturalness of Swedish sentence-pairs generated by 4 systems using 3 different assessment procedures (free number magnitude estimation, 5- and 11-point category scales). : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. SAM Internal Report II.a, Final report, Year three: 1.III.91-28.II.1992.
Goldstein & Till (1992): M. Goldstein & O. Till (1992). Assessing segmental intelligibility of two rule-based synthesizers and natural speech using the ESPRIT/SAMVCV test procedures (SOAP v3.0) in Swedish and testing for differences between two correlated proportions. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. Univeristy College London, London. SAM Internal Report II.b, Final report, Year three: 1.III.91-28.II.1992.
Gong (1995): Y. Gong (1995). Speech recognition in noisy environments: A survey. Speech Communication 16: 261-291.
Gonzalez & Thomason (1978): R. Gonzalez & M. Thomason (1978). Syntactic pattern recognition: An introduction. Addison-Wesley, Reading, MA.
Good (1953): I. Good (1953). The population frequencies of species and the estimation of population parameters. Biometrika, December, 40: 237-264.
Goodine et al. (1992): D. Goodine, L. Hirschman, J. Polifroni, S. Seneff & V. Zue (1992). Evaluating interactive spoken language systems. : Proceedings of the International Conference on Spoken Language Processing, ICSLP'92, 201-204, Banff, October.
Goorfin (1989): L. Goorfin (1989). Electronic dictionary pronounces over 83,000 words. Speech Technology 4(4): 49-51.
Gorin et al. (1991): A. Gorin, S. Levinson, A. Gertner & E. Goldman (1991). Adaptive acquisition of language. Computer, Speech and Language, April, 5(2): 101-132.
Gray & Kopp (1944): C. Gray & G. Kopp (1944). Voiceprint identification. Bell Telephone Report, Bell Laboratories.
Green (1986): D. Green (1986). Control, activation and resource: A framework and a model for the control of speech in bilinguals. Brain and Language 27: 210-223.
Greenspan et al. (1985): S. Greenspan, H. Nusbaum & D. Pisoni (1985). Perception of speech generated by rule: Effects of training and attentional limitations. Research on Speech Perception Progress Report 11, pages 263-287, Indiana University, Indianapolis.
Grenier (1977): Y. Grenier (1977). Identification du locuteur et adaptation au locuteur d'un système de reconnaissance phonémique. Ph.D. Thesis.
Grice (1975): H. Grice (1975). Logic and conversation. : P. Cole & J. Morgan, , Syntax and semantics 3: Pragmatics, 41-58. Academic Press, New York.
Grice et al. (1991): M. Grice, K. Vagges & D. Hirst (1991). Assessment of intonation in text-to-speech synthesis systems - A pilot test in English and Italian. : Proceedings of the Eurospeech '91, 2, 879-882, Genova.
Grice et al. (1992a): M. Grice, K. Vagges & D. Hirst (1992a). Prosodic form tests. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Final report, Year three, 1.III.91-28.II.1992, Stage report So. 5, Part One.
Grice et al. (1992b): M. Grice, K. Vagges & D. Hirst (1992b). Prosodic function tests. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Final report, Year three, 1.III.91-28.II.1992, Stage report So. 5, Part Two.
Grosz (1977): B. Grosz (1977). The representation and use of focus in dialogue understanding. University of California.
Guindon (1988): R. Guindon (1988). A multidisciplinary perspective on dialogue structure in user-advisor dialogues. : R. Guindon, , Cognitive Science and its applications for human-computer interaction, 163-200.
Guindon et al. (1987): R. Guindon, K. Shuldberg & J. Connor (1987). Grammatical and ungrammatical structures in user-advisor dialogues: Evidence for sufficiency of restricted languages in natural language interfaces to advisory systems. : Proceedings of the 25th Annual Meeting of the Association for Computational Linguistics, 41-44, Stanford.
Guindon et al. (1986): R. Guindon, P. Sladky, H. Brunner & J. Connor (1986). The structure of user-adviser dialogues: Is there method in their madness? : Proceedings of the 24th Annual Meeting of the Association for Computational Linguistics, 224-230.
Guyomard & Siroux (1986a): M. Guyomard & J. Siroux (1986a). PALABRE Phase 1 experimental protocol. . CNET/TSS/RCP WP4 task 3, April.
Guyomard & Siroux (1986b): M. Guyomard & J. Siroux (1986b). PALABRE Phase 2 experimental protocol. . CNET/TSS/RCP WP4 task 3, May.
Guyomard & Siroux (1987): M. Guyomard & J. Siroux (1987). Experimentation in the specification of an oral dialogue. : H. Niemann, M. Lang & G. Sagerer, , Recent Advances in Speech Understanding and Dialog Systems. NATO ASI Series. Series F: Computer and Systems Sciences, Vol. 46, 497-501. Springer-Verlag, Berlin, Heidelberg, New York, London, Paris, Tokyo.
Guyomard & Siroux (1988): M. Guyomard & J. Siroux (1988). Constitution incrementale d'un corpus de dialogues oraux cooperatifs. Journal Acoustique 1.
Haeb-Umbach & Ney (1994): R. Haeb-Umbach & H. Ney (1994). Improvements in time-synchronous beam search for 10000-word continuous speech recognition. IEEE Transactions on Speech and Audio Processing, April, 2: 353-356.
Hansen et al. (1992): J. Hansen, C. Pelaez, L. Solana & P. Vossen (1992). Performance assessment and evaluation: Specification document. SUNSTAR Report II.4.
Hauptmann & Rudnicky (1988): A. Hauptmann & A. Rudnicky (1988). Talking to computers: An empirical investigation. International Journal of Man-Machine Studies 28: 583-604.
Hayes (1963): W. Hayes (1963). Statistics. Holt, Rinehart and Winston, Inc., New York.
Hazan & Grice (1989): V. Hazan & M. Grice (1989). The assessment of synthetic speech intelligibility using semantically unpredictable sentences. : Proceedings of the ESCA Workshop on Speech Input/Output Assessment and Speech Databases, 1.6.1-1.6.4.
Hazan & Shi (1993): V. Hazan & B. Shi (1993). Individual variability in the perception of synthetic speech. : Proceedings of the Eurospeech '93, 3, 1849-1852, Berlin.
Heemskerk & Van Heuven (1993): J. Heemskerk & V. Van Heuven (1993). MORPA, a morpheme lexicon based morphological parser. : V. Van Heuven & L. Pols, , Analysis and synthesis of speech, strategic research towards high-quality text-to-speech generation, 67-85. Mouton de Gruyter, Berlin.
Helfrich (1979): H. Helfrich (1979). Age markers in speech. : K. Scherer & H. Giles, , Social markers in speech, 63-107. Cambridge University Press, Cambridge.
Hertz et al. (1985): S. Hertz, J. Kadin & K. Karplus (1985). The DELTA rule development system for speech synthesis from text. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 1589-1601.
Hess (1983): W. Hess (1983). Pitch determination of speech signals. Springer-Verlag, Heidelberg, F.R.G.
Hess et al. (1995): W. Hess, K. Kohler & H. Tillmann (1995). The PhonDat/Verbmobil Speech Corpus. : Proceedings of the Eurospeech 95, Madrid.
Heyer et al. (1991): G. Heyer, K. Waldhur & H. Khatchadourian (1991). Motivation, goals and milestones of ESPRIT II MULTILEX. : Génie Linguistique 91, 1, Versailles, France, 16-17 January.
Hieronymus et al. (1990): J. Hieronymus, H. Alexander, C. Bennett, I. Cohen, D. Davies, J. Dalby, J. Laver, W. Barry, A. Fourcin & J. Wells (1990). Proposed speech segmentation criteria for the SCRIBE project. SCRIBE Project Report.
Hirschman et al. (1990): L. Hirschman, D. Dahl, D. McKay, L. Norton & M. Linebarger (1990). Beyond class A: A proposal for automatic evaluation of discourse. : Proceedings of the DARPA Workshop on Speech and Natural Language, 109-112, Hidden Valley, PA, June.
Hjelmquist et al. (1987): E. Hjelmquist, B. Jansson & G. Torell (1987). Psychological aspects on blind people's reading of radio-distributed daily newspapers. : B. Knave & P. Widebäck, , Work with display units 86, 187-201. North-Holland, Elsevier Science Publishers, Amsterdam.
Hockett (1958): C. Hockett (1958). A course in modern linguistics. Macmillan, New York.
Höge et al. (1985): H. Höge, E. Marschall, O. Schmidbauer & R. Sommer (1985). Worthypothesengenerierung im Projekt SPICOS. : H. Niemann, , Mustererkennung 85, 7. DAGM-Symposium Erlangen, Informatik-Fachberichte, vol. 107, 175-179. Springer-Verlag, Berlin.
Holmes (1988): J. Holmes (1988). Speech synthesis and recognition. Van Nostrand Reinhold (UK) Co. Ltd., Wokingham.
Homayounpour et al. (1993): M. Homayounpour, J. Goldman, G. Chollet & J. Vaissiere (1993). Performance comparison of machine and human speaker verification. : Proceedings of the Eurospeech, 2295.
House (1988): A. House (1988). The recognition of search by machine - A bibliography. Academic Press Ltd., New York, N.Y.
House et al. (1965): A. House, C. Williams, M. Hecker & K. Kryter (1965). Articulation testing methods: Consonantal differentiation with a closed response set. Journal of the Acoustical Society of America, JASA 37: 158-166.
House et al. (1992): J. House, Y. Shitara, M. Grice & P. Howard-Jones (1992). Evaluation of prosody in dialogue synthesis. Speech, Hearing and Language 6: 89-108.
Houtgast & Verhave (1991): T. Houtgast & J. Verhave (1991). A physical approach to speech quality assessment: Correlation patterns in the speech spectrogram. : Proceedings of the Eurospeech '91, 1, 285-288, Genova.
Houtgast & Verhave (1992): T. Houtgast & J. Verhave (1992). An objective approach to speech quality. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Stage report So. 9, Final report, Year three: 1.III.91-28.II.1992.
Howard-Jones (1992a): P. Howard-Jones (1992a). SOAP, Speech Output Assessment Package. Version 4.0, ESPRIT SAM-UCL-042.
Howard-Jones (1992b): P. Howard-Jones (1992b). Specification of listener dimensions. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Stage report So. 8, Part One, Final report, Year three: 1.III.91-28.II.1992.
Howell (1990): P. Howell (1990). Clear speech and turn-taking cues in telephone dialogue. Report to BT, University College London, London.
Hunt (1991): A. Hunt (1991). New commercial applications of telephone-network-based speech recognition and speaker verification. Proceedings of the Eurospeech 15(2): 431.
Hunt (1990): M. Hunt (1990). Figures of merit for assessing connected-word recognizers. Speech Communication 9: 329-336.
IPDS (1995): IPDS (1995). CD-ROM#2: The Kiel corpus of spontaneous speech. vol. 1, kiel.
IPDS (1996): IPDS (1996). CD-ROM#3: The Kiel corpus of spontaneous speech. vol. 2, kiel.
ITU-T (1993): ITU-T (1993). Draft recommendation P.8S - Subjective performance assessment of the quality of speech voice output devices. Study group 12 - contribution 6, ITU-T.
Jakobson et al. (1951): R. Jakobson, G. Fant & M. Halle (1951). Preliminaries to speech analysis. The MIT Press, Cambridge.
Jassem & obacz (1989): W. Jassem & P. obacz (1989). IPA phonemic transcription using an IBM PC and compatibles. Journal of the International Phonetic Association 19(1): 16-23.
Jekosch (1992): U. Jekosch (1992). The Cluster-Identification Test. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Internal report II.e, Final report, Year three: 1.III.91-28.II.1992.
Jekosch & Pols (1994): U. Jekosch & L. Pols (1994). A feature-profile for application-specific speech synthesis assessment and devaluation. : Proceedings of the 3rd International Conference on Spoken Language Processing, ICSLP, Yokohama.
Jelinek (1985): F. Jelinek (1985). A real-time, isolated-word, speech recognition system for dictation transcription. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 858-861.
Jelinek (1991): F. Jelinek (1991). Self-organized language modeling for speech recognition. : A. Waibel & K.-F. Lee, , Readings in speech recognition, 450-506. Morgan Kaufmann Publishers, San Mateo, CA.
Jelinek et al. (1992): F. Jelinek, J. Lafferty & R. Mercer (1992). Basic methods of probabilistic context free grammars. : P. Laface & R. De Mori, , Speech recognition and understanding, 347-360. Springer, Berlin.
Jelinek & Mercer (1980): F. Jelinek & R. Mercer (1980). Interpolated estimation of Markov source parameters from sparse data. : E. Gelsema & L. Kanal, , Pattern recognition in practice, 381-397. North-Holland Publishing Company, Amsterdam.
Jelinek et al. (1990): F. Jelinek, R. Mercer & S. Roukos (1990). Classifying words for improved statistical language models. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 621-624, Albuquerque, NM, April.
Jelinek et al. (1991a): F. Jelinek, R. Mercer & S. Roukos (1991a). Principles of lexical language modeling for speech recognition. : S. Furui & M. Sondhi, , Advances in Speech Signal Processing, 651-699. Marcel Dekker, New York.
Jelinek et al. (1991b): F. Jelinek, B. Merialdo, S. Roukos & M. Strauss (1991b). A dynamic language model for speech recognition. : Proceedings of the DARPA Workshop `Speech and Natural Language Workshop', 293-295, Pacific Grove, CA, February.
Johnston (1993): R. Johnston (1993). An on-going series of subjective experiments to assess speech output from text-to-speech systems. Unpublished report to CCITT Study Group, No. 12.
Jongenburger & Van Bezooijen (1992): W. Jongenburger & R. Van Bezooijen (1992). Evaluatie van ELK: Attitudes van de gebruikers, verstaanbaarheid en acceptabiliteit van de spraaksynthese, bruikbaarheid van het zoeksysteem. Stichting Spraaktechnologie, Utrecht.
Jönsson & Dalbäck (1988): A. Jönsson & N. Dalbäck (1988). Talking to your computer is not like talking to your best friend. : Proceedings of the First Scandinavian Conference on Artificial Intelligence, Tromso, Norway.
Joreskog & Sorbom (1984): J. Joreskog & D. Sorbom (1984). Lisrel VI. Analysis of linear structural relationships by maximum likelihood, instrument variables, and least squares methods. Scientific software, Mooreville, IN.
Karttunen (1983): L. Karttunen (1983). KIMMO: A general morphological processor. Texas Linguistic Forum 22: 165-186.
Kasuya et al. (1993): H. Kasuya, Y. Endo & S. Saliu (1993). Novel acoustic measurements of jitter and shimmer characteristics from pathological voice. : Proceedings of the Eurospeech '93, 1973-1976.
Katz (1987): S. Katz (1987). Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing, March, 35: 400-401.
Kelley (1983a): J. Kelley (1983a). An empirical methodology for writing user-friendly natural language computer applications. : Proceedings of the International Conference of Computer-Human Interaction, CHI '83.
Kelley (1983b): J. Kelley (1983b). Natural language and computers: Six steps for writing an easy-to-use computer application. The Johns Hopkins University, Baltimore.
Kelley (1984): J. Kelley (1984). An interactive design methodology for user-friendly natural language office information applications. Association for Computing Machinery Transactions on Office Information Systems 2: 26-41.
Kerkhoff et al. (1984): J. Kerkhoff, J. Wester & L. Boves (1984). A compiler for implementing the linguistic phase of a text-to-speech conversion system. : H. Bennis & W. Van Lessen-Kloecke, , Linguistics in The Netherlands 1984, 111-119. Foris, Dordrecht.
Kersta (1962): L. Kersta (1962). Voiceprint infallibility. : Meeting of Acoust. Soc. Am., Seattle.
Kinsey (1994): G. Kinsey (1994). Using voice recognition with IVR systems. : AVIOS conference proceedings, 49-56, San Jose.
Kirchhoff (1996): K. Kirchhoff (1996). Phonologisch strukturierte hmms zur automatischen spracherkennung. : D. Gibbon, , Natural language processing and speech technology. Results of the 3rd KONVENS Conference, Bielefeld, October 1996, 55-63. Mouton de Gruyter, Berlin, New York.
Klatt (1976): D. Klatt (1976). The linguistics uses of segmental duration in English: Acoustic and perceptual evidence. Journal of the Acoustical Society of America, JASA 59: 1208-1221.
Klatt (1977): D. Klatt (1977). Review of the ARPA speech understanding project. Journal of the Acoustical Society of America, JASA 62(6): 1345-1366. Also in: A. Waibel, K.-F. Lee, eds., (1990), Readings in speech recognition, Morgan Kaufmann Publishers, San Mateo, California, 554-575.
Klatt (1982): D. Klatt (1982). The KLATTalk text-to-speech conversion system. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 1589-1592.
Klatt (1987): D. Klatt (1987). Review of text-to-speech conversion in English. Journal of the Acoustical Society of America 82: 737-793.
Kneser & Ney (1995): R. Kneser & H. Ney (1995). Improved backing-off for m-gram language modeling. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, I, 49-52, Detroit, MI, May.
Knowles & Alderson (1995): G. Knowles & P. Alderson (1995). Working with speech: The computational analysis of formal British English speech. Longmans, London.
Knowles et al. (1995): G. Knowles, L. Taylor & B. Williams (1995). A corpus of formal British English speech. Longmans, London.
Knuth (1973): D. Knuth (1973). The art of computer programming 3: Sorting and searching. Addison-Wesley, Reading, Massachusetts.
Kohler et al. (1995): K. Kohler, M. Pätzold & A. Simpson (1995). From scenario to segment: The controlled elicitation, transcription, segmentation and labelling of spontaneous speech. Arbeitsberichte (AIPUK) 29, Institut für Phonetik und Digitale Sprachverarbeitung, IPDS, Universität Kiel, Kiel/Germany.
Kornai (1991): A. Kornai (1991). Formal phonology. Doctoral dissertation, Stanford University, Stanford.
Koskenniemi (1983): K. Koskenniemi (1983). Two-level morphology: A general computational model for word-form recognition and production. University of Helsinki, Department of General Linguistics, Helsinki, Finland.
Kraft & Portele (1995): V. Kraft & T. Portele (1995). Quality evaluation of five German speech synthesis systems. Acta Acustica 3: 351-365.
Kryter (1962a): K. Kryter (1962a). Methods for the calculation and use of the Articulation Index. Journal of the Acoustical Society of America, JASA 34: 1689-1697.
Kryter (1962b): K. Kryter (1962b). Validation of the Articulation Index. Journal of the Acoustical Society of America, JASA 34: 1698-1702.
Kuhn & De Mori (1990): R. Kuhn & R. De Mori (1990). A Cache-based natural language model for speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, June, 12: 570-583.
Labov (1972): W. Labov (1972). Sociolinguistic patterns. University of Pennsylvania Press, Pennsylvania.
Labov (1994): W. Labov (1994). Principles of linguistic change. Volume 1: Internal factors. Blackwell, Oxford.
Labrador & Dinesh (1984): C. Labrador & P. Dinesh (1984). Experiments in speech interaction with conventional data services. Interact '84, 104-108.
Lacouture & Normandin (1993): R. Lacouture & Y. Normandin (1993). Efficient lexical access strategies. : Proceedings of the European Conference on Speech Technology.
Ladefoged (1975): P. Ladefoged (1975). A course in phonetics. Harcourt, Brace, Jovanovich, New York.
Lafferty et al. (1992): J. Lafferty, D. Sleator & D. Temperley (1992). Grammatical trigrams: A probabilistic model of link grammars. : Proceedings of the AAAI Fall Symposium on Probabilistic Approaches to Natural Language, Cambridge, MA.
Langer & Gibbon (1992): H. Langer & D. Gibbon (1992). DATR as a graph representation language for ILEX speech oriented lexica. Research Report, March 1992, ASL-TR-43-92/UBI, University of Bielefeld, Bielefeld, Germany.
Langeweg (1988): S. Langeweg (1988). The stress system of Dutch. Doctoral dissertation, Leiden University, Leiden.
Larmouth (1986): D. Larmouth (1986). The legal and ethical status of surreptitious recording in dialect research: Do human subjects guidelines apply? : D. Larmouth, T. Murray & C. Murray, , Legal and ethical issues in surreptitious recording, Publication of the American Dialect Society, number 76. University of Alabama Press, Tuscaloosa and London.
Lau et al. (1993): R. Lau, R. Rosenfeld & S. Roukos (1993). Trigger-based language models: A maximum entropy approach. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, II, 45-48, Minneapolis, MN, April.
Laver (1991): J. Laver (1991). The gift of speech. Papers in the analysis of speech and voice, Edinburgh University Press, Edinburgh.
Laver (1994): J. Laver (1994). Principles of phonetics. Cambridge University Press, Cambridge.
Laver et al. (1988): J. Laver, J. McAllister, M. McAllister & M. Jack (1988). A Prolog-based automatic text-to-phoneme conversion system for British English. : Proceedings of the Second Symposium on Advanced Man-Machine Interface through Spoken Language, November 19-22, Hawaii.
Laver et al. (1989): J. Laver, M. McAllister & J. McAllister (1989). Pre-processing of anomalous text-strings in an automatic text-to-speech system. : S. Ramsaran, , Studies in the pronunciation of English: A commemorative volume in memory of A.C. Gimson. Croon Helm, London.
Lea (1980): W. Lea, (1980). Trends in speech recognition. Prentice-Hall, Englewood Cliffs, NJ.
Lee et al. (1990): K.-F. Lee, H.-W. Hon & R. Reddy (1990). An overview of the SHPINX speech recognition system. : A. Waibel & K.-F. Lee, , Readings in speech recognition, 600-610. Morgan Kaufmann Publishers, San Mateo, California.
Leggett & Williams (1984): J. Leggett & G. Williams (1984). An empirical investigation of voice as an input modality for computer programming. International Journal of Man-Machine Studies 21: 493-520.
Lehiste (1970): I. Lehiste (1970). Suprasegmentals. MIT Press, Cambridge, Mass.
Lehiste et al. (1976): I. Lehiste, J. Olive & L. Streeter (1976). Role of duration in disambiguating syntactically ambiguous sentences. Journal of the Acoustical Society of America, JASA 60: 1199-1202.
Lehmann (1983): E. Lehmann (1983). Theory of point estimation. J. Wiley, New York.
Lehnert & Giron (1995): H. Lehnert & F. Giron (1995). Vocal communication in virtual environments. : Conference documentation of Virtual Reality World '95, 279-293, Stuttgart/Germany.
Lesser et al. (1975): V. Lesser, R. Fennell, L. Erman & D. Reddy (1975). Organization of the HEARSAY-II speech understanding system. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-23, 11-23.
Levelt (1989): J. Levelt (1989). Speaking: From intonation to articulation. ACL-MIT Press Series in Natural Language Processing. Bradford Book - The MIT-Press, Cambridge Massachusetts, London, England.
Levinson et al. (1983): S. Levinson, L. Rabiner & M. Sondhi (1983). An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition. The Bell System Technical Journal, April, 62(4): 1035-1074.
Life et al. (1988): M. Life, M. Lee & J. Long (1988). Assessing the usability of future speech technology: Towards a method. : Speech '88: 7th FASE Symposium, Edinburgh.
Likert (1932): R. Likert (1932). A technique for the measurement of attitudes. Archives of Psychology 140.
Linggard (1985): R. Linggard (1985). Electronic synthesis of speech. Cambridge University Press, Cambridge.
Llisterri (1994): J. Llisterri (1994). Prosody Encoding Survey, Multext - LRE Project 62-050.
Llisterri & Mariño (1993): J. Llisterri & J. Mariño (1993). Spanish adaptation of SAMPA and automatic phonetic transcription. : ESPRIT Project 6819 (SAM-A), , Speech technology assessment in multilingual applications, Year 1, 1 April 1993-30 September 1993, 1-9. London. SAM-A periodic progress report, Document No: SAM-A/UPC/001/V1.
Logan et al. (1989): J. Logan, B. Greene & D. Pisoni (1989). Measuring the segmental intelligibility of synthetic speech produced by ten text-to-speech systems. Journal of the Acoustical Society of America, JASA 86: 566-581.
Logan et al. (1985): J. Logan, D. Pisoni & B. Greene (1985). Measuring the segmental intelligibility of synthetic speech: Results from eight text-to-speech systems. Research on speech perception Progress Report 11, 3-31, Indiana University, Indianapolis.
Loman & Boves (1993): H. Loman & L. Boves (1993). Development of rule based synthesis for text-to-speech. : V. Van Heuven & L. Pols, , Analysis and synthesis of speech, strategic research towards high-quality text-to-speech generation, 157-168. Mouton de Gruyter, Berlin.
Lowerre & Reddy (1980): B. Lowerre & R. Reddy (1980). The HARPY speech understanding system. : W. Lea, , Trends in speech recognition, 340-360. Prentice Hall, Englewood Cliffs, NJ. Also in: A. Waibel, K.-F. Lee, eds., (1990), Readings in speech recognition, Morgan Kaufmann Publishers, San Mateo, California, 576-586.
Luce et al. (1983): P. Luce, T. Feustel & D. Pisoni (1983). Capacity demands in short-term memory for synthetic and natural word lists. Human Factors 25: 17-32.
Luzzati & Néeel (1989): D. Luzzati & F. Néel (1989). Dialogue behaviour induced by machine. : Proceedings of the Eurospeech '89, 2, 601-604, Paris.
Lyons (1977): J. Lyons (1977). Semantics. Volumes I and II. Cambridge University Press, Cambridge.
Maassen & Povel (1985): B. Maassen & D.-J. Povel (1985). The effect of segmental and suprasegmental corrections on the intelligibility of deaf speech. Journal of the Acoustical Society of America, JASA 78: 877-886.
MacDermid (1993): C. MacDermid (1993). Features of naive callers' dialogues with a simulated speech understanding and dialogue system. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 955-958, Berlin, September.
MacWhinney (1995): B. MacWhinney (1995). The CHILDES Project: Tools for analyzing talk. Lawrence Erlbaum, Hillsdale, NJ.
Manous et al. (1985): L. Manous, M. Dedina, H. Nusbaum & D. Pisoni (1985). Speeded sentence verification of natural and synthetic speech. Research on Speech Perception Progress Report 11, Indiana University, Indianapolis.
Marascuilo & Serlin (1988): L. Marascuilo & R. Serlin (1988). Statistical methods for the social, and behavioral sciences. Freeman and company, New York.
Mariani (1989): J. Mariani (1989). Recent advances in speech processing. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 429-440.
Marslen-Wilson (1989): W. Marslen-Wilson, (1989). Lexical representation and process. The MIT Press, Cambridge, Massachusetts and London, England.
Mérialdo (1988): B. Mérialdo (1988). Multi-level decoding for very-large-size-dictionary speech recognition. IBM Journal of Research and Development 32(2): 169-301.
Michaelis & Strube (1995): D. Michaelis & H. Strube (1995). Orthogonale akustische Stimmgüteparameter zur Stimmtherapiedokumentation. Fortschritte der Akustik - DAGA '95 to be printed.
Monaghan & Ladd (1989): A. Monaghan & D. Ladd (1989). Evaluating intonation in the CSTR text-to-speech system. : Proceedings of the ESCA Workshop on Speech I/O Assessment and speech databases, Noordwijkerhout. 3.6.1-3.6.4.
Monaghan & Ladd (1990): A. Monaghan & D. Ladd (1990). Symbolic output as the basis for evaluating intonation in text-to-speech systems. Speech Communication 9: 305-314.
Moody (1991): A. Moody (1991). Speaker verification. Internal Report, January 1991, Ensigma Ltd.
Moore (1977): R. Moore (1977). Evaluating speech recognizers. IEEE Transactions on Acoustics, Speech and Signal Processing 25(2): 178-183.
Moore (1986): R. Moore (1986). The NATO research study group on speech processing: RSG10. : Proceedings of the Speech Tech'86, 201-203, New York, 28-30 April 1986.
Moore (1988): R. Moore (1988). The technology of speech recognition. : Proceedings of the CCTA/Blenheim-Online Conference on Knowledge Based Systems in Government, Bristol, 8-10 November 1988.
Moore (1991): R. Moore (1991). International coordination of research standards in speech science and technology. : Proceedings of the ICSLP-90 Workshop on International Coordination of Spoken Language Database and Assessment Techniques for Speech Input/Output, Kobe, Japan, November 1991.
Moore (1992a): R. Moore (1992a). Speech recognition: Available assessment methods and needs for standardisation. : Proceedings of the Workshop on International Cooperation and Standardisation of Spoken Language Databases and Speech I/O Assessment Techniques, Chiavari, Italy, 26-28 September 1992.
Moore (1992b): R. Moore (1992b). User needs in speech research. : Proceedings of the Workshop on European Textual Corpora, Pisa, Italy, 23-26 January 1992.
Moore (1994a): R. Moore (1994a). The ``Capability Profile''. DRA-CSE Research Note DRA CIS CSE1 RN94/08, August 1994, DRA Speech Research Unit, Malvern, Worcs., UK.
Moore (1994b): R. Moore (1994b). The EAGLES working group on spoken language, Advanced Speech Applications. European research on speech technology. : K. Varghese, S. Pfleger & J. Lefevre, , Research Reports ESPRIT Volume 1. Springer-Verlag, Berlin.
Mori et al. (1992): S. Mori, C. Suen & K. Yamamoto (1992). Historical review of OCR research and development. Proceedings of the IEEE, July, 80(7): 1029-1058.
Morimoto et al. (1990): T. Morimoto, K. Shikano, H. Iida & A. Kurematsu (1990). Integration of speech recognition and language processing in the spoken language translation system SL-TRANS. : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 921-928, Kyoto.
Moulines & Charpentier (1990): E. Moulines & F. Charpentier (1990). Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9: 453-467.
Müller & Runge (1993): C. Müller & F. Runge (1993). Dialogue design principles - key for usability of voice processing. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 943-946, Berlin, September.
Murray & Arnott (1993): I. Murray & J. Arnott (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America, JASA 93: 1097-1108.
Murray & Murray (1986): T. Murray & C. Murray (1986). On the legality and ethics of surreptitious recording. : D. Larmouth, T. Murray & C. Murray, , Legal and ethical issues in surreptitious recording, Publication of the American Dialect Society, number 76. University of Alabama Press, Tuscaloosa and London.
Murveit et al. (1993): H. Murveit, J. Butzberger, V. Digalakis & M. Weintraub (1993). Large vocabulary dictation using SRI's Decipher speech recognition system: Progressive search techniques. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, II, 319-322, Minneapolis, MN, April.
Nadas (1984): A. Nadas (1984). Estimation of probabilities in the language model of the IBM speech recognition system. IEEE Transactions on Acoustics, Speech and Signal Processing, August, 32: 859-861.
Nadas (1985): A. Nadas (1985). On Turing's formula for word probabilities. IEEE Transactions on Acoustics, Speech and Signal Processing, December, 33: 1414-1416.
Nespor & Vogel (1986): M. Nespor & I. Vogel (1986). Prosodic phonology. Foris, Dordrecht.
Newell (1978): A. Newell (1978). The palantype transcription unit - its history and progress to date. Hearing, 99-104. May/June.
Newell (1989): A. Newell (1989). Speech simulation studies - performance and dialogue specification. : J. Peckham, , Recent developments and applications of natural language processing, 141-157. Kogan Page, London.
Ney (1984): H. Ney (1984). The use of a one-stage dynamic programming algorithm for connected word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, April, 32(2): 263-271.
Ney & Aubert (1994): H. Ney & X. Aubert (1994). A word graph algorithm for large vocabulary, continuous speech recognition. : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 1355-1358, Yokohama, Japan, September.
Ney & Essen (1993): H. Ney & U. Essen (1993). Estimating small probabilities by leaving-one-out. : Third European Conference on Speech Communication and Technology, 2239-2242, Berlin, September.
Ney et al. (1994): H. Ney, U. Essen & R. Kneser (1994). On structuring probabilistic dependencies in language modelling. Computer Speech and Language 8: 1-38.
Ney et al. (1992): H. Ney, D. Mergel, A. Noll & A. Paesele (1992). Data driven search organization for continuous speech recognition. IEEE Transactions on Signal Processing, February, 40(2): 272-281.
Ney et al. (1988): H. Ney, D. Mergel, A. Noll & A. Paeseler (1988). Overview of speech recognition in the SPICOS system. : H. Niemann, M. Lang & G. Sagerer, , Recent advances in speech understanding and dialog systems, 46 NATO ASI Series F, 305-310. Springer-Verlag, Berlin.
Niemann et al. (1985): H. Niemann, A. Brietzmann, R. Mühlfeld, P. Regel & G. Schukat (1985). The speech understanding and dialog system EVAR. : R. De Mori & C. Suen, , New systems and architectures for automatic speech recognition and synthesis, NATO ASI Series F, vol. 16, 271-302. Springer-Verlag, Berlin.
Niemann et al. (1992): H. Niemann, E. Nöth, M. Mast & E. Schukat-Talamazzini (1992). Ein Lexikon für ein natürlich-sprachliches Dialogsystem. : Beiträge des ASL-Lexikonworkshops, 15-18, Wandlitz, 26-27 November. ASL-TR-40-92/ZSB.
Nolan (1987): F. Nolan (1987). The limits of segmental description. : Proceedings of the Eleventh International Conference of Phonetic Sciences, 5, 411-414, 1-7 August 1987, Tallinn, Estonia.
Nooteboom & Kruijt (1987): S. Nooteboom & J. Kruijt (1987). Accents, focus distribution, and the perceived distribution of given and new information. Journal of the Acoustical Society of America, JASA 82: 1512-1524.
Nossin (1991): M. Nossin (1991). Le projet GENELEX: EUREKA pour les dictionnaires génériques. Génie Linguistique 91, volume 1. Versailles, France, 16-17 January 1991.
Nunn & Van Heuven (1993): A. Nunn & V. Van Heuven (1993). MORPHON: Lexicon-based text-to-phoneme conversion and phonological rules. : V. Van Heuven & L. Pols, , Analysis and synthesis of speech, strategic research towards high-quality text-to-speech generation, 88-113. Mouton de Gruyter, Berlin.
Nusbaum et al. (1986): H. Nusbaum, S. Greenspan & D. Pisoni (1986). Perceptual attention in monitoring natural and synthetic speech. Research on Speech Perception Progress Report 12, Indiana University, Indianapolis.
Nye & Gaitenby (1974): P. Nye & J. Gaitenby (1974). The intelligibility of synthetic monosyllabic words in short, syntactically normal sentences. Haskins Laboratories Status Report on Speech Research, 37/38, pages 169-190.
Nye et al. (1975): P. Nye, F. Ingemann & L. Donald (1975). Synthetic speech comprehension: A comparison of listener performances with and preferences among different speech forms. Haskins Laboratories Status Report on Speech Research, 41.
Oerder & Ney (1993): M. Oerder & H. Ney (1993). Word graphs: An efficient interface between continuous speech recognition and language understanding. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, II, 119-122, Minneapolis, MN, April.
Oglesby (1994): J. Oglesby (1994). What's in a number? Moving beyond the equal error rate. To appear in Speech Communication, August 1995. Preliminary version published in Martigny ETRW, pp. 87-90.
Olsen & Olsen (1990): G. Olsen & J. Olsen (1990). User-centered design of Collaborative Technology. Cognitive Science and Machine Intelligence Laboratory. To appear in Organizational Computing 32.
O'Malley & Caisse (1987): M. O'Malley & M. Caisse (1987). How to evaluate text-to-speech systems. Speech Technology 3: 66-75.
O'Neill (1975): J. O'Neill (1975). Measurement of hearing by tests of speech and language. : S. Singh, , Measurement procedures in speech, hearing, and language, 219-252. University Park Press, Baltimore.
Oppenheim (1978): A. Oppenheim (1978). Applications of digital signal processing. Prentice-Hall, Englewood Cliffs, N.J.
O'Shaughnessy (1986): D. O'Shaughnessy (1986). Speaker recognition. IEEE ASSP Magazine, 4-17.
O'Shaughnessy (1987): D. O'Shaughnessy (1987). Speech communiacation - human and machine. Addison-Wesley, New York.
Pallet et al. (1990): D. Pallet, W. Fisher & J. Garofolo (1990). DARPA ATIS results, June 1990. : Proceedings of the DARPA Workshop on Speech and Natural Language, 114-121, Hidden Valley, PA, June.
Pallett (1985): D. Pallett (1985). Performance assessment of automatic speech recognizers. Journal of the National Bureau of Standards 90(5). September-October 1985.
Parducci (1965): A. Parducci (1965). Category judgement: A range-frequency model. Psychological Review 72: 407-418.
Pavlovic et al. (1990): C. Pavlovic, M. Rossi & R. Espesser (1990). Use of the magnitude estimation technique for assessing the performance of text-to-speech synthesis system. Journal of the Acoustical Society of America, JASA 87: 373-381.
Pavlovic et al. (1991): C. Pavlovic, M. Rossi & R. Espesser (1991). Perceived spectral energy distributions for EUROM-0 speech and for some synthetic speech. : Proceedings of the 12th International Congress of Phonetic Sciences, 5, 418-421, Aix-en-Provence.
Peckels & Rossi (1973): J. Peckels & M. Rossi (1973). Le test diagnostic par paires minimales. Adaptation au Français du ``Diagnostic Rhyme Test" de W.D. Voiers. Revue d'Acoustique 27: 245-262.
Peckham (1990): J. Peckham (1990). An overview of speaker verification technology and application over the telephone. : Proceedings of the Voice System Worldwide, 166.
Peckham (1993): J. Peckham (1993). A new generation of spoken dialogue systems: Results and lessons from the SUNDIAL project. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 33-40, Berlin, September.
Peckham & Thomas (1990): J. Peckham & T. Thomas (1990). Recognizer sensitivity analysis: A method for assessing the performance of speech recognizers. Speech Communication 9: 317-328.
Pérennou et al. (1991): G. Pérennou, D. Cotto, M. De Calmès, I. Ferrané, J. Pécatte & J. Tihoni (1991). Composantes phonologique et orthographique de BDLEX. : Deuxièmes Journées Nationales du GRECO-PRC Communication Homme-Machine, 351-362, Toulouse, 29-30 January.
Pérennou et al. (1992): G. Pérennou, D. Cotto, M. De Calmès, I. Ferrané & J.-M. Pécatte (1992). Le projet BDLEX de base de données lexicales du Français écrit et parlé. : Proceedings Communication Homme-Machine, Séminaire LEXIQUE, 153-171, 21-22 January 1992, IRIT-UPS Toulouse.
Pérennou & De Calmès (1987): G. Pérennou & M. De Calmès (1987). BDLEX lexical data and knowledge base of spoken and written French. : European Conference on Speech Technology, 1, 393-396, Edinburgh.
Pérennou & Tihoni (1992): G. Pérennou & J. Tihoni (1992). Lexique et phonologie en reconnaissance de la parole. : Proceedings Communication Homme-Machine, Séminaire LEXIQUE, 41-57, 21-22 January 1992, IRIT-UPS Toulouse.
Perkins (1977): W. Perkins (1977). Speech pathology, an applied behavioral science. The C.V. Mosby Company, Saint Louis.
Philips et al. (1987): S. Philips, S. Stelle & C. Tanz (1987). Language, gender and sex in comparative perspective. Cambridge University Press, Cambridge.
Pieraccini et al. (1993): R. Pieraccini, E. Levin & E. Vidal (1993). Learning how to understand language. : Third European Conference on Speech Communication and Technology, 1407-1412, Berlin, September.
Pierce (1991): A. Pierce (1991). Acoustics: An introduction to its physical principles and applications. McGraw Hill, Inc., New York.
Pisoni et al. (1985a): D. Pisoni, B. Greene & H. Nusbaum (1985a). Perception of synthetic speech generated by rule. Proceedings of the IEEE 73: 1665-1676.
Pisoni et al. (1985b): D. Pisoni, B. Greene & H. Nusbaum (1985b). Some human factors issues in the perception of synthetic speech. : Proceedings Speech Tech '85, 57-61, New York.
Pitrelli et al. (1994): J. Pitrelli, M. Beckman & J. Hirschberg (1994). Evaluation of prosodic transcription labeling reliability in the ToBI framework. : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 18-22 September 1994, Yokohama, Japan.
Plenat (1991): M. Plenat (1991). Vers d'une phonémisation des sigles. : Deuxièmes journées du GDR-PRC Communication Homme-Machine, EC2 Editeur, 363-371, Toulouse, 29-30 January.
Plomp & Mimpen (1979): R. Plomp & A. Mimpen (1979). Improving the reliability of testing the speech reception threshold for sentences. Audiology 8: 43-52.
Pols (1991): L. Pols (1991). Quality assessment of text-to-speech synthesis-by-rule. : S. Furui & M. Sondhi, , Advances in speech signal processing, 387-416. Marcel Dekker Inc., New York.
Pols et al. (1987): L. Pols, J.-P. Lefevre, G. Boxelaar & N. Van Son (1987). Word intelligibility of a rule synthesis system for French. : Proceedings of the European Conference on Speech Technology, 1, 179-182, Edinburgh.
Ponamale et al. (1990): M. Ponamale, E. Bilange, K. Choukri & S. Soudoplatoff (1990). A computer-aided approach to the design of an oral dialogue system. : Proceedings of Eastern Multiconference, Nashville.
Portele et al. (1994): T. Portele, B. Heuft, F. Höfer, H. Meyer & W. Hess (1994). A new high quality speech synthesis system for German. : Proceedings Yokohama/New Paltz.
Pratt (1987): R. Pratt (1987). Quantifying the performance of text-to-speech synthesizers. Speech Technology, 54-64.
Price (1990): P. Price (1990). Evaluation of spoken language systems: The ATIS domain. : Proceedings of the DARPA Workshop on Speech and Natural Language, 91-95, Hidden Valley, PA, June.
Quené (1993): H. Quené (1993). Segment durations and accent as cues to word segmentation in Dutch. Journal of the Acoustical Society of America, JASA 94: 2027-2035.
Rabiner & Schafer (1978): L. Rabiner & R. Schafer (1978). Digital processing of speech signals. Prentice-Hall, Englewood Cliffs, N.J.
Radford (1988): A. Radford (1988). Transformational grammar: A first course. CUP, Cambridge.
Ralston et al. (1991): J. Ralston, D. Pisoni, S. Lively, B. Greene & J. Mullennix (1991). Comprehension of synthetic speech produced by rule: Word monitoring and sentence-by-sentence listening times. Human Factors 33: 471-491.
Rayner et al. (1993): M. Rayner, H. Alshawi, I. Breton, D. Carter, V. Digalakis, B. Gamback, J. Kaja, J. Karlgren, B. Lyberg, S. Pulman, P. Price & C. Samuelsson (1993). A speech to speech translation system built from standard components. : Proceedings of a Workshop: Human Language Technology, 217-222, Princeton, NJ, 21-24 March.
Reilly (1987): R. Reilly (1987). Ill-formedness and mis-communication in person-machine dialogue. Information and Software Technology 29: 69-74.
Reyelt et al. (1996): M. Reyelt, M. Grice, R. Benzmüller, J. Mayer & A. Batliner (1996). Prosodische Etikettierung des Deutschen mit ToBI. : D. Gibbon, , Natural language processing and speech technology. Results of the 3rd KONVENS Conference, Bielefeld, October 1996, 144-155. Mouton de Gruyter, Berlin, New York.
Reynolds (1994): D. Reynolds (1994). Speaker identification and verification using Gaussian mixture speaker models. To appear in Speech Communication, August 1995. Preliminary version published in ETRW Martigny, pp. 27-30.
Richards & Underwood (1984a): M. Richards & K. Underwood (1984a). How should people and computers speak to each other? Interact '84, 33-36.
Richards & Underwood (1984b): M. Richards & K. Underwood (1984b). Talking to machines. How are people naturally inclined to speak? : E. Megaw, , Contemporary Ergonomics. Taylor and Francis, London.
Ritchie et al. (1992): G. Ritchie, A. Black, G. Russell & S. Pulman (1992). Computational morphology. The MIT Press, Cambridge, Massachusetts and London.
Roach et al. (1993): P. Roach, G. Knowles, T. Varadi & S. Arnfield (1993). MARSEC: A machine-readable Spoken English corpus. Journal of the International Phonetic Association 23(2): 47-53.
Roach et al. (1990): P. Roach, H. Roach, A. Dew & P. Rowlands (1990). Phonetic analysis and the automatic segmentation and labeling of speech sounds. Journal of the International Phonetic Association 20(1): 15-21.
Roe & Wilpon (1994): D. Roe & J. Wilpon (1994). Voice communication between humans and machines. National Academy Press, Washington.
Roelofs (1987): J. Roelofs (1987). Synthetic speech in practice: Acceptance and efficiency. Behaviour and Information Technology 6: 403-410.
Rose (1971): D. Rose (1971). Audiological assessment. Prentice-Hall International, Inc., London.
Rosenberg (1973): A. Rosenberg (1973). Listener performance in speaker verification tasks. IEEE Transactions on Audio Electroacoustic 21: 221-225.
Rosenberg (1976): A. Rosenberg (1976). Automatic speaker verification: A review. Proceedings of the IEEE, April, 64(4): 475.
Rosenfeld (1994): R. Rosenfeld (1994). Adaptive statistical language modeling: A maximum entropy approach. Ph.D. Thesis, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA. CMU-CS-94-138.
Rossi (1988): M. Rossi (1988). Acoustics and electroacoustics. Artech House, Norwood, MA, USA.
Rowden (1992): C. Rowden (1992). Speech processing. McGraw-Hill Book Company, London.
Rudnicky et al. (1987): A. Rudnicky, L. Baumeister, K. De Graff & E. Lehmann (1987). The lexical access component of the CMU continuous speech recognition system. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP.
Ruske (1985): G. Ruske (1985). Demisyllables as processing units for automatic speech recognition and lexical access. : R. De Mori & C. Suen, , New systems and architectures for automatic speech recognition and synthesis, 16 NATO ASI Series F, 593-611. Springer-Verlag, Berlin.
Ruske & Schotola (1981): G. Ruske & T. Schotola (1981). The efficiency of demisyllable segmentation in the recognition of spoken words. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 971-974, Atlanta.
Sacks et al. (1974): H. Sacks, E. Schlegloff & G. Jefferson (1974). A simplest systematics for the organization of turn-taking in conversation. Language 50: 697-735.
Sagerer (1990): G. Sagerer (1990). Automatisches Verstehen gesprochener Sprache, 74 Reihe Informatik. Bibliographisches Institut, Mannheim.
Sakoe (1979): H. Sakoe (1979). Two-level DP matching - A dynamic programming-based pattern matching algorithm for connected word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing 27: 588-595.
Salza et al. (1993): P. Salza, G. Di Fabbrizio, M. Oreglia, M. Falcone, C. Sementina & C. Delogu (1993). Development of a context dependent methodology for text-to-speech synthesis evaluation in interactive dialogue systems. : ESPRIT Project 6819 (SAM-A), , Speech technology assessment in multilingual applications. London. Report R2, SAM-A Periodic Progress Report. Year 1, 1 April 1993-30 September 1993.
SAM (1992): SAM (1992). Multi-lingual speech input/output assessment, methodology and standardization. ESPRIT project 2589 (SAM), Final report, Year three, 1 III 91-28 II 1992, Ref: SAM-UCL-G004, Univeristy College London, London.
SAM-A (1993): SAM-A (1993). Speech technology assessment in multilingual applications. ESPRIT Project 6819 (SAM-A), Report No. 2, Year 1, Ref SAM-A/G002.
Scharpff & Van Heuven (1988): P. Scharpff & V. Van Heuven (1988). Effects of pause insertion on the intelligibility of low quality speech. : Proceedings of the 7th FASE/Speech '88 Symposium, 261-269, Edinburgh.
Scherer & Giles (1979): K. Scherer & H. Giles, (1979). Social markers in speech. Cambridge University Press, Cambridge.
Schmidt & Watson (1991): M. Schmidt & G. Watson (1991). The evaluation and optimization of automatic speech segmentation. : Proceedings of the Second European Conference on Speech Communication and Technology, Eurospeech 91, 2, 701-704, 24-26 September 1991, Genova, Italy.
Schröder et al. (1987): S. Schröder, G. Sagerer & H. Niemann (1987). Wissensakquisition mit semantischen Netzwerken. : E. Paulus, , Mustererkennung 87, 9. DAGM-Symposium Braunschweig, Informatik-Fachberichte, 305-309. Springer-Verlag, Berlin.
Schukat-Talamazzini (1993): E. Schukat-Talamazzini (1993). Automatische Spracherkennung. Habilitationsschrift, Erlangen University, Erlangen, Germany.
Schwab et al. (1985): E. Schwab, H. Nusbaum & D. Pisoni (1985). Some effects of training on the perception of synthetic speech. Human Factors 27(4): 395-408.
Schwartz & Austin (1991): R. Schwartz & S. Austin (1991). A comparison of several approximate algorithms for finding multiple (n-best) sentence hypotheses. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 701-704, Toronto, May.
Searle (1969): J. Searle (1969). Speech acts: An essay in the philosophy of language. Cambridge University Press, Cambridge.
Searle (1979): J. Searle (1979). Expression and meaning. Cambridge University Press, Cambridge.
Sells (1985): P. Sells (1985). Lectures on contemporary syntactic theories: An introduction to Government-Binding theory, Generalized Phrase Structure Grammar, and Lexical-Functional Grammar. CSLI Center for the Study of Language and Information, Stanford, California.
Siegel (1956): S. Siegel (1956). Nonparametric statistics for the behavioral sciences. McGraw-Hill, New York.
Silverman et al. (1990): K. Silverman, S. Basson & S. Levas (1990). Evaluating synthesizer performance: Is segmental intelligibility enough? : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 981-984, Kobe.
Silverman et al. (1992): K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert & J. Hirschberg (1992). ToBI: A standard for labeling English prosody. : Proceedings of the 1992 International Conference on Spoken Language Processing, ICSLP, 2, 867-870, 12-16 October 1992, Banff, Canada.
Simpson & Fraser (1993): A. Simpson & N. Fraser (1993). Black box and glass box evaluation of the SUNDIAL system. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 1423-1426, Berlin, September.
Simpson & Ruth (1987a): C. Simpson & J. Ruth (1987a). The phonetic discrimination test for speech recognizers: Part I. Speech Technology March/April.
Simpson & Ruth (1987b): C. Simpson & J. Ruth (1987b). The phonetic discrimination test for speech recognizers: Part II. Speech Technology October/November.
Skinner et al. (1992): T. Skinner, J. Holt & N. Nguyen (1992). Automatic identity confirmation and adaptive solutions. Speech Technology 106-111. February 1992.
Smith (1979): P. Smith (1979). Sex markers in speech. : K. Scherer & H. Giles, , Social markers in speech, 109-146. Cambridge University Press, Cambridge.
Smith et al. (1992): R. Smith, D. Hipp & A. Biermann (1992). A dialog control algorithm and its performance. : Proceedings of the 3rd Conference on Applied Natural Language Processing, 9-16, Trento, April.
Soclof (1990): M. Soclof (1990). A comparison of spontaneous speech and read speech in human-machine problem solving dialogues. Massachusetts Institute of Technology.
Soong & Huang (1991): F. Soong & E.-F. Huang (1991). A Tree-Trellis Fast Search for finding the n-best sentence hypotheses. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 705-708, Toronto, May.
Soong et al. (1987): F. Soong, A. Rosenberg, B. Juang & L. Rabiner (1987). A Vector Quantization approach to speaker recognition. AT&T Technical Journal 66. Issue 2.
Sorin (1994): C. Sorin (1994). Towards high-quality multilingual text-to-speech. : Proceedings of the CRIM/FORWISS workshop, 53-62, Munich. Also to appear in H. Niemann, ed., Progress and prospects in research and technology, Infix Publishing Company, Sankt Augustin.
Sotscheck (1982): J. Sotscheck (1982). Ein Reimtest für Verständlichkeitsmessungen mit deutscher Sprache als ein verbessertes Verfahren zur Bestimmung der Sprachübertragungsgeräte. Der Fernmeldung 36: 1-84.
Sperberg-McQueen & Burnard (1994): C. Sperberg-McQueen & L. Burnard, (1994). Guidelines for electronic text encoding and interchange. TEI P3. Chapter 1 Transcription of Speech. Association for Computational Linguistics, Association for Computers and the Humanities, Association for Literary and Linguistic Computing, Chicago and Oxford.
Spiegel et al. (1990): M. Spiegel, M. Altom, M. Macchi & K. Wallace (1990). Comprehensive assessment of the telephone intelligibility of synthesized and natural speech. Speech Communication 9: 279-291.
Sproat et al. (1992): R. Sproat, J. Hirschberg & D. Yarowsky (1992). A corpus-based synthesizer. : Proceedings of the 2nd International Conference on Spoken Language Processing, ICSLP, 1, 563-566, Banff.
Steeneken (1982): H. Steeneken (1982). Ontwikkeling en toetsing van een Nederlandstalige Diagnostische Rijmtest voor het testen van spraakcommunicatiekanalen. Rapport IZF 1982-13, IZF, Soesterberg.
Steeneken (1987): H. Steeneken (1987). Diagnostic information from subjective and objective intelligibility tests. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Dallas.
Steeneken (1989): H. Steeneken (1989). Objective and diagnostic assessment of (isolated) word recognizers. : Proceedings of the European Speech Conference ESCA, Paris.
Steeneken (1991): H. Steeneken (1991). RAMOS - Recognizer Assessment by means of Manipulation Of Speech applied. : Proceedings of the European Speech Conference ESCA, Genova.
Steinbiss et al. (1994): V. Steinbiss, B.-H. Tran & H. Ney (1994). Improvements in beam search. : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 2143-2146, Yokohama, Japan, September.
Stevens et al. (1968): K. Stevens, C. Williams, J. Carbonell & B. Woods (1968). Speaker authentication and identification: A comparison of spectrographic and auditory presentations of speech material. JASA 44: 1596-1607.
Stubbs (1984): M. Stubbs (1984). Discourse analysis. The sociolinguistic analysis of natural language. Blackwell, Oxford.
Sundheim (1991): B. Sundheim (1991). Third message understanding evaluation and conference (MUC-3): Phase 1 status report. : Proceedings of the DARPA Workshop on Speech and Natural Language, 301-305, Pacific Grove, CA, February.
Syrdal & Sciacca (1994): A. Syrdal & B. Sciacca (1994). Testing the intelligibility of text-to-speech output with the Diagnostic Pairs Sentence Intelligibility Evaluation. ITD-94-23828A, Technical Memorandum. Submitted to the Journal of the Acoustical Society of America, JASA, AT&T Bell Laboratories.
't Hart et al. (1990): J. 't Hart, R. Collier & A. Cohen (1990). A perceptual study of intonation. Cambridge University Press, Cambridge.
Terken (1985): J. Terken (1985). Use and function of accentuation. Some experiments. Doctoral dissertation, Leiden University, Leiden.
Terken (1993): J. Terken (1993). Human and synthetic intonation: A case study. : V. Van Heuven & L. Pols, , Analysis and synthesis of speech, strategic research towards high-quality text-to-speech generation, 241-259. Mouton de Gruyter, Berlin.
Terken & Collier (1989): J. Terken & R. Collier (1989). Automatic synthesis of natural-sounding intonation for text-to-speech conversion in Dutch. : Proceedings of the Eurospeech '89, 1, 357-359, Paris.
Thielen (1992): M. Thielen (1992). Male and female speech. Ph.D. Thesis, University of Amsterdam, Amsterdam.
Thorsen (1980): N. Thorsen (1980). A study of the perception of sentence intonation - Evidence from Danish. Journal of the Acoustical Society of America, JASA 67: 1014-1030.
Thurmair (1986): G. Thurmair (1986). Linguistische Analyse im Projekt SPICOS. Kleinheubacher Berichte 29.
Tomlinson (1990): M. Tomlinson (1990). Guide to database generation - recording protocol. : ESPRIT Project 2589 (SAM), , Multilingual speech input/output assessment, methodology and standardisation. University College London, London. Interim Report Year I, Reference SAM-UCL-G002, Document SAM-RSRE-012.
Tosi et al. (1972): O. Tosi, H. Oyer, W. Asbrook, W. Pedrey, C. Nicol & E. Nash (1972). Experiment of voice identification. JASA 51: 2030-2043.
Tubach & Doignon (1991): J. Tubach & P. Doignon (1991). A system for natural spoken language queries: Design, implementation and assessment. : Proceedings of the 2nd European Conference on Speech Communication and Technology, 1473-1476, Genova, September.
Tubach & Bok (1985): J.-P. Tubach & L.-J. Bok (1985). ZUT - Petit dictionnaire français. Institut de Phonitique de Grenoble, avec le concours du CNRS (GRECO Comm. Parlie), Grenoble.
Turing (1950): A. Turing (1950). Computing machinery and intelligence. Mind 59: 433-460.
Valtech et al. (1994): V. Valtech, J. Odell, P. Woodland & S. Young (1994). A dynamic network decoder design for large vocabulary speech recognition. : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 1351-1354, Yokohama, Japan, September.
Van Bezooijen (1986): R. Van Bezooijen (1986). Lay ratings of long-term voice-and-speech characteristics. : F. Beukema & A. Hulk, , Linguistics in the Netherlands 1986, 1-7. Foris, Dordrecht.
Van Bezooijen (1988): R. Van Bezooijen (1988). Evaluation of two synthesis systems for Dutch - Development and applications of intelligibility tests. SPIN-ASSP Report No. 5, Stichting Spraaktechnologie, Utrecht.
Van Bezooijen (1989): R. Van Bezooijen (1989). Evaluation of the suitability of Dutch text-to-speech conversion for application in a digital daily newspaper. : Proceedings of the ESCA Workshop Speech I/O Assessment and Speech Databases, 6.3.1-6.3.4, Noordwijkerhout.
Van Bezooijen & Jongenburger (1993): R. Van Bezooijen & W. Jongenburger (1993). Evaluation of an electronic newspaper for the blind in the Netherlands - intelligibility, acceptability, adequacy, and users'attitudes. : Proceedings of the ESCA Workshop on Speech and Language Technology for Disabled Persons, 195-198, Stockholm.
Van Bezooijen & Pols (1987): R. Van Bezooijen & L. Pols (1987). Evaluation of two synthesis-by-rule systems for Dutch. : Proceedings of the European Conference on Speech Technology, 1, 179-183.
Van Bezooijen & Pols (1989): R. Van Bezooijen & L. Pols (1989). Evaluation of a sentence accentuation algorithm for a Dutch text-to-speech system. : Proceedings of the Eurospeech '89, 1, 218-221, Paris.
Van Bezooijen & Pols (1990): R. Van Bezooijen & L. Pols (1990). Evaluating text-to-speech systems: Some methodological aspects. Speech Communication 9: 263-270.
Van Bezooijen & Pols (1993): R. Van Bezooijen & L. Pols (1993). Evaluation of text-to-speech conversion for Dutch. : V. Van Heuven & L. Pols, , Analysis and synthesis of speech: Strategic research towards high-quality text-to-speech conversion, 339-360. Mouton de Gruyter, Berlin.
Van Bezooijen & Van Hout (1985): R. Van Bezooijen & R. Van Hout (1985). Accentedness ratings and phonological variables as measures of variation in pronunciation. Language and Speech 28: 129-142.
Van Coile (1989): B. Van Coile (1989). The DEPES development system for text-to-speech synthesis. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 250-253.
Van Compernolle et al. (1991): D. Van Compernolle, J. Smolders, P. Jaspers & T. Hellemans (1991). Speaker clustering for dialectic robustness in speaker independent recognition. : Proceedings of Eurospeech '91, 2, 723-726, Genova.
Van Dommelen (1993): W. Van Dommelen (1993). Speaker height and weight identification: A re-evaluation of some old dates. Journal of Phonetics 21: 337-341.
Van Hemert et al. (1987): J. Van Hemert, U. Adriaens-Porzig & L. Adriaens (1987). Speech synthesis in the SPICOS-project. : H. Tillmann & G. Willée, , Analyse und Synthese gesprochener Sprache: Vorträge im Rahmen der Jahrestagung 1987 der Gesellschaft für Linguistische Datenverarbeitung e.V., Bonn, 4-6 March, 34-39. Olms, Hildesheim.
Van Heuven & Scharpff (1991): V. Van Heuven & P. Scharpff (1991). Acceptability of several speech pausing strategies in low quality speech synthesis; interaction with intelligibility. : Proceedings of the 12th International Congress of Phonetic Sciences, 458-461, Aix-en-Provence.
Van Holsteijn (1993): Y. Van Holsteijn (1993). TextScan: A preprocessing module for automatic text-to-speech conversion. : V. Van Heuven & L. Pols, , Analysis and synthesis of speech, strategic research towards high-quality text-to-speech generation, 27-41. Mouton de Gruyter, Berlin.
Van Hout (1989): R. Van Hout (1989). De structuur van taalvariatie, een sociolinguistisch onderzoek naar het stadsdialect van nijmegen. Doctoral dissertation, University of Nijmegen, Nijmegen.
Van Santen (1992): J. Van Santen (1992). Diagnostic perceptual experiments for text-to-speech system evaluation. : Proceedings of the International Conference on Spoken Language Processing, ICSLP, 1, 555-558.
Van Santen (1993): J. Van Santen (1993). Perceptual experiments for diagnostic testing of text-to-speech systems. Computer Speech and Language 7: 49-100.
Van Santen (1994): J. Van Santen (1994). Using statistics in text-to-speech system construction. : Proceedings of the ESCA/IEEE Workshop on Speech Synthesis, 240-243, Mohonk NY.
Van Son et al. (1988): N. Van Son, L. Pols, S. Sandri & P. Salza (1988). First quality evaluation of a diphone-based speech synthesis system for Italian. : Proceedings of the 7th FASE/Speech '88 Symposium, 2, 429-436, Edinburgh.
Vergeynst et al. (1993): N. Vergeynst, K. Edwards, J. Foster & M. Jack (1993). Spoken dialogues for human-computer interaction over the telephone: Complexity measures. : Proceedings of the 3rd European Conference on Speech Communication and Technology, 1415-1418, Berlin, September.
Vintsyuk (1971): T. Vintsyuk (1971). Elementwise recognition of continuous speech composed of words from a specified dictionary. Cybernetics, March-April, 7: 133-143.
Voiers (1977): W. Voiers (1977). Diagnostic evaluation of speech intelligibility. Speech intelligibility and speaker recognition 2: 374-384. Benchmark papers in acoustics, M.E. Hawley (ed.).
Voiers (1983): W. Voiers (1983). Evaluating processed speech using the Diagnostic Rhyme Test. Speech Technology 1: 338-352.
Voiers et al. (1975): W. Voiers, A. Sharpley & C. Hehmsoth (1975). Research on diagnostic evaluation of speech intelligibility. Research Report AFCRL-72-0694, Air Force Cambridge Research Laboratories, Bedford, Massachusetts.
Vroomen et al. (1993): J. Vroomen, R. Collier & S. Mozziconacci (1993). Duration and intonation in emotional speech. : Proceedings of the Eurospeech '93, 1, 577-580, Berlin.
Wahlster (1993): W. Wahlster (1993). VERBMOBIL, translation of face-to-face dialogs. : Proceedings of the Eurospeech '93, opening and plenary sessions, 29-38, Berlin.
Waibel (1988): A. Waibel (1988). Prosody and speech recognition. Research notes in artificial intelligence, Pitman Publishing, London.
Waibel et al. (1991): A. Waibel, A. Jain, A. McNair, H. Saito, A. Hauptmann & J. Tebelskis (1991). A speech-to-speech translation system using connectionist and symbolic processing strategies. : Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-91, 793-796.
Waibel & Lee (1990): A. Waibel & K.-F. Lee, (1990). Readings in speech recognition. Morgan Kaufmann Publishers, San Mateo, California.
Wall & Schwartz (1991): L. Wall & R. Schwartz (1991). Programming perl. O'Reilly & Associates Inc., Sebastopol, CA.
Webers (1985): J. Webers (1985). Tonstudiotechnik. Franzis, Munich, Germany.
Wells (1987): J. Wells (1987). Computer-coded phonetic transcription. Journal of the International Phonetic Association 17(2): 94-114.
Wells (1989): J. Wells (1989). Computer-coded phonemic notation of individual languages of the European Community. Journal of the International Phonetic Association 19(1): 31-54.
Wells (1993a): J. Wells (1993a). Applying SAM-PA to Spanish, Portuguese, and Greek: A preliminary discussion document. : ESPRIT Project 6819 (SAM-A), , Speech technology assessment in multilingual applications. London. Document No: SAM-A/D1-Appendix B, SAM-A periodic progress report, Year 1, 1 April 1993-30 September 1993.
Wells (1993b): J. Wells (1993b). An update on SAMPA. : ESPRIT Project 6819 (SAM-A), , Speech technology assessment in multilingual applications, 1-6. London. Document No: SAM-A/D1-Appendix A, SAM-A periodic progress report, Year 1, 1 April 1993-30 September 1993.
Whittaker & Stenton (1989): S. Whittaker & P. Stenton (1989). User studies and the design of natural language systems. : Proceedings of the 4th conference of the European Chapter of the Association for Computational Linguistics, 116-123, Manchester.
Willems et al. (1988): N. Willems, R. Collier & J. 't Hart (1988). Synthesis scheme for British English intonation. Journal of the Acoustical Society of America, JASA 84: 1250-1261.
Winer (1971): B. Winer (1971). Statistical principles in experimental design. McGraw-Hill, New York, .
Winski & Fourcin (1994): R. Winski & A. Fourcin (1994). A common European approach to assessment, corpora and standards. : K. Varghese, S. Pfleger & J. Lefevre, , Advanced speech applications. European research on speech technology (Research Reports ESPRIT Volume 1). Springer-Verlag, Berlin.
Winski et al. (1995): R. Winski, R. Moore & D. Gibbon (1995). EAGLES spoken language working group: Overview and results. : Proceedings of the 4th European Conference on Speech Communication and Technology - Eurospeech'95, 841-844, Madrid, September 1995.
Witten (1982): I. Witten (1982). Principles of computer speech. Academic Press, New York, N.Y.
Woodland et al. (1995): P. Woodland, C. Leggetter, J. Odell, V. Valtech & S. Young (1995). The 1994 HTK large vocabulary speech recognition system. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, I, 73-76, Detroit, MI, May.
Woods & Zue (1976): W. Woods & V. Zue (1976). Dictionary expansion via phonological rules for a speech understanding system. : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 561-564, Philadelphia.
Wooffitt & Fraser (1992): R. Wooffitt & N. Fraser (1992). We're off to ring the Wizard, the wonderful Wizard of Oz. : G. Button, , Technology in Working Order: Studies of work, interaction and technology, 211-230. Routeledge, London.
Woszczyna et al. (1993): M. Woszczyna, N. Coccaro, A. Eisele, A. Lavie, A. McNair, T. Polzin, I. Rogina, C. Rose, T. Sloboda, M. Tomita, J. Tsutsumi, N. Waibel & W. Ward (1993). Recent advances in Janus: A speech translation system. : Proceedings of a Workshop: Human Language Technology, 211-216, 21-24 March, Princeton, NJ.
Wright et al. (1993): J. Wright, G. Jones & H. Lloyd-Thomas (1993). A consolidated language model for speech recognition. : Proceedings of the European Conference on Speech Communication and Technology, 977-980, Berlin, September.
Yamron (1994): J. Yamron (1994). A generalization of n-grams. : Proceedings of the DARPA Workshop on Robust Speech Recognition, Rutgers University, Piscataway, NJ, July-August.
Yarrington & Foulds (1993): D. Yarrington & R. Foulds (1993). Personalizing synthesized voices. : Proceedings of the ESCA Workshop on Speech and Language Technologies for Disabled Persons, 169-172, Stockholm.
Young et al. (1989): S. Young, A. Hauptmann, W. Ward, E. Smith & P. Werner (1989). High level knowledge sources in usable speech recognition systems. Communications of the ACM 32(2): 183-194. Also in: A. Waibel and K.-F. Lee, eds., (1990), Readings in speech recognition, Morgan Kaufmann Publishers, San Mateo, California, 538-549.
Zue et al. (1991): V. Zue, J. Glass, D. Goodine, L. Hirschman, H. Leung, M. Phillips, J. Polifroni & S. Seneff (1991). The MIT ATIS system: Preliminary development, spontaneous speech data collection, and performance evaluation. : Proceedings of the 2nd European Conference on Speech Communication and Technology, 537-540, Genova, September.

EAGLES SWLG SoftEdition, May 1997. Get the book...