Next: Spoken language reference materials
Up: EAGLES SLWG Handbook
Previous: Bibliographical references
References
- Abercrombie (1967)
-
D. Abercrombie (1967).
Elements of general phonetics.
Edinburgh University Press, Edinburgh.
- Aho et al. (1987)
-
A. Aho, B. Kernighan & P. Weinberger (1987).
The AWK programming language.
Addison-Wesley Publishing Company, Reading, Mass., etc.
- Ainsworth (1988)
-
W. Ainsworth (1988).
Speech recognition by machine.
Peter Peregrinus.
- Aitchison (1994)
-
J. Aitchison (1994).
Words in the mind. An introduction to the mental lexicon.
Blackwell, Oxford.
- Aitkin et al. (1989)
-
M. Aitkin, D. Anderson, B. Francis & J. Hinde (1989).
Statistical modelling in GLIM.
Clarendon Press, Oxford.
- Akers & Lennig (1985)
-
G. Akers & M. Lennig (1985).
Intonation in text-to-speech synthesis: Evaluation of algorithms.
Journal of the Acoustical Society of America, JASA 77:
2157-2165.
- Akmajian (1984)
-
A. Akmajian (1984).
Linguistics: An introduction to language and communication.
The MIT Press, Cambridge, Massachusetts, .
- Allen (1988)
-
G. Allen (1988).
The PHONASCII system.
Journal of the International Phonetic Association 18(1):
9-25.
- Allen et al. (1987)
-
J. Allen, M. Hunnicutt & D. Klatt (1987).
From text to speech: The MITalk system.
Cambridge University Press, Cambridge.
- Allerhand (1987)
-
M. Allerhand (1987).
Knowledge-based speech pattern recognition.
Kogan Page, London.
- Alleva et al. (1992)
- F. Alleva, H. Hon, X. Huang, M. Hwang, R. Rosenfeld & R. Weide (1992).
Applying SPHINX-II to the DARPA Wall Street Journal CSR
task.
: Speech and Natural Language workshop,
393-398, Harriman, New York.
- Alleva et al. (1993)
-
F. Alleva, X. Huang & M.-Y. Hwang (1993).
An improved search algorithm using incremental knowledge for
continuous speech recognition.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 307-311, Minneapolis, MN, April.
- Althoff et al. (1996)
-
F. Althoff, G. Drexel, H. Lüngen, M. Pampel & C. Schillo (1996).
The treatment of compounds in a morphological component for speech
recognition.
: D. Gibbon, , Natural language
processing and speech technology. Results of the 3rd KONVENS Conference,
Bielefeld, October 1996, 71-76. Mouton de Gruyter, Berlin, New
York.
- Andernach et al. (1993)
-
T. Andernach, G. Deville & L. Mortier (1993).
The design of a real world Wizard of Oz experiment for a speech
driven telephone directory information system.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1165-1168, Berlin, September.
- Andry et al. (1990)
-
F. Andry, E. Bilange, F. Charpentier, K. Choukri, M. Ponamali & S. Soudoplatoff (1990).
Computerised simulation tools for the design of an oral dialogue
system.
: Proceedings of the ESPRIT Technical
Conference, Brussels, November.
- Andry et al. (1992)
-
F. Andry, S. McGlashan, N. Youd, N. Fraser & S. Thornton (1992).
Making DATR work for speech: Lexicon compilation in SUNDIAL.
Computational Linguistics 18(3): 245-267.
- Argente (1991)
-
J. Argente (1991).
From speech to speaking styles.
: Proceedings of the ESCA Workshop `Phonetics and phonology of speaking styles: Reduction and elaboration in speech communication', 1-1, 1-12, Barcelona.
- Atal (1976)
-
B. Atal (1976).
Automatic recognition of speakers from their voices.
Proceedings of the IEEE, April, 64(4): 460.
- Atal et al. (1991)
-
B. Atal, J. Miller & R. Kent, (1991).
Papers in speech communication: Speech processing.
Acoustical Society of America.
- Aubergé (1992)
-
V. Aubergé (1992).
Developing a structured lexicon for synthesis of prosody.
: G. Bailly, C. Benoît & T. Sawallis,
, Talking machines: Theories, models and designs,
307-321. North-Holland, Amsterdam.
- Austin (1962)
-
J. Austin (1962).
How to do things with words.
Oxford University Press, Oxford.
- Autesserre et al. (1989)
-
D. Autesserre, G. Pérennou & M. Rossi (1989).
Methodology for the transcription and labeling of a speech corpus.
Journal of the International Phonetic Association 19(1):
2-15.
- Averbuch et al. (1987)
-
A. Averbuch, L. Bahl & R. Bakis (1987).
Experiments with the TANGORA 20000 word speech recognizer.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
701-704.
- Averbuch et al. (1986)
-
A. Averbuch, L. Bahl, R. Bakis, P. Brown, A. Cole, G. Daggett, S. Das, K. Davies, S. De Gennaro, P. De Souza, E. Epstein, D. Fraleigh, F. Jelinek, S. Katz, B. Lewis, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman & P. Spinelli (1986).
An IBM PC-based large-vocabulary isolated-utterance speech
recognizer.
: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 53-56.
- Baayen (1991)
-
H. Baayen (1991).
De CELEX lexicale databank.
Forum der Letteren 32(3): 221-231.
- Bahl et al. (1989)
-
L. Bahl, P. Brown, P. De Souza & R. Mercer (1989).
A tree-based statistical language model for natural language speech
recognition.
IEEE Transactions on Acoustics, Speech and Signal Processing,
ASSP-37(7) 1001-1008.
Also in: A. Waibel, K.-F. Lee, eds. (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 507-514.
- Bahl et al. (1983)
-
L. Bahl, F. Jelinek & R. Mercer (1983).
A maximum likelihood approach to continuous speech recognition.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
March, 5: 179-190.
- Bahl et al. (1984)
-
L. Bahl, F. Jelinek, R. Mercer & A. Nadas (1984).
Next word statistical predictor.
IBM Tech. Disclosure Bulletin, December, 27(7A): 3941-3942.
- Bailleul (1987)
-
C. Bailleul (1987).
Evaluation des performances d'un système de reconnaissance vocale
dans des tâches de contrôle airiens.
Note Interne, CENA/N87083, 22 June.
- Bailly (1994)
-
G. Bailly (1994).
Rule compilers and text-to-speech systems.
Les Cahiers de l'ICP 3: 87-91.
- Bailly & Benoît (1992)
-
G. Bailly & C. Benoît, (1992).
Talking machines: Theories, models and designs.
North-Holland, Elsevier Science Publishers, Amsterdam.
- Baker (1975a)
-
J. Baker (1975a).
The DRAGON system - An overview.
IEEE Transactions on Acoustics, Speech and Signal Processing,
ASSP-23 24-29.
- Baker (1975b)
-
J. Baker (1975b).
Stochastic modeling for automatic speech understanding.
: D. Reddy, , Speech recognition,
521-541. Academic Press, New York, N.Y.
Also in: A. Waibel, K.-F. Lee, eds. (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 297-307.
- Baker (1989)
-
J. Baker (1989).
Dragondictate-30k: Natural language speech recognition with 30000
words.
: Proceedings of the European Conference on
Speech Technology, 2, 161-163.
- Baker et al. (1992)
-
J. Baker, P. Bamberg, K. Bishop, L. Gillick, V. Helman, Z. Huang, Y. Ito, S. Lowe, B. Peskin, R. Roth & F. Scattone (1992).
Large vocabulary recognition of Wall Street Journal sentences
at Dragon systems.
: Speech and Natural Language Workshop,
387-392, Harriman, New York, 23-26 February.
- Ball (1991)
-
M. Ball (1991).
Computer coding of the IPA: Extensions to the IPA.
Journal of the International Phonetic Association 21(1):
36-41.
- Ballou (1987)
-
G. Ballou, (1987).
Handbook for sound engineers.
W. Sams & Co., Indianapolis, U.S.A.
- Barber et al. (1989)
-
S. Barber, R. Carlson, P. Cosi, M. Di Benedetto, B. Granström & K. Vagges (1989).
A rule-based Italian text-to-speech system.
: Proceedings of the Eurospeech '89,
2, 517-520, Paris.
- Barry & Fourcin (1990)
-
W. Barry & A. Fourcin (1990).
Speaker selection criteria.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Interim Report Year I, Reference SAM-UCL-G002, Document
SAM-UCL-001.
- Barry & Fourcin (1992)
-
W. Barry & A. Fourcin (1992).
Levels of labelling.
Computer Speech and Language 6: 1-14.
- Barry et al. (1989)
-
W. Barry, M. Grice, V. Hazan & A. Fourcin (1989).
Excitation distributions for synthesised speech.
: Proceedings of the Eurospeech '89,
1, 353-356, Paris.
- Bartlett (1987)
-
B. Bartlett (1987).
Choosing the right microphones by understanding design tradeoffs.
J. Audio. Eng. Soc. 35.
- Bates & Ayuso (1991)
-
M. Bates & D. Ayuso (1991).
A proposal for incremental dialogue evaluation.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 319-322, Pacific Grove, CA, February.
- Bates et al. (1990)
-
M. Bates, S. Boisen & J. Makhoul (1990).
Developing an evaluation methodology for spoken language systems.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 102-108, Hidden Valley, PA, June.
- Baum (1900)
-
F. Baum (1900).
The Wizard of Oz.
Collins, London.
Edition of 1974.
- Baum (1972)
-
L. Baum (1972).
An inequality and associated maximization technique in statistical
estimation of a Markov process.
Inequalities 3(1): 1-8.
- Beckman (1986)
-
M. Beckman (1986).
Stress and non-stress accent.
Foris, Dordrecht.
- Belina & Hogrefe (1988)
-
F. Belina & D. Hogrefe (1988).
The CCITT specification and design language SDL.
Computer networks and ISDN systems 16: 311-341.
- Bell et al. (1990)
-
T. Bell, J. Cleary & I. Witten (1990).
Text compression.
Prentice Hall, Englewood Cliffs, NJ.
- Benoît (1989)
-
C. Benoît (1989).
Intelligibility test for the assessment of French synthesizers
using semantically unpredictable sentences.
: Proceedings of the ESCA Workshop on Speech
Input/Output Assessment and Speech Databases, 1.7.1-1.7.4.
- Benoît (1991)
-
C. Benoît (1991).
On the assessment of audio-visual speech synthesis.
: Proceedings of the Workshop on
International Cooperation and Standardisations of Speech Databases and Speech
I/O Assessment Methods, Chiavari, Italy.
- Benoît et al. (1992)
-
C. Benoît, T. Lallouache, T. Mohamadi & C. Abry (1992).
A set of French visemes for visual speech synthesis.
: G. Bailly & C. Benoît, ,
Talking machines: Theories, models, and design, 485-504.
North Holland, Elsevier Science Publishers, Amsterdam.
- Benoît et al. (1989)
-
C. Benoît, A. Van Erp, M. Grice, V. Hazan & U. Jekosch (1989).
Multilingual synthesizer assessment using semantically unpredictable
sentences.
: Proceedings of the Eurospeech '89,
2, 633-636, Paris.
- Bentler (1985)
-
P. Bentler (1985).
Theory and implementation of EQS, a structural equations
program.
BMDP Statistical Software Inc., Los Angeles.
- Berendsen et al. (1986)
-
E. Berendsen, S. Langeweg & H. Van Leeuwen (1986).
Computational phonology: Merged not mixed.
: Proceedings of the International
Conference on Computational Linguistics '86, 612-614.
- Berger et al. (1994)
-
A. Berger, P. Brown, J. Cocke, S. Della Pietra, V. Della Pietra,J. Gillett, J. Lafferty, R. Mercer, H. Printz & L. Ures (1994).
The Candide system for machine translation.
: Proceedings of the ARPA Human Language
Technology Workshop, 152-157, Plainsboro, NJ, March.
- Berkley & Flanagan (1990)
-
D. Berkley & J. Flanagan (1990).
Integration of speech recognition, text-to-speech synthesis, and
talker verification into a hands free audio/image teleconferencing system
(humanet).
ICSLP 20(1): 861-864.
- Bimbot et al. (1995)
-
F. Bimbot, I. Magrin-Chagnolleau & L. Mathan (1995).
Second-order statistical measures for text-independent speaker
identification.
Speech Communication 17.
1-2.
- Bimbot & Mathan (1993)
-
F. Bimbot & L. Mathan (1993).
Text-free speaker recognition using an arithmetic-harmonic sphericity
measure.
: Proceedings of the Eurospeech,
169-172.
- Black et al. (1991)
-
E. Black, S. Abney, D. Flickinger, C. Gdaniec, R. Grishman, P. Harrison, D. Hindle, R. Ingria, F. Jelinek, J. Klavans, M. Liberman, M. Marcus, R. Roukos, B. Santorini & T. Strazalkowski (1991).
A procedure for quantitatively comparing the syntactic coverage of
English grammars.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 306-311, Pacific Grove, CA, February.
- Bladon (1990)
-
A. Bladon (1990).
Evaluating the prosody of text-to-speech synthesizers.
: Proceedings of the Speech Tech '90,
215-220.
- Blauert (1983)
-
J. Blauert (1983).
Spatial hearing.
MIT Press, Cambridge.
- Bleiching (1992)
-
D. Bleiching (1992).
Prosodisches Wissen im Lexikon.
: G. Görz, , KONVENS 92, 1.
Konferenz ``Verarbeitung natürlicher Sprache'', Nürnberg, 7.-9.
Oktober 1992, 59-68. Springer-Verlag, Berlin.
- Bleiching et al. (1996)
-
D. Bleiching, G. Drexel & D. Gibbon (1996).
Ein synkretismusmodell für die deutsche morphologie.
: D. Gibbon, , Natural language
processing and speech technology. Results of the 3rd KONVENS Conference,
Bielefeld, October 1996, 237-248. Mouton de Gruyter, Berlin, New
York.
- Bleiching & Gibbon (1994)
-
D. Bleiching & D. Gibbon (1994).
Handbuch zur Demonstrator-Wortliste.
V1.1. May 1994, Bielefeld University, Bielefeld, Germany.
- Bloothooft et al. (1995)
-
G. Bloothooft, V. Hazan, D. Huber & J. Llisterri (1995).
European studies in phonetics and speech communication.
OTS Publications, Utrecht.
- Bobrow & Winograd (1977)
-
D. Bobrow & T. Winograd (1977).
An overview of KRL, a knowledge representation language.
Cognitive Science 1: 3-46.
- Boguraev et al. (1988)
-
B. Boguraev, J. Carroll, S. Pulman, G. Russell, G. Ritchie, A. Black, E. Briscoe & C. Grover (1988).
The lexical component of a natural language toolkit.
: D. Walker, A. Zampolli & N. Calzolari,
, Automating the lexicon: Research and practice in a
multilingual environment. Cambridge University Press, Cambridge.
- Bolinger (1972)
-
D. Bolinger (1972).
Accent is predictable (if you're a mind-reader).
Language 48: 633-644.
- Bolt (1970)
-
R. Bolt (1970).
Speaker identification by speech spectrograms: A scientists' view
of its reliability for legal purposes.
JASA 47(2): 597.
Part 2.
- Boogaart & Silverman (1992)
-
T. Boogaart & K. Silverman (1992).
Evaluating the overall comprehensibility of speech synthesizers.
: Proceedings of the 2nd International
Conference on Spoken Language Processing, ICSLP, 1207-1210, Banff.
- Boogart et al. (1993)
-
T. Boogart, P. Van Alphen & J. Doll (1993).
Application oriented assessment of dialogue systems.
: Joint ESCA - NATO/RSG10 Tutorial and
Research Workshop on Applications of Speech Technology, Lautrach, September.
- Boves (1984)
-
L. Boves (1984).
The phonetic basis of perceptual ratings of running speech.
Foris, Dordrecht.
- Brachman & Levesque (1985)
-
R. Brachman & H. Levesque (1985).
Readings in knowledge representation.
Morgan Kaufmann Publishers, Inc., Los Altos, California.
- Breiman et al. (1984)
-
L. Breiman, J. Friedman, R. Ohlsen & C. Stone (1984).
Classification and regression trees.
Wadsworth, Belmont, CA.
- Bridle et al. (1982)
-
J. Bridle, M. Brown & R. Chamberlain (1982).
An algorithm for connected word recognition.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
899-902, Paris, May.
- Brietzmann et al. (1983)
-
A. Brietzmann, H. Hein, H. Niemann & P. Regel (1983).
The Erlangen system for understanding continuous German speech.
: IEEE International Conference on
Acoustics, Speech and Signal Processing, ICASSP, 304-307, Boston.
- Bristow (1984)
-
G. Bristow (1984).
Electronic speech synthesis.
Collins, London.
- Bristow (1986)
-
G. Bristow (1986).
Electronic speech recognition.
Collins, London.
- Brouwer & De Haan (1987)
-
D. Brouwer & D. De Haan, (1987).
Woman's language, socialization and self-image.
Foris Publications, Dordrecht.
- Browman (1980)
-
C. Browman (1980).
Rules for demisyllable synthesis using Lingua, a language
interpreter.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
561-564, Denver.
- Brown et al. (1992)
-
P. Brown, V. Della Pietra, P. De Souza & R. Mercer (1992).
Class-based n-gram models of natural language.
Computational Linguistics 18(4): 467-479.
- Bruce (1989)
-
G. Bruce (1989).
Report from the IPA Working Group on suprasegmental categories.
Working Papers 35, Lund University, Department of Linguistics,
Lund 25-40.
- Bunt et al. (1985)
-
H. Bunt, R.-J. Beun, F. Dols, J. von der Linden & G. thoe Schwartzenberg (1985).
The TENDUM dialogue system and its theoretical basis.
IPO Annual Progress Report 19: 105-113.
- Burrell (1991)
-
M. Burrell (1991).
Assessment of the degradations of synthetic speech and time frequency
warping over different listening levels.
: Proceedings of the Institute of
Acoustics, 13, Pt. 2.
- Button (1990)
-
G. Button (1990).
Going up a blind alley: Conflating conversation analysis and
computational modelling.
: P. Luff, G. Gilbert & D. Frohlich,
, Computers and conversation, 67-90. Academic
Press, London.
- Cahill (1993)
-
L. Cahill (1993).
Morphonology in the lexicon.
: Proceedings of the Sixth Conference of the
European Chapter of the Association for Computational Linguistics,
87-96, Utrecht.
- Cahill & Evans (1990)
-
L. Cahill & R. Evans (1990).
An application of DATR: The TIC lexicon.
: R. Evans & G. Gazdar, ,
The DATR Papers, 31-39. School of Cognitive and Computing
Science, University of Sussex, Brighton, .
- Campbell (1995)
-
J. Campbell (1995).
Testing with the YOHO CD-ROM Voice Verification Corpus.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
341-344.
- Carbonell & Pierrel (1986)
-
N. Carbonell & J. Pierrel (1986).
Architecture and knowledge sources in a human computer oral dialog
system.
: Proceedings of the NATO workshop:
Structure of multimodal dialogues including voice, Corsica, France.
- Carlson et al. (1979)
-
R. Carlson, B. Granström & D. Klatt (1979).
Some notes on the perception of temporal patterns in speech.
: Proceedings of the 9th International
Congress of Phonetics Sciences, 2, 260-267,
Copenhagen.
- Carroll & Chang (1970)
-
J. Carroll & J. Chang (1970).
Analysis of individual differences in multidimensional scaling via an
n-way generalization of the ``eckhard-young'' composition.
Psychometrika 35: 283-319.
- Carson-Berndsen (1993)
-
J. Carson-Berndsen (1993).
Time map phonology and the projection problem in spoken language
recognition.
Doctoral dissertation, University of Bielefeld, Bielefeld, Germany.
- Cartier et al. (1992)
-
M. Cartier, F. Emerald, D. Pascal, P. Combescure & A. Soubigou (1992).
Une méthode d'évaluation multicritère de sorties vocales:
Application au test de 4 systèmes de synthèse à partir du texte.
: 19èmes Journées d'Étude sur la
Parole, Brussels.
- CCITT (1988a)
-
CCITT (1988a).
Artificial voices.
Blue Book IXth Plenary Assembly V: 87-99.
Recommendation P.50.
- CCITT (1988b)
-
CCITT (1988b).
Objective measurement of active speech level.
Rec. P. 56 Melbourne, CCITT.
- Chafe (1992)
-
W. Chafe (1992).
The importance of corpus linguistics to understanding the nature of
language.
: J. Svartvik, , Directions in
corpus linguistics: Proceedings of the Nobel Symposium 82, New York,
79-97, Berlin. Mouton de Gruyter.
- Charniak & McDermott (1985)
-
E. Charniak & D. McDermott (1985).
Introduction to Artificial Intelligence.
Addison-Wesley, Reading, Massachusetts.
- Chollet & Gagnoulet (1981)
-
G. Chollet & C. Gagnoulet (1981).
On the evaluation of recognizers and databases using a reference
system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, Atlanta.
- Chomsky (1965)
-
N. Chomsky (1965).
Aspects of the theory of syntax.
The MIT Press, Cambridge, MA.
- Chomsky & Halle (1968)
-
N. Chomsky & M. Halle (1968).
The sound pattern of English.
Harper and Row, New York, Evanston, London.
- Choukri et al. (1988)
-
K. Choukri, G. Chollet & C. Montacié (1988).
Test workstation for the evaluation of speech recognition algorithms,
applications and databases.
: Proceedings of the 7th FASE Symposium
(Speech'88), 145-151, Edinburgh, August 1988.
- Church (1987a)
-
K. Church (1987a).
Phonological parsing and lexical retrieval.
Cognition 25: 53-69.
- Church (1987b)
-
K. Church (1987b).
Phonological parsing in speech recognition.
Kluwer Academic Publishers, Boston, Dordrecht, Lancaster.
- Coates (1986)
-
J. Coates (1986).
Women, men and language: A sociolinguistic account of sex
differences in language.
Longman, London.
- Cole (1995)
-
Cole (1995).
The challenge of spoken language systems: Research directions for
the nineties.
IEEE Transactions on Speech and Audio Processing 3: 1-20.
- Combescure (1981)
-
P. Combescure (1981).
20 listes de dix phrases phonétiquement équilibrées.
Revue d'Acoustique 56: 34-38.
- Content et al. (1990)
-
A. Content, P. Mousty & M. Radeau (1990).
Brulex, une base de données lexicales informatise pour le
français écrit et parlé.
L'Année Psychologique 90: 551-566.
- Cookson (1988)
-
S. Cookson (1988).
Final evaluation of VODIS voice operated database inquiry system.
: Proceedings of Speech-88, 7th FASE
Symposium, 1311-1320, Edinburgh, August.
- Cosi & Omologo (1991)
-
P. Cosi & M. Omologo (1991).
Caratterizzazione statistica della segmentazione manuale del segnale
vocale.
Associazione Italiana Acustica (AIA) Meeting. Napoli, Italy,
10-12 April. Cited in Barry and Fourcin 1992.
- Crowdy (1993)
-
S. Crowdy (1993).
Spoken corpus design and transcription.
Longman, Harlow.
- Cruse (1986)
-
D. Cruse (1986).
Lexical semantics.
CUP, Cambridge.
- Crystal (1980)
-
D. Crystal (1980).
Introduction to language pathology.
Edward Arnold Ltd., London.
- Crystal (1985)
-
D. Crystal (1985).
A dictionary of linguistics and phonetics.
Basil Blackwell, Oxford, UK.
- Cucchiarini (1993)
-
C. Cucchiarini (1993).
Phonetic transcription: A methodological and empirical study.
Doctoral thesis, University of Nijmegen, Nijmegen.
- Dahlbäck & Jönsson (1986)
-
N. Dahlbäck & A. Jönsson (1986).
A system for studying human-computer dialogues in natural language.
Research Report LiTH-IDA-R-86-42, Department of Computer and
Information Science, Linköping University, Linköping.
- Dahlbäck & Jönsson (1989)
-
N. Dahlbäck & A. Jönsson (1989).
Empirical studies of discourse representations for natural language
interfaces.
: Proceedings of the 4th Conference of the
European Chapter of the Association for Computational Linguistics,
291-298, Manchester.
- Dalsgaard & Baekgaard (1994)
-
P. Dalsgaard & A. Baekgaard (1994).
Spoken language dialogue systems.
: H. Niemann, R. De Mori & G. Hanrieder,
, Progress and prospects in speech and language technology,
178-191. Infix, Sankt Augustin.
- Damhuis et al. (1994)
-
M. Damhuis, T. Boogaart, C. in 't Veld, M. Versteijlen, W. Schelvis, L. Bos & L. Boves (1994).
Creation & analysis of the Dutch Polyphone Corpus.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1803-1806,
Yokohama.
- Davis & Davis (1975)
-
D. Davis & C. Davis (1975).
Sound system engineering.
W. Sams & Co., Indianapolis, U.S.A.
- De Mori et al. (1984)
-
R. De Mori, M. Gilloux, G. Mercier, M. Simon, C. Tarrides & J. Vaissière (1984).
Integration of acoustic, phonetic, prosodic and lexical knowledge in
an expert system for speech understanding.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP.
42.9.1-42.9.4.
- De Pijper (1983)
-
J. De Pijper (1983).
Modelling British English intonation.
Foris, Dordrecht.
- Della Pietra et al. (1994)
-
S. Della Pietra, V. Della Pietra, J. Gillett, J. Lafferty, H. Printz & L. Ures (1994).
Inference and estimation of a long-range trigram model.
Second International Colloquium `Grammatical Inference and
Applications', Alicante, Spain, September 1994 78-92.
Springer-Verlag, Berlin.
- Delogu et al. (1993a)
-
C. Delogu, A. Di Carlo, C. Sementino & S. Stecconi (1993a).
A methodology for evaluating human-machine spoken language
interaction.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1427-1430, Berlin,
September.
- Delogu et al. (1991)
-
C. Delogu, A. Paoloni, P. Pocci & C. Sementina (1991).
Quality evaluations of text-to-speech synthesizers using magnitude
estimation, categorical estimation, pair comparison and reaction time methods.
: Proceedings of the Eurospeech '91,
353-355, Genova.
- Delogu et al. (1993b)
-
C. Delogu, A. Paoloni, P. Ridolfi & K. Vagges (1993b).
Intelligibility of Italian text-to-speech synthesizers over
ortophonic and telephonic channel.
: Proceedings of the Eurospeech '93,
3, 1893-1896, Berlin.
- Delogu et al. (1992a)
-
C. Delogu, A. Paoloni & C. Sementina (1992a).
Comprehension of natural and synthetic speech: Preliminary studies.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992. SAM Internal Report
II.c.
- Delogu et al. (1992b)
-
C. Delogu, P. Paoloni, P. Pocci & C. Sementina (1992b).
A comparison among different methodologies for evaluating the quality
of text-to-speech synthesis systems.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology an standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992. SAM Internal Report
II.d.
- Delomier et al. (1989)
-
D. Delomier, A. Meunier & M.-A. Morel (1989).
Linguistic features of human-machine oral interaction.
: Proceedings of the Eurospeech '89,
2, 236-239, Paris.
- Dempster et al. (1977)
-
A. Dempster, M. Laird & D. Rubin (1977).
Maximum likelihood from incomplete data via the EM algorithm.
J. Royal Statist. Soc. Ser. B (methodological) 39: 1-38.
- Den Os (1994)
-
E. Den Os (1994).
Transliteration of the Dutch Speech Styles Corpus.
: Proceedings of the Institute of Phonetic
Sciences, 18, 87-94, University of Amsterdam.
- Derouault & Merialdo (1986)
-
A. Derouault & B. Merialdo (1986).
Natural language modelling for phoneme-to-text transcription.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
November, 8: 742-749.
- Diaper (1986)
-
D. Diaper (1986).
Identifying the knowledge requirements of an expert system's natural
language processing interface.
: M. Harrison & A. Monk, ,
People and Computers V: Proceedings of the 2nd Conference of the
British Computer Society Human-Computer Interaction Specialist Group,
Cambridge. Cambridge University Press.
- Diaper (1989)
-
D. Diaper (1989).
The Wizard's apprentice: A program to help analyse natural
language dialogues.
: A. Sutcliffe & L. Macaulay, ,
People and Computers: Designing for usability. Proceedings of the 2nd
Conference of the British Computer Society Human-Computer Interaction
Specialist Group, Cambridge. Cambridge University Press.
- Doddington (1985)
-
G. Doddington (1985).
Speaker recognition - Identifying people by their voices.
Proceedings of the IEEE, November, 73(11): 1651.
- Dolmazon et al. (1990)
-
J.-M. Dolmazon, J.-C. Caërou & W. Barry (1990).
Initial development of SAM standard workstation.
SAM-UCL-022, December, Appendix Se.10, University College London,
London.
- Dougherty (1990)
-
D. Dougherty (1990).
sed & awk.
O'Reilly & Associates Inc., Sebastopol, CA.
- Dreckschmidt (1987)
-
G. Dreckschmidt (1987).
The linguistic component in the speech understanding system SPICOS.
: H. Tillmann & G. Willée, ,
Analyse und Synthese gesprochener Sprache, Jahrestagung der
Gesellschaft für Linguistische Datenverarbeitung, Bonn,
96-101. Olms, Hildesheim.
- Drullman & Collier (1993)
-
R. Drullman & R. Collier (1993).
Speech synthesis with accented and unaccented diphones.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 147-156. Mouton de
Gruyter, Berlin.
- Duda & Hart (1973)
-
R. Duda & P. Hart (1973).
Pattern classification and scene analysis.
J. Wiley, New York.
- Duncan (1974)
-
S. Duncan (1974).
On signalling that it's your turn to speak.
Journal of Experimental Social Psychology 10: 234-247.
- Dybkjaer et al. (1993)
-
H. Dybkjaer, N. Bernsen & L. Dybkjaer (1993).
Wizard-of-Oz and the trade-off between naturalness and recognizer
constraints.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 947-950, Berlin,
September.
- Eargle (1976)
-
J. Eargle (1976).
Sound recording.
Van Nostrand Reinhold Company, New York, USA.
- Edwards & Lampert (1993)
-
J. Edwards & M. Lampert, (1993).
Talking data: Transcription and coding in discourse
research.
Lawrence Erlbaum, Hillsdale.
- Efron & Tibshirani (1993)
-
B. Efron & R. Tibshirani (1993).
An introduction to the bootstrap.
Chapman & Hall, New York.
- Egan (1948)
-
J. Egan (1948).
Articulation testing methods.
Laryngoscope 58: 955-991.
- Ehrlich (1986)
-
U. Ehrlich (1986).
Ein Lexikon für das natürlich-sprachliche Dialogsystem
EVAR.
Arbeitsberichte des IMMD, vol. 19, University of
Erlangen-Nürnberg, Erlangen, Germany.
- Eisen (1993)
-
B. Eisen (1993).
Reliability of speech segmentation and labelling at different levels
of transcription.
: Proceedings of the Third European
Conference on Speech Communication and Technology, 1,
673-676, 21-23 September 1993, Berlin, Germany.
- Erman (1977)
-
L. Erman (1977).
A functional description of the HEARSAY-II speech
understanding system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, Hartford.
- Erman & Hayes-Roth (1981)
-
L. Erman & F. Hayes-Roth (1981).
The HEARSAY-II speech understanding system: Integrating
knowledge to resolve uncertainty.
: B. Webber & N. Nilsson, ,
Readings in Artificial Intelligence, 349-389. Tioga, Palo
Alto, CA.
- Erman & Lesser (1980)
-
L. Erman & V. Lesser (1980).
The HEARSAY-II speech understanding system: A tutorial.
: W. Lea, , Trends in speech
recognition, 361-381. Prentice Hall, Englewood Cliffs, NJ.
Also in: A. Waibel and K.-F. Lee, eds. (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 235-245.
- Esling (1988)
-
J. Esling (1988).
7.1 Computer coding of IPA symbols and 7.3 detailed phonetic
representation of computer data bases.
Journal of the International Phonetic Association 18(2):
99-106.
- Esling (1990)
-
J. Esling (1990).
Computer coding of the IPA: Supplementary report.
Journal of the International Phonetic Association 20(1):
22-26.
- Esling & Gaylord (1993)
-
J. Esling & H. Gaylord (1993).
Computer codes for phonetic symbols.
Journal of the International Phonetic Association 23(2):
83-97.
- Essen & Steinbiss (1992)
-
U. Essen & V. Steinbiss (1992).
Cooccurrence smoothing for stochastic language modelling.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
I, 161-164, San Francisco, CA, March.
- Evans & Gazdar (1989)
-
R. Evans & G. Gazdar (1989).
The DATR papers.
Research Report: May 1989, School of Cognitive and Computing
Science, University of Sussex, School of Cognitive and Computing Science,
University of Sussex, Brighton.
- Evans & Gazdar (1990)
-
R. Evans & G. Gazdar (1990).
The DATR papers.
Research Report: February 1990, School of Cognitive and Computing
Science, University of Sussex, School of Cognitive and Computing Science,
University of Sussex, Brighton.
- Federico (1989)
-
A. Federico (1989).
Comparison between automatic methods and human listeners in speaker
recognition tasks.
: Proceedings of the Eurospeech,
279-282.
- Fellbaum et al. (1994)
-
K. Fellbaum, H. Klaus & J. Sotscheck (1994).
Hörversuche zur Beurteilung der Sprachqualität von
Sprachsynthesesystemen für die deutsche Sprache.
: Fortschritte der Akustik,
Plenarvorträge und Fachbeiträge der 20. Deutschen Jahrestagung
für Akustik, 117-122, Dresden, DPG GmbH.
- Ferguson (1976)
-
G. Ferguson (1976).
Statistical analysis in psychology and education.
McGraw-Hill, Tokyo.
- Ferrané et al. (1992)
-
I. Ferrané, M. De Calmès, D. Cotto, J.-M. Pécatte & G. Pérennou (1992).
Statistiques lexicales sur le corpus de textes utilisés dans le
projet BREF: Questions de couverture lexicale.
: Proceedings Communication Homme-Machine,
Séminaire LEXIQUE, 217-226, 21-22 January 1992, IRIT-UPS,
Toulouse.
- Fillmore (1968)
-
C. Fillmore (1968).
The case for case.
: E. Bach & R. Harms, ,
Universals in linguistic theory, 1-88. Holt, Rinehart and
Winston, New York.
- Fissore et al. (1993)
-
L. Fissore, E. Giachin, P. Laface & P. Massafra (1993).
Using grammars in forward and backward search.
: Proceedings of the European Conference on
Speech Communication and Technology, 1525-1528, Berlin, September.
- Flanagan et al. (1991)
-
J. Flanagan, D. Berkley, G. Elko & M. Sondhi (1991).
Autodirective microphone systems.
Acoustica 73: 58-71.
- Fourcin (1993)
-
A. Fourcin (1993).
The SAM project.
Ellis Horwood, Chichester.
- Fourcin et al. (1989)
-
A. Fourcin, G. Harland, W. Barry & V. Hazan, (1989).
Speech input and output assessment. Multilingual methods and
standards.
Ellis Horwood Ltd., Chichester.
- Fraser (1991)
-
N. Fraser (1991).
Corpus-based evaluation of the SUNDIAL system.
: J. Neal & S. Walter, ,
Proceedings of the Natural Language Processing Systems Evaluation
Workshop, Rome. Rome Laboratory.
Technical Report RL-TR-91-362.
- Fraser & Gilbert (1991a)
-
N. Fraser & G. Gilbert (1991a).
Effects of system voice quality on user utterances in speech dialogue
systems.
: Proceedings of the Second European
Conference on Speech Communication and Technology, 57-60, Genova,
September.
- Fraser & Gilbert (1991b)
-
N. Fraser & G. Gilbert (1991b).
Simulating speech systems.
Computer Speech and Language 5: 81-99.
- Fraser et al. (1992)
-
N. Fraser, N. Gilbert & C. McDermid (1992).
The value of simulation data.
: Proceedings of the Workshop on Empirical
Models and Methodology for Natural Language Dialogue Systems, Trento, April.
- French (1991)
-
J. French (1991).
Updated notes for soundprint transcribers + one page sample text from
COBUILD corpus.
Working paper, NERC-WP4-47, October, J.P. French Associated,
York and COBUILD, Birmingham.
- French (1992)
-
J. French (1992).
Transcription proposals: Multi-level system.
Working paper, NERC-WP 4-50, October, University of
Birmigham, Birmingham.
- Fu (1982)
-
K. Fu (1982).
Syntactic pattern recognition and applications.
Prentice-Hall, Englewood Cliffs, NJ.
- Furui (1981)
-
S. Furui (1981).
Cepstral analysis technique for automatic speaker verification.
IEEE Transactions on Acoustics, Speech and Signal Processing
29(2).
- Furui (1994)
-
S. Furui (1994).
An overview of speaker verification technology.
: ESCA-ETRW Workshop, 1-10,
Martigny.
- Generet et al. (1995)
-
M. Generet, H. Ney & F. Wessel (1995).
Extensions of absolute discounting for language modelling.
: Proceedings of the Fourth European
Conference on Speech Communication and Technology, 1245-1248,
Madrid, September.
- Gerbino et al. (1993)
-
E. Gerbino, P. Baggia, A. Ciaramella & C. Rullent (1993).
Test and evaluation of a spoken dialogue system.
: Proceedings of the International
Conference on Acoustics, Speech and Signal Processing, ICASSP'93,
Minneapolis, April.
- Geutner (1995)
-
P. Geutner (1995).
Using morphology towards better large-vocabulary speech
recognition systems.
Interactive Systems Laboratories, University of Karlsruhe,
Karlsruhe, Germany.
- Gibbon (1991)
-
D. Gibbon (1991).
Lexical signs and lexicon structure: Phonology and prosody in the
ASL-lexicon.
Research Report ASL-MEMO-20-91/UBI, University of Bielefeld,
Bielefeld, Germany.
- Gibbon (1992a)
-
D. Gibbon (1992a).
ILEX: A linguistic approach to computational lexica.
: U. Klenk, , Computatio linguae.
Aufsätze zur algorithmischen und quantitativen Analyse der Sprache,
32-51. Franz Steiner Verlag, Stuttgart.
- Gibbon (1992b)
-
D. Gibbon (1992b).
Language and software, or: Fritzl's quest.
: C. Floyd, H. Züllighoven, R. Budde &
R. Keil-Slavik, , Software Development and Reality
Construction, 376-390. Springer Verlag, Berlin, Heidelberg, New
York.
- Gibbon (1993)
-
D. Gibbon (1993).
Generalized DATR for flexible lexical access: PROLOG
specification.
VERBMOBIL Report 2, October 1993, Bielefeld University,
Bielefeld, Germany.
- Gibbon (1995)
-
D. Gibbon (1995).
The VERBMOBIL lexicon: Bielefeld lexicon database V2.1.
VERBMOBIL Technisches Dokument 21, 31 January 1995, Bielefeld
University, Bielefeld, Germany.
- Gibbon & Ehrlich (1995)
-
D. Gibbon & U. Ehrlich (1995).
Spezifikationen für ein VERBMOBIL-Lexikondatenbankkonzept.
VERBMOBIL Memo 69, Bielefeld University & Daimler Benz AG,
Bielefeld, Ulm.
- Gilbert & Weismer (1974)
-
H. Gilbert & G. Weismer (1974).
The effect of smoking on the speaking fundamental frequency of adult
women.
Journal of Psycholinguistic Research 3: 225-231.
- Gish et al. (1986)
-
H. Gish, M. Kraner, W. Russel & J. Wolf (1986).
Methods and experiments for text-independent speaker recognition over
the telephone line.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, 865.
17.2.1.
- Gish & Schmidt (1994)
-
H. Gish & M. Schmidt (1994).
Text-independent speaker identification.
: IEEE Signal Processing, 11,
18-32.
- Goldsmith (1990)
-
J. Goldsmith (1990).
Autosegmental and metrical phonology.
Indiana University Linguistics Club, Bloomington, Indiana.
- Goldstein (1995)
-
M. Goldstein (1995).
Classification of methods used for assessment of text-to-speech
systems according to the demands placed on the listener.
Speech Communication 16: 225-244.
- Goldstein et al. (1992)
-
M. Goldstein, B. Lindström & O. Till (1992).
Assessing global performance of speech synthesizers: Context effects when assessing naturalness of Swedish sentence-pairs generated by 4 systems using 3 different assessment procedures (free number magnitude estimation, 5- and 11-point category scales).
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
SAM Internal Report II.a, Final report, Year three:
1.III.91-28.II.1992.
- Goldstein & Till (1992)
-
M. Goldstein & O. Till (1992).
Assessing segmental intelligibility of two rule-based synthesizers
and natural speech using the ESPRIT/SAMVCV test procedures (SOAP v3.0)
in Swedish and testing for differences between two correlated proportions.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. Univeristy College London, London.
SAM Internal Report II.b, Final report, Year three:
1.III.91-28.II.1992.
- Gong (1995)
-
Y. Gong (1995).
Speech recognition in noisy environments: A survey.
Speech Communication 16: 261-291.
- Gonzalez & Thomason (1978)
-
R. Gonzalez & M. Thomason (1978).
Syntactic pattern recognition: An introduction.
Addison-Wesley, Reading, MA.
- Good (1953)
-
I. Good (1953).
The population frequencies of species and the estimation of
population parameters.
Biometrika, December, 40: 237-264.
- Goodine et al. (1992)
-
D. Goodine, L. Hirschman, J. Polifroni, S. Seneff & V. Zue (1992).
Evaluating interactive spoken language systems.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP'92, 201-204,
Banff, October.
- Goorfin (1989)
-
L. Goorfin (1989).
Electronic dictionary pronounces over 83,000 words.
Speech Technology 4(4): 49-51.
- Gorin et al. (1991)
-
A. Gorin, S. Levinson, A. Gertner & E. Goldman (1991).
Adaptive acquisition of language.
Computer, Speech and Language, April, 5(2): 101-132.
- Gray & Kopp (1944)
-
C. Gray & G. Kopp (1944).
Voiceprint identification.
Bell Telephone Report, Bell Laboratories.
- Green (1986)
-
D. Green (1986).
Control, activation and resource: A framework and a model for the
control of speech in bilinguals.
Brain and Language 27: 210-223.
- Greenspan et al. (1985)
-
S. Greenspan, H. Nusbaum & D. Pisoni (1985).
Perception of speech generated by rule: Effects of training and
attentional limitations.
Research on Speech Perception Progress Report 11, pages 263-287,
Indiana University, Indianapolis.
- Grenier (1977)
-
Y. Grenier (1977).
Identification du locuteur et adaptation au locuteur d'un
système de reconnaissance phonémique.
Ph.D. Thesis.
- Grice (1975)
-
H. Grice (1975).
Logic and conversation.
: P. Cole & J. Morgan, ,
Syntax and semantics 3: Pragmatics, 41-58. Academic Press,
New York.
- Grice et al. (1991)
-
M. Grice, K. Vagges & D. Hirst (1991).
Assessment of intonation in text-to-speech synthesis systems - A
pilot test in English and Italian.
: Proceedings of the Eurospeech '91,
2, 879-882, Genova.
- Grice et al. (1992a)
-
M. Grice, K. Vagges & D. Hirst (1992a).
Prosodic form tests.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992, Stage report So. 5,
Part One.
- Grice et al. (1992b)
-
M. Grice, K. Vagges & D. Hirst (1992b).
Prosodic function tests.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Final report, Year three, 1.III.91-28.II.1992, Stage report So. 5,
Part Two.
- Grosz (1977)
-
B. Grosz (1977).
The representation and use of focus in dialogue understanding.
University of California.
- Guindon (1988)
-
R. Guindon (1988).
A multidisciplinary perspective on dialogue structure in user-advisor
dialogues.
: R. Guindon, , Cognitive Science
and its applications for human-computer interaction, 163-200.
- Guindon et al. (1987)
-
R. Guindon, K. Shuldberg & J. Connor (1987).
Grammatical and ungrammatical structures in user-advisor dialogues:
Evidence for sufficiency of restricted languages in natural language
interfaces to advisory systems.
: Proceedings of the 25th Annual Meeting of
the Association for Computational Linguistics, 41-44, Stanford.
- Guindon et al. (1986)
-
R. Guindon, P. Sladky, H. Brunner & J. Connor (1986).
The structure of user-adviser dialogues: Is there method in their
madness?
: Proceedings of the 24th Annual Meeting of
the Association for Computational Linguistics, 224-230.
- Guyomard & Siroux (1986a)
-
M. Guyomard & J. Siroux (1986a).
PALABRE Phase 1 experimental protocol.
.
CNET/TSS/RCP WP4 task 3, April.
- Guyomard & Siroux (1986b)
-
M. Guyomard & J. Siroux (1986b).
PALABRE Phase 2 experimental protocol.
.
CNET/TSS/RCP WP4 task 3, May.
- Guyomard & Siroux (1987)
-
M. Guyomard & J. Siroux (1987).
Experimentation in the specification of an oral dialogue.
: H. Niemann, M. Lang & G. Sagerer,
, Recent Advances in Speech Understanding and Dialog Systems.
NATO ASI Series. Series F: Computer and Systems Sciences, Vol. 46,
497-501. Springer-Verlag, Berlin, Heidelberg, New York, London, Paris,
Tokyo.
- Guyomard & Siroux (1988)
-
M. Guyomard & J. Siroux (1988).
Constitution incrementale d'un corpus de dialogues oraux cooperatifs.
Journal Acoustique 1.
- Haeb-Umbach & Ney (1994)
-
R. Haeb-Umbach & H. Ney (1994).
Improvements in time-synchronous beam search for 10000-word
continuous speech recognition.
IEEE Transactions on Speech and Audio Processing, April, 2:
353-356.
- Hansen et al. (1992)
-
J. Hansen, C. Pelaez, L. Solana & P. Vossen (1992).
Performance assessment and evaluation: Specification document.
SUNSTAR Report II.4.
- Hauptmann & Rudnicky (1988)
-
A. Hauptmann & A. Rudnicky (1988).
Talking to computers: An empirical investigation.
International Journal of Man-Machine Studies 28: 583-604.
- Hayes (1963)
-
W. Hayes (1963).
Statistics.
Holt, Rinehart and Winston, Inc., New York.
- Hazan & Grice (1989)
-
V. Hazan & M. Grice (1989).
The assessment of synthetic speech intelligibility using semantically
unpredictable sentences.
: Proceedings of the ESCA Workshop on Speech
Input/Output Assessment and Speech Databases, 1.6.1-1.6.4.
- Hazan & Shi (1993)
-
V. Hazan & B. Shi (1993).
Individual variability in the perception of synthetic speech.
: Proceedings of the Eurospeech '93,
3, 1849-1852, Berlin.
- Heemskerk & Van Heuven (1993)
-
J. Heemskerk & V. Van Heuven (1993).
MORPA, a morpheme lexicon based morphological parser.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 67-85. Mouton de Gruyter,
Berlin.
- Helfrich (1979)
-
H. Helfrich (1979).
Age markers in speech.
: K. Scherer & H. Giles, ,
Social markers in speech, 63-107. Cambridge University
Press, Cambridge.
- Hertz et al. (1985)
-
S. Hertz, J. Kadin & K. Karplus (1985).
The DELTA rule development system for speech synthesis from text.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
1589-1601.
- Hess (1983)
-
W. Hess (1983).
Pitch determination of speech signals.
Springer-Verlag, Heidelberg, F.R.G.
- Hess et al. (1995)
-
W. Hess, K. Kohler & H. Tillmann (1995).
The PhonDat/Verbmobil Speech Corpus.
: Proceedings of the Eurospeech 95, Madrid.
- Heyer et al. (1991)
-
G. Heyer, K. Waldhur & H. Khatchadourian (1991).
Motivation, goals and milestones of ESPRIT II MULTILEX.
: Génie Linguistique 91,
1, Versailles, France, 16-17 January.
- Hieronymus et al. (1990)
- J. Hieronymus, H. Alexander, C. Bennett, I. Cohen, D. Davies, J. Dalby, J. Laver, W. Barry, A. Fourcin & J. Wells (1990).
Proposed speech segmentation criteria for the SCRIBE project.
SCRIBE Project Report.
- Hirschman et al. (1990)
-
L. Hirschman, D. Dahl, D. McKay, L. Norton & M. Linebarger (1990).
Beyond class A: A proposal for automatic evaluation of discourse.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 109-112, Hidden Valley, PA, June.
- Hjelmquist et al. (1987)
-
E. Hjelmquist, B. Jansson & G. Torell (1987).
Psychological aspects on blind people's reading of radio-distributed
daily newspapers.
: B. Knave & P. Widebäck, ,
Work with display units 86, 187-201. North-Holland, Elsevier
Science Publishers, Amsterdam.
- Hockett (1958)
-
C. Hockett (1958).
A course in modern linguistics.
Macmillan, New York.
- Höge et al. (1985)
-
H. Höge, E. Marschall, O. Schmidbauer & R. Sommer (1985).
Worthypothesengenerierung im Projekt SPICOS.
: H. Niemann, , Mustererkennung 85,
7. DAGM-Symposium Erlangen, Informatik-Fachberichte, vol. 107,
175-179. Springer-Verlag, Berlin.
- Holmes (1988)
-
J. Holmes (1988).
Speech synthesis and recognition.
Van Nostrand Reinhold (UK) Co. Ltd., Wokingham.
- Homayounpour et al. (1993)
-
M. Homayounpour, J. Goldman, G. Chollet & J. Vaissiere (1993).
Performance comparison of machine and human speaker verification.
: Proceedings of the Eurospeech,
2295.
- House (1988)
-
A. House (1988).
The recognition of search by machine - A bibliography.
Academic Press Ltd., New York, N.Y.
- House et al. (1965)
-
A. House, C. Williams, M. Hecker & K. Kryter (1965).
Articulation testing methods: Consonantal differentiation with a
closed response set.
Journal of the Acoustical Society of America, JASA 37:
158-166.
- House et al. (1992)
-
J. House, Y. Shitara, M. Grice & P. Howard-Jones (1992).
Evaluation of prosody in dialogue synthesis.
Speech, Hearing and Language 6: 89-108.
- Houtgast & Verhave (1991)
-
T. Houtgast & J. Verhave (1991).
A physical approach to speech quality assessment: Correlation
patterns in the speech spectrogram.
: Proceedings of the Eurospeech '91,
1, 285-288, Genova.
- Houtgast & Verhave (1992)
-
T. Houtgast & J. Verhave (1992).
An objective approach to speech quality.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Stage report So. 9, Final report, Year three: 1.III.91-28.II.1992.
- Howard-Jones (1992a)
-
P. Howard-Jones (1992a).
SOAP, Speech Output Assessment Package.
Version 4.0, ESPRIT SAM-UCL-042.
- Howard-Jones (1992b)
-
P. Howard-Jones (1992b).
Specification of listener dimensions.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Stage report So. 8, Part One, Final report, Year three:
1.III.91-28.II.1992.
- Howell (1990)
-
P. Howell (1990).
Clear speech and turn-taking cues in telephone dialogue.
Report to BT, University College London, London.
- Hunt (1991)
-
A. Hunt (1991).
New commercial applications of telephone-network-based speech
recognition and speaker verification.
Proceedings of the Eurospeech 15(2): 431.
- Hunt (1990)
-
M. Hunt (1990).
Figures of merit for assessing connected-word recognizers.
Speech Communication 9: 329-336.
- IPDS (1995)
-
IPDS (1995).
CD-ROM#2: The Kiel corpus of spontaneous speech. vol. 1,
kiel.
- IPDS (1996)
-
IPDS (1996).
CD-ROM#3: The Kiel corpus of spontaneous speech. vol. 2,
kiel.
- ITU-T (1993)
-
ITU-T (1993).
Draft recommendation P.8S - Subjective performance assessment
of the quality of speech voice output devices.
Study group 12 - contribution 6, ITU-T.
- Jakobson et al. (1951)
-
R. Jakobson, G. Fant & M. Halle (1951).
Preliminaries to speech analysis.
The MIT Press, Cambridge.
- Jassem & obacz (1989)
-
W. Jassem & P. obacz (1989).
IPA phonemic transcription using an IBM PC and compatibles.
Journal of the International Phonetic Association 19(1):
16-23.
- Jekosch (1992)
-
U. Jekosch (1992).
The Cluster-Identification Test.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Internal report II.e, Final report, Year three: 1.III.91-28.II.1992.
- Jekosch & Pols (1994)
-
U. Jekosch & L. Pols (1994).
A feature-profile for application-specific speech synthesis
assessment and devaluation.
: Proceedings of the 3rd International
Conference on Spoken Language Processing, ICSLP, Yokohama.
- Jelinek (1985)
-
F. Jelinek (1985).
A real-time, isolated-word, speech recognition system for dictation
transcription.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
858-861.
- Jelinek (1991)
-
F. Jelinek (1991).
Self-organized language modeling for speech recognition.
: A. Waibel & K.-F. Lee, ,
Readings in speech recognition, 450-506. Morgan Kaufmann
Publishers, San Mateo, CA.
- Jelinek et al. (1992)
-
F. Jelinek, J. Lafferty & R. Mercer (1992).
Basic methods of probabilistic context free grammars.
: P. Laface & R. De Mori, ,
Speech recognition and understanding, 347-360. Springer,
Berlin.
- Jelinek & Mercer (1980)
-
F. Jelinek & R. Mercer (1980).
Interpolated estimation of Markov source parameters from sparse
data.
: E. Gelsema & L. Kanal, ,
Pattern recognition in practice, 381-397. North-Holland
Publishing Company, Amsterdam.
- Jelinek et al. (1990)
-
F. Jelinek, R. Mercer & S. Roukos (1990).
Classifying words for improved statistical language models.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
621-624, Albuquerque, NM, April.
- Jelinek et al. (1991a)
-
F. Jelinek, R. Mercer & S. Roukos (1991a).
Principles of lexical language modeling for speech recognition.
: S. Furui & M. Sondhi, ,
Advances in Speech Signal Processing, 651-699. Marcel
Dekker, New York.
- Jelinek et al. (1991b)
-
F. Jelinek, B. Merialdo, S. Roukos & M. Strauss (1991b).
A dynamic language model for speech recognition.
: Proceedings of the DARPA Workshop `Speech
and Natural Language Workshop', 293-295, Pacific Grove, CA,
February.
- Johnston (1993)
-
R. Johnston (1993).
An on-going series of subjective experiments to assess speech output
from text-to-speech systems.
Unpublished report to CCITT Study Group, No. 12.
- Jongenburger & Van Bezooijen (1992)
-
W. Jongenburger & R. Van Bezooijen (1992).
Evaluatie van ELK: Attitudes van de gebruikers,
verstaanbaarheid en acceptabiliteit van de spraaksynthese, bruikbaarheid van
het zoeksysteem.
Stichting Spraaktechnologie, Utrecht.
- Jönsson & Dalbäck (1988)
-
A. Jönsson & N. Dalbäck (1988).
Talking to your computer is not like talking to your best friend.
: Proceedings of the First Scandinavian
Conference on Artificial Intelligence, Tromso, Norway.
- Joreskog & Sorbom (1984)
-
J. Joreskog & D. Sorbom (1984).
Lisrel VI. Analysis of linear structural relationships by
maximum likelihood, instrument variables, and least squares methods.
Scientific software, Mooreville, IN.
- Karttunen (1983)
-
L. Karttunen (1983).
KIMMO: A general morphological processor.
Texas Linguistic Forum 22: 165-186.
- Kasuya et al. (1993)
-
H. Kasuya, Y. Endo & S. Saliu (1993).
Novel acoustic measurements of jitter and shimmer characteristics
from pathological voice.
: Proceedings of the Eurospeech '93,
1973-1976.
- Katz (1987)
-
S. Katz (1987).
Estimation of probabilities from sparse data for the language model
component of a speech recognizer.
IEEE Transactions on Acoustics, Speech and Signal Processing,
March, 35: 400-401.
- Kelley (1983a)
-
J. Kelley (1983a).
An empirical methodology for writing user-friendly natural language
computer applications.
: Proceedings of the International
Conference of Computer-Human Interaction, CHI '83.
- Kelley (1983b)
-
J. Kelley (1983b).
Natural language and computers: Six steps for writing an
easy-to-use computer application.
The Johns Hopkins University, Baltimore.
- Kelley (1984)
-
J. Kelley (1984).
An interactive design methodology for user-friendly natural language
office information applications.
Association for Computing Machinery Transactions on Office
Information Systems 2: 26-41.
- Kerkhoff et al. (1984)
-
J. Kerkhoff, J. Wester & L. Boves (1984).
A compiler for implementing the linguistic phase of a text-to-speech
conversion system.
: H. Bennis & W. Van Lessen-Kloecke,
, Linguistics in The Netherlands 1984, 111-119.
Foris, Dordrecht.
- Kersta (1962)
-
L. Kersta (1962).
Voiceprint infallibility.
: Meeting of Acoust. Soc. Am., Seattle.
- Kinsey (1994)
-
G. Kinsey (1994).
Using voice recognition with IVR systems.
: AVIOS conference proceedings,
49-56, San Jose.
- Kirchhoff (1996)
-
K. Kirchhoff (1996).
Phonologisch strukturierte hmms zur automatischen spracherkennung.
: D. Gibbon, , Natural language
processing and speech technology. Results of the 3rd KONVENS Conference,
Bielefeld, October 1996, 55-63. Mouton de Gruyter, Berlin, New
York.
- Klatt (1976)
-
D. Klatt (1976).
The linguistics uses of segmental duration in English: Acoustic
and perceptual evidence.
Journal of the Acoustical Society of America, JASA 59:
1208-1221.
- Klatt (1977)
-
D. Klatt (1977).
Review of the ARPA speech understanding project.
Journal of the Acoustical Society of America, JASA 62(6):
1345-1366.
Also in: A. Waibel, K.-F. Lee, eds., (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 554-575.
- Klatt (1982)
-
D. Klatt (1982).
The KLATTalk text-to-speech conversion system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
1589-1592.
- Klatt (1987)
-
D. Klatt (1987).
Review of text-to-speech conversion in English.
Journal of the Acoustical Society of America 82: 737-793.
- Kneser & Ney (1995)
-
R. Kneser & H. Ney (1995).
Improved backing-off for m-gram language modeling.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
I, 49-52, Detroit, MI, May.
- Knowles & Alderson (1995)
-
G. Knowles & P. Alderson (1995).
Working with speech: The computational analysis of formal
British English speech.
Longmans, London.
- Knowles et al. (1995)
-
G. Knowles, L. Taylor & B. Williams (1995).
A corpus of formal British English speech.
Longmans, London.
- Knuth (1973)
-
D. Knuth (1973).
The art of computer programming 3: Sorting and searching.
Addison-Wesley, Reading, Massachusetts.
- Kohler et al. (1995)
-
K. Kohler, M. Pätzold & A. Simpson (1995).
From scenario to segment: The controlled elicitation,
transcription, segmentation and labelling of spontaneous speech.
Arbeitsberichte (AIPUK) 29, Institut für Phonetik und Digitale
Sprachverarbeitung, IPDS, Universität Kiel, Kiel/Germany.
- Kornai (1991)
-
A. Kornai (1991).
Formal phonology.
Doctoral dissertation, Stanford University, Stanford.
- Koskenniemi (1983)
-
K. Koskenniemi (1983).
Two-level morphology: A general computational model for
word-form recognition and production.
University of Helsinki, Department of General Linguistics, Helsinki,
Finland.
- Kraft & Portele (1995)
-
V. Kraft & T. Portele (1995).
Quality evaluation of five German speech synthesis systems.
Acta Acustica 3: 351-365.
- Kryter (1962a)
-
K. Kryter (1962a).
Methods for the calculation and use of the Articulation Index.
Journal of the Acoustical Society of America, JASA 34:
1689-1697.
- Kryter (1962b)
-
K. Kryter (1962b).
Validation of the Articulation Index.
Journal of the Acoustical Society of America, JASA 34:
1698-1702.
- Kuhn & De Mori (1990)
-
R. Kuhn & R. De Mori (1990).
A Cache-based natural language model for speech recognition.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
June, 12: 570-583.
- Labov (1972)
-
W. Labov (1972).
Sociolinguistic patterns.
University of Pennsylvania Press, Pennsylvania.
- Labov (1994)
-
W. Labov (1994).
Principles of linguistic change. Volume 1: Internal
factors.
Blackwell, Oxford.
- Labrador & Dinesh (1984)
-
C. Labrador & P. Dinesh (1984).
Experiments in speech interaction with conventional data services.
Interact '84, 104-108.
- Lacouture & Normandin (1993)
-
R. Lacouture & Y. Normandin (1993).
Efficient lexical access strategies.
: Proceedings of the European Conference on
Speech Technology.
- Ladefoged (1975)
-
P. Ladefoged (1975).
A course in phonetics.
Harcourt, Brace, Jovanovich, New York.
- Lafferty et al. (1992)
-
J. Lafferty, D. Sleator & D. Temperley (1992).
Grammatical trigrams: A probabilistic model of link grammars.
: Proceedings of the AAAI Fall Symposium on
Probabilistic Approaches to Natural Language, Cambridge, MA.
- Langer & Gibbon (1992)
-
H. Langer & D. Gibbon (1992).
DATR as a graph representation language for ILEX speech oriented
lexica.
Research Report, March 1992, ASL-TR-43-92/UBI, University of
Bielefeld, Bielefeld, Germany.
- Langeweg (1988)
-
S. Langeweg (1988).
The stress system of Dutch.
Doctoral dissertation, Leiden University, Leiden.
- Larmouth (1986)
-
D. Larmouth (1986).
The legal and ethical status of surreptitious recording in dialect
research: Do human subjects guidelines apply?
: D. Larmouth, T. Murray & C. Murray,
, Legal and ethical issues in surreptitious recording,
Publication of the American Dialect Society, number 76. University of Alabama
Press, Tuscaloosa and London.
- Lau et al. (1993)
-
R. Lau, R. Rosenfeld & S. Roukos (1993).
Trigger-based language models: A maximum entropy approach.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 45-48, Minneapolis, MN, April.
- Laver (1991)
-
J. Laver (1991).
The gift of speech.
Papers in the analysis of speech and voice, Edinburgh University
Press, Edinburgh.
- Laver (1994)
-
J. Laver (1994).
Principles of phonetics.
Cambridge University Press, Cambridge.
- Laver et al. (1988)
-
J. Laver, J. McAllister, M. McAllister & M. Jack (1988).
A Prolog-based automatic text-to-phoneme conversion system for
British English.
: Proceedings of the Second Symposium on
Advanced Man-Machine Interface through Spoken Language, November 19-22,
Hawaii.
- Laver et al. (1989)
-
J. Laver, M. McAllister & J. McAllister (1989).
Pre-processing of anomalous text-strings in an automatic
text-to-speech system.
: S. Ramsaran, , Studies in the
pronunciation of English: A commemorative volume in memory of A.C. Gimson.
Croon Helm, London.
- Lea (1980)
-
W. Lea, (1980).
Trends in speech recognition.
Prentice-Hall, Englewood Cliffs, NJ.
- Lee et al. (1990)
-
K.-F. Lee, H.-W. Hon & R. Reddy (1990).
An overview of the SHPINX speech recognition system.
: A. Waibel & K.-F. Lee, ,
Readings in speech recognition, 600-610. Morgan Kaufmann
Publishers, San Mateo, California.
- Leggett & Williams (1984)
-
J. Leggett & G. Williams (1984).
An empirical investigation of voice as an input modality for computer
programming.
International Journal of Man-Machine Studies 21: 493-520.
- Lehiste (1970)
-
I. Lehiste (1970).
Suprasegmentals.
MIT Press, Cambridge, Mass.
- Lehiste et al. (1976)
-
I. Lehiste, J. Olive & L. Streeter (1976).
Role of duration in disambiguating syntactically ambiguous sentences.
Journal of the Acoustical Society of America, JASA 60:
1199-1202.
- Lehmann (1983)
-
E. Lehmann (1983).
Theory of point estimation.
J. Wiley, New York.
- Lehnert & Giron (1995)
-
H. Lehnert & F. Giron (1995).
Vocal communication in virtual environments.
: Conference documentation of Virtual
Reality World '95, 279-293, Stuttgart/Germany.
- Lesser et al. (1975)
-
V. Lesser, R. Fennell, L. Erman & D. Reddy (1975).
Organization of the HEARSAY-II speech understanding system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP-23,
11-23.
- Levelt (1989)
-
J. Levelt (1989).
Speaking: From intonation to articulation.
ACL-MIT Press Series in Natural Language Processing. Bradford Book -
The MIT-Press, Cambridge Massachusetts, London, England.
- Levinson et al. (1983)
-
S. Levinson, L. Rabiner & M. Sondhi (1983).
An introduction to the application of the theory of probabilistic
functions of a Markov process to automatic speech recognition.
The Bell System Technical Journal, April, 62(4): 1035-1074.
- Life et al. (1988)
-
M. Life, M. Lee & J. Long (1988).
Assessing the usability of future speech technology: Towards a
method.
: Speech '88: 7th FASE Symposium,
Edinburgh.
- Likert (1932)
-
R. Likert (1932).
A technique for the measurement of attitudes.
Archives of Psychology 140.
- Linggard (1985)
-
R. Linggard (1985).
Electronic synthesis of speech.
Cambridge University Press, Cambridge.
- Llisterri (1994)
-
J. Llisterri (1994).
Prosody Encoding Survey, Multext - LRE Project 62-050.
- Llisterri & Mariño (1993)
-
J. Llisterri & J. Mariño (1993).
Spanish adaptation of SAMPA and automatic phonetic transcription.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications, Year 1, 1
April 1993-30 September 1993, 1-9. London.
SAM-A periodic progress report, Document No: SAM-A/UPC/001/V1.
- Logan et al. (1989)
-
J. Logan, B. Greene & D. Pisoni (1989).
Measuring the segmental intelligibility of synthetic speech produced
by ten text-to-speech systems.
Journal of the Acoustical Society of America, JASA 86:
566-581.
- Logan et al. (1985)
-
J. Logan, D. Pisoni & B. Greene (1985).
Measuring the segmental intelligibility of synthetic speech:
Results from eight text-to-speech systems.
Research on speech perception Progress Report 11, 3-31, Indiana
University, Indianapolis.
- Loman & Boves (1993)
-
H. Loman & L. Boves (1993).
Development of rule based synthesis for text-to-speech.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 157-168. Mouton de
Gruyter, Berlin.
- Lowerre & Reddy (1980)
-
B. Lowerre & R. Reddy (1980).
The HARPY speech understanding system.
: W. Lea, , Trends in speech
recognition, 340-360. Prentice Hall, Englewood Cliffs, NJ.
Also in: A. Waibel, K.-F. Lee, eds., (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 576-586.
- Luce et al. (1983)
-
P. Luce, T. Feustel & D. Pisoni (1983).
Capacity demands in short-term memory for synthetic and natural word
lists.
Human Factors 25: 17-32.
- Luzzati & Néeel (1989)
-
D. Luzzati & F. Néel (1989).
Dialogue behaviour induced by machine.
: Proceedings of the Eurospeech '89,
2, 601-604, Paris.
- Lyons (1977)
-
J. Lyons (1977).
Semantics. Volumes I and II.
Cambridge University Press, Cambridge.
- Maassen & Povel (1985)
-
B. Maassen & D.-J. Povel (1985).
The effect of segmental and suprasegmental corrections on the
intelligibility of deaf speech.
Journal of the Acoustical Society of America, JASA 78:
877-886.
- MacDermid (1993)
-
C. MacDermid (1993).
Features of naive callers' dialogues with a simulated speech
understanding and dialogue system.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 955-958, Berlin,
September.
- MacWhinney (1995)
-
B. MacWhinney (1995).
The CHILDES Project: Tools for analyzing talk.
Lawrence Erlbaum, Hillsdale, NJ.
- Manous et al. (1985)
-
L. Manous, M. Dedina, H. Nusbaum & D. Pisoni (1985).
Speeded sentence verification of natural and synthetic speech.
Research on Speech Perception Progress Report 11, Indiana
University, Indianapolis.
- Marascuilo & Serlin (1988)
-
L. Marascuilo & R. Serlin (1988).
Statistical methods for the social, and behavioral sciences.
Freeman and company, New York.
- Mariani (1989)
-
J. Mariani (1989).
Recent advances in speech processing.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
429-440.
- Marslen-Wilson (1989)
-
W. Marslen-Wilson, (1989).
Lexical representation and process.
The MIT Press, Cambridge, Massachusetts and London, England.
- Mérialdo (1988)
-
B. Mérialdo (1988).
Multi-level decoding for very-large-size-dictionary speech
recognition.
IBM Journal of Research and Development 32(2): 169-301.
- Michaelis & Strube (1995)
-
D. Michaelis & H. Strube (1995).
Orthogonale akustische Stimmgüteparameter zur
Stimmtherapiedokumentation.
Fortschritte der Akustik - DAGA '95 to be printed.
- Monaghan & Ladd (1989)
-
A. Monaghan & D. Ladd (1989).
Evaluating intonation in the CSTR text-to-speech system.
: Proceedings of the ESCA Workshop on Speech
I/O Assessment and speech databases, Noordwijkerhout.
3.6.1-3.6.4.
- Monaghan & Ladd (1990)
-
A. Monaghan & D. Ladd (1990).
Symbolic output as the basis for evaluating intonation in
text-to-speech systems.
Speech Communication 9: 305-314.
- Moody (1991)
-
A. Moody (1991).
Speaker verification.
Internal Report, January 1991, Ensigma Ltd.
- Moore (1977)
-
R. Moore (1977).
Evaluating speech recognizers.
IEEE Transactions on Acoustics, Speech and Signal Processing
25(2): 178-183.
- Moore (1986)
-
R. Moore (1986).
The NATO research study group on speech processing: RSG10.
: Proceedings of the Speech Tech'86,
201-203, New York, 28-30 April 1986.
- Moore (1988)
-
R. Moore (1988).
The technology of speech recognition.
: Proceedings of the CCTA/Blenheim-Online
Conference on Knowledge Based Systems in Government, Bristol, 8-10 November
1988.
- Moore (1991)
-
R. Moore (1991).
International coordination of research standards in speech science
and technology.
: Proceedings of the ICSLP-90 Workshop on
International Coordination of Spoken Language Database and Assessment
Techniques for Speech Input/Output, Kobe, Japan, November 1991.
- Moore (1992a)
-
R. Moore (1992a).
Speech recognition: Available assessment methods and needs for
standardisation.
: Proceedings of the Workshop on
International Cooperation and Standardisation of Spoken Language Databases
and Speech I/O Assessment Techniques, Chiavari, Italy, 26-28 September
1992.
- Moore (1992b)
-
R. Moore (1992b).
User needs in speech research.
: Proceedings of the Workshop on European
Textual Corpora, Pisa, Italy, 23-26 January 1992.
- Moore (1994a)
-
R. Moore (1994a).
The ``Capability Profile''.
DRA-CSE Research Note DRA CIS CSE1 RN94/08, August 1994, DRA Speech
Research Unit, Malvern, Worcs., UK.
- Moore (1994b)
-
R. Moore (1994b).
The EAGLES working group on spoken language, Advanced
Speech Applications. European research on speech technology.
: K. Varghese, S. Pfleger & J. Lefevre,
, Research Reports ESPRIT Volume 1. Springer-Verlag, Berlin.
- Mori et al. (1992)
-
S. Mori, C. Suen & K. Yamamoto (1992).
Historical review of OCR research and development.
Proceedings of the IEEE, July, 80(7): 1029-1058.
- Morimoto et al. (1990)
-
T. Morimoto, K. Shikano, H. Iida & A. Kurematsu (1990).
Integration of speech recognition and language processing in the
spoken language translation system SL-TRANS.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 921-928, Kyoto.
- Moulines & Charpentier (1990)
-
E. Moulines & F. Charpentier (1990).
Pitch synchronous waveform processing techniques for text-to-speech
synthesis using diphones.
Speech Communication 9: 453-467.
- Müller & Runge (1993)
-
C. Müller & F. Runge (1993).
Dialogue design principles - key for usability of voice processing.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 943-946, Berlin,
September.
- Murray & Arnott (1993)
-
I. Murray & J. Arnott (1993).
Toward the simulation of emotion in synthetic speech: A review of
the literature on human vocal emotion.
Journal of the Acoustical Society of America, JASA 93:
1097-1108.
- Murray & Murray (1986)
-
T. Murray & C. Murray (1986).
On the legality and ethics of surreptitious recording.
: D. Larmouth, T. Murray & C. Murray,
, Legal and ethical issues in surreptitious recording,
Publication of the American Dialect Society, number 76. University of Alabama
Press, Tuscaloosa and London.
- Murveit et al. (1993)
-
H. Murveit, J. Butzberger, V. Digalakis & M. Weintraub (1993).
Large vocabulary dictation using SRI's Decipher speech
recognition system: Progressive search techniques.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 319-322, Minneapolis, MN, April.
- Nadas (1984)
-
A. Nadas (1984).
Estimation of probabilities in the language model of the IBM speech
recognition system.
IEEE Transactions on Acoustics, Speech and Signal Processing,
August, 32: 859-861.
- Nadas (1985)
-
A. Nadas (1985).
On Turing's formula for word probabilities.
IEEE Transactions on Acoustics, Speech and Signal Processing,
December, 33: 1414-1416.
- Nespor & Vogel (1986)
-
M. Nespor & I. Vogel (1986).
Prosodic phonology.
Foris, Dordrecht.
- Newell (1978)
-
A. Newell (1978).
The palantype transcription unit - its history and progress to date.
Hearing, 99-104.
May/June.
- Newell (1989)
-
A. Newell (1989).
Speech simulation studies - performance and dialogue specification.
: J. Peckham, , Recent developments
and applications of natural language processing, 141-157. Kogan
Page, London.
- Ney (1984)
-
H. Ney (1984).
The use of a one-stage dynamic programming algorithm for connected
word recognition.
IEEE Transactions on Acoustics, Speech and Signal Processing,
April, 32(2): 263-271.
- Ney & Aubert (1994)
-
H. Ney & X. Aubert (1994).
A word graph algorithm for large vocabulary, continuous speech
recognition.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1355-1358,
Yokohama, Japan, September.
- Ney & Essen (1993)
-
H. Ney & U. Essen (1993).
Estimating small probabilities by leaving-one-out.
: Third European Conference on Speech
Communication and Technology, 2239-2242, Berlin, September.
- Ney et al. (1994)
-
H. Ney, U. Essen & R. Kneser (1994).
On structuring probabilistic dependencies in language modelling.
Computer Speech and Language 8: 1-38.
- Ney et al. (1992)
-
H. Ney, D. Mergel, A. Noll & A. Paesele (1992).
Data driven search organization for continuous speech recognition.
IEEE Transactions on Signal Processing, February, 40(2):
272-281.
- Ney et al. (1988)
-
H. Ney, D. Mergel, A. Noll & A. Paeseler (1988).
Overview of speech recognition in the SPICOS system.
: H. Niemann, M. Lang & G. Sagerer,
, Recent advances in speech understanding and dialog systems,
46 NATO ASI Series F, 305-310.
Springer-Verlag, Berlin.
- Niemann et al. (1985)
-
H. Niemann, A. Brietzmann, R. Mühlfeld, P. Regel & G. Schukat (1985).
The speech understanding and dialog system EVAR.
: R. De Mori & C. Suen, ,
New systems and architectures for automatic speech recognition and
synthesis, NATO ASI Series F, vol. 16, 271-302. Springer-Verlag,
Berlin.
- Niemann et al. (1992)
-
H. Niemann, E. Nöth, M. Mast & E. Schukat-Talamazzini (1992).
Ein Lexikon für ein natürlich-sprachliches Dialogsystem.
: Beiträge des ASL-Lexikonworkshops,
15-18, Wandlitz, 26-27 November.
ASL-TR-40-92/ZSB.
- Nolan (1987)
-
F. Nolan (1987).
The limits of segmental description.
: Proceedings of the Eleventh International
Conference of Phonetic Sciences, 5, 411-414, 1-7
August 1987, Tallinn, Estonia.
- Nooteboom & Kruijt (1987)
-
S. Nooteboom & J. Kruijt (1987).
Accents, focus distribution, and the perceived distribution of given
and new information.
Journal of the Acoustical Society of America, JASA 82:
1512-1524.
- Nossin (1991)
-
M. Nossin (1991).
Le projet GENELEX: EUREKA pour les dictionnaires
génériques.
Génie Linguistique 91, volume 1.
Versailles, France, 16-17 January 1991.
- Nunn & Van Heuven (1993)
-
A. Nunn & V. Van Heuven (1993).
MORPHON: Lexicon-based text-to-phoneme conversion and
phonological rules.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 88-113. Mouton de Gruyter,
Berlin.
- Nusbaum et al. (1986)
-
H. Nusbaum, S. Greenspan & D. Pisoni (1986).
Perceptual attention in monitoring natural and synthetic speech.
Research on Speech Perception Progress Report 12, Indiana
University, Indianapolis.
- Nye & Gaitenby (1974)
-
P. Nye & J. Gaitenby (1974).
The intelligibility of synthetic monosyllabic words in short,
syntactically normal sentences.
Haskins Laboratories Status Report on Speech Research, 37/38, pages
169-190.
- Nye et al. (1975)
-
P. Nye, F. Ingemann & L. Donald (1975).
Synthetic speech comprehension: A comparison of listener
performances with and preferences among different speech forms.
Haskins Laboratories Status Report on Speech Research, 41.
- Oerder & Ney (1993)
-
M. Oerder & H. Ney (1993).
Word graphs: An efficient interface between continuous speech
recognition and language understanding.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
II, 119-122, Minneapolis, MN, April.
- Oglesby (1994)
-
J. Oglesby (1994).
What's in a number? Moving beyond the equal error rate.
To appear in Speech Communication, August 1995. Preliminary version
published in Martigny ETRW, pp. 87-90.
- Olsen & Olsen (1990)
-
G. Olsen & J. Olsen (1990).
User-centered design of Collaborative Technology.
Cognitive Science and Machine Intelligence Laboratory.
To appear in Organizational Computing 32.
- O'Malley & Caisse (1987)
-
M. O'Malley & M. Caisse (1987).
How to evaluate text-to-speech systems.
Speech Technology 3: 66-75.
- O'Neill (1975)
-
J. O'Neill (1975).
Measurement of hearing by tests of speech and language.
: S. Singh, , Measurement procedures
in speech, hearing, and language, 219-252. University Park Press,
Baltimore.
- Oppenheim (1978)
-
A. Oppenheim (1978).
Applications of digital signal processing.
Prentice-Hall, Englewood Cliffs, N.J.
- O'Shaughnessy (1986)
-
D. O'Shaughnessy (1986).
Speaker recognition.
IEEE ASSP Magazine, 4-17.
- O'Shaughnessy (1987)
-
D. O'Shaughnessy (1987).
Speech communiacation - human and machine.
Addison-Wesley, New York.
- Pallet et al. (1990)
-
D. Pallet, W. Fisher & J. Garofolo (1990).
DARPA ATIS results, June 1990.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 114-121, Hidden Valley, PA, June.
- Pallett (1985)
-
D. Pallett (1985).
Performance assessment of automatic speech recognizers.
Journal of the National Bureau of Standards 90(5).
September-October 1985.
- Parducci (1965)
-
A. Parducci (1965).
Category judgement: A range-frequency model.
Psychological Review 72: 407-418.
- Pavlovic et al. (1990)
-
C. Pavlovic, M. Rossi & R. Espesser (1990).
Use of the magnitude estimation technique for assessing the
performance of text-to-speech synthesis system.
Journal of the Acoustical Society of America, JASA 87:
373-381.
- Pavlovic et al. (1991)
-
C. Pavlovic, M. Rossi & R. Espesser (1991).
Perceived spectral energy distributions for EUROM-0 speech and for
some synthetic speech.
: Proceedings of the 12th International
Congress of Phonetic Sciences, 5, 418-421,
Aix-en-Provence.
- Peckels & Rossi (1973)
-
J. Peckels & M. Rossi (1973).
Le test diagnostic par paires minimales. Adaptation au
Français du ``Diagnostic Rhyme Test" de W.D. Voiers.
Revue d'Acoustique 27: 245-262.
- Peckham (1990)
-
J. Peckham (1990).
An overview of speaker verification technology and application over
the telephone.
: Proceedings of the Voice System
Worldwide, 166.
- Peckham (1993)
-
J. Peckham (1993).
A new generation of spoken dialogue systems: Results and lessons
from the SUNDIAL project.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 33-40, Berlin, September.
- Peckham & Thomas (1990)
-
J. Peckham & T. Thomas (1990).
Recognizer sensitivity analysis: A method for assessing the
performance of speech recognizers.
Speech Communication 9: 317-328.
- Pérennou et al. (1991)
-
G. Pérennou, D. Cotto, M. De Calmès, I. Ferrané, J. Pécatte & J. Tihoni (1991).
Composantes phonologique et orthographique de BDLEX.
: Deuxièmes Journées Nationales du
GRECO-PRC Communication Homme-Machine, 351-362, Toulouse, 29-30
January.
- Pérennou et al. (1992)
-
G. Pérennou, D. Cotto, M. De Calmès, I. Ferrané & J.-M. Pécatte (1992).
Le projet BDLEX de base de données lexicales du Français
écrit et parlé.
: Proceedings Communication Homme-Machine,
Séminaire LEXIQUE, 153-171, 21-22 January 1992, IRIT-UPS
Toulouse.
- Pérennou & De Calmès (1987)
-
G. Pérennou & M. De Calmès (1987).
BDLEX lexical data and knowledge base of spoken and written
French.
: European Conference on Speech Technology,
1, 393-396, Edinburgh.
- Pérennou & Tihoni (1992)
-
G. Pérennou & J. Tihoni (1992).
Lexique et phonologie en reconnaissance de la parole.
: Proceedings Communication Homme-Machine,
Séminaire LEXIQUE, 41-57, 21-22 January 1992, IRIT-UPS
Toulouse.
- Perkins (1977)
-
W. Perkins (1977).
Speech pathology, an applied behavioral science.
The C.V. Mosby Company, Saint Louis.
- Philips et al. (1987)
-
S. Philips, S. Stelle & C. Tanz (1987).
Language, gender and sex in comparative perspective.
Cambridge University Press, Cambridge.
- Pieraccini et al. (1993)
-
R. Pieraccini, E. Levin & E. Vidal (1993).
Learning how to understand language.
: Third European Conference on Speech
Communication and Technology, 1407-1412, Berlin, September.
- Pierce (1991)
-
A. Pierce (1991).
Acoustics: An introduction to its physical principles and
applications.
McGraw Hill, Inc., New York.
- Pisoni et al. (1985a)
-
D. Pisoni, B. Greene & H. Nusbaum (1985a).
Perception of synthetic speech generated by rule.
Proceedings of the IEEE 73: 1665-1676.
- Pisoni et al. (1985b)
-
D. Pisoni, B. Greene & H. Nusbaum (1985b).
Some human factors issues in the perception of synthetic speech.
: Proceedings Speech Tech '85,
57-61, New York.
- Pitrelli et al. (1994)
-
J. Pitrelli, M. Beckman & J. Hirschberg (1994).
Evaluation of prosodic transcription labeling reliability in the
ToBI framework.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 18-22 September 1994,
Yokohama, Japan.
- Plenat (1991)
-
M. Plenat (1991).
Vers d'une phonémisation des sigles.
: Deuxièmes journées du GDR-PRC
Communication Homme-Machine, EC2 Editeur, 363-371, Toulouse,
29-30 January.
- Plomp & Mimpen (1979)
-
R. Plomp & A. Mimpen (1979).
Improving the reliability of testing the speech reception threshold
for sentences.
Audiology 8: 43-52.
- Pols (1991)
-
L. Pols (1991).
Quality assessment of text-to-speech synthesis-by-rule.
: S. Furui & M. Sondhi, ,
Advances in speech signal processing, 387-416. Marcel Dekker
Inc., New York.
- Pols et al. (1987)
-
L. Pols, J.-P. Lefevre, G. Boxelaar & N. Van Son (1987).
Word intelligibility of a rule synthesis system for French.
: Proceedings of the European Conference on
Speech Technology, 1, 179-182, Edinburgh.
- Ponamale et al. (1990)
-
M. Ponamale, E. Bilange, K. Choukri & S. Soudoplatoff (1990).
A computer-aided approach to the design of an oral dialogue system.
: Proceedings of Eastern Multiconference,
Nashville.
- Portele et al. (1994)
-
T. Portele, B. Heuft, F. Höfer, H. Meyer & W. Hess (1994).
A new high quality speech synthesis system for German.
: Proceedings Yokohama/New Paltz.
- Pratt (1987)
-
R. Pratt (1987).
Quantifying the performance of text-to-speech synthesizers.
Speech Technology, 54-64.
- Price (1990)
-
P. Price (1990).
Evaluation of spoken language systems: The ATIS domain.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 91-95, Hidden Valley, PA, June.
- Quené (1993)
-
H. Quené (1993).
Segment durations and accent as cues to word segmentation in Dutch.
Journal of the Acoustical Society of America, JASA 94:
2027-2035.
- Rabiner & Schafer (1978)
-
L. Rabiner & R. Schafer (1978).
Digital processing of speech signals.
Prentice-Hall, Englewood Cliffs, N.J.
- Radford (1988)
-
A. Radford (1988).
Transformational grammar: A first course.
CUP, Cambridge.
- Ralston et al. (1991)
-
J. Ralston, D. Pisoni, S. Lively, B. Greene & J. Mullennix (1991).
Comprehension of synthetic speech produced by rule: Word monitoring
and sentence-by-sentence listening times.
Human Factors 33: 471-491.
- Rayner et al. (1993)
- M. Rayner, H. Alshawi, I. Breton, D. Carter, V. Digalakis, B. Gamback, J. Kaja, J. Karlgren, B. Lyberg, S. Pulman, P. Price & C. Samuelsson (1993).
A speech to speech translation system built from standard components.
: Proceedings of a Workshop: Human
Language Technology, 217-222, Princeton, NJ, 21-24 March.
- Reilly (1987)
-
R. Reilly (1987).
Ill-formedness and mis-communication in person-machine dialogue.
Information and Software Technology 29: 69-74.
- Reyelt et al. (1996)
-
M. Reyelt, M. Grice, R. Benzmüller, J. Mayer & A. Batliner (1996).
Prosodische Etikettierung des Deutschen mit ToBI.
: D. Gibbon, , Natural language processing and speech technology. Results of the 3rd KONVENS Conference, Bielefeld, October 1996, 144-155. Mouton de Gruyter, Berlin, New York.
- Reynolds (1994)
-
D. Reynolds (1994).
Speaker identification and verification using Gaussian mixture
speaker models.
To appear in Speech Communication, August 1995. Preliminary version
published in ETRW Martigny, pp. 27-30.
- Richards & Underwood (1984a)
-
M. Richards & K. Underwood (1984a).
How should people and computers speak to each other?
Interact '84, 33-36.
- Richards & Underwood (1984b)
-
M. Richards & K. Underwood (1984b).
Talking to machines. How are people naturally inclined to speak?
: E. Megaw, , Contemporary
Ergonomics. Taylor and Francis, London.
- Ritchie et al. (1992)
-
G. Ritchie, A. Black, G. Russell & S. Pulman (1992).
Computational morphology.
The MIT Press, Cambridge, Massachusetts and London.
- Roach et al. (1993)
-
P. Roach, G. Knowles, T. Varadi & S. Arnfield (1993).
MARSEC: A machine-readable Spoken English corpus.
Journal of the International Phonetic Association 23(2):
47-53.
- Roach et al. (1990)
-
P. Roach, H. Roach, A. Dew & P. Rowlands (1990).
Phonetic analysis and the automatic segmentation and labeling of
speech sounds.
Journal of the International Phonetic Association 20(1):
15-21.
- Roe & Wilpon (1994)
-
D. Roe & J. Wilpon (1994).
Voice communication between humans and machines.
National Academy Press, Washington.
- Roelofs (1987)
-
J. Roelofs (1987).
Synthetic speech in practice: Acceptance and efficiency.
Behaviour and Information Technology 6: 403-410.
- Rose (1971)
-
D. Rose (1971).
Audiological assessment.
Prentice-Hall International, Inc., London.
- Rosenberg (1973)
-
A. Rosenberg (1973).
Listener performance in speaker verification tasks.
IEEE Transactions on Audio Electroacoustic 21: 221-225.
- Rosenberg (1976)
-
A. Rosenberg (1976).
Automatic speaker verification: A review.
Proceedings of the IEEE, April, 64(4): 475.
- Rosenfeld (1994)
-
R. Rosenfeld (1994).
Adaptive statistical language modeling: A maximum entropy
approach.
Ph.D. Thesis, School of Computer Science, Carnegie Mellon
University, Pittsburgh, PA.
CMU-CS-94-138.
- Rossi (1988)
-
M. Rossi (1988).
Acoustics and electroacoustics.
Artech House, Norwood, MA, USA.
- Rowden (1992)
-
C. Rowden (1992).
Speech processing.
McGraw-Hill Book Company, London.
- Rudnicky et al. (1987)
-
A. Rudnicky, L. Baumeister, K. De Graff & E. Lehmann (1987).
The lexical access component of the CMU continuous speech
recognition system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP.
- Ruske (1985)
-
G. Ruske (1985).
Demisyllables as processing units for automatic speech recognition
and lexical access.
: R. De Mori & C. Suen, ,
New systems and architectures for automatic speech recognition and
synthesis, 16 NATO ASI Series F,
593-611. Springer-Verlag, Berlin.
- Ruske & Schotola (1981)
-
G. Ruske & T. Schotola (1981).
The efficiency of demisyllable segmentation in the recognition of
spoken words.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
971-974, Atlanta.
- Sacks et al. (1974)
-
H. Sacks, E. Schlegloff & G. Jefferson (1974).
A simplest systematics for the organization of turn-taking in
conversation.
Language 50: 697-735.
- Sagerer (1990)
-
G. Sagerer (1990).
Automatisches Verstehen gesprochener Sprache, 74
Reihe Informatik.
Bibliographisches Institut, Mannheim.
- Sakoe (1979)
-
H. Sakoe (1979).
Two-level DP matching - A dynamic programming-based pattern
matching algorithm for connected word recognition.
IEEE Transactions on Acoustics, Speech and Signal Processing
27: 588-595.
- Salza et al. (1993)
-
P. Salza, G. Di Fabbrizio, M. Oreglia, M. Falcone, C. Sementina & C. Delogu (1993).
Development of a context dependent methodology for text-to-speech
synthesis evaluation in interactive dialogue systems.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications. London.
Report R2, SAM-A Periodic Progress Report. Year 1, 1 April 1993-30
September 1993.
- SAM (1992)
-
SAM (1992).
Multi-lingual speech input/output assessment, methodology and
standardization.
ESPRIT project 2589 (SAM), Final report, Year three,
1 III 91-28 II 1992, Ref: SAM-UCL-G004, Univeristy College London, London.
- SAM-A (1993)
-
SAM-A (1993).
Speech technology assessment in multilingual applications.
ESPRIT Project 6819 (SAM-A), Report No. 2, Year 1, Ref SAM-A/G002.
- Scharpff & Van Heuven (1988)
-
P. Scharpff & V. Van Heuven (1988).
Effects of pause insertion on the intelligibility of low quality
speech.
: Proceedings of the 7th FASE/Speech '88
Symposium, 261-269, Edinburgh.
- Scherer & Giles (1979)
-
K. Scherer & H. Giles, (1979).
Social markers in speech.
Cambridge University Press, Cambridge.
- Schmidt & Watson (1991)
-
M. Schmidt & G. Watson (1991).
The evaluation and optimization of automatic speech segmentation.
: Proceedings of the Second European
Conference on Speech Communication and Technology, Eurospeech 91,
2, 701-704, 24-26 September 1991, Genova, Italy.
- Schröder et al. (1987)
-
S. Schröder, G. Sagerer & H. Niemann (1987).
Wissensakquisition mit semantischen Netzwerken.
: E. Paulus, , Mustererkennung 87,
9. DAGM-Symposium Braunschweig, Informatik-Fachberichte, 305-309.
Springer-Verlag, Berlin.
- Schukat-Talamazzini (1993)
-
E. Schukat-Talamazzini (1993).
Automatische Spracherkennung.
Habilitationsschrift, Erlangen University, Erlangen, Germany.
- Schwab et al. (1985)
-
E. Schwab, H. Nusbaum & D. Pisoni (1985).
Some effects of training on the perception of synthetic speech.
Human Factors 27(4): 395-408.
- Schwartz & Austin (1991)
-
R. Schwartz & S. Austin (1991).
A comparison of several approximate algorithms for finding multiple
(n-best) sentence hypotheses.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
701-704, Toronto, May.
- Searle (1969)
-
J. Searle (1969).
Speech acts: An essay in the philosophy of language.
Cambridge University Press, Cambridge.
- Searle (1979)
-
J. Searle (1979).
Expression and meaning.
Cambridge University Press, Cambridge.
- Sells (1985)
-
P. Sells (1985).
Lectures on contemporary syntactic theories: An introduction to
Government-Binding theory, Generalized Phrase Structure Grammar, and
Lexical-Functional Grammar.
CSLI Center for the Study of Language and Information, Stanford,
California.
- Siegel (1956)
-
S. Siegel (1956).
Nonparametric statistics for the behavioral sciences.
McGraw-Hill, New York.
- Silverman et al. (1990)
-
K. Silverman, S. Basson & S. Levas (1990).
Evaluating synthesizer performance: Is segmental intelligibility
enough?
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 981-984, Kobe.
- Silverman et al. (1992)
-
K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert & J. Hirschberg (1992).
ToBI: A standard for labeling English prosody.
: Proceedings of the 1992 International
Conference on Spoken Language Processing, ICSLP, 2,
867-870, 12-16 October 1992, Banff, Canada.
- Simpson & Fraser (1993)
-
A. Simpson & N. Fraser (1993).
Black box and glass box evaluation of the SUNDIAL system.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1423-1426, Berlin,
September.
- Simpson & Ruth (1987a)
-
C. Simpson & J. Ruth (1987a).
The phonetic discrimination test for speech recognizers: Part I.
Speech Technology March/April.
- Simpson & Ruth (1987b)
-
C. Simpson & J. Ruth (1987b).
The phonetic discrimination test for speech recognizers: Part II.
Speech Technology October/November.
- Skinner et al. (1992)
-
T. Skinner, J. Holt & N. Nguyen (1992).
Automatic identity confirmation and adaptive solutions.
Speech Technology 106-111.
February 1992.
- Smith (1979)
-
P. Smith (1979).
Sex markers in speech.
: K. Scherer & H. Giles, ,
Social markers in speech, 109-146. Cambridge University
Press, Cambridge.
- Smith et al. (1992)
-
R. Smith, D. Hipp & A. Biermann (1992).
A dialog control algorithm and its performance.
: Proceedings of the 3rd Conference on
Applied Natural Language Processing, 9-16, Trento, April.
- Soclof (1990)
-
M. Soclof (1990).
A comparison of spontaneous speech and read speech in human-machine
problem solving dialogues.
Massachusetts Institute of Technology.
- Soong & Huang (1991)
-
F. Soong & E.-F. Huang (1991).
A Tree-Trellis Fast Search for finding the n-best sentence
hypotheses.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
705-708, Toronto, May.
- Soong et al. (1987)
-
F. Soong, A. Rosenberg, B. Juang & L. Rabiner (1987).
A Vector Quantization approach to speaker recognition.
AT&T Technical Journal 66.
Issue 2.
- Sorin (1994)
-
C. Sorin (1994).
Towards high-quality multilingual text-to-speech.
: Proceedings of the CRIM/FORWISS workshop,
53-62, Munich.
Also to appear in H. Niemann, ed., Progress and prospects in research
and technology, Infix Publishing Company, Sankt Augustin.
- Sotscheck (1982)
-
J. Sotscheck (1982).
Ein Reimtest für Verständlichkeitsmessungen mit deutscher
Sprache als ein verbessertes Verfahren zur Bestimmung der
Sprachübertragungsgeräte.
Der Fernmeldung 36: 1-84.
- Sperberg-McQueen & Burnard (1994)
-
C. Sperberg-McQueen & L. Burnard, (1994).
Guidelines for electronic text encoding and interchange.
TEI P3. Chapter 1 Transcription of Speech. Association for
Computational Linguistics, Association for Computers and the Humanities,
Association for Literary and Linguistic Computing, Chicago and Oxford.
- Spiegel et al. (1990)
-
M. Spiegel, M. Altom, M. Macchi & K. Wallace (1990).
Comprehensive assessment of the telephone intelligibility of
synthesized and natural speech.
Speech Communication 9: 279-291.
- Sproat et al. (1992)
-
R. Sproat, J. Hirschberg & D. Yarowsky (1992).
A corpus-based synthesizer.
: Proceedings of the 2nd International
Conference on Spoken Language Processing, ICSLP, 1,
563-566, Banff.
- Steeneken (1982)
-
H. Steeneken (1982).
Ontwikkeling en toetsing van een Nederlandstalige Diagnostische
Rijmtest voor het testen van spraakcommunicatiekanalen.
Rapport IZF 1982-13, IZF, Soesterberg.
- Steeneken (1987)
-
H. Steeneken (1987).
Diagnostic information from subjective and objective intelligibility
tests.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP, Dallas.
- Steeneken (1989)
-
H. Steeneken (1989).
Objective and diagnostic assessment of (isolated) word recognizers.
: Proceedings of the European Speech
Conference ESCA, Paris.
- Steeneken (1991)
-
H. Steeneken (1991).
RAMOS - Recognizer Assessment by means of Manipulation Of
Speech applied.
: Proceedings of the European Speech
Conference ESCA, Genova.
- Steinbiss et al. (1994)
-
V. Steinbiss, B.-H. Tran & H. Ney (1994).
Improvements in beam search.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 2143-2146,
Yokohama, Japan, September.
- Stevens et al. (1968)
-
K. Stevens, C. Williams, J. Carbonell & B. Woods (1968).
Speaker authentication and identification: A comparison of
spectrographic and auditory presentations of speech material.
JASA 44: 1596-1607.
- Stubbs (1984)
-
M. Stubbs (1984).
Discourse analysis. The sociolinguistic analysis of natural
language.
Blackwell, Oxford.
- Sundheim (1991)
-
B. Sundheim (1991).
Third message understanding evaluation and conference (MUC-3):
Phase 1 status report.
: Proceedings of the DARPA Workshop on
Speech and Natural Language, 301-305, Pacific Grove, CA, February.
- Syrdal & Sciacca (1994)
-
A. Syrdal & B. Sciacca (1994).
Testing the intelligibility of text-to-speech output with the
Diagnostic Pairs Sentence Intelligibility Evaluation.
ITD-94-23828A, Technical Memorandum. Submitted to the Journal of
the Acoustical Society of America, JASA, AT&T Bell Laboratories.
- 't Hart et al. (1990)
-
J. 't Hart, R. Collier & A. Cohen (1990).
A perceptual study of intonation.
Cambridge University Press, Cambridge.
- Terken (1985)
-
J. Terken (1985).
Use and function of accentuation. Some experiments.
Doctoral dissertation, Leiden University, Leiden.
- Terken (1993)
-
J. Terken (1993).
Human and synthetic intonation: A case study.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 241-259. Mouton de
Gruyter, Berlin.
- Terken & Collier (1989)
-
J. Terken & R. Collier (1989).
Automatic synthesis of natural-sounding intonation for text-to-speech
conversion in Dutch.
: Proceedings of the Eurospeech '89,
1, 357-359, Paris.
- Thielen (1992)
-
M. Thielen (1992).
Male and female speech.
Ph.D. Thesis, University of Amsterdam, Amsterdam.
- Thorsen (1980)
-
N. Thorsen (1980).
A study of the perception of sentence intonation - Evidence from
Danish.
Journal of the Acoustical Society of America, JASA 67:
1014-1030.
- Thurmair (1986)
-
G. Thurmair (1986).
Linguistische Analyse im Projekt SPICOS.
Kleinheubacher Berichte 29.
- Tomlinson (1990)
-
M. Tomlinson (1990).
Guide to database generation - recording protocol.
: ESPRIT Project 2589 (SAM), ,
Multilingual speech input/output assessment, methodology and
standardisation. University College London, London.
Interim Report Year I, Reference SAM-UCL-G002, Document
SAM-RSRE-012.
- Tosi et al. (1972)
-
O. Tosi, H. Oyer, W. Asbrook, W. Pedrey, C. Nicol & E. Nash (1972).
Experiment of voice identification.
JASA 51: 2030-2043.
- Tubach & Doignon (1991)
-
J. Tubach & P. Doignon (1991).
A system for natural spoken language queries: Design,
implementation and assessment.
: Proceedings of the 2nd European Conference
on Speech Communication and Technology, 1473-1476, Genova,
September.
- Tubach & Bok (1985)
-
J.-P. Tubach & L.-J. Bok (1985).
ZUT - Petit dictionnaire français.
Institut de Phonitique de Grenoble, avec le concours du CNRS (GRECO
Comm. Parlie), Grenoble.
- Turing (1950)
-
A. Turing (1950).
Computing machinery and intelligence.
Mind 59: 433-460.
- Valtech et al. (1994)
-
V. Valtech, J. Odell, P. Woodland & S. Young (1994).
A dynamic network decoder design for large vocabulary speech
recognition.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1351-1354,
Yokohama, Japan, September.
- Van Bezooijen (1986)
-
R. Van Bezooijen (1986).
Lay ratings of long-term voice-and-speech characteristics.
: F. Beukema & A. Hulk, ,
Linguistics in the Netherlands 1986, 1-7. Foris, Dordrecht.
- Van Bezooijen (1988)
-
R. Van Bezooijen (1988).
Evaluation of two synthesis systems for Dutch - Development and
applications of intelligibility tests.
SPIN-ASSP Report No. 5, Stichting Spraaktechnologie, Utrecht.
- Van Bezooijen (1989)
-
R. Van Bezooijen (1989).
Evaluation of the suitability of Dutch text-to-speech conversion
for application in a digital daily newspaper.
: Proceedings of the ESCA Workshop Speech
I/O Assessment and Speech Databases, 6.3.1-6.3.4, Noordwijkerhout.
- Van Bezooijen & Jongenburger (1993)
-
R. Van Bezooijen & W. Jongenburger (1993).
Evaluation of an electronic newspaper for the blind in the
Netherlands - intelligibility, acceptability, adequacy, and users'attitudes.
: Proceedings of the ESCA Workshop on Speech
and Language Technology for Disabled Persons, 195-198, Stockholm.
- Van Bezooijen & Pols (1987)
-
R. Van Bezooijen & L. Pols (1987).
Evaluation of two synthesis-by-rule systems for Dutch.
: Proceedings of the European Conference on
Speech Technology, 1, 179-183.
- Van Bezooijen & Pols (1989)
-
R. Van Bezooijen & L. Pols (1989).
Evaluation of a sentence accentuation algorithm for a Dutch
text-to-speech system.
: Proceedings of the Eurospeech '89,
1, 218-221, Paris.
- Van Bezooijen & Pols (1990)
-
R. Van Bezooijen & L. Pols (1990).
Evaluating text-to-speech systems: Some methodological aspects.
Speech Communication 9: 263-270.
- Van Bezooijen & Pols (1993)
-
R. Van Bezooijen & L. Pols (1993).
Evaluation of text-to-speech conversion for Dutch.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech: Strategic research towards
high-quality text-to-speech conversion, 339-360. Mouton de Gruyter, Berlin.
- Van Bezooijen & Van Hout (1985)
-
R. Van Bezooijen & R. Van Hout (1985).
Accentedness ratings and phonological variables as measures of
variation in pronunciation.
Language and Speech 28: 129-142.
- Van Coile (1989)
-
B. Van Coile (1989).
The DEPES development system for text-to-speech synthesis.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
250-253.
- Van Compernolle et al. (1991)
-
D. Van Compernolle, J. Smolders, P. Jaspers & T. Hellemans (1991).
Speaker clustering for dialectic robustness in speaker independent
recognition.
: Proceedings of Eurospeech '91,
2, 723-726, Genova.
- Van Dommelen (1993)
-
W. Van Dommelen (1993).
Speaker height and weight identification: A re-evaluation of some
old dates.
Journal of Phonetics 21: 337-341.
- Van Hemert et al. (1987)
-
J. Van Hemert, U. Adriaens-Porzig & L. Adriaens (1987).
Speech synthesis in the SPICOS-project.
: H. Tillmann & G. Willée, ,
Analyse und Synthese gesprochener Sprache: Vorträge im Rahmen der
Jahrestagung 1987 der Gesellschaft für Linguistische Datenverarbeitung
e.V., Bonn, 4-6 March, 34-39. Olms, Hildesheim.
- Van Heuven & Scharpff (1991)
-
V. Van Heuven & P. Scharpff (1991).
Acceptability of several speech pausing strategies in low quality
speech synthesis; interaction with intelligibility.
: Proceedings of the 12th International
Congress of Phonetic Sciences, 458-461, Aix-en-Provence.
- Van Holsteijn (1993)
-
Y. Van Holsteijn (1993).
TextScan: A preprocessing module for automatic text-to-speech
conversion.
: V. Van Heuven & L. Pols, ,
Analysis and synthesis of speech, strategic research towards
high-quality text-to-speech generation, 27-41. Mouton de Gruyter,
Berlin.
- Van Hout (1989)
-
R. Van Hout (1989).
De structuur van taalvariatie, een sociolinguistisch onderzoek naar
het stadsdialect van nijmegen.
Doctoral dissertation, University of Nijmegen, Nijmegen.
- Van Santen (1992)
-
J. Van Santen (1992).
Diagnostic perceptual experiments for text-to-speech system
evaluation.
: Proceedings of the International
Conference on Spoken Language Processing, ICSLP, 1,
555-558.
- Van Santen (1993)
-
J. Van Santen (1993).
Perceptual experiments for diagnostic testing of text-to-speech
systems.
Computer Speech and Language 7: 49-100.
- Van Santen (1994)
-
J. Van Santen (1994).
Using statistics in text-to-speech system construction.
: Proceedings of the ESCA/IEEE Workshop on
Speech Synthesis, 240-243, Mohonk NY.
- Van Son et al. (1988)
-
N. Van Son, L. Pols, S. Sandri & P. Salza (1988).
First quality evaluation of a diphone-based speech synthesis system
for Italian.
: Proceedings of the 7th FASE/Speech '88
Symposium, 2, 429-436, Edinburgh.
- Vergeynst et al. (1993)
-
N. Vergeynst, K. Edwards, J. Foster & M. Jack (1993).
Spoken dialogues for human-computer interaction over the telephone:
Complexity measures.
: Proceedings of the 3rd European Conference
on Speech Communication and Technology, 1415-1418, Berlin,
September.
- Vintsyuk (1971)
-
T. Vintsyuk (1971).
Elementwise recognition of continuous speech composed of words from a
specified dictionary.
Cybernetics, March-April, 7: 133-143.
- Voiers (1977)
-
W. Voiers (1977).
Diagnostic evaluation of speech intelligibility.
Speech intelligibility and speaker recognition 2: 374-384.
Benchmark papers in acoustics, M.E. Hawley (ed.).
- Voiers (1983)
-
W. Voiers (1983).
Evaluating processed speech using the Diagnostic Rhyme Test.
Speech Technology 1: 338-352.
- Voiers et al. (1975)
-
W. Voiers, A. Sharpley & C. Hehmsoth (1975).
Research on diagnostic evaluation of speech intelligibility.
Research Report AFCRL-72-0694, Air Force Cambridge Research
Laboratories, Bedford, Massachusetts.
- Vroomen et al. (1993)
-
J. Vroomen, R. Collier & S. Mozziconacci (1993).
Duration and intonation in emotional speech.
: Proceedings of the Eurospeech '93,
1, 577-580, Berlin.
- Wahlster (1993)
-
W. Wahlster (1993).
VERBMOBIL, translation of face-to-face dialogs.
: Proceedings of the Eurospeech '93, opening
and plenary sessions, 29-38, Berlin.
- Waibel (1988)
-
A. Waibel (1988).
Prosody and speech recognition.
Research notes in artificial intelligence, Pitman Publishing, London.
- Waibel et al. (1991)
-
A. Waibel, A. Jain, A. McNair, H. Saito, A. Hauptmann & J. Tebelskis (1991).
A speech-to-speech translation system using connectionist and
symbolic processing strategies.
: Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, ICASSP-91,
793-796.
- Waibel & Lee (1990)
-
A. Waibel & K.-F. Lee, (1990).
Readings in speech recognition.
Morgan Kaufmann Publishers, San Mateo, California.
- Wall & Schwartz (1991)
-
L. Wall & R. Schwartz (1991).
Programming perl.
O'Reilly & Associates Inc., Sebastopol, CA.
- Webers (1985)
-
J. Webers (1985).
Tonstudiotechnik.
Franzis, Munich, Germany.
- Wells (1987)
-
J. Wells (1987).
Computer-coded phonetic transcription.
Journal of the International Phonetic Association 17(2):
94-114.
- Wells (1989)
-
J. Wells (1989).
Computer-coded phonemic notation of individual languages of the
European Community.
Journal of the International Phonetic Association 19(1):
31-54.
- Wells (1993a)
-
J. Wells (1993a).
Applying SAM-PA to Spanish, Portuguese, and Greek: A
preliminary discussion document.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications. London.
Document No: SAM-A/D1-Appendix B, SAM-A periodic progress report,
Year 1, 1 April 1993-30 September 1993.
- Wells (1993b)
-
J. Wells (1993b).
An update on SAMPA.
: ESPRIT Project 6819 (SAM-A), ,
Speech technology assessment in multilingual applications,
1-6. London.
Document No: SAM-A/D1-Appendix A, SAM-A periodic progress report,
Year 1, 1 April 1993-30 September 1993.
- Whittaker & Stenton (1989)
-
S. Whittaker & P. Stenton (1989).
User studies and the design of natural language systems.
: Proceedings of the 4th conference of the
European Chapter of the Association for Computational Linguistics,
116-123, Manchester.
- Willems et al. (1988)
-
N. Willems, R. Collier & J. 't Hart (1988).
Synthesis scheme for British English intonation.
Journal of the Acoustical Society of America, JASA 84:
1250-1261.
- Winer (1971)
-
B. Winer (1971).
Statistical principles in experimental design.
McGraw-Hill, New York, .
- Winski & Fourcin (1994)
-
R. Winski & A. Fourcin (1994).
A common European approach to assessment, corpora and standards.
: K. Varghese, S. Pfleger & J. Lefevre,
, Advanced speech applications. European research on speech
technology (Research Reports ESPRIT Volume 1). Springer-Verlag, Berlin.
- Winski et al. (1995)
-
R. Winski, R. Moore & D. Gibbon (1995).
EAGLES spoken language working group: Overview and results.
: Proceedings of the 4th European Conference
on Speech Communication and Technology - Eurospeech'95, 841-844,
Madrid, September 1995.
- Witten (1982)
-
I. Witten (1982).
Principles of computer speech.
Academic Press, New York, N.Y.
- Woodland et al. (1995)
-
P. Woodland, C. Leggetter, J. Odell, V. Valtech & S. Young (1995).
The 1994 HTK large vocabulary speech recognition system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
I, 73-76, Detroit, MI, May.
- Woods & Zue (1976)
-
W. Woods & V. Zue (1976).
Dictionary expansion via phonological rules for a speech
understanding system.
: Proceedings of the IEEE International
Conference on Acoustics, Speech and Signal Processing, ICASSP,
561-564, Philadelphia.
- Wooffitt & Fraser (1992)
-
R. Wooffitt & N. Fraser (1992).
We're off to ring the Wizard, the wonderful Wizard of Oz.
: G. Button, , Technology in Working
Order: Studies of work, interaction and technology, 211-230.
Routeledge, London.
- Woszczyna et al. (1993)
-
M. Woszczyna, N. Coccaro, A. Eisele, A. Lavie, A. McNair, T. Polzin, I. Rogina, C. Rose, T. Sloboda, M. Tomita, J. Tsutsumi, N. Waibel & W. Ward (1993).
Recent advances in Janus: A speech translation system.
: Proceedings of a Workshop: Human Language
Technology, 211-216, 21-24 March, Princeton, NJ.
- Wright et al. (1993)
-
J. Wright, G. Jones & H. Lloyd-Thomas (1993).
A consolidated language model for speech recognition.
: Proceedings of the European Conference on
Speech Communication and Technology, 977-980, Berlin, September.
- Yamron (1994)
-
J. Yamron (1994).
A generalization of n-grams.
: Proceedings of the DARPA Workshop on
Robust Speech Recognition, Rutgers University, Piscataway, NJ, July-August.
- Yarrington & Foulds (1993)
-
D. Yarrington & R. Foulds (1993).
Personalizing synthesized voices.
: Proceedings of the ESCA Workshop on Speech
and Language Technologies for Disabled Persons, 169-172,
Stockholm.
- Young et al. (1989)
-
S. Young, A. Hauptmann, W. Ward, E. Smith & P. Werner (1989).
High level knowledge sources in usable speech recognition systems.
Communications of the ACM 32(2): 183-194.
Also in: A. Waibel and K.-F. Lee, eds., (1990), Readings in speech
recognition, Morgan Kaufmann Publishers, San Mateo, California, 538-549.
- Zue et al. (1991)
-
V. Zue, J. Glass, D. Goodine, L. Hirschman, H. Leung, M. Phillips, J. Polifroni & S. Seneff (1991).
The MIT ATIS system: Preliminary development, spontaneous
speech data collection, and performance evaluation.
: Proceedings of the 2nd European Conference
on Speech Communication and Technology, 537-540, Genova,
September.
EAGLES SWLG SoftEdition, May 1997. Get the book...