Bibliography

Works Cited

Abbott, K. 2002. Voice enabling Web applications: VoiceXML and beyond. Berkeley, CA: Apress

Allen, J. 1995. Natural language understanding. Redwood City, CA: Benjamin Cummings.

Baber, C. (1997). Beyond the Desktop. San Diego, CA: Academic Press.

Bailey, C.J. 1999. "You and your English grammar." In M. Celce-Murcia and D. Larsen-Freeman, eds., The grammar book. 2nd edition. Boston: Heinle and Heinle Publishers.

Balentine, B. (1999). Re-engineering the speech menu: A "Device" approach to interactive list-selection. In D. Gardner-Bonneau, ed., Human factors and voice interface systems, 213-215. Norwell, MA: Kluwer Academic Publishers.

Balogh, J. 2002. Methodologies and best practices: An overview. Usability Workshop. SpeechTek 2002, New York.

Balogh, J., N. LeDuc, and M. Cohen. 2001. "Navigating the voice Web." Proceedings of UAHCI (Universal Access for Human Computer Interaction) 2001.

Boyce, S. 1999. Spoken natural language dialogue systems: User interface issues for the future. In D. Gardner-Bonnea, ed., Human factors and voice interface systems, 205-235. Norwell, MA: Kluwer Academic Publishers.

Bransford, J.D., and M.K. Johnson. 1973. Considerations of some problems of comprehension. In W.G. Chase, ed., Visual information processing. Orlando, FL: Academic Press.

Broadbent, D.E. 1975. The magic number seven after fifteen years. In A. Kennedy and A. Wilkes, eds., Studies in long term memory. London: Wiley.

Campbell, J. 1997. Speaker recognition: A tutorial. Proceedings of the IEEE 85: 1437-1462.

CCIR-1. 1999. Attitudes to recognition accuracy. Technology Report, Dialogues Spotlight Research, Centre for Communication Interface Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/

CCIR-2. 1999. Dialogues for speaker verification and operator hand-over. Technology Report, Dialogues Spotlight Research, Centre for Communication Interface Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/

CCIR-3. 1999. The effects of speaker state (tone of voice) and speaker style (fast track) in dialogue prompts. Technology Report, Dialogues Spotlight Research, Centre for Communication Interface Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/

CCIR-4. 1999. The priming effects of telephone tutorials. Technology Report, Dialogues Spotlight Research, Centre for Communication Interface Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/

CCIR-5. 1999. User attitudes towards real and synthetic speech. Technology Report, Dialogues Spotlight Research, Centre for Communication Interface Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/

Celce-Murcia, M., and D. Larsen-Freeman. 1999. The grammar book. 2nd edition. Boston: Heinle and Heinle Publishers.

Chu-Carroll, J., and J. Nickerson. 2000. Evaluating automatic dialogue strategy adaptation for a spoken dialogue system. Proceedings of NAACL (North American Chapter of the Association for Computational Linguistics) 2000, Seattle, WA.

Chu-Carroll, J., and M. Brown. 1998. User modeling and user adapted interaction. Special Issue on Computational Models of Mixed Initiative Interaction 8(3+4): 215-253. Also appeared in S. Haller, S. McRoy, and A. Kobsa, eds., Computational models of mixed-initiative interaction, 49-87. Boston: Kluwer Academic Publishers, 1999.

Cohen, M. 1991. Combining linguistic knowledge with statistical pattern recognition techniques for speech recognition. Keynote talk and Proceedings of Expert Systems and Their Microcomputer Applications, Ankara, Turkey.

Cohen, M. 2000. Surfing the voice Web: Issues in the design of a voice browser. Keynote talk, ASR2000, Paris.

Cohen, M. 2001. VUI design under the microscope: User centered design methodology. V-World 2001, San Diego, CA.

Cohen, M., Z. Rivlin, and H. Bratt. 1995. Speech recognition in the ATIS domain using multiple knowledge sources. Proceedings of the Spoken Language Systems Technology Workshop, ARPA, Austin, TX.

Crystal, D. 1992. The Cambridge encyclopedia of language. Cambridge, UK: The Cambridge University Press.

Crystal, David. 1995. The Cambridge encyclopedia of the English language. Cambridge, UK: The Cambridge University Press.

Daneman, M., and P.A. Carpenter. 1980. Individual differences in working memory and reading. Journal of Verbal Learning & Verbal Behavior 19(4): 450-466.

Dumas, J., and J. Redish. 1999. A practical guide to usability testing. Revised edition. Exeter, UK: Intellect.

Dutoit, T. 1997. An introduction to text-to-speech synthesis. Boston: Kluwer Academic Publishers.

Dutton, R., J. Foster, and M. Jack. 1999. Please mind the doors: Do interface metaphors improve the usability of voice response services? BT Technological Journal 17(1): 172-177.

Edgington M., A. Lowry, P. Jackson, A. Breen, and S. Minnis. 1998a. Overview of current text-to-speech techniques, part I: Text and linguistic analysis. In F.A. Westall, D. Johnston, and A. Lewis, eds., Speech technology for telecommunications. New York: Chapman & Hall.

Edgington M., A. Lowry, P. Jackson, A. Breen, and S. Minnis. 1998b. Overview of current text-to-speech techniques, part II: Prosody and speech generation. In F.A. Westall, D. Johnston, and A. Lewis, eds., Speech technology for telecommunications. New York: Chapman & Hall.

ETSI (European Telecommunications Standards Institute). 2002. Generic spoken command vocabulary for ICT devices and services. ETSI DES/HF-00021 v 0.0.40 (2002-05-27). www.etsi.org.

Fraser, J., and G. Gilbret. 1991. Simulating speech systems. Computer, Speech, and Language 5: 81-99.

Furui, S. 1996. An overview of speaker recognition technology. In C.F. Lee and F. Soong, eds., Automatic speech and speaker recognition, 31-66. Boston: Kluwer Academic.

Gardner-Bonneau, D.J. 1992. Human factors in interactive voice response applications: "Common sense" is an uncommon commodity. Journal of the American Voice I/O Society 12: 1-12.

Giangola, J. 2000. Building naturalness into prompt design. V-World 2000, Scottsdale, AZ.

Giles, H., and P. Powesland. 1975. Speech style and social evaluation. (European Monographs in Social Psychology). New York: Harcourt Brace.

Giles, H., and P. Smith. 1979. Accommodation theory: Optimal levels of convergence. In H. Giles and R. St Clair, eds., Language and social psychology, 45-65. Oxford: Blackwell.

Gold, B., and N. Morgan. 2000. Speech and audio signal processing: Processing and perception of speech and music. New York: John Wiley & Sons.

Grice, H. P. 1975. Logic and conversation. In P. Cole and J.L. Morgan, eds., Speech acts, 41-58. New York: Academic Press.

Halliday, M.A.K. 1994. An introduction to functional grammar. 2nd edition. London: Edward Arnold.

Ishihara, R. 2003. Enhancing TTS performance. Proceedings of Telephony Voice User Interface Conference, San Diego.

Jackson, E., D. Appelt, J. Bear, R. Moore, and A. Podlozny. 1991. A template matcher for robust NL interpretation. Proceedings of the Fourth DARPA Workshop on Speech and Natural Language, Pacific Grove, CA.

Jelinek, F. 1997. Statistical methods for speech recognition. Cambridge, MA: MIT Press.

Kamm, C., D. Litman, and M.A. Walker. 1998. From novice to expert: The effect of tutorials on user expertise with spoken dialogue systems. Proceedings of ICSLP (International Conference on Spoken Language Processing) 1998, Sydney, Australia.

Kamm, C., M.A. Walker, and D. Litman. 1999. Evaluating spoken language systems. Proceedings of AVIOS (Applied Voice Input/Output Association) 1999, San Jose, CA.

Kramer, G., ed. 1994. Auditory display: Sonification, audification, and auditory interfaces. Proceedings Volume XVIII, Santa Fe Institute, Studies in the Sciences of Complexity. Reading, MA: Addison-Wesley.

Labov, W. 1966. The social stratification of English in New York City. Washington, DC: Center for Applied Linguistics.

Larson, J. 2003. VoiceXML: Introduction to developing speech applications. Upper Saddle River, NJ: Prentice Hall.

Manning, C., and H. Schutze. 1999. Foundations of statistical natural language understanding. Cambridge, MA: MIT Press.

McClelland, I., and F. Brigham. 1990. Marketing ergonomics: How should ergonomics be packaged? Ergonomics 33(5): 519-526.

McConnell, S. 1996. Rapid development: Taming wild software schedules. Redmond, WA: Microsoft Press.

Melrose, S. L. 1999. Must and its periphrastic forms in American English usage. M.A. Thesis, UCLA. In Celce-Murcia and Larsen-Freeman, The grammar book. 2nd edition. Boston: Heinle and Heinle Publishers.

Miller, G. 1956. The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review 63: 81-97.

Nielsen, J. 1993. Usability engineering. San Diego, CA: Morgan Kaufman.

Norman, D.A. 2002. The design of everyday things. New York: Basic Books.

Norman, D.A., and S.W. Draper, eds. 1986. User centered system design: New perspectives on human-computer interaction. Hillsdale, NJ: Lawrence Erlbaum Associates.

Nowlin, R. 2001. VUI design under the microscope: Requirements definition. V-World 2001, San Diego, CA.

Nowlin, R. 2001. VUI design under the microscope: Tuning and validation. V-World 2001, San Diego, CA.

Oviatt, S. 1996. User-centered modeling for spoken language and multimodal interfaces. IEEE Multimedia 3(4): 26-35.

Page, J., and A. Breen. 1998. The Laureate text-to-speech system: Architecture and applications. In F.A. Westall, D. Johnston, and A. Lewis, eds., Speech technology for telecommunications. New York: Chapman & Hall.

Pierrehumbert, J. 1980. The phonetics and phonology of English intonation. Doctoral dissertation, MIT.

Preece, J., Y. Rogers, and H. Sharp. 2002. Interaction design: Beyond human-computer interaction. New York: John Wiley and Sons.

Quirk, R., and S. Greenbaum. 1973. A concise grammar of contemporary English. New York: Harcourt Brace Jovanovich.

Rabiner, L. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77: 257-286.

Rabiner, L., and B. Juang. 1993. Fundamentals of speech recognition. Englewood Cliffs, NJ: Prentice Hall.

Raman, T. 1997. Auditory user interfaces. Boston: Kluwer Academic.

Raskin, J. 2000. The humane interface. Boston: Addison-Wesley.

Reeves, B., and C. Nass. 1996. The media equation. Stanford, CA: Center for the Study of Language and Information.

Reynolds, D., and L. Heck. 2001. Speaker verification: From research to reality. International Conference on Acoustics, Speech, and Signal Processing. Tutorial. Salt Lake City, Utah.

Richards, J. 1980. Conversation. TESOL Quarterly XIV(4).

Rubin, J. 1994. Handbook of usability testing. New York: John Wiley and Sons.

Rudnicky, A., and W. Xu. 1999. An agenda-based dialog management architecture for spoken language systems. IEEE Automatic Speech Recognition and Understanding Workshop, Keystone, CO.

Schiffrin, D. 1987. Discourse markers. Cambridge, UK: Cambridge University Press.

Schiffrin, D. 1998. Approaches to discourse. Cambridge, MA: Blackwell Publishers.

Schumacher, R.M., Jr., M.L. Hardzinski, and A.L. Schwarz. 1995. Increasing the usability of interactive voice response systems: Research and guidelines for phone-based interfaces. Human Factors 37(2): 251-264.

Selkirk, E. 1995. Sentence prosody: Intonation, stress, and phrasing. In John A. Goldsmith, ed., Phonological theory. Cambridge, MA: Blackwell Publishers.

Seneff, S., and J. and Polifroni. 2000. Dialogue management in the Mercury flight reservation system. Presented at Satellite Dialogue Workshop, ANLP-NAACL, Seattle.

Sharma, C., and J. Kunins. 2002. VoiceXML: Strategies and techniques for effective voice application development with VoiceXML 2.0. New York: John Wiley and Sons.

Sheeder, T. 2001. VUI design under the microscope: Detailed design. V-World 2001, San Diego, CA.

Sheeder, T. 2001. VUI design under the microscope: High-Level design. V-World 2001, San Diego, CA.

Sheeder, T., and J. Balogh. 2003. Say it like you mean it: Priming for structure in caller responses to a spoken dialog system. International Journal of Speech Technology 6(3): 103-111.

Shneiderman, B. 1998. Designing the user interface: Strategies for effective human-computer interaction. 3rd Edition. Reading, MA: Addison-Wesley.

Soukup, B. 2000. Y'all come back now, y'hear: Language attitudes in the United States towards Southern American English. MA thesis, University of Vienna.

TSSC (Telephone Speech Standards Committee). 2000. Universal commands for telephony-based spoken language systems. Telephone Speech Standards Committee: Common Dialog Tasks Subcommittee. www.acm.org/sigchi/bulletin/2000.2/telephonepaper.pdf

van Santen, J., R. Sproat, J. Olive, and J. Hirschbereg. 1997. Progress in speech synthesis. New York: Springer Publishers.

Weinschenk, S., and D. Barker. 2000. Designing effective speech interfaces. New York: John Wiley & Sons.

Weintraub, M., H. Murveit, M. Cohen, P. Price, J. Bernstein, G. Baldwin, and D. Bell. 1989. Linguistic constraints in hidden Markov based speech recognition. International Conference on Acoustics, Speech and Signal Processing, Glasgow, Scotland.

Wickelgren, W.A. 1974. Size of rehearsal group and short-term memory. Journal of Experimental Psychology 68: 413-419.

Yankelovich, N., G.A. Levow, and M. Marx. 1995. Designing SpeechActs: Issues in speech user interfaces. In I.R. Katz, R. Mack, and L. Marks, eds., Human Factors in Computing Systems. CHI 1995 Conference Proceedings, 369-376.


Works Consulted

Anderson, J.R. 1990. Cognitive psychology and its implications. New York, NY: W.H. Freeman, 167-170.

Attwater, D.J., and S.J. Whittaker. 1999. Large-vocabulary data-centric dialogues. BT Technology Journal 17(1), 149-159.

Balentine, B. 2003. The power of the pause. Speech Recognition Update 118.

Balogh, J. 2001. Strategies for concatenating recordings in a voice user interface: What we can learn from prosody. Extended Abstracts, CHI (Computer Human Interface) 2001, 249-250.

Bowen, J. 1975. Patterns of English pronunciation. Rowley, MA: Newbury House Publishers.

Cowley, C., and D. Jones. 1992. Synthesized or digitized? A guide to the use of computer speech. Applied Ergonomics Jun. 23 (3): 172-176.

Delogu, C., S. Conte, and C. Sementina. 1998. Cognitive factors in the evaluation of synthetic speech. Speech Communication 24(2): 153-168.

Francis, A., and H. Nusbaum. 1999. The effect of lexical complexity on intelligibility. International Journal of Speech Technology 3(1): 15-25.

Fucci, D., M. Reynolds, R. Bettagere, and M. Gonzales. 1995. Synthetic speech intelligibility under several experimental conditions. AAC: Augmentative & Alternative Communication Jun. 11 (2): 113-117.

Gong, L., and J. Lai. 2001. Shall we mix synthetic speech and human speech? Impact on users' performance, perception, and attitude. Proceedings of CHI (Computer-Human Interface) 2001, Seattle, WA. 158-165.

Higginbotham, D., A. Drazek, K. Kowarsky, and C. Scally. 1994. Discourse comprehension of synthetic speech delivered at normal and slow presentation rates. AAC: Augmentative & Alternative Communication Sept. 10 (3): 191-202.

Hoover, J., J. Reichle, D. Van Tasell, and D. Cole. 1987. The intelligibility of synthesized speech: ECHO II versus VOTRAX. Journal of Speech & Hearing Research Sep. 30 (3): 425-431.

Hudson, R. 1980. Sociolinguistics. London: Cambridge University Press.

James, F. 1996. Presenting HTML structure in audio: User satisfaction with audio hypertext. Proceedings of ICAD (International Conference on Auditory Display) 1996, Palo Alto, CA, 97-103.

Jurafsky, D., and J. Martin. 2000. Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition. Upper Saddle River, NJ: Prentice Hall.

Kangas, K., and G. Allen. 1990. Intelligibility of synthetic speech for normal-hearing and hearing-impaired listeners. Journal of Speech & Hearing Disorders Nov. 55(4): 751-755.

Lai, J., D. Wood, and M. Considine. 2000. The effect of task conditions on the comprehensibility of synthetic speech. Proceedings of CHI 2000, 321-328. The Hague, Netherlands. New York: ACM.

Lamel, L., S. Rosset, J. Gauvain, S. Bennacef, M. Garnier-Rizet, and B. Prouts. 1998. The LIMSI ARISE system. Proceedings of IVTTA (Interactive Voice Technology for Telecommunication Applications) 1998, Turin, Italy, 209-214.

Lavelle, C-A., M. de Calmes, and G. Perennou. 1998. A study of users' behaviors in different states of a spontaneous oral dialogue with an automatic inquiry system. IEEE, Proceedings of IVTTA (Interactive Voice Technology for Telecommunication Applications) 1998, Turin, Italy, 118-123.

McInnes, F.R., D.J. Attwater, D. Edgington, M.S. Schmidt, and M.A. Jack. 1999. User attitudes to concatenated natural speech and text-to-speech synthesis in an automated information service. Eurospeech 1999, Proceedings of Speech Technology Symposium, Budapest.

Nass, C., Y. Moon, and N. Green. 1997. Are machines gender neutral? Gender-stereotypic responses to computers with voices. Journal of Applied Social Psychology May 27 (10): 864-876.

Oshrin, S., and J. Siders. 1987. The effect of word predictability on the intelligibility of computer synthesized speech. Journal of Computer-Based Instruction 14(3): 89-90.

Paris, C., M. Thomas, R. Gilson, and J. Kincaid. 2000. Linguistic cues and memory for synthetic and natural speech. Human Factors 42(3): 421-431.

Paris, C., R. Gilson, M. Thomas, and N. Silver. 1995. Effect of synthetic voice intelligibility on speech comprehension. Human Factors 37(2): 335-340.

Pinker, Steven. 1994. The language instinct. New York: William Morrow.

Potjer, J., A. Russel, L. Boves, and E. den Os. 1996. Subjective and objective evaluation of two types of dialogues in a call assistance service. Proceedings of IVTTA (Interactive Voice Technology for Telecommunications Applications) 1996, Basking Ridge, NJ, 89-92.

Ralston, J., D. Pisoni, S. Lively, and Beth G. Greene. 1991. Comprehension of synthetic speech produced by rule: Word monitoring and sentence-by-sentence listening times. Human Factors 33(4): 471-491.

Reynolds, M., C. Isaacs-Duvall, B. Sheward, and M. Rotter. 2000. Examination of the effects of listening practice on synthesized speech comprehension. AAC: Augmentative & Alternative Communication 16(4): 250-259.

Reynolds, M., Z. Bond, and D. Fucci. 1996. Synthetic speech intelligibility: Comparison of native and non-native speakers of English. AAC: Augmentative & Alternative Communication 12(1): 32-36.

Rosch, E. 1976. Classification of real-world objects: Origins and representations in cognition. In S. Erlich and E. Tulvings, eds., La Mémoire sémantique. Paris: Bulletin de Psychologie.

Rosenthal, M. 1974. The magic boxes: Pre-school children's attitudes toward black and standard English. Florida F. L. Reporter 210: 55-93.

Schwab, E., H. Nusbaum, and D. Pisoni. 1985. Some effects of training on the perception of synthetic speech. Human Factors 27(4): 395-408.

Smither, J. 1993. Short term memory demands in processing synthetic speech by old and young adults. Behaviour & Information Technology 12(6): 330-335.

Stern, S., J. Mullennix, C. Dyson, and S. Wilson. 1999. The persuasiveness of synthetic speech versus human speech. Human Factors 41(4): 588-595.

Stifelman, L.J., B. Arons, C. Schmandt, and E. Hulteen. 1993. VoiceNotes: A speech interface for a hand-held voice notetaker. Proceedings of INTERCHI 1993, ACM, New York.

Sutton, B., J. King, K. Hux, and D.R. Beukelman. 1995. Younger and older adults' rate performance when listening to synthetic speech. AAC: Augmentative & Alternative Communication 11(3): 147-153.

Venkatagiri, S. 1994. Effect of sentence length and exposure on the intelligibility of synthesized speech. AAC: Augmentative & Alternative Communication 10(2): 96-104.

Vromans, B., R.J. van Vark, B. Rueber, and A. Kellner. Extending the SUSI System with negative knowledge. Proceedings of Eurospeech 1999, Budapest.

Waterworth, J.A. 1983. Effect of intonation form and pause durations of automatic telephone number announcements on subjective preference and memory performance. Applied Ergonomics 14(1): 39-42.

Whalen, D., C. Hoequist, and S. Sheffert. 1995. The effects of breath sounds on the perception of synthetic speech. Journal of the Acoustical Society of America 97(5, pt. 1): 3147-3153.

Yankelovich, N. (in press). Using natural dialogs as the basis for speech interface design. In S. Luperfoy, ed., Automated spoken dialog systems. Cambridge, MA: MIT Press.

home |
Address any comments to authors@vuidesign.org.