|
Bibliography
Works
Cited
Abbott,
K. 2002. Voice enabling Web applications: VoiceXML and beyond. Berkeley,
CA: Apress
Allen,
J. 1995. Natural language understanding. Redwood City, CA: Benjamin
Cummings.
Baber,
C. (1997). Beyond the Desktop. San Diego, CA: Academic Press.
Bailey,
C.J. 1999. "You and your English grammar." In M. Celce-Murcia
and D. Larsen-Freeman, eds., The grammar book. 2nd edition. Boston:
Heinle and Heinle Publishers.
Balentine,
B. (1999). Re-engineering the speech menu: A "Device"
approach to interactive list-selection. In D. Gardner-Bonneau, ed.,
Human factors and voice interface systems, 213-215. Norwell, MA:
Kluwer Academic Publishers.
Balogh,
J. 2002. Methodologies and best practices: An overview. Usability
Workshop. SpeechTek 2002, New York.
Balogh,
J., N. LeDuc, and M. Cohen. 2001. "Navigating the voice Web."
Proceedings of UAHCI (Universal Access for Human Computer Interaction)
2001.
Boyce,
S. 1999. Spoken natural language dialogue systems: User interface
issues for the future. In D. Gardner-Bonnea, ed., Human factors
and voice interface systems, 205-235. Norwell, MA: Kluwer Academic
Publishers.
Bransford,
J.D., and M.K. Johnson. 1973. Considerations of some problems of
comprehension. In W.G. Chase, ed., Visual information processing.
Orlando, FL: Academic Press.
Broadbent,
D.E. 1975. The magic number seven after fifteen years. In A. Kennedy
and A. Wilkes, eds., Studies in long term memory. London: Wiley.
Campbell,
J. 1997. Speaker recognition: A tutorial. Proceedings of the IEEE
85: 1437-1462.
CCIR-1.
1999. Attitudes to recognition accuracy. Technology Report, Dialogues
Spotlight Research, Centre for Communication Interface Research,
University of Edinburgh. http://spotlight.ccir.ed.ac.uk/
CCIR-2.
1999. Dialogues for speaker verification and operator hand-over.
Technology Report, Dialogues Spotlight Research, Centre for Communication
Interface Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/
CCIR-3.
1999. The effects of speaker state (tone of voice) and speaker style
(fast track) in dialogue prompts. Technology Report, Dialogues Spotlight
Research, Centre for Communication Interface Research, University
of Edinburgh. http://spotlight.ccir.ed.ac.uk/
CCIR-4.
1999. The priming effects of telephone tutorials. Technology Report,
Dialogues Spotlight Research, Centre for Communication Interface
Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/
CCIR-5.
1999. User attitudes towards real and synthetic speech. Technology
Report, Dialogues Spotlight Research, Centre for Communication Interface
Research, University of Edinburgh. http://spotlight.ccir.ed.ac.uk/
Celce-Murcia,
M., and D. Larsen-Freeman. 1999. The grammar book. 2nd edition.
Boston: Heinle and Heinle Publishers.
Chu-Carroll,
J., and J. Nickerson. 2000. Evaluating automatic dialogue strategy
adaptation for a spoken dialogue system. Proceedings of NAACL (North
American Chapter of the Association for Computational Linguistics)
2000, Seattle, WA.
Chu-Carroll,
J., and M. Brown. 1998. User modeling and user adapted interaction.
Special Issue on Computational Models of Mixed Initiative Interaction
8(3+4): 215-253. Also appeared in S. Haller, S. McRoy, and A. Kobsa,
eds., Computational models of mixed-initiative interaction, 49-87.
Boston: Kluwer Academic Publishers, 1999.
Cohen,
M. 1991. Combining linguistic knowledge with statistical pattern
recognition techniques for speech recognition. Keynote talk and
Proceedings of Expert Systems and Their Microcomputer Applications,
Ankara, Turkey.
Cohen,
M. 2000. Surfing the voice Web: Issues in the design of a voice
browser. Keynote talk, ASR2000, Paris.
Cohen,
M. 2001. VUI design under the microscope: User centered design methodology.
V-World 2001, San Diego, CA.
Cohen,
M., Z. Rivlin, and H. Bratt. 1995. Speech recognition in the ATIS
domain using multiple knowledge sources. Proceedings of the Spoken
Language Systems Technology Workshop, ARPA, Austin, TX.
Crystal,
D. 1992. The Cambridge encyclopedia of language. Cambridge, UK:
The Cambridge University Press.
Crystal,
David. 1995. The Cambridge encyclopedia of the English language.
Cambridge, UK: The Cambridge University Press.
Daneman,
M., and P.A. Carpenter. 1980. Individual differences in working
memory and reading. Journal of Verbal Learning & Verbal Behavior
19(4): 450-466.
Dumas,
J., and J. Redish. 1999. A practical guide to usability testing.
Revised edition. Exeter, UK: Intellect.
Dutoit,
T. 1997. An introduction to text-to-speech synthesis. Boston: Kluwer
Academic Publishers.
Dutton,
R., J. Foster, and M. Jack. 1999. Please mind the doors: Do interface
metaphors improve the usability of voice response services? BT Technological
Journal 17(1): 172-177.
Edgington
M., A. Lowry, P. Jackson, A. Breen, and S. Minnis. 1998a. Overview
of current text-to-speech techniques, part I: Text and linguistic
analysis. In F.A. Westall, D. Johnston, and A. Lewis, eds., Speech
technology for telecommunications. New York: Chapman & Hall.
Edgington
M., A. Lowry, P. Jackson, A. Breen, and S. Minnis. 1998b. Overview
of current text-to-speech techniques, part II: Prosody and speech
generation. In F.A. Westall, D. Johnston, and A. Lewis, eds., Speech
technology for telecommunications. New York: Chapman & Hall.
ETSI
(European Telecommunications Standards Institute). 2002. Generic
spoken command vocabulary for ICT devices and services. ETSI DES/HF-00021
v 0.0.40 (2002-05-27). www.etsi.org.
Fraser,
J., and G. Gilbret. 1991. Simulating speech systems. Computer, Speech,
and Language 5: 81-99.
Furui,
S. 1996. An overview of speaker recognition technology. In C.F.
Lee and F. Soong, eds., Automatic speech and speaker recognition,
31-66. Boston: Kluwer Academic.
Gardner-Bonneau,
D.J. 1992. Human factors in interactive voice response applications:
"Common sense" is an uncommon commodity. Journal of the
American Voice I/O Society 12: 1-12.
Giangola,
J. 2000. Building naturalness into prompt design. V-World 2000,
Scottsdale, AZ.
Giles,
H., and P. Powesland. 1975. Speech style and social evaluation.
(European Monographs in Social Psychology). New York: Harcourt Brace.
Giles,
H., and P. Smith. 1979. Accommodation theory: Optimal levels of
convergence. In H. Giles and R. St Clair, eds., Language and social
psychology, 45-65. Oxford: Blackwell.
Gold,
B., and N. Morgan. 2000. Speech and audio signal processing: Processing
and perception of speech and music. New York: John Wiley & Sons.
Grice,
H. P. 1975. Logic and conversation. In P. Cole and J.L. Morgan,
eds., Speech acts, 41-58. New York: Academic Press.
Halliday,
M.A.K. 1994. An introduction to functional grammar. 2nd edition.
London: Edward Arnold.
Ishihara,
R. 2003. Enhancing TTS performance. Proceedings of Telephony Voice
User Interface Conference, San Diego.
Jackson,
E., D. Appelt, J. Bear, R. Moore, and A. Podlozny. 1991. A template
matcher for robust NL interpretation. Proceedings of the Fourth
DARPA Workshop on Speech and Natural Language, Pacific Grove, CA.
Jelinek,
F. 1997. Statistical methods for speech recognition. Cambridge,
MA: MIT Press.
Kamm,
C., D. Litman, and M.A. Walker. 1998. From novice to expert: The
effect of tutorials on user expertise with spoken dialogue systems.
Proceedings of ICSLP (International Conference on Spoken Language
Processing) 1998, Sydney, Australia.
Kamm,
C., M.A. Walker, and D. Litman. 1999. Evaluating spoken language
systems. Proceedings of AVIOS (Applied Voice Input/Output Association)
1999, San Jose, CA.
Kramer,
G., ed. 1994. Auditory display: Sonification, audification, and
auditory interfaces. Proceedings Volume XVIII, Santa Fe Institute,
Studies in the Sciences of Complexity. Reading, MA: Addison-Wesley.
Labov,
W. 1966. The social stratification of English in New York City.
Washington, DC: Center for Applied Linguistics.
Larson,
J. 2003. VoiceXML: Introduction to developing speech applications.
Upper Saddle River, NJ: Prentice Hall.
Manning,
C., and H. Schutze. 1999. Foundations of statistical natural language
understanding. Cambridge, MA: MIT Press.
McClelland,
I., and F. Brigham. 1990. Marketing ergonomics: How should ergonomics
be packaged? Ergonomics 33(5): 519-526.
McConnell,
S. 1996. Rapid development: Taming wild software schedules. Redmond,
WA: Microsoft Press.
Melrose,
S. L. 1999. Must and its periphrastic forms in American English
usage. M.A. Thesis, UCLA. In Celce-Murcia and Larsen-Freeman, The
grammar book. 2nd edition. Boston: Heinle and Heinle Publishers.
Miller,
G. 1956. The magical number seven, plus or minus two: Some limits
on our capacity for processing information. Psychological Review
63: 81-97.
Nielsen,
J. 1993. Usability engineering. San Diego, CA: Morgan Kaufman.
Norman,
D.A. 2002. The design of everyday things. New York: Basic Books.
Norman,
D.A., and S.W. Draper, eds. 1986. User centered system design: New
perspectives on human-computer interaction. Hillsdale, NJ: Lawrence
Erlbaum Associates.
Nowlin,
R. 2001. VUI design under the microscope: Requirements definition.
V-World 2001, San Diego, CA.
Nowlin,
R. 2001. VUI design under the microscope: Tuning and validation.
V-World 2001, San Diego, CA.
Oviatt,
S. 1996. User-centered modeling for spoken language and multimodal
interfaces. IEEE Multimedia 3(4): 26-35.
Page,
J., and A. Breen. 1998. The Laureate text-to-speech system: Architecture
and applications. In F.A. Westall, D. Johnston, and A. Lewis, eds.,
Speech technology for telecommunications. New York: Chapman &
Hall.
Pierrehumbert,
J. 1980. The phonetics and phonology of English intonation. Doctoral
dissertation, MIT.
Preece,
J., Y. Rogers, and H. Sharp. 2002. Interaction design: Beyond human-computer
interaction. New York: John Wiley and Sons.
Quirk,
R., and S. Greenbaum. 1973. A concise grammar of contemporary English.
New York: Harcourt Brace Jovanovich.
Rabiner,
L. 1989. A tutorial on hidden Markov models and selected applications
in speech recognition. Proceedings of the IEEE 77: 257-286.
Rabiner,
L., and B. Juang. 1993. Fundamentals of speech recognition. Englewood
Cliffs, NJ: Prentice Hall.
Raman,
T. 1997. Auditory user interfaces. Boston: Kluwer Academic.
Raskin,
J. 2000. The humane interface. Boston: Addison-Wesley.
Reeves,
B., and C. Nass. 1996. The media equation. Stanford, CA: Center
for the Study of Language and Information.
Reynolds,
D., and L. Heck. 2001. Speaker verification: From research to reality.
International Conference on Acoustics, Speech, and Signal Processing.
Tutorial. Salt Lake City, Utah.
Richards,
J. 1980. Conversation. TESOL Quarterly XIV(4).
Rubin,
J. 1994. Handbook of usability testing. New York: John Wiley and
Sons.
Rudnicky,
A., and W. Xu. 1999. An agenda-based dialog management architecture
for spoken language systems. IEEE Automatic Speech Recognition and
Understanding Workshop, Keystone, CO.
Schiffrin,
D. 1987. Discourse markers. Cambridge, UK: Cambridge University
Press.
Schiffrin,
D. 1998. Approaches to discourse. Cambridge, MA: Blackwell Publishers.
Schumacher,
R.M., Jr., M.L. Hardzinski, and A.L. Schwarz. 1995. Increasing the
usability of interactive voice response systems: Research and guidelines
for phone-based interfaces. Human Factors 37(2): 251-264.
Selkirk,
E. 1995. Sentence prosody: Intonation, stress, and phrasing. In
John A. Goldsmith, ed., Phonological theory. Cambridge, MA: Blackwell
Publishers.
Seneff,
S., and J. and Polifroni. 2000. Dialogue management in the Mercury
flight reservation system. Presented at Satellite Dialogue Workshop,
ANLP-NAACL, Seattle.
Sharma,
C., and J. Kunins. 2002. VoiceXML: Strategies and techniques for
effective voice application development with VoiceXML 2.0. New York:
John Wiley and Sons.
Sheeder,
T. 2001. VUI design under the microscope: Detailed design. V-World
2001, San Diego, CA.
Sheeder,
T. 2001. VUI design under the microscope: High-Level design. V-World
2001, San Diego, CA.
Sheeder,
T., and J. Balogh. 2003. Say it like you mean it: Priming for structure
in caller responses to a spoken dialog system. International Journal
of Speech Technology 6(3): 103-111.
Shneiderman,
B. 1998. Designing the user interface: Strategies for effective
human-computer interaction. 3rd Edition. Reading, MA: Addison-Wesley.
Soukup,
B. 2000. Y'all come back now, y'hear: Language attitudes in the
United States towards Southern American English. MA thesis, University
of Vienna.
TSSC
(Telephone Speech Standards Committee). 2000. Universal commands
for telephony-based spoken language systems. Telephone Speech Standards
Committee: Common Dialog Tasks Subcommittee. www.acm.org/sigchi/bulletin/2000.2/telephonepaper.pdf
van
Santen, J., R. Sproat, J. Olive, and J. Hirschbereg. 1997. Progress
in speech synthesis. New York: Springer Publishers.
Weinschenk,
S., and D. Barker. 2000. Designing effective speech interfaces.
New York: John Wiley & Sons.
Weintraub,
M., H. Murveit, M. Cohen, P. Price, J. Bernstein, G. Baldwin, and
D. Bell. 1989. Linguistic constraints in hidden Markov based speech
recognition. International Conference on Acoustics, Speech and Signal
Processing, Glasgow, Scotland.
Wickelgren,
W.A. 1974. Size of rehearsal group and short-term memory. Journal
of Experimental Psychology 68: 413-419.
Yankelovich,
N., G.A. Levow, and M. Marx. 1995. Designing SpeechActs: Issues
in speech user interfaces. In I.R. Katz, R. Mack, and L. Marks,
eds., Human Factors in Computing Systems. CHI 1995 Conference Proceedings,
369-376.
Works Consulted
Anderson,
J.R. 1990. Cognitive psychology and its implications. New York,
NY: W.H. Freeman, 167-170.
Attwater,
D.J., and S.J. Whittaker. 1999. Large-vocabulary data-centric dialogues.
BT Technology Journal 17(1), 149-159.
Balentine,
B. 2003. The power of the pause. Speech Recognition Update 118.
Balogh,
J. 2001. Strategies for concatenating recordings in a voice user
interface: What we can learn from prosody. Extended Abstracts, CHI
(Computer Human Interface) 2001, 249-250.
Bowen,
J. 1975. Patterns of English pronunciation. Rowley, MA: Newbury
House Publishers.
Cowley,
C., and D. Jones. 1992. Synthesized or digitized? A guide to the
use of computer speech. Applied Ergonomics Jun. 23 (3): 172-176.
Delogu,
C., S. Conte, and C. Sementina. 1998. Cognitive factors in the evaluation
of synthetic speech. Speech Communication 24(2): 153-168.
Francis,
A., and H. Nusbaum. 1999. The effect of lexical complexity on intelligibility.
International Journal of Speech Technology 3(1): 15-25.
Fucci,
D., M. Reynolds, R. Bettagere, and M. Gonzales. 1995. Synthetic
speech intelligibility under several experimental conditions. AAC:
Augmentative & Alternative Communication Jun. 11 (2): 113-117.
Gong,
L., and J. Lai. 2001. Shall we mix synthetic speech and human speech?
Impact on users' performance, perception, and attitude. Proceedings
of CHI (Computer-Human Interface) 2001, Seattle, WA. 158-165.
Higginbotham,
D., A. Drazek, K. Kowarsky, and C. Scally. 1994. Discourse comprehension
of synthetic speech delivered at normal and slow presentation rates.
AAC: Augmentative & Alternative Communication Sept. 10 (3):
191-202.
Hoover,
J., J. Reichle, D. Van Tasell, and D. Cole. 1987. The intelligibility
of synthesized speech: ECHO II versus VOTRAX. Journal of Speech
& Hearing Research Sep. 30 (3): 425-431.
Hudson,
R. 1980. Sociolinguistics. London: Cambridge University Press.
James,
F. 1996. Presenting HTML structure in audio: User satisfaction with
audio hypertext. Proceedings of ICAD (International Conference on
Auditory Display) 1996, Palo Alto, CA, 97-103.
Jurafsky,
D., and J. Martin. 2000. Speech and language processing: An introduction
to natural language processing, computational linguistics, and speech
recognition. Upper Saddle River, NJ: Prentice Hall.
Kangas,
K., and G. Allen. 1990. Intelligibility of synthetic speech for
normal-hearing and hearing-impaired listeners. Journal of Speech
& Hearing Disorders Nov. 55(4): 751-755.
Lai,
J., D. Wood, and M. Considine. 2000. The effect of task conditions
on the comprehensibility of synthetic speech. Proceedings of CHI
2000, 321-328. The Hague, Netherlands. New York: ACM.
Lamel,
L., S. Rosset, J. Gauvain, S. Bennacef, M. Garnier-Rizet, and B.
Prouts. 1998. The LIMSI ARISE system. Proceedings of IVTTA (Interactive
Voice Technology for Telecommunication Applications) 1998, Turin,
Italy, 209-214.
Lavelle,
C-A., M. de Calmes, and G. Perennou. 1998. A study of users' behaviors
in different states of a spontaneous oral dialogue with an automatic
inquiry system. IEEE, Proceedings of IVTTA (Interactive Voice Technology
for Telecommunication Applications) 1998, Turin, Italy, 118-123.
McInnes,
F.R., D.J. Attwater, D. Edgington, M.S. Schmidt, and M.A. Jack.
1999. User attitudes to concatenated natural speech and text-to-speech
synthesis in an automated information service. Eurospeech 1999,
Proceedings of Speech Technology Symposium, Budapest.
Nass,
C., Y. Moon, and N. Green. 1997. Are machines gender neutral? Gender-stereotypic
responses to computers with voices. Journal of Applied Social Psychology
May 27 (10): 864-876.
Oshrin,
S., and J. Siders. 1987. The effect of word predictability on the
intelligibility of computer synthesized speech. Journal of Computer-Based
Instruction 14(3): 89-90.
Paris,
C., M. Thomas, R. Gilson, and J. Kincaid. 2000. Linguistic cues
and memory for synthetic and natural speech. Human Factors 42(3):
421-431.
Paris,
C., R. Gilson, M. Thomas, and N. Silver. 1995. Effect of synthetic
voice intelligibility on speech comprehension. Human Factors 37(2):
335-340.
Pinker,
Steven. 1994. The language instinct. New York: William Morrow.
Potjer,
J., A. Russel, L. Boves, and E. den Os. 1996. Subjective and objective
evaluation of two types of dialogues in a call assistance service.
Proceedings of IVTTA (Interactive Voice Technology for Telecommunications
Applications) 1996, Basking Ridge, NJ, 89-92.
Ralston,
J., D. Pisoni, S. Lively, and Beth G. Greene. 1991. Comprehension
of synthetic speech produced by rule: Word monitoring and sentence-by-sentence
listening times. Human Factors 33(4): 471-491.
Reynolds,
M., C. Isaacs-Duvall, B. Sheward, and M. Rotter. 2000. Examination
of the effects of listening practice on synthesized speech comprehension.
AAC: Augmentative & Alternative Communication 16(4): 250-259.
Reynolds,
M., Z. Bond, and D. Fucci. 1996. Synthetic speech intelligibility:
Comparison of native and non-native speakers of English. AAC: Augmentative
& Alternative Communication 12(1): 32-36.
Rosch,
E. 1976. Classification of real-world objects: Origins and representations
in cognition. In S. Erlich and E. Tulvings, eds., La Mémoire
sémantique. Paris: Bulletin de Psychologie.
Rosenthal,
M. 1974. The magic boxes: Pre-school children's attitudes toward
black and standard English. Florida F. L. Reporter 210: 55-93.
Schwab,
E., H. Nusbaum, and D. Pisoni. 1985. Some effects of training on
the perception of synthetic speech. Human Factors 27(4): 395-408.
Smither,
J. 1993. Short term memory demands in processing synthetic speech
by old and young adults. Behaviour & Information Technology
12(6): 330-335.
Stern,
S., J. Mullennix, C. Dyson, and S. Wilson. 1999. The persuasiveness
of synthetic speech versus human speech. Human Factors 41(4): 588-595.
Stifelman,
L.J., B. Arons, C. Schmandt, and E. Hulteen. 1993. VoiceNotes: A
speech interface for a hand-held voice notetaker. Proceedings of
INTERCHI 1993, ACM, New York.
Sutton,
B., J. King, K. Hux, and D.R. Beukelman. 1995. Younger and older
adults' rate performance when listening to synthetic speech. AAC:
Augmentative & Alternative Communication 11(3): 147-153.
Venkatagiri,
S. 1994. Effect of sentence length and exposure on the intelligibility
of synthesized speech. AAC: Augmentative & Alternative Communication
10(2): 96-104.
Vromans,
B., R.J. van Vark, B. Rueber, and A. Kellner. Extending the SUSI
System with negative knowledge. Proceedings of Eurospeech 1999,
Budapest.
Waterworth,
J.A. 1983. Effect of intonation form and pause durations of automatic
telephone number announcements on subjective preference and memory
performance. Applied Ergonomics 14(1): 39-42.
Whalen,
D., C. Hoequist, and S. Sheffert. 1995. The effects of breath sounds
on the perception of synthetic speech. Journal of the Acoustical
Society of America 97(5, pt. 1): 3147-3153.
Yankelovich,
N. (in press). Using natural dialogs as the basis for speech interface
design. In S. Luperfoy, ed., Automated spoken dialog systems. Cambridge,
MA: MIT Press.
|