Automatic speech recognition (ASR) research has progressed from the recognition of read speech to the recognition of spontaneous
conversational speech in the past decade, prompting some in the field to re-evaluate ASR pronunciation models and their role
of capturing the increased phonetic variability within unscripted speech. Two basic approaches for modeling pronunciation
variation have emerged: encoding linguistic knowledge to pre-specify possible alternative pronunciations of words and deriving
alternatives directly from a pronunciation corpus. This tutorial is intended to ground the reader in the basic linguistic
concepts in phonetics and phonology that guide both of these techniques and to outline several pronunciation modeling strategies
that have been employed through the years. The chapter will conclude with a summary of some promising recent research directions.