Stony Brook University Logo Computer Science
CSE 542 Back to Graduate Courses

Course CSE542
Title Speech Processing
Description

CSE542 will cover three main topics, each to be covered over a 4-week period

  • Introduction to the collection and analysis of speech data for speech processing:
    Includes a brief introduction to corpus linguistics. Students will learn the range and types of spoken language collections, and will learn how to analyze speech data using the Praat tool.
  • Introduction to speech recognition:
    Students will learn basic technologies for speech recognition, using the Hidden Markov Model Toolkit (HTK).
  • Introduction to concatenative text-to-speech synthesis:
    Students will learn the basics of text-to-speech synthesis (TTS), as well as current technologies for concatenative TTS. The TTS system Festival (or its Java version, FreeTTS) will be used.
  • Integration of speech recognition and TTS into other technologies (by means of, e.g., VoiceXML and/or the speech SDKs under development by Microsoft, Sun (Java), and IBM) will also be discussed.
Prerequisite or Corequisite

Knowledge of C/C++, Java, and/or Perl preferred. Knowledge of phonetics a plus.

Credit Information 3 - credits
Course Goals  
Course Webpage N/A
Course Coordinator  
[an error occurred while processing this directive]