audio2_siaprueb
Classified in Electronics
Written on in
English with a size of 366.38 KB
Speech to Text (STT):
Def: A Speech-To-Text (STT) or Automatic Speech Recognizer (ASR) system transforms a speech recording input into an output text that contains, word by word, what is said in the recording. EJEMPLO: raw audio -> STT -> (we are learning about speech to text)
We can describe the problem as Sequence labelling: This means the computer receives a continuous sequence of data (the audio waves shown in the image) and has to assign discrete "labels" (words or letters) to parts of that sound.
Sequence-to-Sequence Mapping: This is a more general term. It simply means mapping one type of sequence (audio) to another type of sequence (text).
STT EVALUATION:
STT Systems Make Mistakes so before evaluating, we must accept that these systems... Continue reading "audio2_siaprueb" »