Joke Collection Website - Public benefit messages - What is TTS?

What is TTS?

TTS is the abbreviation of Text To Speech, which means "from text to speech". It is an outstanding work that uses both linguistics and psychology. With the support of built-in chips and the design of neural networks, it intelligently converts text into natural speech streams. TTS technology converts text files in real time, and the conversion time can be calculated in seconds. Under the action of its unique intelligent voice controller, the voice of the text output is smooth, making the listener feel natural when listening to the information, without the coldness and jerkiness of machine voice output. TTS speech synthesis technology will soon cover the first and second level Chinese characters of the national standard. It has an English interface, automatically recognizes Chinese and English, and supports mixed reading of Chinese and English. All voices use real-person Mandarin as the standard pronunciation, achieving fast speech synthesis of 120-150 Chinese characters/second, and a reading speed of 3-4 Chinese characters/second, allowing users to hear clear and pleasant sound quality and coherent and smooth intonation. Nowadays, a small number of MP3 players have TTS function. \x0d\\x0d\ TTS text-to-speech conversion has a wide range of uses, including reading emails, voice prompts for IVR systems, etc. At present, IVR systems have been widely used in various industries (such as telecommunications, transportation, etc.). \x0d\ The key technology used in TTS is speech synthesis (SpeechSynthesis). Early TTS was generally implemented using dedicated chips, such as Texas Instruments' TMS50C10/TMS50C57, Philips' PH84H36, etc., but they were mainly used in household appliances or children's toys. \x0d\ TTS based on microcomputer applications is generally implemented with pure software, which mainly includes the following parts:\x0d\ ●Text analysis - Linguistic analysis of the input text, lexical, grammatical and semantic analysis sentence by sentence, to Determine the low-level structure of the sentence and the phoneme composition of each word, including text segmentation, word segmentation, polyphone processing, number processing, abbreviation processing, etc. \x0d\ ●Speech synthesis-Extract the words or phrases corresponding to the processed text from the speech synthesis library, and convert the linguistic description into speech waveforms. \x0d\ ●Rhyme processing - Quality of Synthetic Speech (Quality of Synthetic Speech) refers to the quality of the speech output by the speech synthesis system. It is generally evaluated subjectively in terms of clarity (or intelligibility), naturalness and coherence. Clarity is the percentage of correctly hearing and distinguishing meaningful words; naturalness is used to evaluate whether the sound quality of the synthesized speech is close to the human voice and whether the intonation of the synthesized words is natural; coherence is used to evaluate whether the synthesized sentences are fluent. \x0d\ To synthesize high-quality speech, the algorithm used is extremely complex, so the requirements for the machine are also very high. The complexity of the algorithm determines the system capacity of current microcomputers for concurrent multi-channel TTS. \x0d\\x0d\This is TTS