0. 说明
工具在: https://github.com/ruclion/G2P_Kb
英文用
- CMUDict
- 0,1,2重音
找了个不用搭环境的, 确实不错, 谢谢作者: https://github.com/Kyubyong/g2p
但是具体的原理和标准还没有特别关注, 以下摘抄自Git
0.1. 简介
此模块旨在将英语字素(拼写)转换为音素(读音)。在语音合成等多项任务中,它被认为是必不可少的。不像西班牙语或德语这样的许多语言可以通过拼写来推断单词的发音,英语单词通常远没有人们期望的那样。因此,如果我们想知道某个单词的发音,最好参考字典。但是,这种方法至少要考虑两个问题。首先,您不能消除同形异义词(具有多个发音的单词)的发音的歧义。 (请参阅下文。)其次,您无法检查单词是否不在词典中。 (请参阅下面的b。)
例子
a. I refuse to collect the refuse around here. (rɪ|fju:z as verb vs. |refju:s as noun)
b. I am an activationist. (activationist: newly coined word which means n. A person who designs and implements programs of treatment or therapy that use recreation and activities to help people whose functional abilities are affected by illness or disability. from WORD SPY
0.2. 方案
对