Speech examples of "EditSpeech: A Text Based Speech Editing System Using Bidirectional Partial Inference"

paper: Arxiv

We select VCTK and MST dataset to be English and Chinese dataset for demo respectively. For each dataset, we provide 10 samples for each edit option, including insertion, replacement and deletion. For the insertion and replacement, we provide the original audio, the edited audio from baseline systems and our proposed system, to not only demonstrates the edit effect but also allow the listeners to compare the edit performance of different systems. For the deletion, we only provide the original audio and the edited audio from our proposed system, as difference between different system is not noticeable.


Baseline system for comparison:
The baseline systems 1 to 3 aim to indicate how the consideration of text and speech context contributes to the naturalness of the edited speech, while baseline system 4 aims to evaluate the naturalness improvement when the left and right context are both taken into consideration. We mainly compare EditSpeech and the baseline systems in insertion and replacement operations. As new content is added in these two operations, the differences between outputs of systems are obvious.
Baseline system 1: The whole edited text after insertion/replacement and the speaker embedding of the original speaker are used for the generation of the speech, where only the outputs of the forward decoder are used.
Baseline system 2: The text to insert/replace and the speaker embedding of the original speaker are used to generate mel-spectrogram to insert/replace, where only the outputs of the forward decoder are used. This mel-spctrogram is then concatenated with the unmodified part of given mel-spectrogram to construct the edited mel-spectrogram.
Baseline system 3: Candidate mel-spectrogram is first synthesized as in baseline system 1. Then DTW of mel-cepstral coefficient (MCEP) is used to align the unmodified part of given and candidate mel-spectrogram. In this way, the modified part of the candidate mel-spectrogram can be located, which is then concatenated with the unmodified part of given mel-spectrogram to construct the edited mel-spectrogram.
Baseline system 4: This system is similar to the proposed system except the bidirectional fusion step is removed and only the outputs of the forward decoder are used.


1. English VCTK dataset:


1.1 Insertion:
Each sample has 6 audios, of which one is the original audio from human, and the other 5 are synthesized audios from baseline and proposed systems. The synthesized audio are based on the modified text where some words are inserted into the text of the original audio.


Sample:1
original text: For that reason cover should not be given.
original word list: for that reason cover should not be given
edited word list: for that theoretical and realistic reason cover should not be given

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:2
original text: What do we want to do that for?
original word list: what do we want to do that for
edited word list: what do we this afternoon want to do that for

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:3
original text: Some have accepted it as a miracle without physical explanation.
original word list: some have accepted it as a miracle without physical explanation
edited word list: some have accepted it as a miracle never seen before without physical explanation

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:4
original text: Both men were on one-year contracts.
original word list: both men were on one year contracts
edited word list: both men and women were on one year contracts

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:5
original text: We're not seeing an image of the person, he said.
original word list: we're not seeing an image of the person he said
edited word list: we're not actually seeing an image of the person he said

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:6
original text: Many complicated ideas about the rainbow have been formed.
original word list: many complicated ideas about the rainbow have been formed
edited word list: many complicated ideas about the rainbow from researchers have been formed

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:7
original text: The parties include the Scottish Executive.
original word list: the parties include the scottish executive
edited word list: the parties found recently include the scottish executive

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:8
original text: And she is a successful actor with three decades of experience.
original word list: and she is a successful actor with three decades of experience
edited word list: and she is a successful talented beautiful actor with three decades of experience

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:9
original text: I am totally devoted to this job.
original word list: i am totally devoted to this job
edited word list: i am totally interested in and devoted to this job

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:10
original text: You should always be able to get out in some direction.
original word list: you should always be able to get out in some direction
edited word list: you should always be able to take courage and get out in some direction

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


1.2 Replacement:
Each sample has 6 audios, of which one is the original audio from human, and the other 5 are synthesized audios from baseline and proposed systems. The synthesized audio are based on the modified text where some words in the text of original audio are replaced by other words.


Sample:1
original text: For that reason cover should not be given.
original word list: for that reason cover should not be given
edited word list: for that reason cover is impossible to be given

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:2
original text: What do we want to do that for?
original word list: what do we want to do that for
edited word list: what do we want to figure out for

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:3
original text: Some have accepted it as a miracle without physical explanation.
original word list: some have accepted it as a miracle without physical explanation
edited word list: some have accepted it as an undeniable fact without physical explanation

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:4
original text: Both men were on one-year contracts.
original word list: both men were on one year contracts
edited word list: both men were on ten months contracts

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:5
original text: We're not seeing an image of the person, he said.
original word list: we're not seeing an image of the person he said
edited word list: we're not hearing an audio of the person he said

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:6
original text: Many complicated ideas about the rainbow have been formed.
original word list: many complicated ideas about the rainbow have been formed
edited word list: many complicated ideas about how the universe evolves have been formed

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:7
original text: The parties include the Scottish Executive.
original word list: the parties include the scottish executive
edited word list: the parties include the English and American executive

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:8
original text: And she is a successful actor with three decades of experience.
original word list: and she is a successful actor with three decades of experience
edited word list: and she is a successful actor with about thirty years of experience

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:9
original text: I am totally devoted to this job.
original word list: i am totally devoted to this job
edited word list: i am totally fall in love with this job

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:10
original text: You should always be able to get out in some direction.
original word list: you should always be able to get out in some direction
edited word list: you should always be able to focus your research on some direction

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


1.3 Deletion:
Each sample has 2 audios, of which one is the original audio from human, and the other is synthesized audio from proposed system. The synthesized audio are based on the modified text where some words in the text of original audio are deleted.


Sample:1
original text: For that reason cover should not be given.
original word list: for that reason cover should not be given
edited word list: for that reason cover given

original_audio

proposed system


Sample:2
original text: What do we want to do that for?
original word list: what do we want to do that for
edited word list: what do we do that for

original_audio

proposed system


Sample:3
original text: Some have accepted it as a miracle without physical explanation.
original word list: some have accepted it as a miracle without physical explanation
edited word list: some have accepted it without physical explanation

original_audio

proposed system


Sample:4
original text: Both men were on one-year contracts.
original word list: both men were on one year contracts
edited word list: both men were on contracts

original_audio

proposed system


Sample:5
original text: We're not seeing an image of the person, he said.
original word list: we're not seeing an image of the person he said
edited word list: we're not seeing person he said

original_audio

proposed system


Sample:6
original text: Many complicated ideas about the rainbow have been formed.
original word list: many complicated ideas about the rainbow have been formed
edited word list: many complicated ideas have been formed

original_audio

proposed system


Sample:7
original text: The parties include the Scottish Executive.
original word list: the parties include the scottish executive
edited word list: the parties include the executive

original_audio

proposed system


Sample:8
original text: And she is a successful actor with three decades of experience.
original word list: and she is a successful actor with three decades of experience
edited word list: and she is a successful actor with experience

original_audio

proposed system


Sample:9
original text: I am totally devoted to this job.
original word list: i am totally devoted to this job
edited word list: i am devoted to this job

original_audio

proposed system


Sample:10
original text: You should always be able to get out in some direction.
original word list: you should always be able to get out in some direction
edited word list: you should always get out in some direction

original_audio

proposed system


2. Chinse MST dataset:


2.1 Insertion:
Each sample has 6 audios, of which one is the original audio from human, and the other 5 are synthesized audios from baseline and proposed systems. The synthesized audio are based on the modified text where some words are inserted into the text of the original audio.


Sample:1
original text: 最后,一些粗浅观察吧,就多说几点吧。一,越南确实在勃兴。
original word list: zui4 hou4 yi1 xie1 cu1 qian3 guan1 cha2 ba5 jiu4 duo1 shuo1 ji3 dian3 ba5 yi1 yue4 nan2 que4 shi2 zai4 bo2 xing1
edited word list: zui4 hou4 yi1 xie1 我自己的 cu1 qian3 guan1 cha2 ba5 jiu4 duo1 shuo1 ji3 dian3 ba5 yi1 yue4 nan2 que4 shi2 zai4 bo2 xing1

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:2
original text: 这些差异和不同实际上就是横亘在穷人和富人之间的一道分界线。
original word list: zhe4 xie1 cha1 yi4 he2 bu4 tong2 shi2 ji4 shang4 jiu4 shi4 heng2 gen4 zai4 qiong2 ren2 he2 fu4 ren2 zhi1 jian1 de5 yi1 dao4 fen1 jie4 xian4
edited word list: zhe4 xie1 cha1 yi4 he2 bu4 tong2 shi2 ji4 shang4 jiu4 shi4 heng2 gen4 zai4 社会的 qiong2 ren2 he2 fu4 ren2 zhi1 jian1 de5 yi1 dao4 fen1 jie4 xian4

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:3
original text: 没多久,全世界的人都知道了,这个世界上还有一个美丽善良的姑娘叫拇指姑娘。
original word list: mei2 duo1 jiu3 quan2 shi4 jie4 de5 ren2 dou1 zhi1 dao4 le5 zhe4 ge4 shi4 jie4 shang4 hai2 you3 yi2 ge4 mei3 li4 shan4 liang2 de5 gu1 niang2 jiao4 mu3 zhi3 gu1 niang2
edited word list: mei2 duo1 jiu3 quan2 shi4 jie4 de5 ren2 dou1 不约而同 zhi1 dao4 le5 zhe4 ge4 shi4 jie4 shang4 hai2 you3 yi2 ge4 mei3 li4 shan4 liang2 de5 gu1 niang2 jiao4 mu3 zhi3 gu1 niang2

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:4
original text: “你怎么说我们也不会相信,如果真是你的,叫你的家人到局里领取。”
original word list: ni3 zen3 me5 shuo1 wo3 men5 ye3 bu2 hui4 xiang1 xin4 ru2 guo3 zhen1 shi4 ni3 de5 jiao4 ni3 de5 jia1 ren2 dao4 ju2 li3 ling3 qu3
edited word list: ni3 zen3 me5 shuo1 wo3 men5 ye3 bu2 hui4 xiang1 xin4 ru2 guo3 zhen1 shi4 ni3 de5 jiao4 ni3 de5 jia1 ren2 dao4 镇上的警察 ju2 li3 ling3 qu3

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:5
original text: 喂您好,我这边是瓜子二手车全国营销中心,今天专门给您打个电话,至少可以给您再便宜好几千,
original word list: wei4 nin2 hao3 wo3 zhe4 bian1 shi4 gua1 zi5 er4 shou3 che1 quan2 guo2 ying2 xiao1 zhong1 xin1 jin1 tian1 zhuan1 men2 gei3 nin2 da3 ge4 dian4 hua4 zhi4 shao3 ke3 yi3 gei3 nin2 zai4 pian2 yi2 hao3 ji3 qian1
edited word list: wei4 nin2 hao3 wo3 zhe4 bian1 shi4 最近成立的 gua1 zi5 er4 shou3 che1 quan2 guo2 ying2 xiao1 zhong1 xin1 jin1 tian1 zhuan1 men2 gei3 nin2 da3 ge4 dian4 hua4 zhi4 shao3 ke3 yi3 gei3 nin2 zai4 pian2 yi2 hao3 ji3 qian1

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:6
original text: 她请求老巫婆赶紧施行法术,让拉斯姆斯赶紧回来。
original word list: ta1 qing3 qiu2 lao3 wu1 po2 gan3 jin3 shi1 xing2 fa3 shu4 rang4 la1 si1 mu3 si1 gan3 jin3 hui2 lai2
edited word list: ta1 qing3 qiu2 lao3 wu1 po2 gan3 jin3 shi1 xing2 钻研二十年的 fa3 shu4 rang4 la1 si1 mu3 si1 gan3 jin3 hui2 lai2

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:7
original text: 受苦难折磨的中国人并不是我们几个,而是成千上万的。
original word list: shou4 ku3 nan4 zhe2 mo2 de5 zhong1 guo2 ren2 bing4 bu2 shi4 wo3 men5 ji3 ge4 er2 shi4 cheng2 qian1 shang4 wan4 de5
edited word list: shou4 ku3 nan4 zhe2 mo2 de5 zhong1 guo2 ren2 bing4 bu2 shi4 wo3 men5 现场在座的 ji3 ge4 er2 shi4 cheng2 qian1 shang4 wan4 de5

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:8
original text: 他们只记得,他心狠手辣地处理了那三位可怜的老家伙。
original word list: ta1 men5 zhi3 ji4 de5 ta1 xin1 hen3 shou3 la4 di4 chu3 li3 le5 na4 san1 wei4 ke3 lian2 de5 lao3 jia1 huo3
edited word list: ta1 men5 zhi3 凭记忆依稀 ji4 de5 ta1 xin1 hen3 shou3 la4 di4 chu3 li3 le5 na4 san1 wei4 ke3 lian2 de5 lao3 jia1 huo3

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:9
original text: 一个好的婚姻更容易获得好的收益。
original word list: yi2 ge4 hao3 de5 hun1 yin1 geng4 rong2 yi4 huo4 de2 hao3 de5 shou1 yi4
edited word list: yi2 ge4 hao3 de5 hun1 yin1 一般来说 geng4 rong2 yi4 huo4 de2 hao3 de5 shou1 yi4

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:10
original text: 那个被人讽来赞去的不可一世的诗人安静地坐在他的房子里。
original word list: na4 ge4 bei4 ren2 feng3 lai2 zan4 qu4 de5 bu4 ke3 yi1 shi4 de5 shi1 ren2 an1 jing4 di4 zuo4 zai4 ta1 de5 fang2 zi5 li3
edited word list: na4 ge4 bei4 ren2 feng3 lai2 zan4 qu4 de5 bu4 ke3 yi1 shi4 de5 shi1 ren2 an1 jing4 di4 zuo4 zai4 ta1 价值上亿 de5 fang2 zi5 li3

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


2.2 Replacement:
Each sample has 6 audios, of which one is the original audio from human, and the other 5 are synthesized audios from baseline and proposed systems. The synthesized audio are based on the modified text where some words in the text of original audio are replaced by other words.


Sample:1
original text: 最后,一些粗浅观察吧,就多说几点吧。一,越南确实在勃兴。
original word list: zui4 hou4 yi1 xie1 cu1 qian3 guan1 cha2 ba5 jiu4 duo1 shuo1 ji3 dian3 ba5 yi1 yue4 nan2 que4 shi2 zai4 bo2 xing1
edited word list: zui4 hou4 yi1 xie1 个人看法 ba5 jiu4 duo1 shuo1 ji3 dian3 ba5 yi1 yue4 nan2 que4 shi2 zai4 bo2 xing1

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:2
original text: 这些差异和不同实际上就是横亘在穷人和富人之间的一道分界线。
original word list: zhe4 xie1 cha1 yi4 he2 bu4 tong2 shi2 ji4 shang4 jiu4 shi4 heng2 gen4 zai4 qiong2 ren2 he2 fu4 ren2 zhi1 jian1 de5 yi1 dao4 fen1 jie4 xian4
edited word list: zhe4 xie1 cha1 yi4 he2 bu4 tong2 shi2 ji4 shang4 jiu4 shi4 heng2 gen4 zai4 男生和女生 zhi1 jian1 de5 yi1 dao4 fen1 jie4 xian4

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:3
original text: 没多久,全世界的人都知道了,这个世界上还有一个美丽善良的姑娘叫拇指姑娘。
original word list: mei2 duo1 jiu3 quan2 shi4 jie4 de5 ren2 dou1 zhi1 dao4 le5 zhe4 ge4 shi4 jie4 shang4 hai2 you3 yi2 ge4 mei3 li4 shan4 liang2 de5 gu1 niang2 jiao4 mu3 zhi3 gu1 niang2
edited word list: mei2 duo1 jiu3 quan2 shi4 jie4 de5 ren2 dou1 zhi1 dao4 le5 zhe4 ge4 shi4 jie4 shang4 hai2 you3 yi2 ge4 沉鱼落雁 de5 gu1 niang2 jiao4 mu3 zhi3 gu1 niang2

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:4
original text: “你怎么说我们也不会相信,如果真是你的,叫你的家人到局里领取。”
original word list: ni3 zen3 me5 shuo1 wo3 men5 ye3 bu2 hui4 xiang1 xin4 ru2 guo3 zhen1 shi4 ni3 de5 jiao4 ni3 de5 jia1 ren2 dao4 ju2 li3 ling3 qu3
edited word list: ni3 zen3 me5 百般狡辩 wo3 men5 ye3 bu2 hui4 xiang1 xin4 ru2 guo3 zhen1 shi4 ni3 de5 jiao4 ni3 de5 jia1 ren2 dao4 ju2 li3 ling3 qu3

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:5
original text: 喂您好,我这边是瓜子二手车全国营销中心,今天专门给您打个电话,至少可以给您再便宜好几千,
original word list: wei4 nin2 hao3 wo3 zhe4 bian1 shi4 gua1 zi5 er4 shou3 che1 quan2 guo2 ying2 xiao1 zhong1 xin1 jin1 tian1 zhuan1 men2 gei3 nin2 da3 ge4 dian4 hua4 zhi4 shao3 ke3 yi3 gei3 nin2 zai4 pian2 yi2 hao3 ji3 qian1
edited word list: wei4 nin2 hao3 wo3 zhe4 bian1 shi4 华为手机 quan2 guo2 ying2 xiao1 zhong1 xin1 jin1 tian1 zhuan1 men2 gei3 nin2 da3 ge4 dian4 hua4 zhi4 shao3 ke3 yi3 gei3 nin2 zai4 pian2 yi2 hao3 ji3 qian1

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:6
original text: 她请求老巫婆赶紧施行法术,让拉斯姆斯赶紧回来。
original word list: ta1 qing3 qiu2 lao3 wu1 po2 gan3 jin3 shi1 xing2 fa3 shu4 rang4 la1 si1 mu3 si1 gan3 jin3 hui2 lai2
edited word list: ta1 qing3 qiu2 碰到的路人 gan3 jin3 shi1 xing2 fa3 shu4 rang4 la1 si1 mu3 si1 gan3 jin3 hui2 lai2

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:7
original text: 受苦难折磨的中国人并不是我们几个,而是成千上万的。
original word list: shou4 ku3 nan4 zhe2 mo2 de5 zhong1 guo2 ren2 bing4 bu2 shi4 wo3 men5 ji3 ge4 er2 shi4 cheng2 qian1 shang4 wan4 de5
edited word list: shou4 ku3 nan4 zhe2 mo2 de5 社会底层打工人 bing4 bu2 shi4 wo3 men5 ji3 ge4 er2 shi4 cheng2 qian1 shang4 wan4 de5

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:8
original text: 他们只记得,他心狠手辣地处理了那三位可怜的老家伙。
original word list: ta1 men5 zhi3 ji4 de5 ta1 xin1 hen3 shou3 la4 di4 chu3 li3 le5 na4 san1 wei4 ke3 lian2 de5 lao3 jia1 huo3
edited word list: ta1 men5 zhi3 ji4 de5 ta1 迅雷不及掩耳 di4 chu3 li3 le5 na4 san1 wei4 ke3 lian2 de5 lao3 jia1 huo3

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:9
original text: 一个好的婚姻更容易获得好的收益。
original word list: yi2 ge4 hao3 de5 hun1 yin1 geng4 rong2 yi4 huo4 de2 hao3 de5 shou1 yi4
edited word list: yi2 ge4 hao3 de5 工作平台 geng4 rong2 yi4 huo4 de2 hao3 de5 shou1 yi4

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


Sample:10
original text: 那个被人讽来赞去的不可一世的诗人安静地坐在他的房子里。
original word list: na4 ge4 bei4 ren2 feng3 lai2 zan4 qu4 de5 bu4 ke3 yi1 shi4 de5 shi1 ren2 an1 jing4 di4 zuo4 zai4 ta1 de5 fang2 zi5 li3
edited word list: na4 ge4 bei4 ren2 feng3 lai2 zan4 qu4 de5 bu4 ke3 yi1 shi4 de5 理论物理学家 an1 jing4 di4 zuo4 zai4 ta1 de5 fang2 zi5 li3

original_audio

baseline system 1

baseline system 2

baseline system 3

baseline system 4

proposed system


2.3 Deletion:
Each sample has 2 audios, of which one is the original audio from human, and the other is synthesized audio from proposed system. The synthesized audio are based on the modified text where some words in the text of original audio are deleted.


Sample:1
original text: 最后,一些粗浅观察吧,就多说几点吧。一,越南确实在勃兴。
original word list: zui4 hou4 yi1 xie1 cu1 qian3 guan1 cha2 ba5 jiu4 duo1 shuo1 ji3 dian3 ba5 yi1 yue4 nan2 que4 shi2 zai4 bo2 xing1
edited word list: zui4 hou4 yi1 xie1 guan1 cha2 ba5 jiu4 duo1 shuo1 ji3 dian3 ba5 yi1 yue4 nan2 que4 shi2 zai4 bo2 xing1

original_audio

proposed system


Sample:2
original text: 这些差异和不同实际上就是横亘在穷人和富人之间的一道分界线。
original word list: zhe4 xie1 cha1 yi4 he2 bu4 tong2 shi2 ji4 shang4 jiu4 shi4 heng2 gen4 zai4 qiong2 ren2 he2 fu4 ren2 zhi1 jian1 de5 yi1 dao4 fen1 jie4 xian4
edited word list: zhe4 xie1 cha1 yi4 shi2 ji4 shang4 jiu4 shi4 heng2 gen4 zai4 qiong2 ren2 he2 fu4 ren2 zhi1 jian1 de5 yi1 dao4 fen1 jie4 xian4

original_audio

proposed system


Sample:3
original text: 没多久,全世界的人都知道了,这个世界上还有一个美丽善良的姑娘叫拇指姑娘。
original word list: mei2 duo1 jiu3 quan2 shi4 jie4 de5 ren2 dou1 zhi1 dao4 le5 zhe4 ge4 shi4 jie4 shang4 hai2 you3 yi2 ge4 mei3 li4 shan4 liang2 de5 gu1 niang2 jiao4 mu3 zhi3 gu1 niang2
edited word list: mei2 duo1 jiu3 quan2 shi4 jie4 de5 ren2 dou1 zhi1 dao4 le5 zhe4 ge4 shi4 jie4 shang4 hai2 you3 yi2 ge4 gu1 niang2 jiao4 mu3 zhi3 gu1 niang2

original_audio

proposed system


Sample:4
original text: “你怎么说我们也不会相信,如果真是你的,叫你的家人到局里领取。”
original word list: ni3 zen3 me5 shuo1 wo3 men5 ye3 bu2 hui4 xiang1 xin4 ru2 guo3 zhen1 shi4 ni3 de5 jiao4 ni3 de5 jia1 ren2 dao4 ju2 li3 ling3 qu3
edited word list: ni3 zen3 me5 shuo1 wo3 men5 ye3 bu2 hui4 xiang1 xin4 ru2 guo3 zhen1 shi4 ni3 de5 jiao4 ni3 dao4 ju2 li3 ling3 qu3

original_audio

proposed system


Sample:5
original text: 喂您好,我这边是瓜子二手车全国营销中心,今天专门给您打个电话,至少可以给您再便宜好几千,
original word list: wei4 nin2 hao3 wo3 zhe4 bian1 shi4 gua1 zi5 er4 shou3 che1 quan2 guo2 ying2 xiao1 zhong1 xin1 jin1 tian1 zhuan1 men2 gei3 nin2 da3 ge4 dian4 hua4 zhi4 shao3 ke3 yi3 gei3 nin2 zai4 pian2 yi2 hao3 ji3 qian1
edited word list: wei4 nin2 hao3 wo3 zhe4 bian1 shi4 gua1 zi5 er4 shou3 che1 quan2 guo2 ying2 xiao1 zhong1 xin1 jin1 tian1 zhuan1 men2 gei3 nin2 da3 ge4 dian4 hua4 zhi4 shao3 ke3 yi3 gei3 nin2 hao3 ji3 qian1

original_audio

proposed system


Sample:6
original text: 她请求老巫婆赶紧施行法术,让拉斯姆斯赶紧回来。
original word list: ta1 qing3 qiu2 lao3 wu1 po2 gan3 jin3 shi1 xing2 fa3 shu4 rang4 la1 si1 mu3 si1 gan3 jin3 hui2 lai2
edited word list: ta1 qing3 qiu2 lao3 wu1 po2 gan3 jin3 shi1 xing2 fa3 shu4 rang4 la1 si1 mu3 si1 hui2 lai2

original_audio

proposed system


Sample:7
original text: 受苦难折磨的中国人并不是我们几个,而是成千上万的。
original word list: shou4 ku3 nan4 zhe2 mo2 de5 zhong1 guo2 ren2 bing4 bu2 shi4 wo3 men5 ji3 ge4 er2 shi4 cheng2 qian1 shang4 wan4 de5
edited word list: shou4 ku3 nan4 zhe2 mo2 de5 zhong1 guo2 ren2 bing4 bu2 shi4 ji3 ge4 er2 shi4 cheng2 qian1 shang4 wan4 de5

original_audio

proposed system


Sample:8
original text: 他们只记得,他心狠手辣地处理了那三位可怜的老家伙。
original word list: ta1 men5 zhi3 ji4 de5 ta1 xin1 hen3 shou3 la4 di4 chu3 li3 le5 na4 san1 wei4 ke3 lian2 de5 lao3 jia1 huo3
edited word list: ta1 men5 zhi3 ji4 de5 ta1 xin1 hen3 shou3 la4 di4 chu3 li3 le5 lao3 jia1 huo3

original_audio

proposed system


Sample:9
original text: 一个好的婚姻更容易获得好的收益。
original word list: yi2 ge4 hao3 de5 hun1 yin1 geng4 rong2 yi4 huo4 de2 hao3 de5 shou1 yi4
edited word list: yi2 ge4 hao3 de5 hun1 yin1 huo4 de2 hao3 de5 shou1 yi4

original_audio

proposed system


Sample:10
original text: 那个被人讽来赞去的不可一世的诗人安静地坐在他的房子里。
original word list: na4 ge4 bei4 ren2 feng3 lai2 zan4 qu4 de5 bu4 ke3 yi1 shi4 de5 shi1 ren2 an1 jing4 di4 zuo4 zai4 ta1 de5 fang2 zi5 li3
edited word list: na4 ge4 shi1 ren2 an1 jing4 di4 zuo4 zai4 ta1 de5 fang2 zi5 li3

original_audio

proposed system