有声和无声检测对比
检测无声部分并将其衰减.
In[1]:=
![Click for copyable input](assets.zh/voiced-vs-unvoiced-detection/In_42.png)
a = ExampleData[{"Audio", "NoisyTalk"}, "Audio"]
使用 AudioIntervals 查找低 RMS 振幅和高频谱平坦度的部分.
In[2]:=
![Click for copyable input](assets.zh/voiced-vs-unvoiced-detection/In_43.png)
nonVoicedIntervals =
AudioIntervals[
a, #RMSAmplitude < .02 && #SpectralFlatness > .0001 &, .1,
PartitionGranularity -> {.06, .01}]
Out[2]=
![](assets.zh/voiced-vs-unvoiced-detection/O_20.png)
可视化检测到的区间.
In[3]:=
![Click for copyable input](assets.zh/voiced-vs-unvoiced-detection/In_44.png)
AudioPlot[a,
Epilog -> {RGBColor[1, 0, 0, .3],
Rectangle[{#[[1]], -1}, {#[[2]], 1}] & /@ nonVoicedIntervals},
ImageSize -> Medium]
Out[3]=
![](assets.zh/voiced-vs-unvoiced-detection/O_21.png)
衰减检测到的区间.
In[4]:=
![Click for copyable input](assets.zh/voiced-vs-unvoiced-detection/In_45.png)
AudioJoin[
Riffle[AudioFade /@ AudioTrim[a, Except@nonVoicedIntervals],
0.3*AudioTrim[a, nonVoicedIntervals]]]