Skip to content
项目
群组
代码片段
帮助
当前项目
正在载入...
登录 / 注册
切换导航面板
F
Fairseq-S2T
概览
Overview
Details
Activity
Cycle Analytics
版本库
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
问题
0
Issues
0
列表
Board
标记
里程碑
合并请求
0
Merge Requests
0
CI / CD
CI / CD
流水线
作业
日程表
图表
维基
Wiki
代码片段
Snippets
成员
Collapse sidebar
Close sidebar
活动
图像
聊天
创建新问题
作业
提交
Issue Boards
Open sidebar
xuchen
Fairseq-S2T
Commits
21734086
Commit
21734086
authored
Jul 26, 2022
by
xuchen
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
fix the bugs
parent
e1d3d2ed
显示空白字符变更
内嵌
并排
正在显示
1 个修改的文件
包含
12 行增加
和
3 行删除
+12
-3
examples/speech_to_text/prep_audio_data.py
+12
-3
没有找到文件。
examples/speech_to_text/prep_audio_data.py
查看文件 @
21734086
...
...
@@ -185,7 +185,7 @@ class AudioDataset(Dataset):
if
need_waveform
:
offset
=
item
.
get
(
'offset'
,
False
)
if
offset
:
if
offset
is
not
False
:
waveform
,
sample_rate
=
torchaudio
.
load
(
audio
,
frame_offset
=
offset
,
num_frames
=
item
[
"n_frames"
])
...
...
@@ -272,7 +272,11 @@ def process(args):
waveform
,
sample_rate
,
_
=
dataset
.
get
(
idx
,
need_waveform
=
True
)
if
waveform
.
shape
[
1
]
==
0
:
continue
try
:
features
=
extract_fbank_features
(
waveform
,
sample_rate
,
Path
(
features_path
))
except
AssertionError
:
logger
.
warning
(
"Extract file
%
s failed."
%
utt_id
)
if
split
==
'train'
and
args
.
cmvn_type
==
"global"
and
not
utt_id
.
startswith
(
"sp"
):
if
len
(
gcmvn_feature_list
)
<
args
.
gcmvn_max_num
:
...
...
@@ -326,16 +330,21 @@ def process(args):
_
,
sample_rate
,
n_frames
=
dataset
.
get
(
idx
,
need_waveform
=
False
)
utt_id
=
item
[
"id"
]
manifest
[
"id"
]
.
append
(
utt_id
)
if
use_raw
:
audio_path
=
item
[
"audio"
]
# add offset and frames info
if
item
.
get
(
"offset"
,
False
):
if
item
.
get
(
"offset"
,
False
)
is
not
False
:
audio_path
=
f
"{audio_path}:{item['offset']}:{n_frames}"
manifest
[
"audio"
]
.
append
(
audio_path
)
else
:
if
utt_id
in
zip_manifest
:
manifest
[
"audio"
]
.
append
(
zip_manifest
[
utt_id
])
else
:
logger
.
warning
(
"
%
s is not in the zip"
%
utt_id
)
continue
manifest
[
"id"
]
.
append
(
utt_id
)
duration_ms
=
int
(
n_frames
/
sample_rate
*
1000
)
manifest
[
"n_frames"
]
.
append
(
int
(
1
+
(
duration_ms
-
25
)
/
10
))
...
...
编写
预览
Markdown
格式
0%
重试
或
添加新文件
添加附件
取消
您添加了
0
人
到此讨论。请谨慎行事。
请先完成此评论的编辑!
取消
请
注册
或者
登录
后发表评论