Skip to content
项目
群组
代码片段
帮助
当前项目
正在载入...
登录 / 注册
切换导航面板
M
mtbookv2
概览
Overview
Details
Activity
Cycle Analytics
版本库
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
问题
0
Issues
0
列表
Board
标记
里程碑
合并请求
0
Merge Requests
0
CI / CD
CI / CD
流水线
作业
日程表
图表
维基
Wiki
代码片段
Snippets
成员
Collapse sidebar
Close sidebar
活动
图像
聊天
创建新问题
作业
提交
Issue Boards
Open sidebar
NiuTrans
mtbookv2
Commits
a1d41738
Commit
a1d41738
authored
4 years ago
by
曹润柘
Browse files
Options
Browse Files
Download
Plain Diff
合并分支 'caorunzhe' 到 'master'
Caorunzhe 查看合并请求
!858
parents
df6f2b8e
99defeba
全部展开
隐藏空白字符变更
内嵌
并排
正在显示
10 个修改的文件
包含
62 行增加
和
32 行删除
+62
-32
Chapter13/Figures/figure-bpe.tex
+3
-3
Chapter13/Figures/figure-exposure-bias.tex
+13
-13
Chapter13/Figures/figure-framework-of-Adversarial-Neural-machine-translation.tex
+4
-4
Chapter13/Figures/figure-label-smoothing.tex
+7
-7
Chapter13/Figures/figure-of-scheduling-sampling-method.tex
+5
-5
Chapter13/chapter13.tex
+0
-0
Chapter18/chapter18.tex
+0
-0
ChapterAppend/chapterappend.tex
+0
-0
ChapterPostscript/postscript.tex
+0
-0
bibliography.bib
+30
-0
没有找到文件。
Chapter13/Figures/figure-bpe.tex
查看文件 @
a1d41738
\begin{tikzpicture}
\tikzstyle
{
tnode
}
= [rectangle,inner sep=0em,minimum width=8em,minimum height=6.6em,rounded corners=5pt,fill=green!
20
]
\tikzstyle
{
pnode
}
= [rectangle,inner sep=0em,minimum width=8em,minimum height=6.6em,rounded corners=5pt,fill=yellow!
30
]
\tikzstyle
{
mnode
}
= [rectangle,inner sep=0em,minimum width=8em,minimum height=6.6em,rounded corners=5pt,fill=red!
20
]
\tikzstyle
{
tnode
}
= [rectangle,inner sep=0em,minimum width=8em,minimum height=6.6em,rounded corners=5pt,fill=green!
5,drop shadow
]
\tikzstyle
{
pnode
}
= [rectangle,inner sep=0em,minimum width=8em,minimum height=6.6em,rounded corners=5pt,fill=yellow!
5,drop shadow
]
\tikzstyle
{
mnode
}
= [rectangle,inner sep=0em,minimum width=8em,minimum height=6.6em,rounded corners=5pt,fill=red!
5,drop shadow
]
\tikzstyle
{
wnode
}
= [inner sep=0em,minimum height=1.5em]
%第一排
...
...
This diff is collapsed.
Click to expand it.
Chapter13/Figures/figure-exposure-bias.tex
查看文件 @
a1d41738
...
...
@@ -18,11 +18,11 @@
\node
[anchor=west,rnnnode] (n3) at ([xshift=2em,yshift=0em]n2.east)
{$
\mathbi
{
h
}_{
j
-
1
}$}
;
\node
[anchor=west,rnnnode] (n4) at ([xshift=2em,yshift=0em]n3.east)
{$
\mathbi
{
h
}_{
j
}$}
;
\node
[anchor=south,snode,font=
\footnotesize
] (n5) at ([xshift=0em,yshift=1em]n3.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n6) at ([xshift=0em,yshift=1em]n5.north)
{$
\
tilde
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=south,ynode] (n6) at ([xshift=0em,yshift=1em]n5.north)
{$
\
hat
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=south,snode,font=
\footnotesize
] (n7) at ([xshift=0em,yshift=1em]n4.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n8) at ([xshift=0em,yshift=1em]n7.north)
{$
\
tilde
{{
y
}}_{
j
}$}
;
\node
[anchor=south,ynode] (n8) at ([xshift=0em,yshift=1em]n7.north)
{$
\
hat
{{
y
}}_{
j
}$}
;
\node
[anchor=south,snode,font=
\footnotesize
] (n13) at ([xshift=0em,yshift=1em]n1.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n14) at ([xshift=0em,yshift=1em]n13.north)
{$
\
tilde
{{
y
}}_{
1
}$}
;
\node
[anchor=south,ynode] (n14) at ([xshift=0em,yshift=1em]n13.north)
{$
\
hat
{{
y
}}_{
1
}$}
;
\node
[anchor=north] (n11) at ([xshift=0em,yshift=-1em]n3.south)
{${{
y
}}_{
j
-
2
}$}
;
\node
[anchor=north] (n12) at ([xshift=0em,yshift=-1em]n4.south)
{${{
y
}}_{
j
-
1
}$}
;
...
...
@@ -132,16 +132,16 @@
\node
[anchor=west,rnnnode] (n3) at ([xshift=2em,yshift=0em]n2.east)
{$
\mathbi
{
h
}_{
j
-
1
}$}
;
\node
[anchor=west,rnnnode] (n4) at ([xshift=2em,yshift=0em]n3.east)
{$
\mathbi
{
h
}_{
j
}$}
;
\node
[anchor=south,snode,font=
\footnotesize
] (n5) at ([xshift=0em,yshift=1em]n3.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n6) at ([xshift=0em,yshift=1em]n5.north)
{$
\
tilde
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=south,ynode] (n6) at ([xshift=0em,yshift=1em]n5.north)
{$
\
hat
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=south,snode,font=
\footnotesize
] (n7) at ([xshift=0em,yshift=1em]n4.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n8) at ([xshift=0em,yshift=1em]n7.north)
{$
\
tilde
{{
y
}}_{
j
}$}
;
\node
[anchor=south,ynode] (n8) at ([xshift=0em,yshift=1em]n7.north)
{$
\
hat
{{
y
}}_{
j
}$}
;
\node
[anchor=south,snode,font=
\footnotesize
] (n13) at ([xshift=0em,yshift=1em]n1.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n14) at ([xshift=0em,yshift=1em]n13.north)
{$
\
tilde
{{
y
}}_{
1
}$}
;
\node
[anchor=south,ynode] (n14) at ([xshift=0em,yshift=1em]n13.north)
{$
\
hat
{{
y
}}_{
1
}$}
;
\node
[anchor=north] (n9) at ([xshift=0em,yshift=-3em]n4.south)
{
\small
{
(b) 推断阶段
}}
;
\node
[anchor=north] (n11) at ([xshift=0em,yshift=-1em]n3.south)
{$
\
tilde
{{
y
}}_{
j
-
2
}$}
;
\node
[anchor=north] (n12) at ([xshift=0em,yshift=-1em]n4.south)
{$
\
tilde
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=north] (n11) at ([xshift=0em,yshift=-1em]n3.south)
{$
\
hat
{{
y
}}_{
j
-
2
}$}
;
\node
[anchor=north] (n12) at ([xshift=0em,yshift=-1em]n4.south)
{$
\
hat
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=north] (x1) at ([xshift=0em,yshift=-1em]n1.south)
{$
\langle
$
sos
$
\rangle
$}
;
...
...
@@ -172,13 +172,13 @@
\draw
[->,thick] ([xshift=-0em,yshift=0em]n11.north)--([xshift=-0em,yshift=0em]n3.south);
\draw
[->,thick] ([xshift=0em,yshift=0em]n12.north)--([xshift=-0em,yshift=-0em]n4.south);
\node
[anchor=west] (n10) at ([xshift=4em,yshift=0em]n8.east)
{
模型输出
$
\
tilde
{{
y
}}$
:
}
;
\node
[anchor=west] (n10) at ([xshift=4em,yshift=0em]n8.east)
{
模型输出
$
\
hat
{{
y
}}$
:
}
;
\node
[anchor=north west] (y1) at ([xshift=0.5em,yshift=-7em]n10.south west)
{$
\
tilde
{
y
}_{
1
}$}
;
\node
[anchor=west] (y2) at ([xshift=1em,yshift=0em]y1.east)
{$
\
tilde
{
y
}_{
2
}$}
;
\node
[anchor=west] (y3) at ([xshift=1em,yshift=0em]y2.east)
{$
\
tilde
{
y
}_{
3
}$}
;
\node
[anchor=north west] (y1) at ([xshift=0.5em,yshift=-7em]n10.south west)
{$
\
hat
{
y
}_{
1
}$}
;
\node
[anchor=west] (y2) at ([xshift=1em,yshift=0em]y1.east)
{$
\
hat
{
y
}_{
2
}$}
;
\node
[anchor=west] (y3) at ([xshift=1em,yshift=0em]y2.east)
{$
\
hat
{
y
}_{
3
}$}
;
\node
[anchor=west] (y4) at ([xshift=0.5em,yshift=0em]y3.east)
{$
\ldots
$}
;
\node
[anchor=west] (y5) at ([xshift=0.5em,yshift=0em]y4.east)
{$
\
tilde
{
y
}_{
n
}$}
;
\node
[anchor=west] (y5) at ([xshift=0.5em,yshift=0em]y4.east)
{$
\
hat
{
y
}_{
n
}$}
;
\node
[anchor=center,prob,minimum size=0.3em] (label11) at ([xshift=-0.1em,yshift=1em]y1.north)
{}
;
...
...
This diff is collapsed.
Click to expand it.
Chapter13/Figures/figure-framework-of-Adversarial-Neural-machine-translation.tex
查看文件 @
a1d41738
...
...
@@ -3,16 +3,16 @@
\begin{tikzpicture}
\tikzstyle
{
rnnnode
}
= [draw,inner sep=4pt,minimum width=2em,minimum height=2em,rounded corners=1pt,fill=green!
20
]
\tikzstyle
{
snode
}
= [draw,inner sep=4pt,minimum width=2em,minimum height=2em,rounded corners=1pt,fill=red!
20
]
\tikzstyle
{
rnnnode
}
= [draw,inner sep=4pt,minimum width=2em,minimum height=2em,rounded corners=1pt,fill=green!
15
]
\tikzstyle
{
snode
}
= [draw,inner sep=4pt,minimum width=2em,minimum height=2em,rounded corners=1pt,fill=red!
15
]
\tikzstyle
{
wode
}
= [inner sep=0pt,minimum width=2em,minimum height=2em,rounded corners=0pt]
\node
[anchor=west,wode] (n1) at (0,0)
{$
y
$}
;
\node
[anchor=north west,wode] (n2) at ([xshift=3em,yshift=-2.5em]n1.south east)
{$
x
$}
;
\node
[anchor=south west,rnnnode] (n3) at ([xshift=8em,yshift=0.5em]n2.north east)
{
生成模型
$
G
$}
;
\node
[anchor=south east,wode] (n4) at ([xshift=-2em,yshift=0em]n3.north west)
{$
\
tilde
{
y
}$}
;
\node
[anchor=south east,wode] (n4) at ([xshift=-2em,yshift=0em]n3.north west)
{$
\
hat
{
y
}$}
;
\node
[anchor=south,snode] (n5) at ([xshift=0em,yshift=6em]n2.north)
{
判别网络
$
D
$}
;
\node
[anchor=west,align=left,font=
\small
] (n6) at ([xshift=15em,yshift=-3em]n5.east)
{
根据
$
(
\seq
{
x
}
,
\seq
{
\
tilde
{
y
}}
)
$
生
\\
成奖励信号
}
;
\node
[anchor=west,align=left,font=
\small
] (n6) at ([xshift=15em,yshift=-3em]n5.east)
{
根据
$
(
\seq
{
x
}
,
\seq
{
\
hat
{
y
}}
)
$
生
\\
成奖励信号
}
;
\draw
[->,thick] ([xshift=0em,yshift=-0.3em]n1.north)--([xshift=-0.3em,yshift=-0.1em]n5.south);
...
...
This diff is collapsed.
Click to expand it.
Chapter13/Figures/figure-label-smoothing.tex
查看文件 @
a1d41738
...
...
@@ -38,20 +38,20 @@
\node
[anchor=south,font=
\scriptsize
] (one
_
hot
_
w7) at (one
_
hot
_
label7.north)
{$
0
$}
;
%label smoothing
\node
[anchor=west,minimum width=1.2em,minimum height=0.2em,fill=
red
!50,inner sep=0pt] (label1) at ([xshift=1.5em,yshift=-4.4em]model.east)
{}
;
\node
[anchor=west,minimum width=1.2em,minimum height=0.2em,fill=
orange
!50,inner sep=0pt] (label1) at ([xshift=1.5em,yshift=-4.4em]model.east)
{}
;
\node
[anchor=south,font=
\scriptsize
] (w1) at (label1.north)
{$
0
.
1
$}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
red
!50,inner sep=0pt] (label2) at (label1.south east)
{}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
orange
!50,inner sep=0pt] (label2) at (label1.south east)
{}
;
\node
[anchor=south,font=
\scriptsize
] (w2) at (label2.north)
{$
0
.
1
$}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.8em,fill=
red
!50,inner sep=0pt] (label3) at (label2.south east)
{}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.8em,fill=
orange
!50,inner sep=0pt] (label3) at (label2.south east)
{}
;
\node
[anchor=south,font=
\scriptsize
] (w3) at (label3.north)
{{
\color
{
red
}
$
0
.
4
$}}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
red
!50,inner sep=0pt] (label4) at (label3.south east)
{}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
orange
!50,inner sep=0pt] (label4) at (label3.south east)
{}
;
\node
[anchor=south,font=
\scriptsize
] (w5) at (label4.north)
{$
0
.
1
$}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
red
!50,inner sep=0pt] (label5) at (label4.south east)
{}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
orange
!50,inner sep=0pt] (label5) at (label4.south east)
{}
;
\node
[anchor=south,font=
\scriptsize
] (w6) at (label5.north)
{$
0
.
1
$}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
red
!50,inner sep=0pt] (label6) at (label5.south east)
{}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
orange
!50,inner sep=0pt] (label6) at (label5.south east)
{}
;
\node
[anchor=south,font=
\scriptsize
] (w7) at (label6.north)
{$
0
.
1
$}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
red
!50,inner sep=0pt] (label7) at (label6.south east)
{}
;
\node
[anchor=south west,minimum width=1.2em,minimum height=0.2em,fill=
orange
!50,inner sep=0pt] (label7) at (label6.south east)
{}
;
\node
[anchor=south,font=
\scriptsize
] (w8) at (label7.north)
{$
0
.
1
$}
;
...
...
This diff is collapsed.
Click to expand it.
Chapter13/Figures/figure-of-scheduling-sampling-method.tex
查看文件 @
a1d41738
...
...
@@ -14,18 +14,18 @@
\node
[anchor=west,rnnnode] (n3) at ([xshift=3em,yshift=0em]n2.east)
{$
\mathbi
{
h
}_{
j
-
1
}$}
;
\node
[anchor=west,rnnnode] (n4) at ([xshift=3em,yshift=0em]n3.east)
{$
\mathbi
{
h
}_{
j
}$}
;
\node
[anchor=south,snode] (n5) at ([xshift=0em,yshift=1em]n3.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n6) at ([xshift=0em,yshift=1em]n5.north)
{$
\
tilde
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=south,ynode] (n6) at ([xshift=0em,yshift=1em]n5.north)
{$
\
hat
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=south,snode] (n7) at ([xshift=0em,yshift=1em]n4.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n8) at ([xshift=0em,yshift=1em]n7.north)
{$
\
tilde
{{
y
}}_{
j
}$}
;
\node
[anchor=south,ynode] (n8) at ([xshift=0em,yshift=1em]n7.north)
{$
\
hat
{{
y
}}_{
j
}$}
;
\node
[anchor=south,snode,font=
\footnotesize
] (n13) at ([xshift=0em,yshift=1em]n1.north)
{
Softmax
}
;
\node
[anchor=south,ynode] (n14) at ([xshift=0em,yshift=1em]n13.north)
{$
\
tilde
{{
y
}}_{
1
}$}
;
\node
[anchor=south,ynode] (n14) at ([xshift=0em,yshift=1em]n13.north)
{$
\
hat
{{
y
}}_{
1
}$}
;
\node
[anchor=north] (x1) at ([xshift=0em,yshift=-1em]n1.south)
{$
\langle
$
sos
$
\rangle
$}
;
\node
[anchor=north,font=
\small
] (x2) at ([xshift=-1.3em,yshift=-2.3em]n3.south)
{$
\
tilde
{{
y
}}_{
j
-
2
}$}
;
\node
[anchor=north,font=
\small
] (x2) at ([xshift=-1.3em,yshift=-2.3em]n3.south)
{$
\
hat
{{
y
}}_{
j
-
2
}$}
;
\node
[anchor=north,font=
\small
] (x3) at ([xshift=1.3em,yshift=-2.5em]n3.south)
{${
y
}_{
j
-
2
}$}
;
\node
[anchor=north,font=
\small
] (x4) at ([xshift=1.3em,yshift=-2.5em]n4.south)
{${
y
}_{
j
-
1
}$}
;
\node
[anchor=north,font=
\small
] (x5) at ([xshift=-1.3em,yshift=-2.3em]n4.south)
{$
\
tilde
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=north,font=
\small
] (x5) at ([xshift=-1.3em,yshift=-2.3em]n4.south)
{$
\
hat
{{
y
}}_{
j
-
1
}$}
;
\node
[anchor=south,inner sep=2pt] (st1) at (n6.north)
{
\scriptsize
{
\textbf
{
[step
$
j
-
1
$
]
}}}
;
\node
[anchor=south,inner sep=2pt] (st2) at (n8.north)
{
\scriptsize
{
\textbf
{
[step
$
j
$
]
}}}
;
...
...
This diff is collapsed.
Click to expand it.
Chapter13/chapter13.tex
查看文件 @
a1d41738
This source diff could not be displayed because it is too large. You can
view the blob
instead.
This diff is collapsed.
Click to expand it.
Chapter18/chapter18.tex
查看文件 @
a1d41738
差异被折叠。
点击展开。
ChapterAppend/chapterappend.tex
查看文件 @
a1d41738
差异被折叠。
点击展开。
ChapterPostscript/postscript.tex
查看文件 @
a1d41738
差异被折叠。
点击展开。
bibliography.bib
查看文件 @
a1d41738
...
...
@@ -9243,6 +9243,36 @@ author = {Zhuang Liu and
publisher = {Asian Conference on Machine Learning},
year = {2018}
}
@inproceedings{DBLP:journals/corr/abs-1810-02525,
author = {Peter Henderson and
Joshua Romoff and
Joelle Pineau},
title = {Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent
Optimization in Policy Gradient Methods},
publisher = {CoRR},
volume = {abs/1810.02525},
year = {2018}
}
@inproceedings{DBLP:conf/nips/Kakade01,
author = {Sham M. Kakade},
title = {A Natural Policy Gradient},
pages = {1531--1538},
publisher = {Advances in Neural Information Processing Systems},
year = {2001}
}
@inproceedings{DBLP:conf/icml/KoolHW19,
author = {Wouter Kool and
Herke van Hoof and
Max Welling},
title = {Stochastic Beams and Where To Find Them: The Gumbel-Top-k Trick for
Sampling Sequences Without Replacement},
series = {Proceedings of Machine Learning Research},
volume = {97},
pages = {3499--3508},
publisher = {International Conference on Machine Learning},
year = {2019}
}
%%%%% chapter 15------------------------------------------------------
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
...
...
This diff is collapsed.
Click to expand it.
编写
预览
Markdown
格式
0%
重试
或
添加新文件
添加附件
取消
您添加了
0
人
到此讨论。请谨慎行事。
请先完成此评论的编辑!
取消
请
注册
或者
登录
后发表评论