Skip to content
项目
群组
代码片段
帮助
当前项目
正在载入...
登录 / 注册
切换导航面板
T
Toy-MT-Introduction
概览
Overview
Details
Activity
Cycle Analytics
版本库
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
问题
0
Issues
0
列表
Board
标记
里程碑
合并请求
0
Merge Requests
0
CI / CD
CI / CD
流水线
作业
日程表
图表
维基
Wiki
代码片段
Snippets
成员
Collapse sidebar
Close sidebar
活动
图像
聊天
创建新问题
作业
提交
Issue Boards
Open sidebar
单韦乔
Toy-MT-Introduction
Commits
d53fb7d4
Commit
d53fb7d4
authored
Dec 19, 2019
by
xiaotong
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
summary pages
parent
7e80636a
显示空白字符变更
内嵌
并排
正在显示
2 个修改的文件
包含
155 行增加
和
144 行删除
+155
-144
Section06-Neural-Machine-Translation/section06-test.tex
+60
-109
Section06-Neural-Machine-Translation/section06.tex
+95
-35
没有找到文件。
Section06-Neural-Machine-Translation/section06-test.tex
查看文件 @
d53fb7d4
...
...
@@ -145,119 +145,70 @@
\subsection
{
注意力机制
}
%%%------------------------------------------------------------------------------------------------------------
\begin{frame}
{
模块1:词嵌入层
}
%%% 做个小结
\begin{frame}
{
我们赶上了好时代 ...
}
\begin{itemize}
\item
词嵌入
\item
神经机器翻译的火爆这几年有目共睹,好事情!!!
\begin{itemize}
\item
https://arxiv.org上搜索neural machine translation
\item
ACL、EMNLP等顶会神经机器翻译论文数量近些年几乎呈线性增长
\item
神经机器翻译系统在各大比赛中霸榜,开源机器翻译满天飞,大厂秀肌肉,小作坊刷存在感
\end{itemize}
\vspace
{
0.3em
}
\item
<2-> 这里只介绍了最基本的概念,NMT的内容远不止这些
\begin{itemize}
\item
各种专题:解码、压缩、先验知识、低资源翻译、无指导方法、篇章级翻译等等等等
\item
推荐一个survey,有些基础的可以参考一下,很全面
\\
``Neural Machine Translation: A Review'' by Felix Stahlberg
\\
\url
{
https://arxiv.org/abs/1912.02047
}
\item
如何搭建一个优秀的NMT系统?- 有许多技巧
\\
下一章介绍
\item
回忆一下第一章介绍的NMT开源系统,可以试试
\end{itemize}
\end{itemize}
%%% 图
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
%%% last slide
\begin{frame}
{
结束~
}
\vspace
{
2em
}
\begin{center}
\hspace*
{
-0.6cm
}
\begin{tikzpicture}
\setlength
{
\base
}{
0.9cm
}
\tikzstyle
{
rnnnode
}
= [rounded corners=1pt,minimum height=0.5
\base
,minimum width=1
\base
,draw,inner sep=0pt,outer sep=0pt]
\tikzstyle
{
wordnode
}
= [font=
\tiny
]
% RNN translation model
\begin{scope}
[local bounding box=RNNMT]
% RNN Encoder
\coordinate
(eemb0) at (0,0);
\foreach
\x
[count=
\y
from 0] in
{
1,2,...,3
}
\node
[rnnnode,minimum height=0.5\base,fill=green!30!white,anchor=west]
(eemb
\x
) at ([xshift=0.4
\base
]eemb
\y
.east)
{
\tiny
{$
e
_
x
()
$}}
;
\foreach
\x
in
{
1,2,...,3
}
\node
[rnnnode,fill=blue!30!white,anchor=south]
(enc
\x
) at ([yshift=0.3
\base
]eemb
\x
.north)
{}
;
\node
[]
(enclabel1) at (enc1)
{
\tiny
{$
\textbf
{
h
}_{
m
-
2
}$}}
;
\node
[]
(enclabel2) at (enc2)
{
\tiny
{$
\textbf
{
h
}_{
m
-
1
}$}}
;
\node
[rnnnode,fill=purple!30!white]
(enclabel3) at (enc3)
{
\tiny
{$
\textbf
{
h
}_{
m
}$}}
;
\node
[wordnode,left=0.4\base of enc1]
(init1)
{$
\cdots
$}
;
\node
[wordnode,left=0.4\base of eemb1]
(init2)
{$
\cdots
$}
;
\node
[wordnode,below=0pt of eemb1]
()
{
走
}
;
\node
[wordnode,below=0pt of eemb2]
()
{
吗
}
;
\node
[wordnode,below=0pt of eemb3]
()
{$
\langle
$
eos
$
\rangle
$}
;
% RNN Decoder
\foreach
\x
in
{
1,2,...,3
}
\node
[rnnnode,minimum height=0.5\base,fill=green!30!white,anchor=south]
(demb
\x
) at ([yshift=
\base
]enc
\x
.north)
{
\tiny
{$
e
_
y
()
$}}
;
\foreach
\x
in
{
1,2,...,3
}
\node
[rnnnode,fill=blue!30!white,anchor=south]
(dec
\x
) at ([yshift=0.3
\base
]demb
\x
.north)
{{
\tiny
{$
\textbf
{
s
}_
\x
$}}}
;
\foreach
\x
in
{
1,2,...,3
}
\node
[rnnnode,minimum height=0.5\base,fill=red!30!white,anchor=south]
(softmax
\x
) at ([yshift=0.3
\base
]dec
\x
.north)
{
\tiny
{
Softmax
}}
;
\node
[wordnode,right=0.4\base of demb3]
(end1)
{$
\cdots
$}
;
\node
[wordnode,right=0.4\base of dec3]
(end2)
{$
\cdots
$}
;
\node
[wordnode,right=0.4\base of softmax3]
(end3)
{$
\cdots
$}
;
% Decoder input words
\node
[wordnode,below=0pt of demb1]
(decwordin)
{$
\langle
$
sos
$
\rangle
$}
;
\ExtractX
{$
(
demb
2
.south
)
$}
\ExtractY
{$
(
decwordin.base
)
$}
\node
[wordnode,anchor=base]
() at (
\XCoord
,
\YCoord
)
{
Do
}
;
\ExtractX
{$
(
demb
3
.south
)
$}
\ExtractY
{$
(
decwordin.base
)
$}
\node
[wordnode,anchor=base]
() at (
\XCoord
,
\YCoord
)
{
you
}
;
% Decoder output words
\node
[wordnode,above=0pt of softmax1]
(decwordout)
{
Do
}
;
\ExtractX
{$
(
softmax
2
.north
)
$}
\ExtractY
{$
(
decwordout.base
)
$}
\node
[wordnode,anchor=base]
() at (
\XCoord
,
\YCoord
)
{
you
}
;
\ExtractX
{$
(
softmax
3
.north
)
$}
\ExtractY
{$
(
decwordout.base
)
$}
\node
[wordnode,anchor=base]
() at (
\XCoord
,
\YCoord
)
{
know
}
;
% Connections
\draw
[-latex']
(init1.east) to (enc1.west);
\draw
[-latex']
(dec3.east) to (end2.west);
\foreach
\x
in
{
1,2,...,3
}
\draw
[-latex']
(eemb
\x
) to (enc
\x
);
\foreach
\x
in
{
1,2,...,3
}
\draw
[-latex']
(demb
\x
) to (dec
\x
);
\foreach
\x
in
{
1,2,...,3
}
\draw
[-latex']
(dec
\x
.north) to (softmax
\x
.south);
\foreach
\x
[count=
\y
from 2] in
{
1,2
}
{
\draw
[-latex']
(enc
\x
.east) to (enc
\y
.west);
\draw
[-latex']
(dec
\x
.east) to (dec
\y
.west);
}
\coordinate
(bridge) at ([yshift=0.4
\base
]enc2.north west);
\draw
[-latex']
(enc3.north) .. controls +(north:0.3
\base
) and +(east:
\base
) .. (bridge) .. controls +(west:2.7
\base
) and +(west:0.3
\base
) .. (dec1.west);
\end{scope}
\begin{scope}
\coordinate
(start) at (5.8
\base
,0.3
\base
);
\node
[anchor=south west] (one) at (start)
{
\scriptsize
{$
\begin
{
bmatrix
}
0
\\
0
\\
0
\\
\vdots
\\
0
\\
{
\color
{
ugreen
}
1
}
\\
0
\\
0
\end
{
bmatrix
}$}}
;
\node
[anchor=north] (w) at ([yshift=3pt]one.south)
{
\scriptsize
{
\color
{
ugreen
}
you
}}
;
\node
[anchor=north west] (words) at ([xshift=10pt]one.north east)
{
\scriptsize
{$
\begin
{
matrix
}
\langle\textrm
{
eos
}
\rangle
\\
\langle\textrm
{
sos
}
\rangle
\\
\textrm
{
Do
}
\\
\vdots
\\
\textrm
{
know
}
\\
\textrm
{
you
}
\\
\textrm
{
?
}
\\
\textrm
{
have
}
\end
{
matrix
}$}}
;
\node
[anchor=north west] (mat) at ([xshift=-6pt]words.north east)
{
\scriptsize
{$
\begin
{
bmatrix
}
.
1
&
-
4
&
\cdots
&
2
\\
5
&
2
&
\cdots
&
.
2
\\
2
&
.
1
&
\cdots
&
.
3
\\
\vdots
&
\vdots
&
\ddots
&
\vdots
\\
0
&
.
8
&
\cdots
&
4
\\
-
1
&
-
2
&
\cdots
&
-
3
\\
.
7
&
.
5
&
\cdots
&
3
\\
-
2
&
.
3
&
\cdots
&
.
1
\end
{
bmatrix
}$
}}
;
\begin{pgfonlayer}
{
background
}
\node
[draw=ugreen,fill=green!20!white,rounded corners=0.3em,minimum width=3.8cm,minimum height=0.9em,anchor=south west] (emb) at ([shift=
{
(1.25cm,0.8cm)
}
]start)
{}
;
\end{pgfonlayer}
\draw
[decorate,decoration=
{
brace,mirror
}
] ([shift=
{
(6pt,2pt)
}
]mat.south west) to node [auto,swap,font=
\scriptsize
]
{
词嵌入矩阵
}
([shift=
{
(-6pt,2pt)
}
]mat.south east);
\draw
[-latex'] ([xshift=-2pt,yshift=-0.65cm]one.east) to ([yshift=-0.65cm]words.west);
\draw
[-latex'] (emb.east) -| ([yshift=0.4cm]mat.north east);
\draw
[-latex'] ([yshift=-0.4cm]w.south) to ([yshift=2pt]w.south);
\node
[draw=ugreen,densely dashed,thick,rounded corners=3pt,fit=(one) (words) (mat) (w)] (input)
{}
;
\end{scope}
\draw
[->,thick,densely dashed,ugreen] ([yshift=-0.2em]demb3.east) to [out=0,in=180] ([yshift=-1cm]input.west);
\end{tikzpicture}
\begin{tikzpicture}
\begin{scope}
\small
{
\node
[anchor=south west,minimum width=15em] (source) at (0,0)
{
\Large
{
\textbf
{
source
}
: 谢谢 大家 !
}}
;
\node
[anchor=south west,minimum width=15em] (target) at ([yshift=12em]source.north west)
{
\Large
{
\textbf
{
target
}
: Thank You !
}}
;
\node
[anchor=center,minimum width=9.6em,minimum height=1.8em,draw,rounded corners=0.3em] (hidden) at ([yshift=6em]source.north)
{}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!20] (cell01) at ([xshift=0.2em]hidden.west)
{
\footnotesize
{
.2
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!10] (cell02) at (cell01.east)
{
\footnotesize
{
-1
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!70] (cell03) at (cell02.east)
{
\footnotesize
{
6
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!50] (cell04) at (cell03.east)
{
\footnotesize
{
5
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!30] (cell05) at (cell04.east)
{
\footnotesize
{
.7
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!10] (cell06) at (cell05.east)
{
\footnotesize
{
-2
}}
;
\filldraw
[fill=red!20,draw=white] (source.north west) -- (source.north east) -- ([xshift=-0.2em,yshift=-0.1em]hidden.south east) -- ([xshift=0.2em,yshift=-0.1em]hidden.south west);
\filldraw
[fill=blue!20,draw=white] (target.south west) -- (target.south east) -- ([xshift=-0.2em,yshift=0.1em]hidden.north east) -- ([xshift=0.2em,yshift=0.1em]hidden.north west);
\draw
[->,thick] (source.north west) -- ([xshift=0.2em,yshift=-0.1em]hidden.south west);
\draw
[->,thick] (source.north east) -- ([xshift=-0.2em,yshift=-0.1em]hidden.south east);
\draw
[->,thick] ([xshift=0.2em,yshift=0.1em]hidden.north west) -- (target.south west);
\draw
[->,thick] ([xshift=-0.2em,yshift=0.1em]hidden.north east) -- (target.south east);
}
\node
[anchor=south] (enclabel) at ([yshift=2em]source.north)
{
\large
{
Encoder
}}
;
\node
[anchor=north] (declabel) at ([yshift=-2em]target.south)
{
\large
{
Decoder
}}
;
\end{scope}
\end{tikzpicture}
\vspace
{
2em
}
\end{center}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
...
...
Section06-Neural-Machine-Translation/section06.tex
查看文件 @
d53fb7d4
...
...
@@ -1133,10 +1133,14 @@ NLP问题的隐含结构假设 & 无隐含结构假设,端到端学习 \\
%%% 词嵌入
\begin{frame}
{
模块1:词嵌入层
}
\begin{itemize}
\item
把输入的词转换成唯一对应的词表大小的0-1向量
\item
根据0-1向量,从词嵌入矩阵中取出对应的词嵌入
$
e
_
y
$
\item
取出的词嵌入
$
e
_
y
$
作为循环神经网络的输入
\item
词嵌入的作用是把离散化的单词表示转换为连续空间上的分布式表示
\begin{itemize}
\item
把输入的词转换成唯一对应的词表大小的0-1向量
\item
根据0-1向量,从词嵌入矩阵中取出对应的词嵌入
$
e
_
y
$
\item
取出的词嵌入
$
e
_
y
$
作为循环神经网络的输入
\end{itemize}
\end{itemize}
\vspace
{
-1em
}
%%% 图
\begin{center}
\hspace*
{
-0.6cm
}
...
...
@@ -1237,8 +1241,9 @@ NLP问题的隐含结构假设 & 无隐含结构假设,端到端学习 \\
\draw
[decorate,decoration=
{
brace,mirror
}
] ([shift=
{
(6pt,2pt)
}
]mat.south west) to node [auto,swap,font=
\scriptsize
]
{
词嵌入矩阵
}
([shift=
{
(-6pt,2pt)
}
]mat.south east);
\draw
[-latex'] ([xshift=-2pt,yshift=-0.65cm]one.east) to ([yshift=-0.65cm]words.west);
\draw
[-latex'] (emb.east) -| ([yshift=0.4cm]mat.north east);
\draw
[-latex'] (emb.east) -| ([yshift=0.4cm]mat.north east)
node [pos=1,above]
{
\scriptsize
{
RNN输入
}}
;
\draw
[-latex'] ([yshift=-0.4cm]w.south) to ([yshift=2pt]w.south);
\node
[anchor=north] (wlabel) at ([yshift=-0.6em]w.south)
{
\scriptsize
{
输入的单词
}}
;
\node
[draw=ugreen,densely dashed,thick,rounded corners=3pt,fit=(one) (words) (mat) (w)] (input)
{}
;
\end{scope}
...
...
@@ -1252,9 +1257,11 @@ NLP问题的隐含结构假设 & 无隐含结构假设,端到端学习 \\
%%% 输出
\begin{frame}
{
模块2:输出层
}
\begin{itemize}
\item
循环网络输出
$
s
$
经过权重矩阵
$
W
$
变换成词表大小的向量
\item
获得的向量经过Softmax变换得到不同词作为输出的概率
\item
一般选取概率最高的词作为模型最终的输出
\item
输出层需要得到每个目标语单词的生成概率,进而选取概率最高的词作为输出。但RNN中的隐藏层并不会输出单词概率,而是输出
$
s
$
,其每一行对应一个单词表示
\begin{itemize}
\item
循环网络输出
$
s
$
经过权重矩阵
$
W
$
变换成词表大小的向量
\item
获得的向量经过Softmax变换得到不同词作为输出的概率,即单词
$
i
$
的概率
$
p
_
i
=
\textrm
{
Softmax
}
(
i
)
=
\frac
{
e
^{
s
_
i
}}{
\sum
_{
j
}
e
^{
s
_{
j
}}}
$
\end{itemize}
\end{itemize}
%%% 图
\begin{center}
...
...
@@ -1391,16 +1398,9 @@ NLP问题的隐含结构假设 & 无隐含结构假设,端到端学习 \\
%%%------------------------------------------------------------------------------------------------------------
%%% LSTM
\begin{frame}
{
模块3:循环单元 - 长短时记忆模型(LSTM)
}
\only
<1>
{
遗忘门根据输入的
$
x
_
t
$
和
$
h
_
t
$
决定保留多大比例的
$
c
_
t
$
\\
[5pt]
}
\only
<2>
{
输入门根据
$
x
_
t
$
和
$
h
_
t
$
计算需要保存的记忆
$
\hat
{
c
}_
t
$
和其比例
\\
[5pt]
}
\only
<3>
{
把
$
c
_
t
$
和
$
\hat
{
c
}_
t
$
组合得到新的记忆
$
c
_{
t
+
1
}$
\\
[5pt]
}
\only
<4>
{
输出门根据
$
x
_
t
$
,
$
h
_
t
$
和
$
c
_{
t
+
1
}$
得到新的隐藏状态
$
h
_{
t
+
1
}$
\\
[5pt]
}
\only
<5>
{
如此反复,不断更新
$
c
$
和
$
h
$
直到不再有新的
$
x
$
输入
\\
[5pt]
}
{
\scriptsize
\begin{tabular}
{
rl
}
*
$
x
_
t
$
:
&
上一层的输出
\\
*
$
h
_
t
$
:
&
同一层上一时刻的隐藏状态
\\
*
$
c
_
t
$
:
&
同一层上一时刻的记忆
\end{tabular}
}
\begin{itemize}
\item
LSTM是最常用的循环单元结构,它一种典型的记忆网络,通过``门''单元来动态地选择遗忘多少以前的信息
\end{itemize}
%%% 图
\begin{center}
\begin{tikzpicture}
...
...
@@ -1576,11 +1576,16 @@ NLP问题的隐含结构假设 & 无隐含结构假设,端到端学习 \\
\end{scope}
\end{tikzpicture}
\end{center}
{
\scriptsize
\begin{tabular}
{
l
}
*
$
x
_
t
$
: 上一层的输出,
$
h
_
t
$
: 同一层上一时刻的隐藏状态
\\
*
$
c
_
t
$
: 同一层上一时刻的记忆
\end{tabular}
}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
%%% GRU
\begin{frame}
{
改进
- 门循环单元(GRU)
}
\begin{frame}
{
另一种循环单元
- 门循环单元(GRU)
}
\begin{itemize}
\item
GRU
\end{itemize}
...
...
@@ -5004,7 +5009,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\item
需要考虑的问题:
\begin{itemize}
\item
古文短,现代文长,过翻译或者欠翻译对性能影响很大,如何对长度进行更精确的建模
\item
不同时
期、问题的文字差异性很
大,如何进行自动适应和风格迁移
\item
不同时
代语言差异性
大,如何进行自动适应和风格迁移
\end{itemize}
\end{itemize}
...
...
@@ -5019,7 +5024,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\begin{frame}
{
NMT应用
}
\begin{itemize}
\item
古文翻译实例
\vspace
{
0.
5
em
}
\vspace
{
0.
0
em
}
\begin{tcolorbox}
[size=normal,left=2mm,right=1mm,colback=red!5!white,colframe=red!75!black]
{
...
...
@@ -5062,7 +5067,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\begin{itemize}
\item
除了古文翻译,对联也可以用机器翻译系统生成,只需将输入输出变为对联的上联和下联
\end{itemize}
\vspace
{
-0.
5
em
}
\vspace
{
-0.
8
em
}
\begin{center}
\begin{tikzpicture}
...
...
@@ -5127,16 +5132,16 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\node
[lnode,anchor=west] (l1) at (0,0)
{
上联:翠竹千支歌盛世
}
;
\node
[rnode,anchor=west] (l2) at ([xshift=1em]l1.east)
{
下联:红梅万点报新春
}
;
\node
[lnode,anchor=north] (l3) at ([yshift=-
1
em]l1.south)
{
上联:一帆风顺年年好
}
;
\node
[lnode,anchor=north] (l3) at ([yshift=-
0.8
em]l1.south)
{
上联:一帆风顺年年好
}
;
\node
[rnode,anchor=west] (l4) at ([xshift=1em]l3.east)
{
下联:万事如意步步高
}
;
\node
[lnode,anchor=north] (l5) at ([yshift=-
1
em]l3.south)
{
上联:佳节迎春春生笑脸
}
;
\node
[lnode,anchor=north] (l5) at ([yshift=-
0.8
em]l3.south)
{
上联:佳节迎春春生笑脸
}
;
\node
[rnode,anchor=west] (l6) at ([xshift=1em]l5.east)
{
下联:新年纳福富华满堂
}
;
\node
[lnode,anchor=north] (l7) at ([yshift=-
1
em]l5.south)
{
上联:腊梅吐芳迎红日
}
;
\node
[lnode,anchor=north] (l7) at ([yshift=-
0.8
em]l5.south)
{
上联:腊梅吐芳迎红日
}
;
\node
[rnode,anchor=west] (l8) at ([xshift=1em]l7.east)
{
下联:绿柳展枝舞春风
}
;
\node
[lnode,anchor=north] (l9) at ([yshift=-
1
em]l7.south)
{
上联:雪兆丰年丛岭翠
}
;
\node
[lnode,anchor=north] (l9) at ([yshift=-
0.8
em]l7.south)
{
上联:雪兆丰年丛岭翠
}
;
\node
[rnode,anchor=west] (l10) at ([xshift=1em]l9.east)
{
下联:春回大地满园红
}
;
\end{scope}
...
...
@@ -5158,7 +5163,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\begin{itemize}
\item
还可以用机器翻译系统来写诗。如藏头诗,给定诗句的第一个字,生成一首完整的诗。还可以根据意境生成诗句
\end{itemize}
\vspace
{
0.
3
em
}
\vspace
{
0.
0
em
}
\begin{center}
\begin{tikzpicture}
...
...
@@ -5195,22 +5200,77 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
\section
{
Transformer
}
%%% 做个小结
\begin{frame}
{
我们赶上了好时代 ...
}
\begin{itemize}
\item
神经机器翻译的火爆这几年有目共睹,好事情!!!
\begin{itemize}
\item
https://arxiv.org上搜索neural machine translation
\item
ACL、EMNLP等顶会神经机器翻译论文数量近些年几乎呈线性增长
\item
神经机器翻译系统在各大比赛中霸榜,开源机器翻译满天飞,大厂秀肌肉,小作坊刷存在感
\end{itemize}
\vspace
{
0.3em
}
\item
<2-> 这里只介绍了最基本的概念,NMT的内容远不止这些
\begin{itemize}
\item
各种专题:解码、压缩、先验知识、低资源翻译、无指导方法、篇章级翻译等等等等
\item
推荐一个survey,有些基础的可以参考一下,很全面
\\
``Neural Machine Translation: A Review'' by Felix Stahlberg
\\
\url
{
https://arxiv.org/abs/1912.02047
}
\item
如何搭建一个优秀的NMT系统?- 有许多技巧
\\
下一章介绍
\item
回忆一下第一章介绍的NMT开源系统,可以试试
\end{itemize}
\end{itemize}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
\subsection
{
多头自注意力模型
}
%%% open source NMT
\begin{frame}
{
一些开源NMT系统
}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
\subsection
{
训练
}
%%% last slide
\begin{frame}
{
结束~
}
%%%------------------------------------------------------------------------------------------------------------
\subsection
{
推断
}
\vspace
{
2em
}
%%%------------------------------------------------------------------------------------------------------------
\subsection
{
深层网络
}
\begin{center}
\begin{tikzpicture}
\begin{scope}
\small
{
\node
[anchor=south west,minimum width=15em] (source) at (0,0)
{
\Large
{
\textbf
{
source
}
: 谢谢 大家 !
}}
;
\node
[anchor=south west,minimum width=15em] (target) at ([yshift=12em]source.north west)
{
\Large
{
\textbf
{
target
}
: Thank You !
}}
;
\node
[anchor=center,minimum width=9.6em,minimum height=1.8em,draw,rounded corners=0.3em] (hidden) at ([yshift=6em]source.north)
{}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!20] (cell01) at ([xshift=0.2em]hidden.west)
{
\footnotesize
{
.2
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!10] (cell02) at (cell01.east)
{
\footnotesize
{
-1
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!70] (cell03) at (cell02.east)
{
\footnotesize
{
6
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!50] (cell04) at (cell03.east)
{
\footnotesize
{
5
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!30] (cell05) at (cell04.east)
{
\footnotesize
{
.7
}}
;
\node
[anchor=west,minimum width=1.5em,minimum size=1.5em,fill=ugreen!10] (cell06) at (cell05.east)
{
\footnotesize
{
-2
}}
;
\filldraw
[fill=red!20,draw=white] (source.north west) -- (source.north east) -- ([xshift=-0.2em,yshift=-0.1em]hidden.south east) -- ([xshift=0.2em,yshift=-0.1em]hidden.south west);
\filldraw
[fill=blue!20,draw=white] (target.south west) -- (target.south east) -- ([xshift=-0.2em,yshift=0.1em]hidden.north east) -- ([xshift=0.2em,yshift=0.1em]hidden.north west);
\draw
[->,thick] (source.north west) -- ([xshift=0.2em,yshift=-0.1em]hidden.south west);
\draw
[->,thick] (source.north east) -- ([xshift=-0.2em,yshift=-0.1em]hidden.south east);
\draw
[->,thick] ([xshift=0.2em,yshift=0.1em]hidden.north west) -- (target.south west);
\draw
[->,thick] ([xshift=-0.2em,yshift=0.1em]hidden.north east) -- (target.south east);
}
\node
[anchor=south] (enclabel) at ([yshift=2em]source.north)
{
\large
{
Encoder
}}
;
\node
[anchor=north] (declabel) at ([yshift=-2em]target.south)
{
\large
{
Decoder
}}
;
\end{scope}
\end{tikzpicture}
\vspace
{
2em
}
\end{center}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
\section
{
其它应用
}
\end{CJK}
\end{document}
编写
预览
Markdown
格式
0%
重试
或
添加新文件
添加附件
取消
您添加了
0
人
到此讨论。请谨慎行事。
请先完成此评论的编辑!
取消
请
注册
或者
登录
后发表评论