Skip to content
项目
群组
代码片段
帮助
当前项目
正在载入...
登录 / 注册
切换导航面板
T
Toy-MT-Introduction
概览
Overview
Details
Activity
Cycle Analytics
版本库
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
问题
0
Issues
0
列表
Board
标记
里程碑
合并请求
0
Merge Requests
0
CI / CD
CI / CD
流水线
作业
日程表
图表
维基
Wiki
代码片段
Snippets
成员
Collapse sidebar
Close sidebar
活动
图像
聊天
创建新问题
作业
提交
Issue Boards
Open sidebar
单韦乔
Toy-MT-Introduction
Commits
8c083c20
Commit
8c083c20
authored
Dec 19, 2019
by
姜雨帆
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Update Transformer
parent
e46c4b29
隐藏空白字符变更
内嵌
并排
正在显示
1 个修改的文件
包含
123 行增加
和
98 行删除
+123
-98
Section06-Neural-Machine-Translation/section06.tex
+123
-98
没有找到文件。
Section06-Neural-Machine-Translation/section06.tex
查看文件 @
8c083c20
...
@@ -3398,86 +3398,7 @@ $\textrm{``you''} = \argmax_{y} \textrm{P}(y|\textbf{s}_1, \alert{\textbf{C}})$
...
@@ -3398,86 +3398,7 @@ $\textrm{``you''} = \argmax_{y} \textrm{P}(y|\textbf{s}_1, \alert{\textbf{C}})$
\end{itemize}
\end{itemize}
\end{frame}
\end{frame}
%
%
%%%%------------------------------------------------------------------------------------------------------------
%%%% 自注意力机制(续)
%\begin{frame}{自注意力机制(续)}
%\begin{itemize}
%\item 基于前面对注意力机制的介绍,自注意力机制则是将源语言每个位置的表示$\textbf{h}_{i}$看做query,同时将所有位置的表示看做{\color{ugreen} \textbf{key}}和{\color{red} \textbf{value}}
%
%\vspace{-1.5em}
%\begin{center}
%\begin{tikzpicture}
%\begin{scope}
%
%\tikzstyle{rnode} = [draw,minimum width=3.5em,minimum height=1.2em]
%
%\node [rnode,anchor=south west,fill=red!20!white] (value1) at (0,0) {\scriptsize{$\textbf{h}(\textrm{``你''})$}};
%\node [rnode,anchor=south west,fill=red!20!white] (value2) at ([xshift=1em]value1.south east) {\scriptsize{$\textbf{h}(\textrm{``什么''})$}};
%\node [rnode,anchor=south west,fill=red!20!white] (value3) at ([xshift=1em]value2.south east) {\scriptsize{$\textbf{h}(\textrm{``也''})$}};
%\node [rnode,anchor=south west,fill=red!20!white] (value4) at ([xshift=1em]value3.south east) {\scriptsize{$\textbf{h}(\textrm{``没''})$}};
%
%
%\node [rnode,anchor=south west,fill=green!20!white] (key1) at ([yshift=0.2em]value1.north west) {\scriptsize{$\textbf{h}(\textrm{``你''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key2) at ([yshift=0.2em]value2.north west) {\scriptsize{$\textbf{h}(\textrm{``什么''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key3) at ([yshift=0.2em]value3.north west) {\scriptsize{$\textbf{h}(\textrm{``也''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key4) at ([yshift=0.2em]value4.north west) {\scriptsize{$\textbf{h}(\textrm{``没''})$}};
%
%\node [rnode,anchor=east,fill=white] (query) at ([xshift=-2em]key1.west) {\scriptsize{$\textbf{h}(\textrm{``没''})$}};
%\node [anchor=south] (sep) at ([yshift=0.8em]query.north) {\scriptsize{$\textbf{...}$}};
%
%\draw [->] ([yshift=1pt,xshift=6pt]query.north) .. controls +(90:1em) and +(90:1em) .. ([yshift=1pt]key1.north);
%\draw [->] ([yshift=1pt,xshift=3pt]query.north) .. controls +(90:1.4em) and +(90:1.4em) .. ([yshift=1pt]key2.north);
%\draw [->] ([yshift=1pt]query.north) .. controls +(90:1.8em) and +(90:1.8em) .. ([yshift=1pt]key3.north);
%\draw [->] ([yshift=1pt,xshift=-3pt]query.north) .. controls +(90:2.2em) and +(90:2.2em) .. ([yshift=1pt]key4.north);
%\node [anchor=south east] (alpha1) at ([xshift=1em]key1.north east) {\scriptsize{$\alpha_1=.1$}};
%\node [anchor=south east] (alpha2) at ([xshift=1em]key2.north east) {\scriptsize{$\alpha_2=.1$}};
%\node [anchor=south east] (alpha3) at ([xshift=1em]key3.north east) {\scriptsize{$\alpha_3=.3$}};
%\node [anchor=south east] (alpha4) at ([xshift=1em]key4.north east) {\scriptsize{$\alpha_4=.4$}};
%
%
%\node [rnode,anchor=south west,fill=green!20!white] (key5) at ([yshift=2em]key1.north west) {\scriptsize{$\textbf{h}(\textrm{``你''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key6) at ([yshift=2em]key2.north west) {\scriptsize{$\textbf{h}(\textrm{``什么''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key7) at ([yshift=2em]key3.north west) {\scriptsize{$\textbf{h}(\textrm{``也''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key8) at ([yshift=2em]key4.north west) {\scriptsize{$\textbf{h}(\textrm{``没''})$}};
%
%\node [rnode,anchor=east,fill=white] (query1) at ([xshift=-2em]key5.west) {\scriptsize{$\textbf{h}(\textrm{``什么''})$}};
%
%\draw [->] ([yshift=1pt,xshift=6pt]query1.north) .. controls +(90:1em) and +(90:1em) .. ([yshift=1pt]key5.north);
%\draw [->] ([yshift=1pt,xshift=3pt]query1.north) .. controls +(90:1.4em) and +(90:1.4em) .. ([yshift=1pt]key6.north);
%\draw [->] ([yshift=1pt]query1.north) .. controls +(90:1.8em) and +(90:1.8em) .. ([yshift=1pt]key7.north);
%\draw [->] ([yshift=1pt,xshift=-3pt]query1.north) .. controls +(90:2.2em) and +(90:2.2em) .. ([yshift=1pt]key8.north);
%\node [anchor=south east] (alpha1) at ([xshift=1em]key5.north east) {\scriptsize{$\alpha_1=.2$}};
%\node [anchor=south east] (alpha2) at ([xshift=1em]key6.north east) {\scriptsize{$\alpha_2=.5$}};
%\node [anchor=south east] (alpha3) at ([xshift=1em]key7.north east) {\scriptsize{$\alpha_3=.1$}};
%\node [anchor=south east] (alpha4) at ([xshift=1em]key8.north east) {\scriptsize{$\alpha_4=.1$}};
%
%
%\node [rnode,anchor=south west,fill=green!20!white] (key9) at ([yshift=2em]key5.north west) {\scriptsize{$\textbf{h}(\textrm{``你''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key10) at ([yshift=2em]key6.north west) {\scriptsize{$\textbf{h}(\textrm{``什么''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key11) at ([yshift=2em]key7.north west) {\scriptsize{$\textbf{h}(\textrm{``也''})$}};
%\node [rnode,anchor=south west,fill=green!20!white] (key12) at ([yshift=2em]key8.north west) {\scriptsize{$\textbf{h}(\textrm{``没''})$}};
%
%\node [rnode,anchor=east,fill=white] (query2) at ([xshift=-2em]key9.west) {\scriptsize{$\textbf{h}(\textrm{``你''})$}};
%\node [anchor=east] (querylabel2) at ([xshift=-0.2em]query2.west) {\scriptsize{query}};
%
%\draw [->] ([yshift=1pt,xshift=6pt]query2.north) .. controls +(90:1em) and +(90:1em) .. ([yshift=1pt]key9.north);
%\draw [->] ([yshift=1pt,xshift=3pt]query2.north) .. controls +(90:1.4em) and +(90:1.4em) .. ([yshift=1pt]key10.north);
%\draw [->] ([yshift=1pt]query2.north) .. controls +(90:1.8em) and +(90:1.8em) .. ([yshift=1pt]key11.north);
%\draw [->] ([yshift=1pt,xshift=-3pt]query2.north) .. controls +(90:2.2em) and +(90:2.2em) .. ([yshift=1pt]key12.north);
%\node [anchor=south east] (alpha1) at ([xshift=1em]key9.north east) {\scriptsize{$\alpha_1=.4$}};
%\node [anchor=south east] (alpha2) at ([xshift=1em]key10.north east) {\scriptsize{$\alpha_2=.4$}};
%\node [anchor=south east] (alpha3) at ([xshift=1em]key11.north east) {\scriptsize{$\alpha_3=0$}};
%\node [anchor=south east] (alpha4) at ([xshift=1em]key12.north east) {\scriptsize{$\alpha_4=.1$}};
%
%\end{scope}
%\end{tikzpicture}
%\end{center}
%
%\item<2->自注意力模型通过计算源语各位置的匹配程度对value进行加权求和,完成对源语信息的提取
%
%\end{itemize}
%\end{frame}
%
%%%------------------------------------------------------------------------------------------------------------
%%%------------------------------------------------------------------------------------------------------------
%%% 自注意力机制(续)
%%% 自注意力机制(续)
\begin{frame}
{
自注意力机制(续)
}
\begin{frame}
{
自注意力机制(续)
}
...
@@ -3687,7 +3608,7 @@ $\textrm{``you''} = \argmax_{y} \textrm{P}(y|\textbf{s}_1, \alert{\textbf{C}})$
...
@@ -3687,7 +3608,7 @@ $\textrm{``you''} = \argmax_{y} \textrm{P}(y|\textbf{s}_1, \alert{\textbf{C}})$
% colback=blue!10!white,colbacklower=black!5!white]
% colback=blue!10!white,colbacklower=black!5!white]
\begin{itemize}
\begin{itemize}
\item
Transformer
总体结构
\item
Transformer
输入和位置编码
\end{itemize}
\end{itemize}
...
@@ -3751,8 +3672,13 @@ $\textrm{``you''} = \argmax_{y} \textrm{P}(y|\textbf{s}_1, \alert{\textbf{C}})$
...
@@ -3751,8 +3672,13 @@ $\textrm{``you''} = \argmax_{y} \textrm{P}(y|\textbf{s}_1, \alert{\textbf{C}})$
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0) {};
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1) {};
\begin{pgfonlayer}
{
background
}
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (input1) (pos1)] (box1)
{}
;
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (input2) (pos2)] (box2)
{}
;
\end{pgfonlayer}
\end{scope}
\end{scope}
\end{tikzpicture}
\end{tikzpicture}
...
@@ -3918,7 +3844,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -3918,7 +3844,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
% colback=blue!10!white,colbacklower=black!5!white]
% colback=blue!10!white,colbacklower=black!5!white]
\begin{itemize}
\begin{itemize}
\item
Transformer
总体结构
\item
Transformer
多头自注意力机制
\end{itemize}
\end{itemize}
...
@@ -3982,8 +3908,14 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -3982,8 +3908,14 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0) {};
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1) {};
\begin{pgfonlayer}
{
background
}
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (sa1)] (box1)
{}
;
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (sa2)] (box2)
{}
;
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (ed1)] (box3)
{}
;
\end{pgfonlayer}
\end{scope}
\end{scope}
\end{tikzpicture}
\end{tikzpicture}
...
@@ -4204,7 +4136,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4204,7 +4136,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
% colback=blue!10!white,colbacklower=black!5!white]
% colback=blue!10!white,colbacklower=black!5!white]
\begin{itemize}
\begin{itemize}
\item
Transformer
总体结构
\item
Transformer
残差和层正则化
\end{itemize}
\end{itemize}
...
@@ -4268,8 +4200,15 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4268,8 +4200,15 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0) {};
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1) {};
\begin{pgfonlayer}
{
background
}
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (res1)] (box1)
{}
;
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (res2)] (box2)
{}
;
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (res3)] (box3)
{}
;
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (res4)] (box4)
{}
;
\end{pgfonlayer}
\end{scope}
\end{scope}
\end{tikzpicture}
\end{tikzpicture}
...
@@ -4277,7 +4216,6 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4277,7 +4216,6 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
%\end{tcolorbox}
%\end{tcolorbox}
\end{frame}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
%%%------------------------------------------------------------------------------------------------------------
\subsection
{
残差和层正则化
}
\subsection
{
残差和层正则化
}
\begin{frame}
{
残差
\&
层正则化
}
\begin{frame}
{
残差
\&
层正则化
}
...
@@ -4315,7 +4253,12 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4315,7 +4253,12 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\end{center}
\end{center}
\vspace
{
0.5em
}
\vspace
{
0.5em
}
\item
<2-> 在这里引入了在图像领域用来训练深层网络的技术,
\\
{
\color
{
red
}
\textbf
{
残差网络
}}
来避免上述问题
\item
<2-> 在这里引入了在图像领域用来训练深层网络的技术,
\\
{
\color
{
red
}
\textbf
{
残差网络
}}
来避免上述问题
\vspace
{
1.5em
}
%\vspace{0.5em}
\begin{displaymath}
x
_{
l+1
}
= x
_
l+
\mathcal
{
F
}
(x
_
l)
\end{displaymath}
\vspace
{
0em
}
\begin{center}
\begin{center}
\begin{tikzpicture}
\begin{tikzpicture}
\begin{scope}
\begin{scope}
...
@@ -4362,14 +4305,18 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4362,14 +4305,18 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\begin{itemize}
\begin{itemize}
\item
在Transformer的训练过程中,由于引入了残差操作,将前面所有层的输出加到一起。这样会导致高层的参数分布不断变大,造成训练过程不稳定、训练时间较长。
\item
在Transformer的训练过程中,由于引入了残差操作,将前面所有层的输出加到一起。这样会导致高层的参数分布不断变大,造成训练过程不稳定、训练时间较长。
为了避免这种情况,在每层中加入了层正则化操作
为了避免这种情况,在每层中加入了层正则化操作
\vspace
{
0.5em
}
\begin{itemize}
\begin{itemize}
\item
使用均值和方差对样本进行平移缩放,将数据规范化为均值为0,方差为1的标准分布
\item
使用均值和方差对样本进行平移缩放,将数据规范化为均值为0,方差为1的标准分布
\item
层正则化针对单个训练样本
\end{itemize}
\end{itemize}
\vspace
{
0.5em
}
\vspace
{
-0.3em
}
\begin{displaymath}
\textrm
{
LN(
}
x
\textrm
{
)
}
= g
\cdot\frac
{
x-
\mu
}{
\sigma
}
+b
\end{displaymath}
\item
<2->在Transformer中经常使用的层正则化操作有两种,分别是后正则化和前正则化
\item
<2->在Transformer中经常使用的层正则化操作有两种,分别是后正则化和前正则化
\vspace
{
0.5em
}
%
\vspace{0.5em}
\begin{center}
\begin{center}
\begin{tikzpicture}
\begin{tikzpicture}
\begin{scope}
\begin{scope}
...
@@ -4407,6 +4354,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4407,6 +4354,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\end{frame}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
%%%------------------------------------------------------------------------------------------------------------
\section
{
Transformer
}
\section
{
Transformer
}
\begin{frame}
{
Transformer
}
\begin{frame}
{
Transformer
}
...
@@ -4415,7 +4363,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4415,7 +4363,7 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
% colback=blue!10!white,colbacklower=black!5!white]
% colback=blue!10!white,colbacklower=black!5!white]
\begin{itemize}
\begin{itemize}
\item
Transformer
总体结构
\item
Transformer
前馈全连接网络
\end{itemize}
\end{itemize}
...
@@ -4479,8 +4427,13 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4479,8 +4427,13 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\draw
[->,standard]
(res2.north) -- ([yshift=0.5em]res2.north) -- ([xshift=5em,yshift=0.5em]res2.north) -- ([xshift=5em,yshift=-2.2em]res2.north) -- ([xshift=6.5em,yshift=-2.2em]res2.north);
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=ugreen!70] [fit = (sa1) (res1) (ffn1) (res2)] (box0) {};
\node
[rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1)
{}
;
%\node [rectangle,inner sep=0.7em,rounded corners=1pt,very thick,dotted,draw=red!60] [fit = (sa2) (res3) (res5)] (box1) {};
\begin{pgfonlayer}
{
background
}
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (ffn1)] (box1)
{}
;
\node
[rectangle,inner sep=0.2em,rounded corners=1pt,very thick,dotted,fill=red!40] [fit = (ffn2)] (box2)
{}
;
\end{pgfonlayer}
\end{scope}
\end{scope}
\end{tikzpicture}
\end{tikzpicture}
...
@@ -4961,6 +4914,78 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
...
@@ -4961,6 +4914,78 @@ PE_{(pos,2i+1)} = cos(pos/10000^{2i/d_{model}})
\end{frame}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
%%%------------------------------------------------------------------------------------------------------------
%%% 应用
\begin{frame}
{
有趣的应用
}
\vspace
{
5.0em
}
\begin{tcolorbox}
[enhanced,size=normal,left=13mm,right=1mm,colback=red!5!white,colframe=red!75!black,drop fuzzy shadow]
{
\Large
\textbf
{
一些有趣的神经机器翻译应用
}
}
\end{tcolorbox}
\begin{center}
\begin{tikzpicture}
\begin{scope}
[scale=0.7]
\tikzstyle
{
rnnnode
}
= [draw,inner sep=5pt,minimum width=3em,minimum height=0.8em,fill=green!30!white,blur shadow=
{
shadow xshift=1pt,shadow yshift=-1pt
}
]
\node
[anchor=west,rnnnode] (node11) at (0,0)
{
\tiny
{
Self-Attention
}}
;
\node
[anchor=west,rnnnode] (node12) at ([xshift=1em]node11.east)
{
\tiny
{
Self-Attention
}}
;
\node
[anchor=west,rnnnode] (node13) at ([xshift=1em]node12.east)
{
\tiny
{
Self-Attention
}}
;
\node
[anchor=west,rnnnode] (node14) at ([xshift=1em]node13.east)
{
\tiny
{
Self-Attention
}}
;
\node
[anchor=north,rnnnode,fill=blue!30!white] (e1) at ([yshift=-2em]node11.south)
{
\tiny
{}}
;
\node
[anchor=north,rnnnode,fill=blue!30!white] (e2) at ([yshift=-2em]node12.south)
{
\tiny
{}}
;
\node
[anchor=north,rnnnode,fill=blue!30!white] (e3) at ([yshift=-2em]node13.south)
{
\tiny
{}}
;
\node
[anchor=north,rnnnode,fill=blue!30!white] (e4) at ([yshift=-2em]node14.south)
{
\tiny
{}}
;
\node
[anchor=north,inner sep=2pt] (w1) at ([yshift=-1em]e1.south)
{
\tiny
{$
<
$
s
$
>
$}}
;
\node
[anchor=north,inner sep=2pt] (w2) at ([yshift=-1em]e2.south)
{
\tiny
{
让
}}
;
\node
[anchor=north,inner sep=2pt] (w3) at ([yshift=-1em]e3.south)
{
\tiny
{
我们
}}
;
\node
[anchor=north,inner sep=2pt] (w4) at ([yshift=-1em]e4.south)
{
\tiny
{
开始
}}
;
\draw
[->,thick] ([yshift=0.1em]w1.north)--([yshift=-0.1em]e1.south);
\draw
[->,thick] ([yshift=0.1em]w2.north)--([yshift=-0.1em]e2.south);
\draw
[->,thick] ([yshift=0.1em]w3.north)--([yshift=-0.1em]e3.south);
\draw
[->,thick] ([yshift=0.1em]w4.north)--([yshift=-0.1em]e4.south);
\draw
[->,thick] ([yshift=0.1em]e1.north)--([yshift=-0.1em]node11.south);
\draw
[->,thick] ([yshift=0.1em]e2.north)--([yshift=-0.1em]node12.south);
\draw
[->,thick] ([yshift=0.1em]e3.north)--([yshift=-0.1em]node13.south);
\draw
[->,thick] ([yshift=0.1em]e4.north)--([yshift=-0.1em]node14.south);
\draw
[->,thick] ([yshift=0.1em]e1.north)--([xshift=-0.5em,yshift=-0.2em]node12.south);
\draw
[->,thick] ([yshift=0.1em]e3.north)--([xshift=0.5em,yshift=-0.2em]node12.south);
\draw
[->,thick] ([yshift=0.1em]e4.north)--([xshift=1.5em,yshift=-0.2em]node12.south);
\node
[anchor=south,rnnnode,fill=blue!30!white] (node21) at ([yshift=2.0em]node11.north)
{
\tiny
{}}
;
\node
[anchor=south,rnnnode,fill=blue!30!white] (node22) at ([yshift=2.0em]node12.north)
{
\tiny
{}}
;
\node
[anchor=south,rnnnode,fill=blue!30!white] (node23) at ([yshift=2.0em]node13.north)
{
\tiny
{}}
;
\node
[anchor=south,rnnnode,fill=blue!30!white] (node24) at ([yshift=2.0em]node14.north)
{
\tiny
{}}
;
\node
[anchor=south] (output1) at ([yshift=1em]node21.north)
{
\Large
{
\textbf
{}}}
;
\node
[anchor=south] (output2) at ([yshift=1em]node22.north)
{
\Large
{
\textbf
{}}}
;
\node
[anchor=south] (output3) at ([yshift=1em]node23.north)
{
\Large
{
\textbf
{}}}
;
\node
[anchor=south] (output4) at ([yshift=1em]node24.north)
{
\Large
{
\textbf
{}}}
;
\draw
[->,thick] ([yshift=0.1em]node21.north)--([yshift=-0.1em]output1.south);
\draw
[->,thick] ([yshift=0.1em]node22.north)--([yshift=-0.1em]output2.south);
\draw
[->,thick] ([yshift=0.1em]node23.north)--([yshift=-0.1em]output3.south);
\draw
[->,thick] ([yshift=0.1em]node24.north)--([yshift=-0.1em]output4.south);
\draw
[->,thick] ([yshift=0.1em]node11.north)--([yshift=-0.1em]node21.south);
\draw
[->,thick] ([yshift=0.1em]node12.north)--([yshift=-0.1em]node22.south);
\draw
[->,thick] ([yshift=0.1em]node13.north)--([yshift=-0.1em]node23.south);
\draw
[->,thick] ([yshift=0.1em]node14.north)--([yshift=-0.1em]node24.south);
\end{scope}
\end{tikzpicture}
\end{center}
\end{frame}
%%%------------------------------------------------------------------------------------------------------------
\subsection
{
应用
}
\subsection
{
应用
}
\begin{frame}
{
NMT应用
}
\begin{frame}
{
NMT应用
}
\begin{itemize}
\begin{itemize}
...
...
编写
预览
Markdown
格式
0%
重试
或
添加新文件
添加附件
取消
您添加了
0
人
到此讨论。请谨慎行事。
请先完成此评论的编辑!
取消
请
注册
或者
登录
后发表评论