updates

6817d376 · xiaotong · 99f5a117 · 6817d376 · 6817d376
Commit 6817d376 authored Jan 13, 2020 by xiaotong
--- a/Section04-Phrasal-and-Syntactic-Models/section04-test.tex
+++ b/Section04-Phrasal-and-Syntactic-Models/section04-test.tex
@@ -152,83 +152,13 @@
 \subsection{规则匹配}
 %%%------------------------------------------------------------------------------------------------------------
-%%%  基于串的解码方法
+%%%  基于串的解码 - 规则匹配
-\begin{frame}{基于串的解码}
+\begin{frame}{基于串的解码 - 规则匹配}
 \begin{itemize}
-\item 不同于基于树的解码，\alert{基于串的解码}方法并不要求输入句法树，它直接对输入词串进行翻译，最终得到译文。
+\item 相比基于树的解码，基于串的解码的实现要复杂许多，这主要是因为对于每一个片段，需要判断每条规则是否能匹配
-    \begin{itemize}
-    \item 这种方法适用于树到串、串到树、树到树等多种模型
-    \item 本质上，由于并不受固定输入的句法树约束，基于串的解码可以探索更多潜在的树结构，这也增大了搜索空间(相比基于串的解码)，因此该方法更有可能找到高质量翻译结果
-    \end{itemize}
-\item<2-> 在基于串的方法中，句法结构被看做是翻译的隐含变量，而非线性的输入和输出。比如，层次短语翻译解码就是一种典型的基于串的解码方法，所有的翻译推导在翻译过程里动态生成，但是并不要输入或者输出这些推导所对应的层次结构
 \end{itemize}
-\visible<2->{
-\begin{center}
-\begin{tikzpicture}
-\begin{scope}[scale=0.9,level distance=15pt,sibling distance=0pt]
-{\scriptsize
-\Tree[.\node(bsn0){IP};
-          [.\node(bsn1){NP};
-               [.\node(bsn2){NN}; \node(bsw1){猫}; ]
-          ]
-          [.\node(bsn3){VP};
-               [.\node(bsn4){VV}; \node(bsw2){喜欢}; ]
-               [.\node(bsn5){VP}; \edge[roof]; \node(bsw3){吃 \ 鱼}; ]
-          ]
-     ]
-\node [anchor=west] (target) at ([xshift=1em]bsw3.east) {Cats like eating fish};
-\node [anchor=north,inner sep=3pt] (cap1) at (target.south west) {(a) 基于树的解码};
-\draw [->,thick] (bsw3.east) -- (target.west);
-\node [anchor=west] (sourcelabel) at ([xshift=2em]bsn0.east) {显式输入的结构};
-\node [anchor=west] (source2) at ([xshift=2em]target.east) {猫 喜欢 吃 鱼};
-\node [anchor=west] (target2) at ([xshift=1em]source2.east) {Cats like eating fish};
-\node [anchor=north,inner sep=3pt] (cap2) at (target2.south west) {(b) 基于串的解码};
-\draw [->,thick] (source2.east) -- (target2.west);
-\begin{scope}[xshift=2.45in,sibling distance=3pt]
-\Tree[.\node(bsn0){IP};
-          [.\node(bsn1){NP};
-               [.\node(bsn2){NN}; ]
-          ]
-          [.\node(bsn3){VP};
-               [.\node(bsn4){VV}; ]
-               [.\node(bsn5){VP}; ]
-          ]
-     ]
-\begin{pgfonlayer}{background}
-\node [draw,dashed,inner sep=2pt] (box) [fit = (bsn0) (bsn1) (bsn2) (bsn3) (bsn4) (bsn5)] {};
-\node [anchor=north west] (boxlabel) at (box.north east) {隐含结构};
-\end{pgfonlayer}
-\end{scope}
-}
-\end{scope}
-\end{tikzpicture}
-\end{center}
-}
-\end{frame}
-%%%------------------------------------------------------------------------------------------------------------
-%%%  基于串的解码方法本质上和句法分析一样
-\begin{frame}{基于串的解码 $\approx$ 句法分析}
-\begin{itemize}
-\item 基于串的翻译和传统\alert{句法分析}的任务很像：对于一个输入的词串，找到生成这个词串的最佳推导。唯一不同的地方，在于机器翻译需要考虑译文的生成(语言模型的引入会使问题稍微复杂一些)，但是源语言部分的处理和句法分析一模一样
-\item<2-> 这个过程仍然可以用基于chart的方法实现，即对于每一个源语言片段，都匹配可能的翻译规则，之后填入相应的表格单元，这也构成了一个超图，最佳推导可以从这个超图得到
-\end{itemize}
-\visible<2->{
 \begin{center}
 \begin{tikzpicture}
@@ -263,7 +193,6 @@
 \end{tikzpicture}
 \end{center}
-}
 \end{frame}

--- a/Section06-Neural-Machine-Translation/section06.tex
+++ b/Section06-Neural-Machine-Translation/section06.tex
@@ -5366,7 +5366,7 @@ x_{l+1} = x_l+\mathcal{F}(x_l)
 \item 由于自回归性，Transformer在推断阶段无法进行并行化操作，导致推断速度非常慢！
-\item<2-> 加速手段：低精度、Cache(缓存需要重复计算的变量) 、Average Attention Network、Share Attention Network
+\item<2-> 加速手段：低精度、Cache(缓存需要重复计算的变量) 、Average Attention Network、Shared Attention Network
 \end{itemize}