第二章格式和参考文献

0001aff6 · 单韦乔 · 7b394bd1 · 0001aff6 · 0001aff6
Commit 0001aff6 authored Aug 23, 2020 by 单韦乔
--- a/Chapter2/chapter2.tex
+++ b/Chapter2/chapter2.tex
@@ -618,7 +618,7 @@ F(x)=\int_{-\infty}^x f(x)\textrm{d}x
 \subsubsection{2.古德-图灵估计法}

 \vspace{-0.5em}
-\parinterval {\small\bfnew{古德-图灵估计法}}\index{古德-图灵估计法}（Good-Turing Estimate）\index{Good-Turing Estimate}是Alan Turing和他的助手I.J.Good开发的，作为他们在二战期间破解德国密码机Enigma所使用的方法的一部分，在1953 年I.J.Good将其发表。这一方法也是很多平滑算法的核心，其基本思路是：把非零的$n$元语法单元的概率降低匀给一些低概率$n$元语法单元，以减小最大似然估计与真实概率之间的偏离\upcite{good1953population,gale1995good}。
+\parinterval {\small\bfnew{古德-图灵估计法}}\index{古德-图灵估计法}（Good-Turing Estimate）\index{Good-Turing Estimate}是Alan Turing和他的助手Irving John Good开发的，作为他们在二战期间破解德国密码机Enigma所使用的方法的一部分，在1953 年Irving John Good将其发表。这一方法也是很多平滑算法的核心，其基本思路是：把非零的$n$元语法单元的概率降低匀给一些低概率$n$元语法单元，以减小最大似然估计与真实概率之间的偏离\upcite{good1953population,gale1995good}。

 \parinterval 假定在语料库中出现$r$次的$n$-gram有$n_r$个，特别的，出现0次的$n$-gram（即未登录词及词串）出现的次数为$n_0$个。语料库中全部单词的总个数为$N$，显然
 \begin{eqnarray}
@@ -841,7 +841,6 @@ c_{\textrm{KN}}(\cdot) = \left\{\begin{array}{ll}
 \parinterval 在这种序列生成方式的基础上，实现搜索通常有两种方法\ \dash\ 深度优先遍历和宽度优先遍历\upcite{DBLP:books/mg/CormenLR89}。在深度优先遍历中，每次从词表中可重复地选择一个单词，然后从左至右地生成序列，直到<eos>被选择，此时一个完整的单词序列被生成出来。然后从<eos>回退到上一个单词，选择之前词表中未被选择到的候选单词代替<eos>，并继续挑选下一个单词直到<eos>被选到，如果上一个单词的所有可能都被枚举过，那么回退到上上一个单词继续枚举，直到回退到<sos>，这时候枚举结束。在宽度优先遍历中，每次不是只选择一个单词，而是枚举所有单词。

 有一个一个简单的例子。假设词表只含两个单词\{a, b\}，从<sos>开始枚举所有单词，有三种可能：
-
 \begin{eqnarray}
 \text{\{<sos> a, <sos> b, <sos> <eos>\}} \nonumber
 \end{eqnarray}
@@ -900,14 +899,12 @@ c_{\textrm{KN}}(\cdot) = \left\{\begin{array}{ll}
 %-------------------------------------------

 \parinterval 从这个角度来看，在树的遍历中，可以很自然地引入语言模型打分：在解空间树中引入节点的权重\ \dash\ 将当前节点$i$的得分重设为语言模型打分$\log \funp{P}(w_i | w_1 w_2 \ldots w_{i-1})$，其中$w_1 w_2 \ldots w_{i-1}$是该节点的全部祖先。与先前不同的是，由于在使用语言模型打分时，词的概率通常小于1，因此句子很长时概率会非常小，容易造成浮点误差,所以这里使用概率的对数形式$\log \funp{P}(w_i | w_1 w_2 \ldots w_{i-1})$代替$\funp{P}(w_i | w_1 w_2 \ldots w_{i-1})$。此时对于图中一条包含<eos>的完整序列来说，它的最终得分$\textrm{score}(\cdot)$可以被定义为：
-
 \begin{eqnarray}
 \textrm{score}(w_1 w_2 \ldots w_m) & = & \log \funp{P}(w_1 w_2 \ldots w_m) \nonumber \\
                                   & = & \sum_{i=1}^{m}\log \funp{P}(w_i | w_1 w_2 \ldots w_{i-1})
 \end{eqnarray}

 通常，$\textrm{score}(\cdot)$也被称作{\small\bfnew{模型得分}}\index{模型得分}（Model Score\index{Model Score}）。如图\ref{fig:2-15}所示，可知红线所示单词序列“<sos>\ I\ agree\ <eos>”的模型得分为：
-
 \begin{eqnarray}
 &&\textrm{score(<sos>\ I\ agree\ <eos>)}   \nonumber \\
 & = & \log \funp{P}(\textrm{<sos>}) + \log \funp{P}(\textrm{I} | \textrm{<sos>}) + \log \funp{P}(\textrm{agree} | \textrm{<sos>\ I}) + \log \funp{P}(\textrm{<sos>}| \textrm{<sos>\ I\ agree})   \nonumber \\

--- a/bibliography.bib
+++ b/bibliography.bib
@@ -648,7 +648,7 @@
  author    = {Yilin Yang and
               Liang Huang and
               Mingbo Ma},
-  editor    = {Ellen Riloff and
+  //editor    = {Ellen Riloff and
               David Chiang and
               Julia Hockenmaier and
               Jun'ichi Tsujii},
@@ -660,11 +660,11 @@
  publisher = {Association for Computational Linguistics},
  year      = {2018}
 }
-%缩写
+
 @article{jelinek1980interpolated,
 	title={Interpolated estimation of Markov source parameters from sparse data},
 	author={F. {Jelinek}},
-	journal={Proc. Workshop on Pattern Recognition in Practice, 1980},
+	journal={Pattern Recognition in Practice},
 	pages={381--397},
 	year={1980}
 }
@@ -842,7 +842,7 @@
 @inproceedings{DBLP:conf/acl/WangW97,
  author    = {Ye{-}Yi Wang and
               Alex Waibel},
-  editor    = {Philip R. Cohen and
+  //editor    = {Philip R. Cohen and
               Wolfgang Wahlster},
  title     = {Decoding Algorithm in Statistical Machine Translation},
  booktitle = {35th Annual Meeting of the Association for Computational Linguistics
@@ -908,7 +908,7 @@

 @inproceedings{DBLP:conf/amta/Koehn04,
  author    = {Philipp Koehn},
-  editor    = {Robert E. Frederking and
+  //editor    = {Robert E. Frederking and
               Kathryn Taylor},
  title     = {Pharaoh: {A} Beam Search Decoder for Phrase-Based Statistical Machine
               Translation Models},