Skip to content
项目
群组
代码片段
帮助
当前项目
正在载入...
登录 / 注册
切换导航面板
T
Toy-MT-Introduction
概览
Overview
Details
Activity
Cycle Analytics
版本库
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
问题
0
Issues
0
列表
Board
标记
里程碑
合并请求
0
Merge Requests
0
CI / CD
CI / CD
流水线
作业
日程表
图表
维基
Wiki
代码片段
Snippets
成员
Collapse sidebar
Close sidebar
活动
图像
聊天
创建新问题
作业
提交
Issue Boards
Open sidebar
单韦乔
Toy-MT-Introduction
Commits
aa6c2d09
Commit
aa6c2d09
authored
Feb 29, 2020
by
曹润柘
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
updates
parent
e31fccc6
全部展开
隐藏空白字符变更
内嵌
并排
正在显示
15 个修改的文件
包含
149 行增加
和
78 行删除
+149
-78
Book/Chapter1/Figures/figure-Example-RBMT.tex
+7
-7
Book/Chapter1/Figures/figure-Example-SMT.tex
+6
-6
Book/Chapter1/Figures/figure-zh-sentences-into-en-sentences.tex
+4
-4
Book/Chapter1/chapter1.tex
+0
-0
Book/Chapter2/Figures/figure-Self-information-function.tex
+11
-6
Book/Chapter2/Figures/figure-evaluation-of-probability-for-grammar.tex
+6
-6
Book/Chapter2/Figures/figure-examples-of-Chinese-word-segmentation-based-on-1-gram-model.tex
+5
-5
Book/Chapter2/Figures/figure-full-probability-word-segmentation-3.tex
+6
-6
Book/Chapter2/Figures/figure-no-smoothing&smoothed-probability-distributions.tex
+49
-22
Book/Chapter2/Figures/figure-the-dice-game-model.tex
+6
-6
Book/Chapter2/Figures/figure-word-frequency-distribution.tex
+47
-8
Book/Chapter2/chapter2.tex
+0
-0
Book/Chapter3/Chapter3.tex
+0
-0
Book/Chapter3/Figures/figure-a-more-detailed-explanation-of-formula-3.40.tex
+1
-1
Book/mt-book.tex
+1
-1
没有找到文件。
Book/Chapter1/Figures/figure-Example-RBMT.tex
查看文件 @
aa6c2d09
...
...
@@ -10,14 +10,14 @@
\begin{scope}
{
{
\footnotesize
\node
[anchor=north west] (rule1) at (0,0)
{
\textbf
{
1: If
}
源=`
我',
\textbf
{
then
}
译=`I
'
}
;
\node
[anchor=north west] (rule2) at ([yshift=0.1em]rule1.south west)
{
\textbf
{
2: If
}
源=`
你',
\textbf
{
then
}
译=`you
'
}
;
\node
[anchor=north west] (rule3) at ([yshift=0.1em]rule2.south west)
{
\textbf
{
3: If
}
源=`
感到 满意
',
}
;
\node
[anchor=north west] (rule3part2) at ([yshift=0.5em]rule3.south west)
{
\textbf
{
\hspace
{
0.95em
}
then
}
译=`
be satisfied with
'
}
;
\node
[anchor=north west] (rule4) at ([yshift=0.1em]rule3part2.south west)
{
\textbf
{
4: If
}
源=`
对 ... 动词[表态度]
'
}
;
\node
[anchor=north west] (rule1) at (0,0)
{
\textbf
{
1: If
}
源=`
`我'',
\textbf
{
then
}
译=``I'
'
}
;
\node
[anchor=north west] (rule2) at ([yshift=0.1em]rule1.south west)
{
\textbf
{
2: If
}
源=`
`你'',
\textbf
{
then
}
译=``you'
'
}
;
\node
[anchor=north west] (rule3) at ([yshift=0.1em]rule2.south west)
{
\textbf
{
3: If
}
源=`
`感到 满意'
',
}
;
\node
[anchor=north west] (rule3part2) at ([yshift=0.5em]rule3.south west)
{
\textbf
{
\hspace
{
0.95em
}
then
}
译=`
`be satisfied with'
'
}
;
\node
[anchor=north west] (rule4) at ([yshift=0.1em]rule3part2.south west)
{
\textbf
{
4: If
}
源=`
`对 ... 动词[表态度]'
'
}
;
\node
[anchor=north west] (rule4part2) at ([yshift=0.5em]rule4.south west)
{
\textbf
{
\hspace
{
0.95em
}
then
}
调序[动词 + 对象]
}
;
\node
[anchor=north west] (rule5) at ([yshift=0.1em]rule4part2.south west)
{
\textbf
{
5: If
}
译文主语是`
I
'
}
;
\node
[anchor=north west] (rule5part2) at ([yshift=0.5em]rule5.south west)
{
\textbf
{
\hspace
{
0.95em
}
then
}
be动词为`
am/was
'
}
;
\node
[anchor=north west] (rule5) at ([yshift=0.1em]rule4part2.south west)
{
\textbf
{
5: If
}
译文主语是`
`I'
'
}
;
\node
[anchor=north west] (rule5part2) at ([yshift=0.5em]rule5.south west)
{
\textbf
{
\hspace
{
0.95em
}
then
}
be动词为`
`am/was'
'
}
;
\node
[anchor=north west] (rule6) at ([yshift=0.1em]rule5part2.south west)
{
\textbf
{
6: If
}
源语是主谓结构
}
;
\node
[anchor=north west] (rule6part2) at ([yshift=0.5em]rule6.south west)
{
\textbf
{
\hspace
{
0.95em
}
then
}
译文为主谓结构
}
;
\node
[anchor=south west] (rulebaselabel) at (rule1.north west)
{{
\color
{
ublue
}
资源:规则库
}}
;
...
...
Book/Chapter1/Figures/figure-Example-SMT.tex
查看文件 @
aa6c2d09
...
...
@@ -10,12 +10,12 @@
{
\begin{scope}
{
\scriptsize
\node
[anchor=north west] (example1) at (0,0)
{
\textbf
{
1:
}
源=`
他 在 哪 ?
'
}
;
\node
[anchor=north west] (example1part2) at ([yshift=0.2em]example1.south west)
{
\hspace
{
1em
}
译=`
Where is he ?
'
}
;
\node
[anchor=north west] (example2) at ([yshift=0.1em]example1part2.south west)
{
\textbf
{
2:
}
源=`
我 真高兴
'
}
;
\node
[anchor=north west] (example2part2) at ([yshift=0.2em]example2.south west)
{
\hspace
{
1em
}
译=`
I'm so happy
'
}
;
\node
[anchor=north west] (example3) at ([yshift=0.1em]example2part2.south west)
{
\textbf
{
3:
}
源=`
出发 !
'
}
;
\node
[anchor=north west] (example3part2) at ([yshift=0.2em]example3.south west)
{
\hspace
{
1em
}
译=`
Let's go!
'
}
;
\node
[anchor=north west] (example1) at (0,0)
{
\textbf
{
1:
}
源=`
`他 在 哪 ?'
'
}
;
\node
[anchor=north west] (example1part2) at ([yshift=0.2em]example1.south west)
{
\hspace
{
1em
}
译=`
`Where is he ?'
'
}
;
\node
[anchor=north west] (example2) at ([yshift=0.1em]example1part2.south west)
{
\textbf
{
2:
}
源=`
`我 真高兴'
'
}
;
\node
[anchor=north west] (example2part2) at ([yshift=0.2em]example2.south west)
{
\hspace
{
1em
}
译=`
`I'm so happy'
'
}
;
\node
[anchor=north west] (example3) at ([yshift=0.1em]example2part2.south west)
{
\textbf
{
3:
}
源=`
`出发 !'
'
}
;
\node
[anchor=north west] (example3part2) at ([yshift=0.2em]example3.south west)
{
\hspace
{
1em
}
译=`
`Let's go!'
'
}
;
\node
[anchor=north west] (example4) at ([yshift=0.1em]example3part2.south west)
{
\hspace
{
1em
}
...
}
;
\node
[anchor=north west] (example5) at ([yshift=0.1em]example4.south west)
{
\hspace
{
1em
}
\quad
}
;
\node
[anchor=north west] (example6) at ([yshift=0.1em]example5.south west)
{
\hspace
{
1em
}
\quad
}
;
...
...
Book/Chapter1/Figures/figure-zh-sentences-into-en-sentences.tex
查看文件 @
aa6c2d09
...
...
@@ -10,10 +10,10 @@
\begin{scope}
{
{
\footnotesize
\node
[anchor=north west] (example1) at (0,0)
{
\textbf
{
1:
}
源=`
什么 时候 开始 ?
'
}
;
\node
[anchor=north west] (example1part2) at ([yshift=0.5em]example1.south west)
{
\hspace
{
1em
}
译=`
When will it start ?
'
}
;
\node
[anchor=north west] (example2) at ([yshift=0.1em]example1part2.south west)
{
\textbf
{
2:
}
源=`
我 对 他 感到 高兴
'
}
;
\node
[anchor=north west] (example2part2) at ([yshift=0.5em]example2.south west)
{
\hspace
{
1em
}
译=`
I am happy with him
'
}
;
\node
[anchor=north west] (example1) at (0,0)
{
\textbf
{
1:
}
源=`
`什么 时候 开始 ?'
'
}
;
\node
[anchor=north west] (example1part2) at ([yshift=0.5em]example1.south west)
{
\hspace
{
1em
}
译=`
`When will it start ?'
'
}
;
\node
[anchor=north west] (example2) at ([yshift=0.1em]example1part2.south west)
{
\textbf
{
2:
}
源=`
`我 对 他 感到 高兴'
'
}
;
\node
[anchor=north west] (example2part2) at ([yshift=0.5em]example2.south west)
{
\hspace
{
1em
}
译=`
`I am happy with him'
'
}
;
\node
[anchor=north west] (example3) at ([yshift=0.1em]example2part2.south west)
{
\hspace
{
1em
}
...
}
;
\node
[anchor=south west] (examplebaselabel) at (example1.north west)
{{
\color
{
ublue
}
资源1:翻译实例库
}}
;
}
...
...
Book/Chapter1/chapter1.tex
查看文件 @
aa6c2d09
差异被折叠。
点击展开。
Book/Chapter2/Figures/figure-Self-information-function.tex
查看文件 @
aa6c2d09
...
...
@@ -3,11 +3,14 @@
\begin{axis}
[
width=12cm,
height=4cm,
xlabel=
{$
\textrm
{
P
}
(
x
)
$}
,
ylabel=
{$
\textrm
{
I
}
(
x
)
$}
,
ylabel style=
{
yshift=-0.5cm
}
,
xlabel=
{{$
\textrm
{
P
}
(
x
)
$}}
,
ylabel=
{{$
\textrm
{
I
}
(
x
)
$}}
,
ylabel style=
{
yshift=-1.5em,font=
\footnotesize
}
,
xlabel style=
{
yshift=0.3em,font=
\footnotesize
}
,
xtick=
{
0,0.2,...,1.0
}
,
ytick=
{
0,1,2
}
,
y tick style=
{
opacity=0
}
,
x tick style=
{
opacity=0
}
,
domain=0:1,
enlarge x limits=true,
enlarge y limits=
{
upper
}
,
...
...
@@ -15,10 +18,12 @@
xmin=0,
xmax=1,
ymin=0,
ymax=2
ymax=2,
xticklabel style=
{
font=
\small
}
,
%坐标轴字体大小
yticklabel style=
{
font=
\small
}
]
\addplot
[draw=
ublue
,thick]
{
170*exp(-x-4)-1
}
;
\legend
{
$
\textrm
{
I
}
(
x
)
\cdot
\textrm
{
P
}
(
x
)
$
}
\addplot
[draw=
blue!40
,thick]
{
170*exp(-x-4)-1
}
;
\legend
{
\footnotesize
{$
\textrm
{
I
}
(
x
)
\cdot
\textrm
{
P
}
(
x
)
$}
}
\end{axis}
\end{tikzpicture}
...
...
Book/Chapter2/Figures/figure-evaluation-of-probability-for-grammar.tex
查看文件 @
aa6c2d09
...
...
@@ -56,18 +56,18 @@
\node
[rectangle,draw=ublue, inner sep=0.2em]
[fit = (treebanklabel) (t1n1) (t2w1) (t2wn)] (treebank)
{}
;
\end{pgfonlayer}
\node
[anchor=north west] (math1) at ([xshift=2em]treebank.north east)
{
P(
'VP
$
\to
$
VV NN
')
}
;
\node
[anchor=north west] (math1part2) at ([xshift=-1em,yshift=0.2em]math1.south west)
{$
=
\frac
{
\textrm
{
'VP'和'VV NN'同时出现的次数
=
1
}}{
\textrm
{
'VP
'出现的次数
}
=
4
}$}
;
\node
[anchor=north west] (math1) at ([xshift=2em]treebank.north east)
{
P(
``VP
$
\to
$
VV NN'
')
}
;
\node
[anchor=north west] (math1part2) at ([xshift=-1em,yshift=0.2em]math1.south west)
{$
=
\frac
{
\textrm
{
``VP''和``VV NN''同时出现的次数
=
1
}}{
\textrm
{
``VP'
'出现的次数
}
=
4
}$}
;
\node
[anchor=north west] (math1part3) at ([yshift=0.2em]math1part2.south west)
{$
=
\frac
{
1
}{
4
}$}
;
\node
[anchor=north west] (math2) at ([yshift=-6em]math1.north west)
{
P(
'NP
$
\to
$
NN
')
}
;
\node
[anchor=north west] (math2part2) at ([xshift=-1em,yshift=0.2em]math2.south west)
{$
=
\frac
{
\textrm
{
'NP'和'NN'同时出现的次数
=
2
}}{
\textrm
{
'NP
'出现的次数
}
=
3
}$}
;
\node
[anchor=north west] (math2) at ([yshift=-6em]math1.north west)
{
P(
``NP
$
\to
$
NN'
')
}
;
\node
[anchor=north west] (math2part2) at ([xshift=-1em,yshift=0.2em]math2.south west)
{$
=
\frac
{
\textrm
{
``NP''和``NN''同时出现的次数
=
2
}}{
\textrm
{
``NP'
'出现的次数
}
=
3
}$}
;
\node
[anchor=north west] (math2part3) at ([yshift=0.2em]math2part2.south west)
{$
=
\frac
{
2
}{
3
}$}
;
\node
[anchor=north west] (math3) at ([yshift=-6em]math2.north west)
{
P(
'IP
$
\to
$
NP NP
')
}
;
\node
[anchor=north west] (math3part2) at ([xshift=-1em,yshift=0.2em]math3.south west)
{$
=
\frac
{
\textrm
{
'IP'和'NP NP'同时出现的次数
=
0
}}{
\textrm
{
'IP
'出现的次数
}
=
3
}$}
;
\node
[anchor=north west] (math3) at ([yshift=-6em]math2.north west)
{
P(
``IP
$
\to
$
NP NP'
')
}
;
\node
[anchor=north west] (math3part2) at ([xshift=-1em,yshift=0.2em]math3.south west)
{$
=
\frac
{
\textrm
{
``IP''和``NP NP''同时出现的次数
=
0
}}{
\textrm
{
``IP'
'出现的次数
}
=
3
}$}
;
\begin{pgfonlayer}
{
background
}
...
...
Book/Chapter2/Figures/figure-examples-of-Chinese-word-segmentation-based-on-1-gram-model.tex
查看文件 @
aa6c2d09
...
...
@@ -64,7 +64,7 @@
\node
[anchor=west] (label1) at (0,-6.8em)
{
\textbf
{
自动分词系统
}
:对任意的数据句子
$
S
$
,找到最佳的分词结果
$
W
^{
*
}$
输出
}
;
}
{
\node
[anchor=north west] (label2) at (label1.south west)
{
假设输入
$
S
$
=
'确实现在数据很多'
}
;
\node
[anchor=north west] (label2) at (label1.south west)
{
假设输入
$
S
$
=
“确实现在数据很多”
}
;
}
{
\node
[anchor=north west,draw,thick,inner sep=2pt] (data11) at (label2.south west)
{
枚举所有可能的切分
}
;
...
...
@@ -87,12 +87,12 @@
\node
[anchor=north west] (data21) at (data11.south west)
{
确/实现/在/数/据很/多
}
;
}
{
\node
[anchor=north west] (data22) at (data12.south west)
{$
\textrm
{
P
}
(
\textrm
{
'确'
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
'实现'
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
'在'
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
'数'
}
)
\cdot
$}
;
\node
[anchor=north west] (data22) at (data12.south west)
{$
\textrm
{
P
}
(
\textrm
{
“确”
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
“实现”
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
“在”
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
“数”
}
)
\cdot
$}
;
}
\node
[anchor=north west,minimum height=1.6em] (data23) at (data13.south west)
{}
;
\node
[anchor=north west,minimum height=1.6em] (data31) at ([yshift=0.3em]data21.south west)
{}
;
{
\node
[anchor=north west] (data32) at ([yshift=0.3em]data22.south west)
{$
\textrm
{
P
}
(
\textrm
{
'据很'
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
'多'
}
)
=
2
.
13
\times
10
^{
-
45
}$}
;
\node
[anchor=north west] (data32) at ([yshift=0.3em]data22.south west)
{$
\textrm
{
P
}
(
\textrm
{
“据很”
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
“多”
}
)
=
2
.
13
\times
10
^{
-
45
}$}
;
}
\node
[anchor=north west,minimum height=1.6em] (data33) at ([yshift=0.3em]data23.south west)
{}
;
...
...
@@ -100,7 +100,7 @@
\node
[anchor=north west] (data41) at (data31.south west)
{
确实/现在/数据/很多
}
;
}
{
\node
[anchor=north west] (data42) at (data32.south west)
{$
\textrm
{
P
}
(
\textrm
{
'确实'
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
'现在'
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
'数据'
}
)
\cdot
$}
;
\node
[anchor=north west] (data42) at (data32.south west)
{$
\textrm
{
P
}
(
\textrm
{
“确实”
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
“现在”
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
“数据”
}
)
\cdot
$}
;
}
{
\node
[anchor=north west] (data43) at ([yshift=-0.2em]data33.south west)
{
\color
{
red
}{
\textbf
{
输出
}}}
;
...
...
@@ -110,7 +110,7 @@
\node
[anchor=north west] (data51) at (data41.south west)
{
...
}
;
}
{
\node
[anchor=north west] (data52) at ([yshift=0.3em]data42.south west)
{$
\textrm
{
P
}
(
\textrm
{
'很'
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
'多'
}
)
=
1
.
54
\times
10
^{
-
25
}$}
;
\node
[anchor=north west] (data52) at ([yshift=0.3em]data42.south west)
{$
\textrm
{
P
}
(
\textrm
{
“很”
}
)
\cdot
\textrm
{
P
}
(
\textrm
{
“多”
}
)
=
1
.
54
\times
10
^{
-
25
}$}
;
}
\node
[anchor=north west] (data53) at ([yshift=0.3em]data43.south west)
{}
;
}
...
...
Book/Chapter2/Figures/figure-full-probability-word-segmentation-3.tex
查看文件 @
aa6c2d09
...
...
@@ -11,16 +11,16 @@
\begin{tikzpicture}
\node
[anchor=west] (label1) at (0,0)
{
总词数:
$
6
+
8
+
5
=
20
$}
;
\node
[anchor=north west] (p1) at (label1.south west)
{$
\textrm
{
P
}
(
'
\textrm
{
很
}
'
)=
1
/
20
=
0
.
05
$}
;
\node
[anchor=north west] (p2) at (p1.south west)
{$
\textrm
{
P
}
(
'
\textrm
{
。
}
'
)=
3
/
20
=
0
.
15
$}
;
\node
[anchor=north west] (p3) at (p2.south west)
{$
\textrm
{
P
}
(
'
\textrm
{
确实
}
'
)=
1
/
20
=
0
.
05
$}
;
\node
[anchor=north west] (p1) at (label1.south west)
{$
\textrm
{
P
}
(
\textrm
{
“很”
}
)=
1
/
20
=
0
.
05
$}
;
\node
[anchor=north west] (p2) at (p1.south west)
{$
\textrm
{
P
}
(
\textrm
{
“。”
}
)=
3
/
20
=
0
.
15
$}
;
\node
[anchor=north west] (p3) at (p2.south west)
{$
\textrm
{
P
}
(
\textrm
{
“确实”
}
)=
1
/
20
=
0
.
05
$}
;
\node
[anchor=north west] (label11) at ([xshift=18.0em,yshift=1.63em]label1.south west)
{
更多数据-总词数:100K
$
\sim
$
1M
}
;
\node
[anchor=north west] (p12) at (label11.south west)
{$
\textrm
{
P
}
(
'
\textrm
{
很
}
'
)=
0
.
000010
$}
;
\node
[anchor=north west] (p22) at (p12.south west)
{$
\textrm
{
P
}
(
'
\textrm
{
。
}
'
)=
0
.
001812
$}
;
\node
[anchor=north west] (p32) at (p22.south west)
{$
\textrm
{
P
}
(
'
\textrm
{
确实
}
'
)=
0
.
000001
$}
;
\node
[anchor=north west] (p12) at (label11.south west)
{$
\textrm
{
P
}
(
\textrm
{
“很”
}
)=
0
.
000010
$}
;
\node
[anchor=north west] (p22) at (p12.south west)
{$
\textrm
{
P
}
(
\textrm
{
“。”
}
)=
0
.
001812
$}
;
\node
[anchor=north west] (p32) at (p22.south west)
{$
\textrm
{
P
}
(
\textrm
{
“确实”
}
)=
0
.
000001
$}
;
...
...
Book/Chapter2/Figures/figure-no-smoothing&smoothed-probability-distributions.tex
查看文件 @
aa6c2d09
\definecolor
{
ublue
}{
rgb
}{
0.152,0.250,0.545
}
\begin{tikzpicture}
\begin{axis}
[
%align=left,
%axis x line=right,
%axis y line=left,
width=12cm, height=5.5cm,
\begin{tikzpicture}
%画图中的属性如xshift应该是通用的,前面的关键字如xlabel规定了修改的部分
\begin{axis}
[
width=10cm, height=4.5cm,
symbolic x coords=
{
未抽取词,do,want,what,am,people,look
}
,
%自定义x坐标
%enlargelimits=0.15,%x轴移动
xtick=data,
%自定义x坐标
ytick=
{
0,0.05,0.1,0.15,0.2,0.25
}
,
xlabel=
{
低概率词汇
}
,
ylabel=
{
词汇概率
}
,
legend pos=outer north east,
%图标位置
xlabel style=
{
align=right,xshift=6.5cm,yshift=1cm
}
,
ylabel style=
{
rotate=-90,yshift=3cm,xshift=1cm
}
,
y tick label style=
{
/pgf/number format/.cd,fixed,precision=2
}
,
%y轴精度,不用科学表示
xlabel style=
{
align=right,xshift=5.3cm,yshift=0.8cm,font=
\footnotesize
}
,
ylabel style=
{
rotate=-90,yshift=2cm,xshift=1cm,font=
\footnotesize
}
,
y tick style=
{
opacity=0
}
,
%隐藏y轴刻度线
x tick style=
{
opacity=0
}
,
%隐藏x轴刻度线
x tick label style=
{
anchor=base,font=
\footnotesize
,yshift=-0.5cm
}
,
y tick label style=
{
font=
\footnotesize
,/pgf/number format/.cd,fixed,precision=2
}
,
%y轴精度,不用科学表示
y axis line style=
{
opacity=0
}
,
%隐藏y轴
tick align=inside,
%原本的横行线
ymajorgrids,
%显示横行线
axis x line*=bottom,
%显示汉字
%legend entries={未平滑,平滑后},%右上图标
%clip=false,%不剪切
%xlabel shift=5cm,%整体右移
ymajorgrids,
%显示横行网格线
axis x line*=bottom,
%显示x轴坐标汉字(应该是对齐)
major grid style=
{
dotted,draw=ublue
}
,
%横行线颜色
%enlarge x limits=true,%扩展x轴长度,防止边缘bar部分显示
axis on top,
%网格线位于顶层
legend style=
{
anchor=north west
}
,
%图标格式
legend style=
{
anchor=north west
,font=
\footnotesize
}
,
%图标格式
ymin=0,
ymax=0.25]
\addplot
[ybar,bar shift=-
3mm,bar width=6mm,fill=blue!20,draw=blue!20,area legend] coordinates
{
(未抽取词,0) (do,0.05) (want,0.05) (what,0.05) (am,0.1) (people,0.15) (look,0.2)
}
;
\addplot
[ybar,bar shift=
3mm,bar width=6mm,fill=red!20,draw=blue!2
0,area legend] coordinates
{
(未抽取词,0.03) (do,0.062) (want,0.062) (what,0.062) (am,0.09) (people,0.122) (look,0.138)
}
;
\legend
{
未平滑,平滑后
}
\addplot
[ybar,bar shift=-
2mm,bar width=4mm,fill=blue!40,draw=blue!40,area legend] coordinates
{
(未抽取词,0) (do,0.05) (want,0.05) (what,0.05) (am,0.1) (people,0.15) (look,0.2)
}
;
%area legend图例显示长方形颜色
\addplot
[ybar,bar shift=
2mm,bar width=4mm,fill=red!40,draw=red!4
0,area legend] coordinates
{
(未抽取词,0.03) (do,0.062) (want,0.062) (what,0.062) (am,0.09) (people,0.122) (look,0.138)
}
;
\legend
{
未平滑,平滑后
}
%右上图例
\end{axis}
\end{tikzpicture}
%---------------------------------------------------------------------
\ No newline at end of file
%---------------------------------------------------------------------
Book/Chapter2/Figures/figure-the-dice-game-model.tex
查看文件 @
aa6c2d09
...
...
@@ -13,7 +13,7 @@
\node
[draw,thick,minimum size=10pt] at (
\i
,0)
{
1
}
;
}
}
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
"
1
"
)
=
5
/
30
$}
;
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
\text
{
“
1
”
}
)
=
5
/
30
$}
;
\end{scope}
\begin{scope}
[yshift=-2.5em]
...
...
@@ -22,7 +22,7 @@
\node
[draw,thick,minimum size=10pt] at (
\i
,0)
{{
\color
{
red
}
2
}}
;
}
}
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
"
2
"
)
=
4
/
30
$}
;
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
\text
{
“
2
”
}
)
=
4
/
30
$}
;
\end{scope}
\begin{scope}
[yshift=-5.0em]
...
...
@@ -31,7 +31,7 @@
\node
[draw,thick,minimum size=10pt] at (
\i
,0)
{{
\color
{
ublue
}
3
}}
;
}
}
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
"
3
"
)
=
6
/
30
$}
;
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
\text
{
“
3
”
}
)
=
6
/
30
$}
;
\end{scope}
\begin{scope}
[yshift=-7.5em]
...
...
@@ -40,7 +40,7 @@
\node
[draw,thick,minimum size=10pt] at (
\i
,0)
{{
\color
{
ugreen
}
4
}}
;
}
}
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
"
4
"
)
=
12
/
30
$}
;
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
\text
{
“
4
”
}
)
=
12
/
30
$}
;
\end{scope}
\begin{scope}
[yshift=-10.0em]
...
...
@@ -49,7 +49,7 @@
\node
[draw,thick,minimum size=10pt] at (
\i
,0)
{{
\color
{
purple
}
5
}}
;
}
}
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
"
5
"
)
=
2
/
30
$}
;
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
\text
{
“
5
”
}
)
=
2
/
30
$}
;
\end{scope}
\begin{scope}
[yshift=-12.5em]
...
...
@@ -58,7 +58,7 @@
\node
[draw,thick,minimum size=10pt] at (
\i
,0)
{{
\color
{
orange
}
6
}}
;
}
}
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
"
6
"
)
=
1
/
30
$}
;
\node
[anchor=west] at (33em,0)
{$
\textrm
{
P
}
(
\text
{
“
6
”
}
)
=
1
/
30
$}
;
\end{scope}
\end{tikzpicture}
...
...
Book/Chapter2/Figures/figure-word-frequency-distribution.tex
查看文件 @
aa6c2d09
\definecolor
{
ublue
}{
rgb
}{
0.152,0.250,0.545
}
\begin{tikzpicture}
\begin{axis}
[
width=1
2
cm,
height=
6
cm,
xlabel=
{
某语料中的词汇
}
,
width=1
1
cm,
height=
5.5
cm,
xlabel=
{
某语料
库
中的词汇
}
,
ylabel=
{
词汇出现总次数
}
,
xlabel style=
{
xshift=6.
6cm,yshift=1cm
}
,
ylabel style=
{
rotate=-90,yshift=
3cm,xshift=1.2cm
}
,
xlabel style=
{
xshift=6.
2cm,yshift=0.8cm,font=
\footnotesize
}
,
ylabel style=
{
rotate=-90,yshift=
2.8cm,xshift=1.2cm,font=
\footnotesize
}
,
xticklabel style=
{
opacity=0
}
,
ytick=
{
0,10000000,20000000,30000000,40000000,50000000,60000000
}
,
tick align=inside,
y tick style=
{
opacity=0
}
,
x tick style=
{
opacity=0
}
,
y axis line style=
{
opacity=0
}
,
ymajorgrids,
major grid style=
{
dotted,draw=ublue
}
,
ybar,
bar width=0.5,
axis x line*=bottom,
xmin=0,
xmax=105,
ymin=0,
ymax=60000000
]
\addplot
[fill=blue!
20,draw=ublue
]
coordinates
{
(1,56000000)
\addplot
[fill=blue!
40,draw=blue!40
]
coordinates
{
(1,56000000)
(2,34000000)
(3,30000000)
(4,27000000)
...
...
@@ -127,4 +130,41 @@
\end{axis}
\end{tikzpicture}
%---------------------------------------------------------------------
\ No newline at end of file
%---------------------------------------------------------------------
Book/Chapter2/chapter2.tex
查看文件 @
aa6c2d09
差异被折叠。
点击展开。
Book/Chapter3/Chapter3.tex
查看文件 @
aa6c2d09
差异被折叠。
点击展开。
Book/Chapter3/Figures/figure-a-more-detailed-explanation-of-formula-3.40.tex
查看文件 @
aa6c2d09
...
...
@@ -44,7 +44,7 @@
}
{
\draw
[decorate,thick,decoration={brace,amplitude=5pt,mirror}]
([yshift=-0.2em]eq5.south west) -- ([yshift=-0.2em]eq6.south east) node [pos=0.4,below,xshift=-0.0em,yshift=-0.3em] (expcount1)
{
\footnotesize
{
\textbf
{
'
$
t
_
v
$
翻译为
$
s
_
u
$
'这个事件
}}}
;
\draw
[decorate,thick,decoration={brace,amplitude=5pt,mirror}]
([yshift=-0.2em]eq5.south west) -- ([yshift=-0.2em]eq6.south east) node [pos=0.4,below,xshift=-0.0em,yshift=-0.3em] (expcount1)
{
\footnotesize
{
\textbf
{
``
$
t
_
v
$
翻译为
$
s
_
u
$
'
'这个事件
}}}
;
\node
[anchor=north west] (expcount2) at ([yshift=0.5em]expcount1.south west)
{
\footnotesize
{
\textbf
{
出现次数的期望的估计
}}}
;
\node
[anchor=north west] (expcount3) at ([yshift=0.5em]expcount2.south west)
{
\footnotesize
{
\textbf
{
称之为期望频次expected count
}}}
;
}
...
...
Book/mt-book.tex
查看文件 @
aa6c2d09
...
...
@@ -55,7 +55,7 @@
\IfFileExists
{
C:/WINDOWS/win.ini
}
{
\newcommand
{
\mycfont
}{
song
}}
{
\newcommand
{
\mycfont
}{
gbsn
}}
%
{\newcommand{\mycfont}{gbsn}}
\begin{CJK}
{
UTF8
}{
\mycfont
}
\end{CJK}
...
...
编写
预览
Markdown
格式
0%
重试
或
添加新文件
添加附件
取消
您添加了
0
人
到此讨论。请谨慎行事。
请先完成此评论的编辑!
取消
请
注册
或者
登录
后发表评论