Merge pull request #4262 from ChrisLeeGit/master

完成翻译:Learn How to Use Awk Special Patterns begin and end
This commit is contained in:
Chang Liu 2016-08-02 10:01:42 +08:00 committed by GitHub
commit 3c299763a8
2 changed files with 166 additions and 168 deletions

View File

@ -1,168 +0,0 @@
Being translated by ChrisLeeGit
Learn How to Use Awk Special Patterns BEGIN and END Part 9
===============================================================
In Part 8 of this Awk series, we introduced some powerful Awk command features, that is variables, numeric expressions and assignment operators.
As we advance, in this segment, we shall cover more Awk features, and that is the special patterns: `BEGIN` and `END`.
![](http://www.tecmint.com/wp-content/uploads/2016/07/Learn-Awk-Patterns-BEGIN-and-END.png)
>Learn Awk Patterns BEGIN and END
These special features will prove helpful as we try to expand on and explore more methods of building complex Awk operations.
To get started, let us drive our thoughts back to the introduction of the Awk series, remember when we started this series, I pointed out that the general syntax of a running an Awk command is:
```
# awk 'script' filenames
```
And in the syntax above, the Awk script has the form:
```
/pattern/ { actions }
```
When you consider the pattern in the script, it is normally a regular expression, additionally, you can also think of pattern as special patterns `BEGIN` and `END`. Therefore, we can also write an Awk command in the form below:
```
awk '
BEGIN { actions }
/pattern/ { actions }
/pattern/ { actions }
……….
END { actions }
' filenames
```
In the event that you use the special patterns: `BEGIN` and `END` in an Awk script, this is what each of them means:
- `BEGIN` pattern: means that Awk will execute the action(s) specified in `BEGIN` once before any input lines are read.
- `END` pattern: means that Awk will execute the action(s) specified in `END` before it actually exits.
And the flow of execution of the an Awk command script which contains these special patterns is as follows:
- When the `BEGIN` pattern is used in a script, all the actions for `BEGIN` are executed once before any input line is read.
- Then an input line is read and parsed into the different fields.
- Next, each of the non-special patterns specified is compared with the input line for a match, when a match is found, the action(s) for that pattern are then executed. This stage will be repeated for all the patterns you have specified.
- Next, stage 2 and 3 are repeated for all input lines.
- When all input lines have been read and dealt with, in case you specify the `END` pattern, the action(s) will be executed.
You should always remember this sequence of execution when working with the special patterns to achieve the best results in an Awk operation.
To understand it all, let us illustrate using the example from part 8, about the list of domains owned by Tecmint, as stored in a file named domains.txt.
```
news.tecmint.com
tecmint.com
linuxsay.com
windows.tecmint.com
tecmint.com
news.tecmint.com
tecmint.com
linuxsay.com
tecmint.com
news.tecmint.com
tecmint.com
linuxsay.com
windows.tecmint.com
tecmint.com
```
```
$ cat ~/domains.txt
```
![](http://www.tecmint.com/wp-content/uploads/2016/07/View-Contents-of-File.png)
>View Contents of File
In this example, we want to count the number of times the domain `tecmint.com` is listed in the file domains.txt. So we wrote a small shell script to help us do that using the idea of variables, numeric expressions and assignment operators which has the following content:
```
#!/bin/bash
for file in $@; do
if [ -f $file ] ; then
#print out filename
echo "File is: $file"
#print a number incrementally for every line containing tecmint.com
awk '/^tecmint.com/ { counter+=1 ; printf "%s\n", counter ; }' $file
else
#print error info incase input is not a file
echo "$file is not a file, please specify a file." >&2 && exit 1
fi
done
#terminate script with exit code 0 in case of successful execution
exit 0
```
Let us now employ the two special patterns: `BEGIN` and `END` in the Awk command in the script above as follows:
We shall alter the script:
```
awk '/^tecmint.com/ { counter+=1 ; printf "%s\n", counter ; }' $file
```
To:
```
awk ' BEGIN { print "The number of times tecmint.com appears in the file is:" ; }
/^tecmint.com/ { counter+=1 ; }
END { printf "%s\n", counter ; }
' $file
```
After making the changes to the Awk command, the complete shell script now looks like this:
```
#!/bin/bash
for file in $@; do
if [ -f $file ] ; then
#print out filename
echo "File is: $file"
#print the total number of times tecmint.com appears in the file
awk ' BEGIN { print "The number of times tecmint.com appears in the file is:" ; }
/^tecmint.com/ { counter+=1 ; }
END { printf "%s\n", counter ; }
' $file
else
#print error info incase input is not a file
echo "$file is not a file, please specify a file." >&2 && exit 1
fi
done
#terminate script with exit code 0 in case of successful execution
exit 0
```
![](http://www.tecmint.com/wp-content/uploads/2016/07/Awk-BEGIN-and-END-Patterns.png)
>Awk BEGIN and END Patterns
When we run the script above, it will first of all print the location of the file domains.txt, then the Awk command script is executed, where the `BEGIN` special pattern helps us print out the message “`The number of times tecmint.com appears in the file is:`” before any input lines are read from the file.
Then our pattern, `/^tecmint.com/` is compared against every input line and the action, `{ counter+=1 ; }` is executed for each input line, which counts the number of times `tecmint.com` appears in the file.
Finally, the `END` pattern will print the total the number of times the domain `tecmint.com` appears in the file.
```
$ ./script.sh ~/domains.txt
```
![](http://www.tecmint.com/wp-content/uploads/2016/07/Script-to-Count-Number-of-Times-String-Appears.png)
>Script to Count Number of Times String Appears
To conclude, we walked through more Awk features exploring on the concepts of special pattern: `BEGIN` and `END`.
As I pointed out before, these Awk features will help us build more complex text filtering operations, there is more to cover under Awk features and in part 10, we shall approach the idea of Awk built-in variables, so stay connected.
--------------------------------------------------------------------------------
via: http://www.tecmint.com/learn-use-awk-special-patterns-begin-and-end/
作者:[Aaron Kili][a]
译者:[ChrisLeeGit](https://github.com/chrisleegit)
校对:[校对ID](https://github.com/校对ID)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]: http://www.tecmint.com/author/aaronkili/

View File

@ -0,0 +1,166 @@
awk 系列:如何使用 awk 的特殊模式 BEGIN 和 END
===============================================================
在 awk 系列的第八节,我们介绍了一些强大的 awk 命令功能,它们是变量、数字表达式和赋值运算符。
本节我们将学习更多的 awk 功能,即 awk 的特殊模式:`BEGIN` 和 `END`
![](http://www.tecmint.com/wp-content/uploads/2016/07/Learn-Awk-Patterns-BEGIN-and-END.png)
> 学习 awk 的模式 BEGIN 和 END
随着我们逐渐展开,并探索出更多构建复杂 awk 操作的方法,将会证明 awk 的这些特殊功能的是多么强大。
开始前,先让我们回顾一下 awk 系列的介绍,记得当我们开始这个系列时,我就指出 awk 指令的通用语法是这样的:
```
# awk 'script' filenames
```
在上述语法中awk 脚本拥有这样的形式:
```
/pattern/ { actions }
```
当你看脚本中的模式(`/pattern`)时,你会发现它通常是一个正则表达式,此外,你也可以将模式(`/pattern`)当成特殊模式 `BEGIN``END`
因此,我们也能按照下面的形式编写一条 awk 命令:
```
awk '
BEGIN { actions }
/pattern/ { actions }
/pattern/ { actions }
……….
END { actions }
' filenames
```
假如你在 awk 脚本中使用了特殊模式:`BEGIN` 和 `END`,以下则是它们对应的含义:
- `BEGIN` 模式:是指 awk 将在读取任何输入行之前立即执行 `BEGIN` 中指定的动作。
- `END` 模式:是指 awk 将在它正式退出前执行 `END` 中指定的动作。
含有这些特殊模式的 awk 命令脚本的执行流程如下:
- 当在脚本中使用了 `BEGIN` 模式,则 `BEGIN` 中所有的动作都会在读取任何输入行之前执行。
- 然后,读入一个输入行并解析成不同的段。
- 接下来,每一条指定的非特殊模式都会和输入行进行比较匹配,当匹配成功后,就会执行模式对应的动作。对所有你指定的模式重复此执行该步骤。
- 再接下来,对于所有输入行重复执行步骤 2 和 步骤 3。
- 当读取并处理完所有输入行后,假如你指定了 `END` 模式,那么将会执行相应的动作。
当你使用特殊模式时,想要在 awk 操作中获得最好的结果,你应当记住上面的执行顺序。
为了便于理解,让我们使用第八节的例子进行演示,那个例子是关于 Tecmint 拥有的域名列表,并保存在一个叫做 domains.txt 的文件中。
```
news.tecmint.com
tecmint.com
linuxsay.com
windows.tecmint.com
tecmint.com
news.tecmint.com
tecmint.com
linuxsay.com
tecmint.com
news.tecmint.com
tecmint.com
linuxsay.com
windows.tecmint.com
tecmint.com
```
```
$ cat ~/domains.txt
```
![](http://www.tecmint.com/wp-content/uploads/2016/07/View-Contents-of-File.png)
> 查看文件内容
在这个例子中,我们希望统计出 domains.txt 文件中域名 `tecmint.com` 出现的次数。所以,我们编写了一个简单的 shell 脚本帮助我们完成任务,它使用了变量、数学表达式和赋值运算符的思想,脚本内容如下:
```
#!/bin/bash
for file in $@; do
if [ -f $file ] ; then
# 输出文件名
echo "File is: $file"
# 输出一个递增的数字记录包含 tecmint.com 的行数
awk '/^tecmint.com/ { counter+=1 ; printf "%s\n", counter ; }' $file
else
# 若输入不是文件,则输出错误信息
echo "$file 不是一个文件,请指定一个文件。" >&2 && exit 1
fi
done
# 成功执行后使用退出代码 0 终止脚本
exit 0
```
现在让我们像下面这样在上述脚本的 awk 命令中应用这两个特殊模式:`BEGIN` 和 `END`
我们应当把脚本:
```
awk '/^tecmint.com/ { counter+=1 ; printf "%s\n", counter ; }' $file
```
改成:
```
awk ' BEGIN { print "文件中出现 tecmint.com 的次数是:" ; }
/^tecmint.com/ { counter+=1 ; }
END { printf "%s\n", counter ; }
' $file
```
在修改了 awk 命令之后,现在完整的 shell 脚本就像下面这样:
```
#!/bin/bash
for file in $@; do
if [ -f $file ] ; then
# 输出文件名
echo "File is: $file"
# 输出文件中 tecmint.com 出现的总次数
awk ' BEGIN { print "文件中出现 tecmint.com 的次数是:" ; }
/^tecmint.com/ { counter+=1 ; }
END { printf "%s\n", counter ; }
' $file
else
# 若输入不是文件,则输出错误信息
echo "$file 不是一个文件,请指定一个文件。" >&2 && exit 1
fi
done
# 成功执行后使用退出代码 0 终止脚本
exit 0
```
![](http://www.tecmint.com/wp-content/uploads/2016/07/Awk-BEGIN-and-END-Patterns.png)
> awk 模式 BEGIN 和 END
当我们运行上面的脚本时,它会首先输出 domains.txt 文件的位置,然后执行 awk 命令脚本,该命令脚本中的特殊模式 `BEGIN` 将会在从文件读取任何行之前帮助我们输出这样的消息“`文件中出现 tecmint.com 的次数是:`”。
接下来,我们的模式 `/^tecmint.com/` 会在每个输入行中进行比较,对应的动作 `{ counter+=1 ; }` 会在每个匹配成功的行上执行,它会统计出 `tecmint.com` 在文件中出现的次数。
最终,`END` 模式将会输出域名 `tecmint.com` 在文件中出现的总次数。
```
$ ./script.sh ~/domains.txt
```
![](http://www.tecmint.com/wp-content/uploads/2016/07/Script-to-Count-Number-of-Times-String-Appears.png)
> 用于统计字符串出现次数的脚本
最后总结一下,我们在本节中演示了更多的 awk 功能,并学习了特殊模式 `BEGIN``END` 的概念。
正如我之前所言,这些 awk 功能将会帮助我们构建出更复杂的文本过滤操作。第十节将会给出更多的 awk 功能,我们将会学习 awk 内置变量的思想,所以,请继续保持关注。
--------------------------------------------------------------------------------
via: http://www.tecmint.com/learn-use-awk-special-patterns-begin-and-end/
作者:[Aaron Kili][a]
译者:[ChrisLeeGit](https://github.com/chrisleegit)
校对:[校对ID](https://github.com/校对ID)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]: http://www.tecmint.com/author/aaronkili/