Merge pull request #8393 from MjSeven/master

20180102 The Uniq Command Tutorial With Examples For Beginners.md 翻译完毕
This commit is contained in:
Xingyu.Wang 2018-04-08 09:25:19 +08:00 committed by GitHub
commit 282d75a18e
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
2 changed files with 158 additions and 163 deletions

View File

@ -1,163 +0,0 @@
Translating by MjSeven
The Uniq Command Tutorial With Examples For Beginners
======
If you're working mostly on command line and dealing with a lot of text files every day, you should be aware of **Uniq** command. This command helps you to find repeated/duplicate lines from a file easily. It is not just for finding duplicates, but also we can use uniq command to remove the duplicates, display the number of occurrences of the duplicate lines, display only the repeated lines and display only the unique lines etc. Since the uniq command is part of GNU coreutils package, it comes preinstalled in most Linux distributions. Let us not bother with installation and see some practical examples.
Please note that the 'uniq' command will not detect repeated lines unless they are adjacent. So, you might need to sort them first or combine the sort command with uniq to get the results. Allow me to show you some examples.
First, let us create a file with some duplicate lines.
```
vi ostechnix.txt
```
```
welcome to ostechnix
welcome to ostechnix
Linus is the creator of Linux.
Linux is secure by default
Linus is the creator of Linux.
Top 500 super computers are powered by Linux
```
As you see in the above file, we have few repeated lines (the first, second, third, and fifth lines are duplicates).
### **1\. Remove consecutive duplicate lines in a file using Uniq command**
If you use 'uniq' command without any arguments, it will remove all consecutive duplicate lines and display only the unique lines.
```
uniq ostechnix.txt
```
Sample output would be:
[![][1]][2]
As you can see, uniq command removed all consecutive duplicate lines in the given file. You might also have noticed that the above output still has the duplicates in second and fourth lines. It is because the uniq command will only omit the repeated lines only if they are adjacent. We can, of course, remove that non-consecutive duplicates too. Look at the second example below.
### **2\. Remove all duplicate lines**
```
sort ostechnix.txt | uniq
```
Sample output would be:
[![][1]][3]
See? There are no duplicates or repeated lines. In other words, the above command will display each line once from file ostechnix.txt. We used the sort command in conjunction with uniq, because, as I already mentioned, uniq will not find the duplicate/repeated lines unless they are adjacent.
### **3\. Display only unique lines from a file**
To display only the unique lines from a file, the command would be:
```
sort ostechnix.txt | uniq -u
```
Sample output:
```
Linux is secure by default
Top 500 super computers are powered by Linux
```
As you can see, we have only two unique lines in the given file.
### **4\. Display only duplicate lines**
Similarly, we can also display duplicates lines from a file like below.
```
sort ostechnix.txt | uniq -d
```
Sample output:
```
Linus is the creator of Linux.
welcome to ostechnix
```
These two are the repeated/duplicated lines in ostechnix.txt file. Please note that -d (small d) will **only print duplicate lines** , **one for each group**. To print **all duplicate lines** , use -D (capital d) like below.
```
sort ostechnix.txt | uniq -D
```
See the difference between both flags in the below screenshot.
[![][1]][4]
### **5\. Display number of occurrences of each line in a file
**
For some reason, you might want to check how many times a line is repeated in the given file. To do so, use **-c** flag like below.
```
sort ostechnix.txt | uniq -c
```
**Sample output:**
```
2 Linus is the creator of Linux.
1 Linux is secure by default
1 Top 500 super computers are powered by Linux
2 welcome to ostechnix
```
We can also display number of occurrences of each line along with that line, sorted by the most frequent like below.
```
sort ostechnix.txt | uniq -c | sort -nr
```
Sample output:
```
2 welcome to ostechnix
2 Linus is the creator of Linux.
1 Top 500 super computers are powered by Linux
1 Linux is secure by default
```
### 6\. Limit the comparison to 'N' characters
We can limit the comparison to a particular number of characters of lines in a file using **-w** flag. For example, let us limit the comparison to first 4 characters of lines in a file and display the repeated lines as shown below.
```
uniq -d -w 4 ostechnix.txt
```
### 7\. Avoid the comparison with the first 'N' characters
Like limit comparison to N characters of lines in a file, we can also avoid comparing the first N characters using -s flag.
The following command will avoid the comparison with the first 4 characters of lines in a file:
```
uniq -d -s 4 ostechnix.txt
```
To avoid comparing the first N fields instead of characters, use '-f' flag in the above command.
For more details, refer the help section;
```
uniq --help
```
and man pages.
```
man uniq
```
And, that's all for today! I hope you now get a basic idea about uniq command and its purpose. If you find our guides useful, please share them on your social, professional networks and support OSTechNix. More good stuffs to come. Stay tuned!
Cheers!
--------------------------------------------------------------------------------
via: https://www.ostechnix.com/uniq-command-tutorial-examples-beginners/
作者:[][a]
译者:[译者ID](https://github.com/译者ID)
校对:[校对者ID](https://github.com/校对者ID)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]:https://www.ostechnix.com
[1]:
[2]:http://www.ostechnix.com/wp-content/uploads/2018/01/uniq-2.png ()
[3]:http://www.ostechnix.com/wp-content/uploads/2018/01/uniq-1-1.png ()
[4]:http://www.ostechnix.com/wp-content/uploads/2018/01/uniq-4.png ()

View File

@ -0,0 +1,158 @@
为初学者提供的 Uniq 命令教程及示例
=====
如果你主要是在命令行上工作,并且每天处理大量的文本文件,那么你应该了解下 **Uniq** 命令。该命令会帮助你轻松地从文件中找到重复的行。它不仅用于查找重复项,而且我们还可以使用它来删除重复项,显示重复项的出现次数,只显示重复的行,只显示唯一的行等。由于 uniq 命令是 GNU coreutils 包的一部分,所以它预装在大多数 Linux 发行版中,让我们不需要费心安装。来看一些实际的例子。
请注意,除非重复行是相邻的,否则 'uniq' 不会删除它们。因此,你可能需要先对它们进行排序,或将排序命令与 uniq 组合以获得结果。让我给你看一些例子。
首先,让我们创建一个带有一些重复行的文件:
```
vi ostechnix.txt
```
```
welcome to ostechnix
welcome to ostechnix
Linus is the creator of Linux.
Linux is secure by default
Linus is the creator of Linux.
Top 500 super computers are powered by Linux
```
正如你在上面的文件中看到的,我们有一些重复的行(第一行和第二行,第三行和第五行是重复的)。
### **1\. 使用 Uniq 命令删除文件中的连续重复行**
如果你在不使用任何参数的情况下使用 'uniq' 命令,它将删除所有连续的重复行,只显示唯一的行。
```
uniq ostechnix.txt
```
示例输出:
![][2]
如你所见, uniq 命令删除了给定文件中的所有连续重复行。你可能还注意到,上面的输出仍然有第二行和第四行重复了。这是因为 uniq 命令只有在相邻的情况下才会删除重复的行,当然,我们也可以删除非连续的重复行。请看下面的第二个例子。
### **2\. 删除所有重复的行**
```
sort ostechnix.txt | uniq
```
示例输出:
![][3]
看到了吗?没有重复的行。换句话说,上面的命令将显示在 ostechnix.txt 中只出现一次的行。我们使用 sort 命令与 uniq 命令结合,因为,就像我提到的,除非重复行是相邻的,否则 uniq 不会删除它们。
### **3\. 只显示文件中唯一的一行**
为了只显示文件中唯一的一行,可以这样做:
```
sort ostechnix.txt | uniq -u
```
示例输出:
```
Linux is secure by default
Top 500 super computers are powered by Linux
```
如你所见,在给定的文件中只有两行是唯一的。
### **4\. 只显示重复的行**
同样的,我们也可以显示文件中重复的行就像下面这样:
```
sort ostechnix.txt | uniq -d
```
示例输出:
```
Linus is the creator of Linux.
welcome to ostechnix
```
这两行在 ostechnix.txt 文件中是重复的行。请注意 -d(小写 d) 将会**只打印重复的行****每组有一个**。打印**所有重复的行**,使用 -D(大写 D),如下所示:
```
sort ostechnix.txt | uniq -D
```
在下面的截图中看两个选项的区别:
![][4]
### **5\. 显示文件中每一行的出现次数**
由于某种原因,你可能想要检查给定文件中每一行重复出现的次数。要做到这一点,使用 **-c** 选项,如下所示:
```
sort ostechnix.txt | uniq -c
```
**示例输出:**
```
2 Linus is the creator of Linux.
1 Linux is secure by default
1 Top 500 super computers are powered by Linux
2 welcome to ostechnix
```
我们还可以按照每一行的出现次数进行排序,然后显示,如下所示:
```
sort ostechnix.txt | uniq -c | sort -nr
```
示例输出:
```
2 welcome to ostechnix
2 Linus is the creator of Linux.
1 Top 500 super computers are powered by Linux
1 Linux is secure by default
```
### 6\. 将比较限制为 'N' 个字符
我们可以使用 -w 选项来限制对文件中特定数量字符的比较。例如,让我们比较文件中的前四个字符,并显示重复行,如下所示:
```
uniq -d -w 4 ostechnix.txt
```
### 7\. 忽略比较指定的 'N' 个字符
像对文件中行的 N 个字符进行限制比较一样,我们也可以使用 -s 选项来忽略比较前 N 个字符。
下面的命令将忽略在文件中每行的前四个字符进行比较:
```
uniq -d -s 4 ostechnix.txt
```
为了忽略比较前 N 字段(译者注:即前几列)而不是字符,在上面的命令中使用 '-f' 选项。
欲了解更多详情,请参考帮助部分:
```
uniq --help
```
也可以使用 man 命令查看:
```
man uniq
```
今天就到这里!我希望你现在对 uniq 命令及其目的有一个基本的了解。如果你发现我们的指南有用,请在你的社交网络上分享,并继续支持 OSTechNix。更多好东西要来了请继续关注
干杯!
--------------------------------------------------------------------------------
via: https://www.ostechnix.com/uniq-command-tutorial-examples-beginners/
作者:[][a]
译者:[MjSeven](https://github.com/MjSeven)
校对:[校对者ID](https://github.com/校对者ID)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]:https://www.ostechnix.com
[1]:
[2]:http://www.ostechnix.com/wp-content/uploads/2018/01/uniq-2.png
[3]:http://www.ostechnix.com/wp-content/uploads/2018/01/uniq-1-1.png
[4]:http://www.ostechnix.com/wp-content/uploads/2018/01/uniq-4.png