mirror of
https://github.com/LCTT/TranslateProject.git
synced 2025-01-19 22:51:41 +08:00
1ee2fc7cbe
renamed: 10 Useful Chaining Operators in Linux with Practical Examples.md -> 201401/10 Useful Chaining Operators in Linux with Practical Examples.md renamed: 10 basic examples of linux netstat command.md -> 201401/10 basic examples of linux netstat command.md renamed: 12 Advanced Commands For Linux Server Admins!.md -> 201401/12 Advanced Commands For Linux Server Admins!.md renamed: 14 New Linux Distros That Were Introduced In 2013.md -> 201401/14 New Linux Distros That Were Introduced In 2013.md renamed: 15 Basic MySQL Interview Questions for Database Administrators.md -> 201401/15 Basic MySQL Interview Questions for Database Administrators.md renamed: "2013\357\274\232The Golden Year for Linux \342\200\223 10 Biggest Linux Achievements.md" -> "201401/2013\357\274\232The Golden Year for Linux \342\200\223 10 Biggest Linux Achievements.md" renamed: 2014--The year of the Linux car.md -> 201401/2014--The year of the Linux car.md renamed: 5 Things To Love And Hate About Ubuntu 13.10.md -> 201401/5 Things To Love And Hate About Ubuntu 13.10.md renamed: 8 Interesting Linux Tips And Tricks!.md -> 201401/8 Interesting Linux Tips And Tricks!.md renamed: CentOS 6.5 desktop installation guide with screenshots.md -> 201401/CentOS 6.5 desktop installation guide with screenshots.md renamed: "Command Line Basics \342\200\223 watch.md" -> "201401/Command Line Basics \342\200\223 watch.md" renamed: Configure Your Browser To Use Tor On Ubuntu or Debian or Linux Mint.md -> 201401/Configure Your Browser To Use Tor On Ubuntu or Debian or Linux Mint.md renamed: Daily Ubuntu Tips - Mount Partitions In Ubuntu From Your Desktop GUI.md -> 201401/Daily Ubuntu Tips - Mount Partitions In Ubuntu From Your Desktop GUI.md renamed: "Daily Ubuntu Tips \342\200\223 Do Nothing When Laptop Lid Is Closed.md" -> "201401/Daily Ubuntu Tips \342\200\223 Do Nothing When Laptop Lid Is Closed.md" renamed: "Daily Ubuntu Tips \342\200\224 Install VMware Workstation In Ubuntu.md" -> "201401/Daily Ubuntu Tips \342\200\224 Install VMware Workstation In Ubuntu.md" renamed: "Daily Ubuntu Tips \342\200\224 Windows Disk Management Equivalent In Ubuntu.md" -> "201401/Daily Ubuntu Tips \342\200\224 Windows Disk Management Equivalent In Ubuntu.md" renamed: "Gnu--toward the post-scarcity world \342\200\223 the Free Software Column.md" -> "201401/Gnu--toward the post-scarcity world \342\200\223 the Free Software Column.md" renamed: How to Dual Boot Ubuntu and Windows Properly.md -> 201401/How to Dual Boot Ubuntu and Windows Properly.md renamed: "How to Install and Configure UFW \342\200\223 An Un-complicated FireWall in Debian or Ubuntu.md" -> "201401/How to Install and Configure UFW \342\200\223 An Un-complicated FireWall in Debian or Ubuntu.md" renamed: How to Upgrade to GNOME 3.10 in Ubuntu 13.10.md -> 201401/How to Upgrade to GNOME 3.10 in Ubuntu 13.10.md renamed: How to install and configure Nagios on Linux.md -> 201401/How to install and configure Nagios on Linux.md renamed: How to set password policy on Linux.md -> 201401/How to set password policy on Linux.md renamed: How to stitch photos together on Linux.md -> 201401/How to stitch photos together on Linux.md renamed: How to upgrade MySQL server on Debian or Ubuntu.md -> 201401/How to upgrade MySQL server on Debian or Ubuntu.md renamed: Juju ice-cream icon design.md -> 201401/Juju ice-cream icon design.md renamed: Linus Torvalds Releases Last Linux Kernel 3.13 RC for 2013.md -> 201401/Linus Torvalds Releases Last Linux Kernel 3.13 RC for 2013.md renamed: Linus Torvalds Says All Contributor License Agreements Are Broken.md -> 201401/Linus Torvalds Says All Contributor License Agreements Are Broken.md renamed: Linux free Command - Display Free and used Memory in the System.md -> 201401/Linux free Command - Display Free and used Memory in the System.md renamed: Linux id Command - Print user ID and group ID information.md -> 201401/Linux id Command - Print user ID and group ID information.md renamed: Linux is Everywhere. We show you exactly where.md -> 201401/Linux is Everywhere. We show you exactly where.md renamed: Linux lsusb Command to Print information about USB on System.md -> 201401/Linux lsusb Command to Print information about USB on System.md renamed: Linux vmstat Command - Tool to Report Virtual Memory Statistics.md -> 201401/Linux vmstat Command - Tool to Report Virtual Memory Statistics.md renamed: "Linux who command \342\200\223 Displays who is on the system.md" -> "201401/Linux who command \342\200\223 Displays who is on the system.md" renamed: "Move Dropbox\342\200\231s Folder To An External Drive In Ubuntu.md" -> "201401/Move Dropbox\342\200\231s Folder To An External Drive In Ubuntu.md" renamed: New Ubuntu 14.04 Icons Are Drop-Dead Gorgeous, Might Not Arrive in Desktop Version.md -> 201401/New Ubuntu 14.04 Icons Are Drop-Dead Gorgeous, Might Not Arrive in Desktop Version.md renamed: Our Top 10 Linux Applications of 2013.md -> 201401/Our Top 10 Linux Applications of 2013.md renamed: Setup your personal Cloud server in minutes using ownCloud On RHEL, CentOS, Scientific Linux 6.5.md -> 201401/Setup your personal Cloud server in minutes using ownCloud On RHEL, CentOS, Scientific Linux 6.5.md renamed: Software May Be Eating The World, But Open Source Software Is Eating Itself.md -> 201401/Software May Be Eating The World, But Open Source Software Is Eating Itself.md renamed: The Debian Administrator's Handbook updated for Debian 7 Wheezy published and freely available for download.md -> 201401/The Debian Administrator's Handbook updated for Debian 7 Wheezy published and freely available for download.md renamed: The Fedora Project Will No Longer Name Its Linux Distributions.md -> 201401/The Fedora Project Will No Longer Name Its Linux Distributions.md renamed: The Genius Of Linux Is Community, Not Technology.md -> 201401/The Genius Of Linux Is Community, Not Technology.md renamed: Top 10 Linux Distros For Hackers!.md -> 201401/Top 10 Linux Distros For Hackers!.md renamed: Tunnel SSH Connections Over SSL Using 'Stunnel' On Debian 7 Ubuntu 13.10.md -> 201401/Tunnel SSH Connections Over SSL Using 'Stunnel' On Debian 7 Ubuntu 13.10.md renamed: Ubuntu Stores Your Wi-Fi Passwords By Default!.md -> 201401/Ubuntu Stores Your Wi-Fi Passwords By Default!.md renamed: Ubuntu Will Reach True Convergence Before Microsoft, Says Shuttleworth.md -> 201401/Ubuntu Will Reach True Convergence Before Microsoft, Says Shuttleworth.md renamed: Understanding Linux cd Command with Examples.md -> 201401/Understanding Linux cd Command with Examples.md renamed: look--Linux Command To Verify Spellings And Display Lines Beginning With A String.md -> 201401/look--Linux Command To Verify Spellings And Display Lines Beginning With A String.md
364 lines
12 KiB
Markdown
364 lines
12 KiB
Markdown
Linux下Nagios的安装与配置
|
||
================================================================================
|
||
[Nagios][1]是企业普遍使用的最具影响力的网络信息监视系统之一,它可以动态监视指定的网络状态,并在状态异常时发出警告音或邮件报警通知运维人员。监控的类型和警报定时器是完全可定制的。
|
||
|
||
Nagios的另一强大功能是它能同时监测主机和服务。例如,它可以同时监测到IP地址和TCP/UDP端口号。为进一步阐述此功能,我们假定有台需要监测的web服务器,Nagios可运用在服务器上基于IP/服务器名运行ping命令的方法检测服务器是否在线,同时当服务器的RTT(往返时延)增加时,Nagios会随时告警。另外,Nagios还能检测TCP的80端口(web服务器)是否可达,比如可能出现服务器在线但Apache/IIS没有响应的情况。
|
||
|
||
而基于Nagios的第三方监测工具如[Centreon][2], [FAN][3] , [op5 Monitor][4]
|
||
在界面设计,自动化运行和技术支持方面在独立的Nagios引擎基础上提供了相应的补充。
|
||
|
||
本段教程将展示**Linux下Nagios的安装与配置**.
|
||
|
||
## Debian或Ubuntu下Nagios的安装 ##
|
||
|
||
Debian系统用户使用apt-get命令就可安装Nagios:
|
||
|
||
root@mrtg:~# apt-get install nagios3
|
||
|
||
Nagios安装过程中可以设置邮件服务器,安装后也可以进行自定义设置。
|
||
|
||
提示:为使Nagios可正常发送邮件通知,需正确配置SMTP。
|
||
|
||
[![](http://farm4.staticflickr.com/3809/11198373625_cdacef2275_z.jpg)][5]
|
||
|
||
可以看到Nagios支持多种邮件传送方式。最常见的莫过于服务器(Internet Site)直接向接收者发送邮件,另一种常见的形式是用智能主机或中继服务器,将邮件传送至中继邮件服务器,然后它负责将邮件传送给接收者。
|
||
|
||
进行下一步操作时服务器的域名需要包含进去。
|
||
|
||
[![](http://farm8.staticflickr.com/7384/11198394746_f4d42300b5_z.jpg)][6]
|
||
|
||
最后,设置一下Nagios管理员‘nagiosadmin’的密码,用户可以之后自行修改密码。
|
||
|
||
[![](http://farm4.staticflickr.com/3759/11198378964_5d7f6638e8_z.jpg)][7]
|
||
|
||
## CentOS或RHEL下Nagios的安装 ##
|
||
|
||
使用yum命令来安装,[建立repoforge库][8]之后运行如下yum命令:
|
||
|
||
[root@mrtg ~]# yum install nagios nagios-plugins
|
||
|
||
## 监测需求 ##
|
||
|
||
本段教程中,我们希望完成以下监测内容:
|
||
|
||
1. 每三分钟监测一次所有Linux服务器。
|
||
2. 每三分钟监测一次所有思科路由器。
|
||
3. 所有的邮件告警集中发送至sentinel@example.tst。
|
||
4. 发出告警前Nagios都将进行3次验证以确保问题确有发生。
|
||
5. 当设备的往返时延超过100ms并且/或包丢失量超过20%将发出邮件告警。
|
||
|
||
余下的教程将指导您如何在Linux系统中配置Nagios。
|
||
|
||
## Ubuntu下Nagios的配置 ##
|
||
|
||
明确Nagios配置文件的地址至关重要,以下路径指明了Debian系统下Nagios的配置文件地址。
|
||
|
||
/etc/nagios-plugins 实现监测功能的专有脚本存放地址
|
||
/etc/nagios3 添加主机、服务,定义检测和定时器的配置文件
|
||
/usr/lib/nagios/plugins 用于监测的可执行文件
|
||
|
||
接下来的步骤相互关联,由此开始定义主机,主机组及向主机组添加服务操作。
|
||
|
||
### 添加主机模板 ###
|
||
|
||
我们对同一类型的主机定义了对应的模板,这里使用安装文件作举例说明。
|
||
|
||
首先,为Linux设备定义主机模板。
|
||
|
||
root@mrtg:~# cd /etc/nagios3/conf.d
|
||
root@mrtg:/etc/nagios3/conf.d/# cp generic-host_nagios2.cfg linux-server.cfg
|
||
|
||
如下编辑linux-server.cfg。需要修改部分已经标出。
|
||
|
||
root@mrtg:/etc/nagios3/conf.d/# vim linux-server.cfg
|
||
-
|
||
define host{
|
||
name linux-server ; 名称,需修改
|
||
notifications_enabled 1
|
||
event_handler_enabled 1
|
||
flap_detection_enabled 1
|
||
failure_prediction_enabled 1
|
||
process_perf_data 1
|
||
retain_status_information 1
|
||
retain_nonstatus_information 1
|
||
check_command example-host-check ; 检查所用脚本,需修改
|
||
check_interval 3 ; 连续检查的间隔,需修改
|
||
max_check_attempts 3 ; 产生邮件告警前的自检次数,需修改
|
||
notification_interval 0
|
||
notification_period 24x7
|
||
notification_options d,u,r
|
||
contact_groups admins ; 邮件将要发送至的组,需修改
|
||
register0
|
||
}
|
||
|
||
接下来,为Cisco设备定义主机模板。
|
||
|
||
root@mrtg:/etc/nagios3/conf.d/# cp linux-server.cfg cisco-device.cfg
|
||
|
||
如下修改cisco-device.cfg。需要修改部分已经标出。
|
||
|
||
root@mrtg:/etc/nagios3/conf.d/# vim cisco-device.cfg
|
||
-
|
||
define host{
|
||
name cisco-device ;名称,需修改
|
||
notifications_enabled 1
|
||
event_handler_enabled 1
|
||
flap_detection_enabled 1
|
||
failure_prediction_enabled 1
|
||
process_perf_data 1
|
||
retain_status_information 1
|
||
retain_nonstatus_information 1
|
||
check_command example-host-check ; 检查时使用的脚本,需修改
|
||
check_interval 3 ; 连续检查间隔,需修改
|
||
max_check_attempts 3 ; 产生邮件告警前的自检次数,需修改
|
||
notification_interval 0
|
||
notification_period 24x7
|
||
notification_options d,u,r
|
||
contact_groups admins ; 邮件将要发至的组,需修改
|
||
register 0
|
||
}
|
||
|
||
### 添加主机 ###
|
||
|
||
现在已定义主机模板,就可以添加需要监测的主机。以默认的文件作例子展示如下内容。
|
||
|
||
root@mrtg:/etc/nagios3/conf.d/# cp localhost_nagios2.cfg example.cfg
|
||
root@mrtg:/etc/nagios3/conf.d/# vim example.cfg
|
||
-
|
||
# Host 1
|
||
define host{
|
||
use linux-server ; 使用的主机模板名
|
||
host_name our-server ; nagios使用的主机名
|
||
alias our-server
|
||
address 172.17.1.23 ; 主机的IP地址
|
||
}
|
||
|
||
# Host 2
|
||
define host{
|
||
use cisco-device ; 使用的主机模板名
|
||
host_name our-router ; nagios使用的主机名
|
||
alias our-router
|
||
address 172.17.1.1 ; 主机的IP地址
|
||
}
|
||
|
||
### 主机组定义 ###
|
||
|
||
当有多个主机时,为方便管理,建议将相似的主机组成一组。
|
||
|
||
root@mrtg:/etc/nagios3/conf.d/# vim hostgroups_nagios2.cfg
|
||
-
|
||
define hostgroup {
|
||
hostgroup_name linux-server ; 主机组名
|
||
alias Linux Servers
|
||
members our-server ; 组员列表
|
||
}
|
||
|
||
define hostgroup {
|
||
hostgroup_name cisco-device ; 主机组名
|
||
alias Cisco Devices
|
||
members our-server ; comma separated list of members
|
||
}
|
||
|
||
### 定义服务 ###
|
||
|
||
首先,定义一个服务example-host-check:当往返时延达到100ms预警值并且有20%包丢失时发出警告,而紧急告警设置为5000ms且包丢失比率为100%,只执行一个IPv4的ping请求检测。
|
||
|
||
root@mrtg:~# vim /etc/nagios-plugins/config/ping.cfg
|
||
-
|
||
define command{
|
||
command_name example-host-check
|
||
command_line /usr/lib/nagios/plugins/check_ping -H '$HOSTADDRESS$' -w 100,20% -c 5000,100% -p 1 -4
|
||
}
|
||
|
||
然后,将命令关联至主机组。
|
||
|
||
root@mrtg:/etc/nagios3/conf.d/# vim services_nagios2.cfg
|
||
-
|
||
define service {
|
||
hostgroup_name linux-server
|
||
service_description Linux Servers
|
||
check_command example-host-check
|
||
use generic-service
|
||
notification_interval 0 ; 初始化设置为0
|
||
}
|
||
|
||
define service {
|
||
hostgroup_name cisco-device
|
||
service_description Cisco Devices
|
||
check_command example-host-check
|
||
use generic-service
|
||
notification_interval 0 ; 初始化设置为0
|
||
}
|
||
|
||
### 联系人定义 ###
|
||
|
||
进行如下定义将发送邮件需要的地址添加至Nagios。
|
||
|
||
root@mrtg:/etc/nagios3/conf.d/# vim contacts.cfg
|
||
-
|
||
define contact{
|
||
contact_name root
|
||
alias Root
|
||
service_notification_period 24x7
|
||
host_notification_period 24x7
|
||
service_notification_options w,u,c,r
|
||
host_notification_options d,r
|
||
service_notification_commands notify-service-by-email
|
||
host_notification_commands notify-host-by-email
|
||
email root@localhost, sentinel@example.tst
|
||
}
|
||
|
||
最后,试运行初始化检测是否有配置错误。如果没有错误,Nagios开始安全运行。
|
||
|
||
root@mrtg:~#nagios –v /etc/nagios3/nagios.cfg
|
||
root@mrtg:~# service nagios3 restart
|
||
|
||
## CentOS/RHEL上的Nagios配置 ##
|
||
|
||
Redhat系统中Nagios的配置文件地址如下所示。
|
||
|
||
/etc/nagios/objects 添加主机、服务,定义监测和定时器的配置文件
|
||
|
||
/usr/lib/nagios/plugins 实现监测的可执行文件
|
||
|
||
### 添加主机模板 ###
|
||
|
||
为特定类型的主机创建所需的模板,相应修改安装所需的文件。
|
||
|
||
[root@mrtg objects]# cd /etc/nagios/objects/
|
||
[root@mrtg objects]# vim templates.cfg
|
||
-
|
||
define host{
|
||
name linux-server
|
||
use generic-host
|
||
check_period 24x7
|
||
check_interval 3
|
||
retry_interval 1
|
||
max_check_attempts 3
|
||
check_command example-host-check
|
||
notification_period 24x7
|
||
notification_interval 0
|
||
notification_options d,u,r
|
||
contact_groups admins
|
||
register 0
|
||
}
|
||
|
||
define host{
|
||
name cisco-router
|
||
use generic-host
|
||
check_period 24x7
|
||
check_interval 3
|
||
retry_interval 1
|
||
max_check_attempts 3
|
||
check_command example-host-check
|
||
notification_period 24x7
|
||
notification_interval 0
|
||
notification_options d,u,r
|
||
contact_groups admins
|
||
register 0
|
||
}
|
||
|
||
### 添加主机和主机组 ###
|
||
|
||
这里以默认的配置文件为例,主机和主机组添加至配置文件中。
|
||
|
||
[root@mrtg objects]# cp localhost.cfg example.cfg
|
||
[root@mrtg objects]# vim example.cfg
|
||
-
|
||
#Adding Linux server
|
||
define host{
|
||
use linux-server
|
||
host_name our-server
|
||
alias our-server
|
||
address 172.17.1.23
|
||
}
|
||
|
||
#Adding Cisco Router
|
||
define host{
|
||
use cisco-router
|
||
host_name our-router
|
||
alias our-router
|
||
address 172.17.1.1
|
||
}
|
||
|
||
# HOST GROUP DEFINITION
|
||
define hostgroup{
|
||
hostgroup_name linux-servers
|
||
alias Linux Servers
|
||
members our-server
|
||
}
|
||
|
||
define hostgroup{
|
||
hostgroup_name cisco-router
|
||
alias cisco-router
|
||
members our-router
|
||
}
|
||
|
||
### 服务定义 ###
|
||
|
||
定义一个服务example-host-check:当往返时延达到100ms预警值并且有20%包丢失时发出警告,而紧急告警设置为5000ms且包丢失比率为100%,只执行一个IPv4的ping请求检测。
|
||
|
||
[root@mrtg objects]# vim commands.cfg
|
||
-
|
||
define command{
|
||
command_name example-host-check
|
||
command_line $USER1$/check_ping -H $HOSTADDRESS$ -w 100.0,20% -c 5000.0,100% -p 1 -4
|
||
}
|
||
|
||
### 联系人定义 ###
|
||
|
||
告警要发送的邮件地址添加至Nagios中。
|
||
|
||
[root@objects objects]# vim contacts.cfg
|
||
-
|
||
define contact{
|
||
contact_name nagiosadmin
|
||
use generic-contact
|
||
alias Nagios Admin
|
||
email nagios@localhost, sentinel@example.tst
|
||
}
|
||
|
||
最后,准备启动Nagios服务,可先试运行检测配置是否出错。
|
||
|
||
[root@mrtg ~]# nagios –v /etc/nagios/nagios.cfg
|
||
[root@mrtg ~]# service nagios restart
|
||
[root@mrtg ~]# chkconfig nagios on
|
||
|
||
### 配置后访问Nagios ###
|
||
|
||
现在一切就绪,可以开始Nagios之旅了。Ubuntu/Debian用户可以通过打开http://IP地址/nagios3网页访问Nagios,CentOS/RHEL用户可以打开http://IP地址/nagios,如http://172.17.1.23/nagios3来访问Nagios。“nagiosadmin”用户则需要认证来访问页面。
|
||
|
||
[![](http://farm4.staticflickr.com/3834/11198394806_4f4a753778_z.jpg)][9]
|
||
|
||
若Nagios没有依原设定运行,首先要做的是建立一个测试运行(dry run)。
|
||
|
||
Debian或Ubuntu系统:
|
||
|
||
# nagios3 -v /etc/nagios3/nagios.cfg
|
||
|
||
CentOS或RHEL系统:
|
||
|
||
# nagios -v /etc/nagios/nagios.cfg
|
||
|
||
日志文件也会提供重要线索,若需查看可以转至路径/var/log/nagios/nagios.log。
|
||
|
||
希望本文有所帮助。
|
||
|
||
--------------------------------------------------------------------------------
|
||
|
||
via: http://xmodulo.com/2013/12/install-configure-nagios-linux.html
|
||
|
||
译者:[icybreaker](https://github.com/icybreaker) 校对:[wxy](https://github.com/wxy)
|
||
|
||
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创翻译,[Linux中国](http://linux.cn/) 荣誉推出
|
||
|
||
[1]:http://www.nagios.org/
|
||
[2]:http://www.centreon.com/
|
||
[3]:http://www.fullyautomatednagios.org/
|
||
[4]:http://www.op5.com/
|
||
[5]:http://www.flickr.com/photos/xmodulo/11198373625/
|
||
[6]:http://www.flickr.com/photos/xmodulo/11198394746/
|
||
[7]:http://www.flickr.com/photos/xmodulo/11198378964/
|
||
[8]:http://xmodulo.com/2013/01/how-to-set-up-rpmforge-repoforge-repository-on-centos.html
|
||
[9]:http://www.flickr.com/photos/xmodulo/11198394806/
|