您的位置:首页 > 移动开发 > IOS开发

nagios运用飞信实现短信报警监控服务器 推荐

2009-07-27 16:13 806 查看
此平台是基于apache2.2.1+php5.3.0+nagios3.1.2+fetion2009等软件的组合搭建而出,这些软件的版本是现阶段最新的。本人经过一段时间的搭建与测试,各个功能都能够成功实现,现源于开源软件,本人却不敢独享,现把搭建与配置过程介绍如下:
一、服务器端的设置
1、下载需求软件:
nagios主程序:
http://prdownloads.sourceforge.net/sourceforge/nagios/nagios-3.1.2.tar.gz
nagios插件程序:
http://prdownloads.sourceforge.net/sourceforge/nagiosplug/nagios-plugins-1.4.13.tar.gz
apache程序:
http://apache.freelamp.com/httpd/httpd-2.2.11.tar.gz
php程序:
http://cn.php.net/distributions/php-5.3.0.tar.gz
libACE程序:
http://www.it-adv.net/fetion/downng/library_linux.tar.gz
fetion机器人程序:
http://www.it-adv.net/fetion/downng/fetion20090406003-linux.tar.gz
2、软件的安装
(1) apache的安装

[root@localhost httpd-2.2.11]# ./configure --prefix=/usr/local/apache --enable-so
[root@localhost httpd-2.2.11]# make && make install容

(2) php的安装

[root@localhost php-5.2.10]# ./configure --prefix=/usr/local/php \
> --with-apxs2=/usr/local/apache/bin/apxs
[root@localhost php-5.2.10]# make && make install
[root@localhost php-5.2.10]# cp php.ini-dist /usr/local/lib/php.ini
[root@localhost php-5.2.10]# vi /usr/local/apache/conf/httpd.conf请

为了配置文件的工整,在配置文件的相关地方加入如下配置:

LoadModule php5_module modules/libphp5.so(如果有,不用加)
AddType application/x-httpd-php .php .html
找到DirectoryIndex index.html
改为:DirectoryIndex index.php index.html
在最后加入nagios的访问目录:
ScriptAlias /nagios/cgi-bin /usr/local/nagios/sbin
<Directory "/usr/local/nagios/sbin">
AuthType Basic
Options ExecCGI
AllowOverride None
Order allow,deny
Allow from all
</Directory>
Alias /nagios /usr/local/nagios/share
<Directory "/usr/local/nagios/share">
AuthType Basic
Options None
AllowOverride None
Order allow,deny
Allow from all
</Directory> 请输入内容

(3) nagios的安装
为了支持Nagios安装完后status map,trend等完全显示,需要gd库的支持,首先看一下gd与gd-devel有没有安装


[root@localhost php-5.2.10]# rpm -qa|grep gd


gdb-6.5-37.el5


libgdiplus-1.2.5-1.el5.centos


gd-2.0.33-9.4.el5_1.1


gdbm-1.8.0-26.2.1


sysklogd-1.4.1-44.el5


libgdiplus-devel-1.2.5-1.el5.centos


gd-devel-2.0.33-9.4.el5_1.1
如果没有,请拿出光盘安装上,或者用yum -y install gd*安装也可


[root@localhost nagios-3.1.2]# useradd nagios


[root@localhost nagios-3.1.2]# ./configure --prefix=/usr/local/nagios \


> --with-gd-lib=/usr/lib --with-gd-inc=/usr/include --with-nagios-user=nagios \


> --with-nagios-group=nagios


[root@localhost nagios-3.1.2]# make all


[root@localhost nagios-3.1.2]# make install


[root@localhost nagios-3.1.2]# make install-init


[root@localhost nagios-3.1.2]# make install-commandmode


[root@localhost nagios-3.1.2]# make install-config
(4) nagios-plugin的安装


[root@localhost nagios-plugins-1.4.13]# ./configure --prefix=/usr/local/nagios


[root@localhost nagios-plugins-1.4.13]# make && make install
(5) 飞信机器人的安装
在安装飞信之前需要在/usr/lib目录下加上libACE库文件,以便使飞信能够正常使用


[root@localhost src]# tar zxvf library_linux.tar.gz


[root@localhost src]# mv libACE* libcrypto.so.0.9.8 libssl.so.0.9.8 /usr/lib
好了,现在来正式安装飞信


[root@localhost src]# tar zxvf fetion20090406003-linux.tar.gz


[root@localhost src]# mv install /usr/local/fetion


[root@localhost src]# chmod -R 755 /usr/local/fetion


[root@localhost src]# chown -R nagios:nagios /usr/local/fetion
测试飞信能否成功发送短信(发送对象需是自己飞信好友):


[root@localhost src]# /usr/local/fetion/fetion --mobile=137*** --pwd=*** \


> --to=136*** --msg-utf8="test"
如果136***能成功收到137***的短信就是正常的哟,如果不能成功看日志。
3、nagios的配置
首先讲一下,我建有host(主机),hostgroup(主机组),contact(联系人),contactgroup(联系人组),分别讲一下它们的相关情况:
host包含:TH-02,VH-178,VH-132,MH-110,MH-13
hostgroup包含:TG(别名:testgroup) , VG(别名:virtualgroup),MG(别名:maintaingroup);TG包括主机:TH-02,VG包括主机:VH-132,VH-178,MG包括主机:MH-110,MH-13
contact包含:WSP,HW,ZY
contactgroup包含:testers,virtualers,maintainers;testers包含用户:WSP,virtualers包含用户:WSP和HW,maintainers包含用户:ZY
为了直观起见,请看下图,



好了,有了大致模块以后,现在我们正式开始配置nagios吧
(1) host的编辑与添加
为了灵活使用,我们直接进入/usr/local/nagios/etc/objects目录下面,


[root@localhost src]# cd /usr/local/nagios/etc/objects/


[root@localhost objects]# vi hosts.cfg
##########################################################################


### Define whole host for all the machines






# Define testgroup host for the testers machine




define host{


host_name TH-02


alias testhost02


max_check_attempts 5


contact_groups testers


address 60.191.*.*


}






# Define virtualgroup host for the virtualers machine




define host{


host_name VH-178


alias Virutalhost178


max_check_attempts 5


contact_groups virtualers


address *.*.*.178


}




define host{


host_name VH-132


alias Virtualhost132


max_check_attempts 5


contact_groups virtualers


address *.*.*.132


}




# Define maintaingroup host for the maintainers machine




define host{


host_name MH-110


alias maintainhost110


max_check_attempts 5


contact_groups maintainers


address *.*.*.110


}




define host{


host_name MH-13


alias maintainhost13


max_check_attempts 5


contact_groups maintainers


address *.*.*.13


}






##############################################################################


### Define all hostgroup for the whole machine






# Define testgroup




define hostgroup{


hostgroup_name TG


alias testgroup


members TH-02


}




# Define virtualgroup




define hostgroup{


hostgroup_name VG


alias virtualgroup


members VH-178,VH-132


}




# Define maintaingroup




define hostgroup{


hostgroup_name MG


alias maintaingroup


members MH-110,MH-13


}




#############################################################################


### SERVICE DEFINITIONS






# Define a service to "ping" the whole machine




define service{


host_name TH-02,VH-178,VH-132,MH-110


service_description PING


is_volatile 0


check_period 24x7


max_check_attempts 1


normal_check_interval 1


retry_check_interval 1


notification_options w,u,c,r


notification_interval 1


notification_period 24x7


check_command check_ping!100.0,20%!500.0,60%
}


# Define a service to "HTTP" the whole machine




define service{


host_name H-02,VH-178,VH-132,MH-13,MH-110
service_description HTTP


is_volatile 0


check_period 24x7


max_check_attempts 1


normal_check_interval 1


retry_check_interval 1


notification_options w,u,c,r


notification_interval 1


notification_period 24x7


check_command check_http


notifications_enabled 0


}

service 可以根据自己的需要加的,我这里只对HTTP与PING来作阐述的。
(2) contacts.cfg的编辑


###############################################################################


# CONTACTS.CFG - SAMPLE CONTACT/CONTACTGROUP DEFINITIONS


#


# Last Modified: 05-31-2007


#


# NOTES: This config file provides you with some example contact and contact


# group definitions that you can reference in host and service


# definitions.


#


# You don't need to keep these definitions in a separate file from your


# other object definitions. This has been done just to make things


# easier to understand.


#


###############################################################################








###############################################################################


###############################################################################


#


# CONTACTS


#


###############################################################################


###############################################################################


### Define contact information for all the contacter






# Define contact information for WSP




define contact{


contact_name WSP


use generic-contact


alias WY-wsp


service_notification_commands notify-service-by-email,service-notify-by-fei


host_notification_commands notify-host-by-email,host-notify-by-fei


email wsp@wy.cn


pager 150***


}




# Define contact information for HW


define contact{


contact_name HW


use generic-contact


alias WY-hw


service_notification_commands notify-service-by-email,service-notify-by-fei


host_notification_commands notify-host-by-email,host-notify-by-fei


email hewei@wy.cn


pager 135***


}




# Define contact information for ZY


define contact{


contact_name ZY


use generic-contact


alias WY-zy


service_notification_commands notify-service-by-email,service-notify-by-fei


host_notification_commands notify-host-by-email,host-notify-by-fei


email zy@wy.cn


pager 150***


}






###############################################################################


###############################################################################


#


# CONTACT GROUPS


#


###############################################################################


###############################################################################


### Define contact group for all ther whole contacter




# Define testers contact group




define contactgroup{


contactgroup_name testers


alias testhost contacters


members WSP


}




# Define virtualers contact group




define contactgroup{


contactgroup_name virtualers


alias virtualhost contacters


members WSP,HW


}






# Define maintainers contact group




define contactgroup{


contactgroup_name maintainers


alias maintainhost contacters


members ZY


}



(3) commands.cfg的编辑
为了支持飞信发短信的功能需要在此文件中添加飞信命令,以促使信息的发送,代码如下:


# 'notify-host-by-fei' command definition


define command {


command_name host-notify-by-fei ;


command_line /usr/local/fetion/fetion --mobile=137*** --pwd=*** --to=$CONTACTPAGER$ --msg-utf8="Host $HOSTSTATE$ alert for $HOSTNAME$! on '$LONGDATETIME$'" $CONTACTPAGER$


}




# 'notify-service-by-fei' command definition


define command {


command_name service-notify-by-fei


command_line /usr/local/fetion/fetion --mobile=137*** --pwd=*** --to=$CONTACTPAGER$ --msg-utf8="$HOSTADDRESS$ $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ on $LONGDATETIME$" $CONTACTPAGER$


}
(4) 编辑nagios.cfg


为了不监控本机,现把localhost注释掉,找到此行,并且加以注释:


# cfg_file=/usr/local/nagios/etc/objects/localhost.cfg


为了使刚才编辑的文件生效,需加入一行:


cfg_file=/usr/local/nagios/etc/objects/hosts.cfg
(5) 编辑cgi.cfg
为了保证权限能正常访问nagios的web页面,需把这些行的值改掉:


authorized_for_system_information=*


authorized_for_configuration_information=*


authorized_for_system_commands=nagios


authorized_for_all_services=*


authorized_for_all_hosts=*


authorized_for_all_service_commands=*


authorized_for_all_host_commands=*


*号在此表示所有用户,也可以改成相关用户,不过必须逐一添加。
服务器端的配置基本上大致如此了。
4、启动nagios与apache服务
(1) nagios自启动


[root@localhost objects]# service nagios restart


Running configuration check...done.


Stopping nagios: done.


Starting nagios: done.


[root@localhost objects]# chkconfig nagios on


[root@localhost objects]# chkconfig --list|grep nagios


nagios 0:关闭 1:关闭 2:启用 3:启用 4:启用 5:启用 6:关闭

(2) apache的启动


[root@localhost ~]# export PATH=$PATH:/usr/local/apache/bin


[root@localhost ~]# apachectl -k start


httpd (pid 3108) already running
注:为了让apache可以自启动,可以加入到/etc/rc.local之中。

5、测试与浏览
在启动nagios服务之前,我们可以先测试一下是否配置有没有错误,或者你启动的时候出现错误也可以用下述命令进行测试


[root@localhost ~]# /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg


Nagios 3.1.2


Copyright (c) 2009 Nagios Core Development Team and Community Contributors


Copyright (c) 1999-2009 Ethan Galstad


Last Modified: 06-23-2009


License: GPL


Website: http://www.nagios.org

Reading configuration data...


Read main config file okay...


Processing object config file '/usr/local/nagios/etc/objects/commands.cfg'...


Processing object config file '/usr/local/nagios/etc/objects/contacts.cfg'...


Processing object config file '/usr/local/nagios/etc/objects/timeperiods.cfg'...


Processing object config file '/usr/local/nagios/etc/objects/templates.cfg'...


Processing object config file '/usr/local/nagios/etc/objects/hosts.cfg'...


Read object config files okay...


Running pre-flight check on configuration data...


Checking services...


Checked 9 services.


Checking hosts...


Checked 5 hosts.


Checking host groups...


Checked 3 host groups.


Checking service groups...


Checked 0 service groups.


Checking contacts...


Checked 3 contacts.


Checking contact groups...


Checked 3 contact groups.


Checking service escalations...


Checked 0 service escalations.


Checking service dependencies...


Checked 0 service dependencies.


Checking host escalations...


Checked 0 host escalations.


Checking host dependencies...


Checked 0 host dependencies.


Checking commands...


Checked 26 commands.


Checking time periods...


Checked 5 time periods.


Checking for circular paths between hosts...


Checking for circular host and service dependencies...


Checking global event handlers...


Checking obsessive compulsive processor commands...


Checking misc settings...


Total Warnings: 0


Total Errors: 0

呵呵,没有错误,可能您在配置之前或多或少会出现问题的,根据相关错误提示,逐一排查,相信您定会成功的。
现在我们打开nagios的web页面,输入:http://***.com/nagios样例
先看map页面,大致页面如下



再看hosts相关链接页面:



最后,我们再看一下hostgroups页面:



二、客户端的配置:
1、需求软件的下载:
nagios插件程序:
http://prdownloads.sourceforge.net/sourceforge/nagiosplug/nagios-plugins-1.4.13.tar.gz
nrpe软件:
http://prdownloads.sourceforge.net/sourceforge/nagios/nrpe-2.12.tar.gz
2、软件的安装
(1) nagios插件的安装:
首先在安装前建立nagios用户,命令如下:


[root@yibaner src]# useradd nagios -s /sbin/nologin -M


[root@yibaner src]# tar zxvf nagios-plugins-1.4.13.tar.gz


[root@yibaner src]# cd nagios-plugins-1.4.13


[root@yibaner nagios-plugins-1.4.13]# ./configure --prefix=/usr/local/nagios


[root@yibaner nagios-plugins-1.4.13]# make && make install


[root@yibaner nagios-plugins-1.4.13]# chown -R nagios.nagios /usr/local/nagios/
(2) nrpe的程序的安装:


[root@yibaner src]# tar zxvf nrpe-2.12.tar.gz


[root@yibaner nrpe-2.12]# cd nrpe-2.12


[root@yibaner nrpe-2.12]# ./configure


[root@yibaner nrpe-2.12]# make all


[root@yibaner nrpe-2.12]# make install-plugin


[root@yibaner nrpe-2.12]# make install-daemon


[root@yibaner nrpe-2.12]# make install-daemon-config
(3) 修改nrpe.cfg配置文件:


[root@yibaner nrpe-2.12]# vi /usr/local/nagios/etc/nrpe.cfg


将server_address的值修改为本机的IP地址:


server_address=*.*.*.110


将allowed_hosts的值修改为远程监控服务器的IP地址:


allowed_hosts=*.*.*.27
(4) 运行nrpe服务:


[root@yibaner nrpe-2.12]# /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d

(5) 查看nrpe运行情况:


[root@yibaner nrpe-2.12]# netstat -nap | grep nrpe
tcp 0 0 *.*.*.110:5666 0.0.0.0:* LISTEN 3538/nrpe


从上述情况来看,nrpe成功运行,并且占用了5666端口
(6) 设置nrpe自启动:


[root@yibaner nrpe-2.12]#vi /etc/rc.local


在其中加中此行:


/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d
(7) 在监控服务器端测试被监控服务器是否正常运行


[root@localhost src]# telnet *.*.*.110 5666
测试如果一切正常,整个配置过程全部告一段落,只需到监控服务器中设置对被监控服务器的所需服务的监控即可。

这样一来,整个nagios的配置全部告一段落了,把被监控服务器的某个服务暂时停用一下,马上就会收到飞信的短消息了,一切表明,配置成功了。
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息