快捷搜索:   nginx

用MRTG监测Linux系统CPU温度

在Linux下有个叫lm_sensors的软件可以帮助我们来监控主板,CPU的工作电压,风扇转速、温度等数据。这些数据我们通常在主板的 BIOS也可以看到。当我们可以在机器运行的时候通过lm_sensors随时来监测着CPU的温度变化,可以预防呵保护因为CPU过热而会烧掉。
 
  1. 安装lm_sensors
 
  现在基本上每个Linux都已经有lm_sensors包了,我们主要安装了就可以了。或者我们也可以通过源文件来自己编译。
 
  1)在FC,RH, CENTOS下,用rpm:
 
  [root@securitycn ~]# rpm -ivh lm_sensors-2.10.0-3.1.i386.rpm
 
  3)编译源文件安装
 
  我们可以通过:这里下载源文件
 
  这里我们要注意的问题是要先安装libsysfs库,是Sysfsutils,Sysfsutils-devel软件
 
  tar xzvf lm-sensors-xxx.tar.gz
 
  make user
 
  make user_install testing
 
  下面我们就用一些简单的命令来利用lm_sensors来得到CPU的数据。我们要用root的身份来:
 
  sensors-detect,然后它会自动搜索主板上的chipset和相应的driver,我们全部答YES就可以了
 
  [root@securitycn ~]# sensors-detect
 
  # sensors-detect revision 1.413 (2006/01/19 20:28:00)
 
  This program will help you determine which I2C/SMBus modules you need to
 
  load to use lm_sensors most effectively. You need to have i2c and
 
  lm_sensors installed before running this program.
 
  Also, you need to be `root', or at least have access to the /dev/i2c-*
 
  files, for most things.
 
  If you have patched your kernel and have some drivers built in, you can
 
  safely answer NO if asked to load some modules. In this case, things may
 
  seem a bit confusing, but they will still work.
 
  It is generally safe and recommended to accept the default answers to all
 
  questions, unless you know what you're doing.
 
  We can start with probing for (PCI) I2C or SMBus adapters.
 
  You do not need any special privileges for this.
 
  Do you want to probe now? (YES/no):
 
  全部默认YES即可。
 
  然后我们启动lm_sensors :
 
  /etc/init.d/lm_sensors start
 
  Starting lm_sensors: [ OK ]
 
  我们可以通过lsmod来确定我们需要的driver已经加载了没有
 
  lsmod | grep i2c
 
  i2c_isa 9153 2 w83627hf,w83781d
 
  i2c_i801 11341 0
 
  i2c_dev 12613 0
 
  i2c_ec 9025 1 sbs
 
  i2c_core 23745 6 w83627hf,w83781d,i2c_isa,i2c_i801,i2c_dev,i2c_ec
 
  然后我们用sensors的命令就可以了:
 
  [root@securitycn ~]# sensors
 
  w83627hf-isa-0290
 
  Adapter: ISA adapter
 
  VCore 1: +3.33 V (min = +0.00 V, max = +0.00 V) ALarm
 
  VCore 2: +3.36 V (min = +0.00 V, max = +0.00 V) ALarm
 
  +3.3V: +0.93 V (min = +3.14 V, max = +3.46 V) ALarm
 
  +5V: +5.11 V (min = +4.73 V, max = +5.24 V)
 
  +12V: +4.56 V (min = +10.82 V, max = +13.19 V) ALarm
 
  -12V: -7.10 V (min = -13.18 V, max = -10.88 V) ALarm
 
  -5V: -1.93 V (min = -5.25 V, max = -4.75 V) ALarm
 
  V5SB: +5.51 V (min = +4.73 V, max = +5.24 V) ALarm
 
  VBat: +0.02 V (min = +2.40 V, max = +3.60 V) ALarm
 
  fan1: 0 RPM (min = 2732 RPM, div = 2) ALarm
 
  fan2: 0 RPM (min = 0 RPM, div = 2)
 
  fan3: 0 RPM (min = 0 RPM, div = 2)
 
  temp1: +38癈 (high = +50癈, hyst = +45癈) sensor = thermistor
 
  temp2: +33.5癈 (high = +80癈, hyst = +75癈) sensor = thermistor
 
  temp3: +33.5癈 (high = +80癈, hyst = +75癈) sensor = thermistor
 
  vid: +0.000 V (VRM Version 10.0)
 
  alarms:
 
  beep_enable:
 
  Sound alarm enabled
 
  这里我们可以看到温度还没有相对应CPU,我们主要稍微修改一下/etc/sensors.conf就可以了,不过其实都不用我们自己去动手,一般我们都可以从主板生产商那里下载到配置文件。通过lmsensors我们就可以得到主板温度,CPU电压,风扇转速这些信息。我们可以根据这些数据来监察系统的运行情况来预防系统的问题。
 
  接下来让它和MRTG整合在一起吧
 
  [root@securitycn ~]# cd /usr/local/mrtg/bin
 
  [root@securitycn ~]# vi temp.sh
 
  #内容如下
 
  #!/bin/bash
 
  cputemp=`/usr/bin/sensors | grep temp1 |awk '{print $2}'|cut -c 2-4` #这句是说找出有temp1那一行,印出第二个列的2-4个字
 
  systemp =`/usr/bin/sensors | grep temp2 |awk '{print $2}'|cut -c 2-5` 不用解释了吧
 
  echo $cputemp
 
  echo $systemp
 
  # the uptime
 
  uptime | sed 's:^.* up \(。*\), [0-9][0-9]* users.*$:\1:'
 
  # my name
 
  uname -n
 
  [root@securitycn ~]# chmod +x temp.sh 改成可执行
 
  [root@securitycn ~]# ./temp.sh 试试看有没有问题
 
  39
 
  33.5
 
  15:36:19 up 22:28, 1 user, load average: 0.04, 0.09, 0.04
 
  securitycn
 
  再来写下面的文件
 
  [root@securitycn ~]# cd ……/etc/
 
  [root@securitycn ~]# vi temp.cfg
 
  WorkDir: /data1/usr/apache/htdocs/mrtg/temp/
 
  Target[index]: `/usr/local/mrtg/bin/temp.sh`
 
  MaxBytes[index]:80
 
  Options[index]: gauge, nopercent, growright
 
  YLegend[index]: Temp (度)
 
  ShortLegend[index]: 度
 
  LegendO[index]: 系统温度;
 
  LegendI[index]: CPU温度;
 
  Title[index]: 系统温度表
 
  PageTop[index]:主机温度表
 
  [root@securitycn ~]# /usr/local/mrtg/bin/mrtg /usr/local/mrtg/etc/temp.cfg
 
  执行3次就不报错了
 
  然后加入到crontab里面
 
  */5 * * * * /usr/local/mrtg/bin/mrtg /usr/local/mrtg/etc/temp.cfg
顶(0)
踩(0)

您可能还会对下面的文章感兴趣:

最新评论