Difference between revisions of "SUNScholar/Disaster Recovery/System Monitor"

From Libopedia
Jump to navigation Jump to search
 
(25 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
<center>
 +
'''[[SUNScholar/Disaster Recovery|Back to Disaster Recovery]]'''
 +
</center>
 
==Introduction==
 
==Introduction==
 
Now that you have a large number of servers you would like to know how they are performing and be informed of potential problems. At our library we use '''munin''' to do this.
 
Now that you have a large number of servers you would like to know how they are performing and be informed of potential problems. At our library we use '''munin''' to do this.
==Server Setup==
 
It is assumed that you will be using the same server for monitoring and backup. To setup munin to gather client statistics, follow the procedure below.
 
  
Add a firewall rule to allow the server to get the stats:
+
To setup a similar system, first configure the server and then configure the clients. We selected the backup server with the most resources to act as our monitoring server as well.
ufw allow 4949
+
==Instructions==
 +
===[[SUNScholar/Disaster Recovery/System Monitor/Server|Step 1 - Server Setup]]===
  
To install munin, type the following:
+
===[[SUNScholar/Disaster Recovery/System Monitor/Client|Step 2 - Client Setup]]===
apt-get install munin
 
Add the clients to the '''/etc/munin/munin.conf''' file as follows.
 
  
Open the file for editing.
+
==YouTube Video==
nano /etc/munin/munin.conf
+
<html5media width="560" height="315">https://www.youtube.com/watch?v=pGSn2CA5u7Y</html5media>
Add one of the following for each client:
 
<pre>
 
[%hostname-of-client%]
 
    use_node_name yes
 
    address %ip-address-of-client%
 
</pre>
 
Setup the munin server.
 
munin-check
 
Wait for about 5 to 10 minutes for munin to gather data and then check out the stats as follows.
 
  
Open a web browser and type the following in the address bar:
+
==References==
http://%hostname-of-monitoring-server%/munin
+
*http://en.wikipedia.org/wiki/Munin_(network_monitoring_application)
You should get a page like this:
+
*http://munin-monitoring.org
 
+
*http://lintut.com/best-free-monitoring-system-for-linux
[[File:Munin-1.png|900px|border]]
+
*http://www.tecmint.com/linux-system-monitoring-troubleshooting-tools
 
+
*http://www.tecmint.com/install-munin-network-monitoring-in-rhel-centos-fedora
==Client Setup==
+
[[Category:System Administration]]
Login and become the root user. Install munin as follows:
 
apt-get install libdbd-pg-perl
 
 
 
apt-get install munin-node
 
Setup munin to allow the monitoring server to gather statistics as follow:
 
nano /etc/munin/munin-node.conf
 
Add the following to the bottom of the file:
 
allow $ip-address-of-monitoring-server%
 
Change the following:
 
host_name %hostname-of-client%
 
Save the file.
 
Run the following command to check to update stats available:
 
munin-node-configure --shell | bash -
 
Check which stats are available
 
cd /etc/munin/plugins
 
 
 
Add a firewall rule to allow the monitoring server to get the stats:
 
ufw allow 4949
 
 
 
Thats it. As usual there is a lot of documentation about Munin out there.
 

Latest revision as of 10:26, 28 August 2016

Back to Disaster Recovery

Introduction

Now that you have a large number of servers you would like to know how they are performing and be informed of potential problems. At our library we use munin to do this.

To setup a similar system, first configure the server and then configure the clients. We selected the backup server with the most resources to act as our monitoring server as well.

Instructions

Step 1 - Server Setup

Step 2 - Client Setup

YouTube Video

References