Difference between revisions of "SUNScholar/Disaster Recovery/System Monitor"

From Libopedia
Jump to navigation Jump to search
(Created page with " ==System Monitoring== Now that you have a large number of servers you would like to know how they are performing and be informed of potential problems. At our library we use '''...")
 
 
(29 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
+
<center>
==System Monitoring==
+
'''[[SUNScholar/Disaster Recovery|Back to Disaster Recovery]]'''
 +
</center>
 +
==Introduction==
 
Now that you have a large number of servers you would like to know how they are performing and be informed of potential problems. At our library we use '''munin''' to do this.
 
Now that you have a large number of servers you would like to know how they are performing and be informed of potential problems. At our library we use '''munin''' to do this.
===Server Setup===
 
It is assumed that you will be using the same server for monitoring and backup. To setup munin to gather client statistics, follow the procedure below.
 
 
Add a firewall rule to allow the server to get the stats:
 
ufw allow 4949
 
 
To install munin, type the following:
 
apt-get install munin
 
Add the clients to the '''/etc/munin/munin.conf''' file as follows.
 
 
Open the file for editing.
 
nano /etc/munin/munin.conf
 
Add one of the following for each client:
 
<pre>
 
[%hostname-of-client%]
 
    use_node_name yes
 
    address %ip-address-of-client%
 
</pre>
 
Setup the munin server.
 
munin-check
 
Wait for about 5 to 10 minutes for munin to gather data and then check out the stats as follows.
 
 
Open a web browser and type the following in the address bar:
 
http://%hostname-of-monitoring-server%/munin
 
You should get a page like this:
 
  
[[File:Munin-1.png|900px|border]]
+
To setup a similar system, first configure the server and then configure the clients. We selected the backup server with the most resources to act as our monitoring server as well.
 +
==Instructions==
 +
===[[SUNScholar/Disaster Recovery/System Monitor/Server|Step 1 - Server Setup]]===
  
===Client Setup===
+
===[[SUNScholar/Disaster Recovery/System Monitor/Client|Step 2 - Client Setup]]===
Login and become the root user. Install munin as follows:
 
apt-get install munin-node
 
Setup munin to allow the monitoring server to gather statistics as follow:
 
nano /etc/munin/munin-node.conf
 
Add the following to the bottom of the file:
 
allow $ip-address-of-monitoring-server%
 
Change the following:
 
host_name %hostname-of-client%
 
Save the file.
 
Run the following command to check to update stats available:
 
munin-node-configure --shell | bash -
 
Check which stats are available
 
cd /etc/munin/plugins
 
  
Add a firewall rule to allow the monitoring server to get the stats:
+
==YouTube Video==
ufw allow 4949
+
<html5media width="560" height="315">https://www.youtube.com/watch?v=pGSn2CA5u7Y</html5media>
  
Thats it. As usual there is a lot of documentation about Munin out there.
+
==References==
 +
*http://en.wikipedia.org/wiki/Munin_(network_monitoring_application)
 +
*http://munin-monitoring.org
 +
*http://lintut.com/best-free-monitoring-system-for-linux
 +
*http://www.tecmint.com/linux-system-monitoring-troubleshooting-tools
 +
*http://www.tecmint.com/install-munin-network-monitoring-in-rhel-centos-fedora
 +
[[Category:System Administration]]

Latest revision as of 10:26, 28 August 2016

Back to Disaster Recovery

Introduction

Now that you have a large number of servers you would like to know how they are performing and be informed of potential problems. At our library we use munin to do this.

To setup a similar system, first configure the server and then configure the clients. We selected the backup server with the most resources to act as our monitoring server as well.

Instructions

Step 1 - Server Setup

Step 2 - Client Setup

YouTube Video

References