Categories

Saturday 20 July 2013

Sungrid Engine client Installation Centos

Hi,

 Oracle Sungrid engine is mainly used for clustering, in which user can submit jobs on the machines on the cluster.

There will be a master node , submit node and then slave machines in which jobs are submitted.

Below is the steps to install sungrid engine client on the machines. In this scenario master is already installed on the cluster. Here we will install and configure a machine as slave on that cluster.

yum -y install binutils csh java ksh libXpm openmotif22 tcl
yum --enablerepo=epel install gridengine gridengine-qmaster gridengine-execd gridengine-qmon gridengine-devel



cd /usr/share/gridengine/
./inst_sge -x -m



just proceed with the installation.



 
When the grid engine is installed , there will be two directories in which grid

 engine data is found.

 One is /usr/share/gridengine = $SGE_ROOT

in which all configuration files resides

another configuration root is $SGE_CELL which is named as default/common directory

 inside the $SGE_ROOT derectory

 source /usr/share/gridengine/default/common/settings.sh

chkconfig --level 35 sge_execd on
chkconfig --level 35 sgemaster on
qconf -as `hostname`
qconf -msconf


Now you have to make sure that the machine must need  a hostname. Set it in

/etc/sysconfig/network.Then add the hostname in DNS so that hostname is pinging

from any where especially from the master node.



now go to


cd /usr/share/gridengine/defaults/common


Edit the configuration file act_qmaster.


vi act_qmaster


add the hostmame of the master node here.


suppose say the hostname of master kmaster.example.com


and hostname of the new node we are creating is knode01.example.com





Now ssh into the master node 
kmaster.example.com





run the command qconf -shgrpl



this command will show the list of available host ques.



the queues will be lised as



@queue1

@queue2



etc



now you need to add the new machine to any of the queue on the master node.



using the command qconf -mhgrp @queue1



  a new text file will open and you need to add the hostname of the new machine

(knode01.example.com) to it



save the file in VI.



Now you need to add the machine as submit host and administrative host to run the

 jobs in them



so use the command



qconf -ah knode01.example.com

qconf -as knode01.example.com







now exit from the master node and ssh again into the new slave node.



go to directory /usr/share/gridengine



run the command ./install_execd



type in yes when prompted.



This will install the slave node and will add it to the master node.





after the installation is finished.



Please start the service /etc/init,d/sge_execd





and then run the command qstat -f



if every thing is correct , this command will list all the available nodes

on the grid.









Regards

Syamkumar.M















Ad